Automatic audio to title

keruxjeff wrote on 1/14/2024, 4:07 PM

I am looking for a simple solution to take spoken audio and convert it to titles (not captions). Is this a possibility with any of the Magix products?

Intel i7-3610QM @ 2.30GHz; Microsoft Windows NT 6.2.9200.0 64-bit; 1 TB Samsung SSD 860 EVO Drive; Intel(R) HD Graphics 4000 1600x900; Windows 10, Version 1909 (OS Build 18363.476); VPX 11 upgraded from MAGIX Movie Edit Pro MX Premium Download Version.

Comments

johnebaker wrote on 1/15/2024, 2:02 AM

@keruxjeff

Hi

None of the Magix video products have Speech to Text features.

For good Speech to Text conversion you are going to be looking at using third party software which can get expensive eg Nuance's Dragon Professional - formerly known as Dragon Naturally Speaking.

For one-off conversions there are several online products available, however accuracy, and text quantity, can be an issue with the 'free' and low cost options.

All will provide a text file which you then have to copy/paste into a title object in program.

HTH

John EB
Forum Moderator

VPX 16, Movie Studio 2025, and earlier versions 2015 and 2016, Music Maker Premium 2024.

PC - running Windows 11 23H2 Professional on Intel i7-8700K 3.2 GHz, 16GB RAM, RTX 2060 6GB 192-bit GDDR6, 1 x 1Tb Sabrent NVME SSD (OS and programs), 2 x 4TB (Data) internal HDD + 1TB internal SSD (Work disc), + 6 ext backup HDDs.

Laptop - Lenovo Legion 5i Phantom - running Windows 11 23H2 on Intel Core i7-10750H, 16GB DDR4-SDRAM, 512GB SSD, 43.9 cm screen Full HD 1920 x 1080, Intel UHD 630 iGPU and NVIDIA GeForce RTX 2060 (6GB GDDR6)

Sony FDR-AX53e Video camera, DJI Osmo Action 3 and Sony HDR-AS30V Sports cams.

Former user wrote on 1/15/2024, 4:12 AM

@keruxjeff You could upload your vid to YT where you can download a ,vtt, .srt, .sbv file or you can copy the Transcript, the button is in the Description,, SRT files do load into Vegas but as mentioned these files don't import into Magix. (don't think they do anyway)

JeanCollier wrote on 1/31/2024, 6:34 AM

Transcribe is an application that can assist you with this. Also, if you want to know more about the application, ask filmy4wap web experts. They have done this well.

CubeAce wrote on 1/31/2024, 6:58 AM

@JeanCollier @johnebaker @Former user @keruxjeff

Personally I hate any form of automatic transcription that can often throw up anomalies whichever way it is employed.

Not knowing the differences between row, row, and roe is just one example of many. Most people that use such services on YouTube never seem to check their work after or are oblivious to the mistakes.

Then again I may just be showing my age as speech such as 'rareerer' and 'mostest' seems to be encroaching speech more in some cultures. I know language evolves over time and when I was younger and even now, some of the grammatical errors I can accept and do not bother me are grating to other people older than myself, but it is annoying to start to find myself becoming a part of that group of discontents.

Ray.

 

Windows 10 Enterprise. Version 22H2 OS build 19045.5011

Direct X 12.1 latest hardware updates for Western Digital hard drives.

Asus ROG STRIX Z390-F Gaming motherboard Rev 1.xx with Supreme FX inboard audio using the S1220A code. Driver No 6.0.8960.1 Bios version 1401

Intel i9900K Coffee Lake 3.6 to 5.1GHz CPU with Intel UHD 630 Graphics .Driver version Graphics Driver 31.0.101.2130 for 7th-10th Gen Intel® with 64GB of 3200MHz Corsair DDR4 ram.

1000 watt EVGA modular power supply.

1 x 250GB Evo 970 NVMe: drive for C: drive backup 1 x 1TB Sabrent NVMe drive for Operating System / Programs only. 1X WD BLACK 1TB internal SATA 7,200rpm hard drives.1 for internal projects, 1 for Library clips/sounds/music/stills./backup of working projects. 1x500GB SSD current project only drive, 2x WD RED 2TB drives for latest footage storage. Total 21TB of 8 external WD drives for backup.

ASUS NVIDIA GeForce RTX 3060 12GB. nVidia Studio driver version 560.81 - 3584xCUDA cores Direct X 12.1. Memory interface 192bit Memory bandwidth 360.05GB/s 12GB of dedicated GDDR6 video memory, shared system memory 16307MB PCi Express x8 Gen3. Two Samsung 27" LED SA350 monitors with 5000000:1 contrast ratios at 60Hz.

Running MMS 2024 Suite v 23.0.1.182 (UDP3) and VPX 14 - v20.0.3.180 (UDP3)

M Audio Axiom AIR Mini MIDI keyboard Ver 5.10.0.3507

VXP 14, MMS 2024 Suite, Vegas Studio 16, Vegas Pro 18, Cubase 4. CS6, NX Studio, Mixcraft 9 Recording Studio. Mixcraft Pro 10 Studio.

Audio System 5 x matched bi-wired 150 watt Tannoy Reveal speakers plus one Tannoy 15" 250 watt sub with 5.1 class A amplifier. Tuned to room with Tannoy audio application.

Ram Acoustic Studio speakers amplified by NAD amplifier.

Rogers LS7 speakers run from Cambridge Audio P50 amplifier

Schrodinger's Backup. "The condition of any backup is unknown until a restore is attempted."

emmrecs wrote on 1/31/2024, 7:56 AM

@JeanCollier

Can you please explain more about your post? What is it that needs to be turned off?

Without a lot more context from you it is likely that your post could be hidden by a moderator since at least it appears to have nothing to do with the original question asked.

Jeff
Forum Moderator

Win 10 Pro 64 bit, Intel i7 Quad Core 6700K @ 4GHz, 32 GB RAM, NVidia GTX 1660TI and Intel HD530 Graphics, MOTU 8-Pre f/w audio interface, VPX, MEP, Music Maker, PhotoStory Deluxe, Photo Manager Deluxe, Xara 3D Maker 7, Samplitude Pro X7 Suite, Reaper, Adobe Audition 3, CS6 and CC, 2 x Canon HG10 cameras, 1 x Canon EOS 600D, Akaso EK7000 Pro Action Cam