Questions tagged [speech-to-text]
Automatically converting recordings of speech into text
39
questions
0
votes
0
answers
59
views
Add Transcript to Audacity Track
I use Audacity for post-production of a podcast. It would be nice to have a transcription below the audio tracks, so it's easier to navigate.
Is there a way to automatically transcribe a track and add ...
0
votes
0
answers
43
views
Add timestamps to output? pocketsphinx
Been using pocketsphinx to successfully grab transcripts from wav files, and it made sense to me that there would be an argument for adding timestamps. Actually, there are SO MANY arguments in the ...
1
vote
0
answers
277
views
Speaker diarization with Node js using openai
I am developing whisper transcription in the node js using openai API. I am able to get the transcriptions for chunked files using ffmpeg library. I am struggling at the point of speaker diarization ...
0
votes
0
answers
835
views
Windows 11 voice typing commands not working
So I recently discovered the Windows 11 voice typing feature (Win + H) and started using it. Allegedly it supports commands like "correct word" or "select word, but it doesn't work for ...
0
votes
1
answer
53
views
How can I use Dragon Professional Individual along with Windows 10's text prediction feature?
I use the text prediction feature in Windows 10:
It seems to prevent Dragon Professional Individual 15.6 from transcribing my speech into the current field.
How can I use Dragon Professional ...
5
votes
2
answers
5k
views
How do I change the speech recognition language (Windows 11)?
I would like to switch the speech recognition between two languages in Windows 11 (speech to text to use in text boxes). So far I tried:
Switching the language priority in the time&language/...
0
votes
2
answers
918
views
Speaker diarization for 3+ speakers using Azure
Does Azure's batch transcription support speaker diarization for more than 2 speakers?
I checked their Rest API documentation and didn't find anything relevant.
Are there other ways to do this using ...
0
votes
1
answer
92
views
Can one define one's own words and phrases in Microsoft Windows 11's voice access?
From https://beebom.com/what-is-voice-access-windows-11-how-use/:
Voice access is a new Windows 11 accessibility feature that makes it easier to control your Windows 11 PC using only your voice.
Can ...
0
votes
3
answers
2k
views
How to convert video to transcript locally on PC?
I am seeking a certain feature. At my University, the professor is recording his lectures, and uploading the videos to his website. On the website, he has some kind of software which converts his ...
4
votes
3
answers
26k
views
Google Chrome Live Caption Edit or Copy Text
Google Chrome has new feature for speech to text any video. (only english)
Are there any way to edit or select text in this box. İ tried some methods but. It's not a dom element. I can't edit or ...
1
vote
2
answers
518
views
How can you speed play mp4 files in chrome?
I was looking for ways for auto-generate subtitles for mp4 files and I ran across chrome's experimental features of live captioning. However I also want to play the mp4 file at 1.5x speed. How can I ...
0
votes
1
answer
49
views
Windows 10 Dictator not working [closed]
Windows 10 Dictator not working.
0
votes
1
answer
78
views
How to create a written transcript of a BBC radio broadcast where the media player does not provide Closed Captioning?
Am not entirely sure that this is the site to ask this question. If it is not, kindly migrate the question to the appropriate SE site.
On June 15, 2018 BBC World Service Weekend aired a program that ...
1
vote
0
answers
1k
views
Google voice speech recognition,beep(music) file location in android for rooted device
I am working on speech recognization where I am continuously listening to user input.I have made a loop through which it listens continuously.
Whenever Recognizer starts to listen it will play an ...
0
votes
1
answer
927
views
How to make a bulleted list with speech recognition?
I'm trying to outline my textbook using a speech recognition tool (windows speech recognition, google's voice typing, or any other free software).
I haven't quite been able to get full outline ...