Skip to main content

Questions tagged [speech-to-text]

Automatically converting recordings of speech into text

0 votes
0 answers
59 views

Add Transcript to Audacity Track

I use Audacity for post-production of a podcast. It would be nice to have a transcription below the audio tracks, so it's easier to navigate. Is there a way to automatically transcribe a track and add ...
dirdi's user avatar
  • 3,275
0 votes
0 answers
43 views

Add timestamps to output? pocketsphinx

Been using pocketsphinx to successfully grab transcripts from wav files, and it made sense to me that there would be an argument for adding timestamps. Actually, there are SO MANY arguments in the ...
Wolfpack'08's user avatar
  • 1,251
1 vote
0 answers
277 views

Speaker diarization with Node js using openai

I am developing whisper transcription in the node js using openai API. I am able to get the transcriptions for chunked files using ffmpeg library. I am struggling at the point of speaker diarization ...
Zeenath's user avatar
  • 111
0 votes
0 answers
835 views

Windows 11 voice typing commands not working

So I recently discovered the Windows 11 voice typing feature (Win + H) and started using it. Allegedly it supports commands like "correct word" or "select word, but it doesn't work for ...
TheKidsWantDjent's user avatar
0 votes
1 answer
53 views

How can I use Dragon Professional Individual along with Windows 10's text prediction feature?

I use the text prediction feature in Windows 10: It seems to prevent Dragon Professional Individual 15.6 from transcribing my speech into the current field. How can I use Dragon Professional ...
Franck Dernoncourt's user avatar
5 votes
2 answers
5k views

How do I change the speech recognition language (Windows 11)?

I would like to switch the speech recognition between two languages in Windows 11 (speech to text to use in text boxes). So far I tried: Switching the language priority in the time&language/...
Albin's user avatar
  • 10.9k
0 votes
2 answers
918 views

Speaker diarization for 3+ speakers using Azure

Does Azure's batch transcription support speaker diarization for more than 2 speakers? I checked their Rest API documentation and didn't find anything relevant. Are there other ways to do this using ...
Christian Adib's user avatar
0 votes
1 answer
92 views

Can one define one's own words and phrases in Microsoft Windows 11's voice access?

From https://beebom.com/what-is-voice-access-windows-11-how-use/: Voice access is a new Windows 11 accessibility feature that makes it easier to control your Windows 11 PC using only your voice. Can ...
Franck Dernoncourt's user avatar
0 votes
3 answers
2k views

How to convert video to transcript locally on PC?

I am seeking a certain feature. At my University, the professor is recording his lectures, and uploading the videos to his website. On the website, he has some kind of software which converts his ...
Galaxy's user avatar
  • 223
4 votes
3 answers
26k views

Google Chrome Live Caption Edit or Copy Text

Google Chrome has new feature for speech to text any video. (only english) Are there any way to edit or select text in this box. İ tried some methods but. It's not a dom element. I can't edit or ...
F.Penb's user avatar
  • 51
1 vote
2 answers
518 views

How can you speed play mp4 files in chrome?

I was looking for ways for auto-generate subtitles for mp4 files and I ran across chrome's experimental features of live captioning. However I also want to play the mp4 file at 1.5x speed. How can I ...
smaillis's user avatar
  • 111
0 votes
1 answer
49 views

Windows 10 Dictator not working [closed]

Windows 10 Dictator not working.
Smart Manoj's user avatar
0 votes
1 answer
78 views

How to create a written transcript of a BBC radio broadcast where the media player does not provide Closed Captioning?

Am not entirely sure that this is the site to ask this question. If it is not, kindly migrate the question to the appropriate SE site. On June 15, 2018 BBC World Service Weekend aired a program that ...
guest271314's user avatar
1 vote
0 answers
1k views

Google voice speech recognition,beep(music) file location in android for rooted device

I am working on speech recognization where I am continuously listening to user input.I have made a loop through which it listens continuously. Whenever Recognizer starts to listen it will play an ...
Maaz Patel's user avatar
0 votes
1 answer
927 views

How to make a bulleted list with speech recognition?

I'm trying to outline my textbook using a speech recognition tool (windows speech recognition, google's voice typing, or any other free software). I haven't quite been able to get full outline ...
mikeLundquist's user avatar

15 30 50 per page