Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think we should make standard browser API for transcribing, otherwise each website wanting to implement private voice recognition will need to download 500MB of data


Perhaps we should call it Web Speech API.


I was trolling, sorry.

This API already exists. It isn't nearly as good as Whisper.cpp (at least on macOS).

Docs: https://developer.mozilla.org/en-US/docs/Web/API/SpeechRecog...

Demo: https://codepen.io/Rumyra/pen/NWLyLe


An important limitation of Web Speech API is that it only accepts audio from a microphone, you can't transcribe an audio file or a WebRTC call.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: