Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'll try to understand some of how stuff like faster-whisper works when I've got time over the weekend, but I fear it may be too complex for me...

I was rather hoping for a guide of just how to either adapt classic whisper usage or adapt one of the optimised ones like faster-whisper (which I've just set up in a docker container but that's used up all the time I've got for playing around right now) to take a text prompt with the audio file.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: