Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is that audio all generated? All the pauses, breaths, speed ups and everything?


From the "Help" modal:

"Illuminate is an experimental technology that uses AI to adapt content to your learning preferences. Illuminate generates audio with two AI-generated voices in conversation, discussing the key points of select papers. Illuminate is currently optimized for published computer science academic papers.

As an experimental product, the generated audio with two AI-generated voices in conversation may not always perfectly capture the nuances of the original research papers. Please be aware that there may be occasional errors or inconsistencies and that we are continually iterating to improve the user experience."


Wow. I did not pick anything in the voice as a clue that it's generated. So does it make it current best text to audio system?


I don’t know if Google’s specifically is the best, but these new GenAI-based text-to-speech systems blow away everything else.


Really? Maybe I was just listening too hard to it and could hear it pretty well in some of the weird cadence and pacing.

If it was shorter audio and I wasn't prepared for it to be AI, it would definitely be harder to notice.


GCP's text to speech options, equally amazing

https://cloud.google.com/text-to-speech/docs/voice-types#cha...




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: