Hacker News .hnnew | past | comments | ask | show | jobs | submit | bazlan's commentslogin

Sad to not see vui on the comparisons!

A 100M podcast model

https://huggingface.co/spaces/fluxions/vui-space


fluxions.ai has a similar model


As someone who has worked in TTS for over 4 years now. I can tell you that evaluation is the most difficult aspect of generative audio ML.

How will this really check that the models are performing well vs just listening?


We're focused on end-to-end evals focused on function-call accuracy, style, tone & latency of the conversations between our sims and your voice agent. Less focused on pure TTS evals at the moment!


Let it die


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: