Story Detail of id 47221889 | Liveview Hacker News

navanchauhan6 hours ago | on: Ask HN: Who is hiring? (March 2026)

Not affiliated with Sesame, but this is what the realtime models are trying to solve. If you look at NVIDIA’s PersonaPlex release [0], it uses a duplex architecture. It’s based on Moshi [1], which aims to address this problem by allowing the model to listen and generate audio at the same time.

[0] https://github.com/NVIDIA/personaplex

[1] https://arxiv.org/abs/2410.00037

#visit	12,955,214
#session	74,665
#live-session	0