site banner

Friday Fun Thread for June 26, 2026

Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.

2
Jump in the discussion.

No email address required.

A little over a year ago I posted about ChatGPT's voice mode, which was, at the time, not much more than text-to-speech grafted onto the chat interface. A few weeks later I posted about a startup doing some impressive language processing, but its capabilities were limited to a 5-minute demo call and some snippets they were showcasing on their website. A few weeks ago I found out that Gemini rolled out a voice mode, which leans on the power of the Gemini LLM combined with a very decent voice synthesis model. At time of writing it is freely available in the Gemini app.

I haven't done much more than a few minute-long conversations but I foresee myself using it more in the future. Would be an excellent companion for long road trips. The only things holding me back are the privacy implications, as Google is now going to have a pretty decent collection of my recent voice data. I asked it how it does voice synthesis/transcription, and it sounds like they do all the processing off-device (i.e. they take your microphone audio, analyze/transcribe it in the cloud, and generate an audio response back) rather than using on-device text-to-speech.

You guys should try it out and report back.

Gemini's voice chat has been out for a while (a few months at least) and I will use it occasionally. The other day I drove to a random historic town and didn't feel like reading so I just pulled it up to ask it about the town I was in and specific things I'm interested in (asking about the architecture and for unusual facts and so on.) It did an ok job but was a bit repetitive and didn't have a super deep understanding of the town. I wanted to keep talking to it but I got back into my car and when it went to the car's speakers, it would begin to talk and then apparently hear itself and immediately stop talking, and then just loop that over and over. I hope they fix that soon.

When they first came out with it I would use it to practice German and then Spanish and then Japanese all in the same conversation so that was fun. It was pretty good at telling me mistakes I made in a constructive way.