site banner

Friday Fun Thread for May 10, 2024

Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.

3
Jump in the discussion.

No email address required.

So OpenAI's big Monday reveal was basically 'Her' (if you've seen the movie).

It's called gpt-4 omni. It's not smarter than the already existing gpt-4, but it is much faster and can interpret live video and audio and respond with a pretty human sounding voice with almost no delay.

https://v.redd.it/k2mrmyhfi80d1

They're going to mine so much valuable data from people with this thing.

Also it's pretty impressive and cool. Could see this being of help to lonely people. But then after getting into a dependent relationship with AI they'll be even more stuck in their own bubble than before, as far as actual human contact is concerned.

Btw, any chance this capability will be available for offline, open source models anytime soon?

Yes. If you use OA's you don't have to build your own scaffolding though.

You'd want to get completions from an LLM that's been fine tuned on conversational transcripts with timestamps and explicit markings for when the speaker changes. It should be possible to generate the dataset to fine tune on from podcast transcripts in a mostly automated fashion. Something along the lines of this. Getting the quality high enough and the latency low enough is likely to be a challenge.