site banner

Culture War Roundup for the week of November 4, 2024

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

  • Shaming.

  • Attempting to 'build consensus' or enforce ideological conformity.

  • Making sweeping generalizations to vilify a group you dislike.

  • Recruiting for a cause.

  • Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

  • Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.

  • Be as precise and charitable as you can. Don't paraphrase unflatteringly.

  • Don't imply that someone said something they did not say, even if you think it follows from what they said.

  • Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

8
Jump in the discussion.

No email address required.

OpenAI Shifts Strategy to Slower, Smarter AI as GPT Scaling Limits Emerge, OpenAI's upcoming Orion model shows how GPT improvements are slowing down

Paywalled, but here's a summary from reddit:

"Some OpenAI employees who tested Orion report it achieved GPT-4-level performance after completing only 20% of its training, but the quality increase was smaller than the leap from GPT-3 to GPT-4, suggesting that traditional scaling improvements may be slowing as high-quality data becomes limited

  • Orion's training involved AI-generated data from previous models like GPT-4 and reasoning models, which may lead it to reproduce some behaviors of older models
  • OpenAI has created a "foundations" team to develop new methods for sustaining improvements as high-quality data supplies decrease
  • Orion's advanced code-writing features could raise operating costs in OpenAI's data centers, and running models like o1, estimated at six times the cost of simpler models, adds financial pressure to further scaling
  • OpenAI is finishing Orion's safety testing for a planned release early next year, which may break from the "GPT" naming convention to reflect changes in model development

“Some researchers at the company believe Orion isn’t reliably better than its predecessor in handling certain tasks, according to the employees. Orion performs better at language tasks but may not outperform previous models at tasks such as coding, according to an OpenAI employee. That could be a problem, as Orion may be more expensive for OpenAI to run in its data centers compared to other models it has recently released, one of those people said.”

This is one of several articles/posts/tweets coming out of the LLMsphere over the past couple of weeks that are renewing concerns over LLMs hitting diminishing returns.

Of course this is just speculation until OpenAI actually releases Orion (or whatever they end up calling it). And really we would need several models past Orion too to actually extrapolate a pattern. But this does fit with my subjective impression that the leap from GPT-3 to GPT-4 was not as big as the leap from GPT-2 to GPT-3, and the leap from 4 to o1 was not as big as the leap from 3 to 4. The fact that they're considering again releasing a new model without calling it GPT-5 is also telling. They know how psychologically important the "GPT-5" moniker has become at this point and they won't give that name to a model unless it really represents a major leap forward.

Speculation: It’s interesting that the bottleneck is given as lack of data rather than architecture. That opens up the possibility that we may be able to get things moving again by finding some other method of obtaining/creating useable data.

LLMs were historically created to use next-token-prediction as a means of solving natural language processing tasks. I think we can regard that problem as provisionally solved. When people talk about GPTs limits, they aren’t talking about its ability to take English input and produce readable English output. They are talking about general intelligence: the ability to output sensible, useful English output.

In short, LLMs are general learning machines using natural language as a proxy task. Natural language is cheap and information rich but any means of conveying information about the world is fair game, provided that it can be converted into the same token space that GPT is using using CLIP or something similar.

What is needed is large quantities of data that conveys causal information about the world. Video is probably a good place to start. Some kind of simulated self-play might also be useable. What else could be useable?

(I’m not sure how next-token prediction would work here)

In effect LLMS aren't smart, they are just great at recognizing patterns they are trained on. Google is great at recognizing text strings that it remembers, LLMS don't need matching strings they match on patterns and are able to combine patterns from multiple sources. LLMs aren't truly intelligent because they are dumbfounded if there isn't a good matching pattern in the training set. They are stumped in a way a human isn't if they encounter something new.

LLMs aren't going to replace humans because the set of all data is miniscule to the set of all potential patterns in the world.

LLMs aren't going to replace humans because the set of all data is miniscule to the set of all potential patterns in the world.

I mean, you can say LLMs aren't going to replace humans...but the 'potential patterns in the world' are all reducible to data in one way or another.

So some Machine trained on language AND physics data AND biology AND etc. etc. is still a potential contender, no?

I suppose it comes down to whether or not there is a ghost in the machine.

If human intelligence is all neurons that can be modeled as a graph with weighted edges then we should be able to simulate it.

Maybe we do that and still can’t get human intelligence to pop out of the simulated brain and find that something is missing.

It would be a bit funny if they design a machine that is provably a 1:1 simulation of a human brain, switch it on, and get an error message to the effect of "Cannot Execute Commands: This unit is not ensouled."

“Humunculus not installed: please refer to manual.”

I mean, that kind of sounds like you're saying it's provably not a 1:1 simulation of a human brain.

What you're describing is measurable evidence of new physics. Every physicist in the world would want to buy you a beer.

...Please contact your local soul provider for further information

If you believe that you've received this message in error, please contact your system administrator at t0.yahweh.root.

Control thread not found.