@100ProofTollBooth comments on "Culture War Roundup for the week of August 18, 2025

Culture War Roundup for the week of August 18, 2025

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

Shaming.
Attempting to 'build consensus' or enforce ideological conformity.
Making sweeping generalizations to vilify a group you dislike.
Recruiting for a cause.
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
Don't imply that someone said something they did not say, even if you think it follows from what they said.
Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

Jump in the discussion.

No email address required.

100ProofTollBooth Dumber than a man, but faster than a dog. 3mo ago

There are two companion articles of late that I'd add to comment on this.

Why LLMs can't actually build software

This one is pretty short and to the point. LLMs, without any companion data management component, are prediction machines. They predict the next n-number of tokens based on the preceding (input) tokens. The context window functions like a very rough analog to a "memory" but it's really better to compare it to priors or biases in the bayesian sense. (This is why you can gradually prompt an LLM into and out of rabbit holes). Crucially, LLMs don't have nor hold an idea of state. They don't have a mental model of anything because they don't have a mental anything (re-read that twice, slowly).

In terms of corporate adoption, companies are seeing that once you get into complex, multi-stage tasks, especially those that might involve multiple teams working together, LLMs break down in hilarious ways. Software devs have been seeing this for months (years?). An LLM can make nice little toy python class or method pretty easily, but when you're getting into complex full stack development, all sorts of failure modes pop up (the best is when it nukes its own tests to make everything pass.)

"Complexity is the enemy" may be a cliche but it remains true. For any company above a certain size, any investment has to answer the question "will this reduce or increase complexity?" The answer may not need to be "reduce." There could be a tradeoff there that actually results in more revenue / reduced cost. But still, the question will come up. With LLMs, the answer, right now, is 100% "increase." Again, that's not a show stopper, but it makes the bar for actually going through with the investment higher. And the returns just aren't there at scale. From friends at large corporations in the middle of this, their anec-data is all the same "we realized pretty early that we'd have to build a whole new team of 'LLM watchers' for at least the first version of the rollout. We didn't want to hire and manage all of that."

AWS may have shown what true pricing looks like

TLDR for this one: for LLM providers to actually break even, it might cost $2k/month per user.

There's room to disagree with that figure, but even the pro version of the big models that cost $200+ per month are probably being heavily subsidized through burning VC cash. A hackernews comment framed it well - "$24k / yr is 20% of a $120k / yr salary. Do we think that every engineer using LLMs for coding is seeing a 20% overall productivity boost?"

Survey says no (Note: there are more than a few "AI makes devs worse" research papers floating around right now. I haven't fully developed my own evaluation of them - I think a few conflate things - but the early data, such as it is, paints a grim picture)

I'm a believer in LLMs to be a transformational technology, but I think our first attempt with them - as a society - is going to be kind of a wet fart. Neither "spacing faring giga-civilizaiton" nor "paperclips ate my robot girlfriend." Two topical predictions are 1) One of the Big AI companies is going to go to zero. 2) A Fortune 100 company is going to go nearly bankrupt because of negligent use of AI, but not in a spectacular "it sent all of our money to china" way ... it'll be about 1 - 2 years slow creep of fucked up internal reporting and management before, all of a sudden, "we've entered a death spiral of declining revenue and rising costs."

Context

RandomRanger Just build nuclear plants! 100ProofTollBooth 3mo ago

An LLM can make nice little toy python class or method pretty easily, but when you're getting into complex full stack development, all sorts of failure modes pop up

I'm using it for full stack development on a $20 plan and it works. I guess it depends on what you mean by complex full stack development, how complex is complex? I wouldn't try to make an MMO or code global air traffic controls with AI but it can definitely handle frontend (if supervised by a human with eyes), backend, database, API calls, logging, cybersecurity...

And sure it does fail sometimes with complex requests, once you go above 10K lines in one context window the quality lowers. But you can use it to fix errors it makes and iterate, have it help with troubleshooting, refactor, focus the context length on what's critical... Seems like there are many programmers who expect it to one-shot everything and if it doesn't one-shot a task they just give up on it entirely.

The metr paper is somewhat specialized. It tests only experienced devs working on repositories they're already familiar with as they mention within, the most favourable conditions for human workers over AI: https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/

Secondly, Claude 3.7 is now obsolete. I recall someone on twitter saying they were one of the devs in that study. He said that modern reasoning models are much more helpful than what they had then + people are getting better at using them.

Given that the general trend in AI is that inference costs are declining while capability increases, since the production frontier is moving outwards, then investment will probably pay off. Usage of Openrouter in terms of tokens has increased 30x within a year. The top 3 users of tokens there are coding tools. People clearly want AI and they're prepared to pay for it, I see no reason why their revealed preference should be disbelieved.

https://openrouter.ai/rankings

kky RandomRanger 3mo ago

Two small notes. First, you are almost certainly being heavily subsidized on that $20 plan. All the evidence points in that direction. You may be paying 1-2 orders of magnitude under cost. Second, the most interesting part of the METR paper was that the devs thought they were being sped up, but the opposite was true. Provably so. Intuitions on AI efficacy cannot be trusted prima facia. Many people find them enjoyable and interesting to use, which of course is their right, but we should not trust their estimates on the actual utility of the tool. Both of these facts seriously undermine the boosters’ case.

rae kky 3mo ago

If you think you’re being subsidised on a $20/month plan, switch to using the API and see the price difference. Keep in mind that providers make a profit on the API too - if you go on OpenRouter, random companies running Deepseek R1 offer tokens at a 7x cheaper rate than Claude Sonnet 4 despite Deepseek most likely being a large model.

As @RandomRanger said, it would make little sense for ALL companies to be directly subsidising users in terms of the actual cost of running the requests - inference is honestly cheaper than you think at scale. Now, many companies aren’t profitable in terms of revenue vs. R&D expenditure, but that’s a different problem with different causes, in part down to them not actually caring about efficiency and optimisation of training runs; who cares when you have billions in funding and can just buy more GPUs?

But the cat’s out of the bag and with all the open weight models out there, there’s no risk of the bigcos bumping up your $20/mo subscription to $2000/mo, unless the USD experiences hyperinflation at which point we’ll have other worries.

RandomRanger Just build nuclear plants! kky 3mo ago · Edited 3mo ago

Does anyone seriously think that these tech companies are selling $200+ worth of compute for $20? The natural assumption should be that they're making good margins on inference and all the losses are due to research/training, fixed costs, wages, capital investment. Why would a venture capitalist, who's whole livelihood and fortune depends on prudent investment, hand money to Anthropic or OpenAI so they can just hand that money to NVIDIA and me, the customer?

Anthropic is providing its services for free to the US govt but that's a special case to buy influence/cultivate dependence. If you, a normal person, mega minmax the subscription you might use more than you pay for but not by that much and the average subscriber will use less. Plus you might praise it online and encourage other people to use the product so it's a good investment.

What evidence points in this direction of ultra-benign, pro-consumer capitalism with 10x subsidies? It seems like a pure myth to me. Extraordinary claims require extraordinary evidence.

Take OpenAI. Sam Altman said he was losing money on the $200 subscription. But Sam Altman says a lot of things and he didn't say 'losing 10x more than we gain'.

The company has projected that it would record losses of about $5 billion and revenue of $3.7 billion for 2024, the New York Times reported in September. The company’s biggest cost is due to the computing power used to run ChatGPT. Not only does it require huge investments in data centers, it also demands vast amounts of electricity to run them.

If the company is losing 150% of revenue (and Anthropic is similar), not 1000% or higher, then clearly it's what I'm saying, not what you're saying. Inference/API is profitable. User subscriptions are profitable. Investment is not profitable in the short term, that's why it's called investment. And they have their fixed costs... That's why AI companies are losing money, they're investing heavily and competing for users.

Furthermore, one study of a selected group of coders doing a subset of software tasks with old models does not disprove the general utility of AI generally, it's not a major, significant fact. I could find studies that show that AI produces productivity gains quite easily. That wouldn't mean that it produces productivity gains in all settings, for all people.

Here's one such study for instance, it finds what you'd expect. Juniors gain more than seniors.

https://mitsloan.mit.edu/ideas-made-to-matter/how-generative-ai-affects-highly-skilled-workers

Or here he lists some more and finds productivity gains with some downsides: https://addyo.substack.com/p/the-reality-of-ai-assisted-software

The metr paper just tells (some) people what they want to hear, it is not conclusive any more than the other papers are conclusive. And a lot of people don't read the metr paper closely. For instance:

Familiarity and inefficiency in use: These devs were relatively new to the specific AI tools. Only one participant had >50 hours experience with Cursor; notably, that one experienced user did see a positive speedup, suggesting a learning curve effect. Others may have used the AI sub-optimally or gotten stuck following it down wrong paths.

100ProofTollBooth Dumber than a man, but faster than a dog. RandomRanger 3mo ago

A couple things;

The natural assumption should be that they're making good margins on inference and all the losses are due to research/training, fixed costs, wages, capital investment.

This is a fun way to say "If you don't count up all my costs, my company is totally making money." Secondarily, I don't know why you would call this a "natural" assumption. Why would I naturally assume that they are making money on inference? More to the point, however, it's not that they need a decent or even good margin on inference, it's that they need wildly good margins on inference if they believe they'll never be able to cut the other fixed and variable costs. You say "they aren't selling $200 worth of inference for $20" I say "Are they selling $2 of inference for $20"?

Why would a venture capitalist, who's whole livelihood and fortune depends on prudent investment, hand money to Anthropic or OpenAI so they can just hand that money to NVIDIA and me, the customer?

Because this is literally post 2000s venture capital strategy. You find product-market fit, and then rush to semi-monopolize (totally legal, of course) a nice market using VC dollars to speed that growth. Not only do VCs not care if you burn cash, they want you to because it means there's still more market out there. This only stops once you hit real scale and the market is more or less saturated. Then, real unit economics and things like total customer value and cost of acquisition come into play. This is often when the MBAs come in and you start to see cost reductions - no more team happy hours at that trendy rooftop bar.

This dynamic has been dialed up to 1,000 in the AI wars; everyone thinks this could be a winner-take-all game or, at the very least, a power low distribution. If the forecast total market is well over $1 trillion, then VCs who give you literally 10s of billions of dollars are still making a positive EV bet. This is how these people think. Burning money in the present is, again, not only okay - but the preferred strategy.

Anthropic is providing its services for free to the US govt.

No, they are not. They are getting paid to do it because it is illegal to provide professional services to the government without compensation. Their federal margins are probably worse than commercial - this is always the case because of federal procurement law - but their costs are also almost certainly being fully covered. Look into "cost plus" contracting for more insight.

See my second point above. This is the VC playbook. Uber didn't turn a profit for ever. Amazon's retail business didn't for over 20 years and now still operates with thin margins.

I don't fully buy into the "VCs are lizard people who eat babies" reddit style rhetoric. Mostly, I think they're essentially trust fund kinds who like to gamble but want to dress it up as "inNovATIon!" But one thing is for sure - VCs aren't interested in building long term sustainable businesses. It's a game of passing the bag and praying for exits (that's literally the handle of a twitter parody account). Your goal is to make sure the startup you invested in has a higher valuation in the next round. If that happens, you can mark your book up. The actual returns come when they get acquired, you sell secondaries, or they go public ... but it all follows the train of "price go up" from funding round to funding round.

What makes a price? A buyer. That's it. All you need is for another investment firm (really, a group of them) to buy into a story that your Uber For Cats play is actually worth more now then when you invested. You don't care beyond that. Margins fucked? Whatever. Even if you literally invested in a cult, or turned your blind eye to a magic box fake product, as long as there is a buyer, it's all fine.

You say "they aren't selling $200 worth of inference for $20" I say "Are they selling $2 of inference for $20"?

Why don't we try and look into this? People have tried to estimate OpenAI margins on inference and they come away with strong margins of 30, 55, 75%. We don't live in a total vacuum of information. When trying to work out their margins on inference, I base my opinion on the general established consensus of their margins.

they need wildly good margins on inference if they believe they'll never be able to cut the other fixed and variable costs

The demand for inference is rising, Openrouter records that demand for tokens rose about 30x in the last year as AI improves. Grow big enough and the margin on inference will outweigh the costs.

They are getting paid to do it

It's effectively free, they're 'selling' it for $1 per agency for a whole year. OpenAI is doing the same thing. Why are you trying to correct me on something you won't even check?

There is a significant difference between making a loss as you expand your business rapidly and try to secure a strong position in an emerging market and 'subsidized by 1-2 orders of magnitude'. No evidence has been supplied for the latter case and it's unbelievable.

Amazon wasn't making a profit because they were continuously expanding and investing in their retail business, not because the actual business was unprofitable. Investors were happy to tolerate them not making profits because they were growing. Uber wasn't making a profit but there were no 10x subsidies. We can see this immediately in how taxis weren't costing $20 while Uber was costing $2 for the same trip.

rae 100ProofTollBooth 3mo ago

If the Big AI companies try to actually implement that kind of pricing, they will face significant competition from local models. Right now you can run Qwen3-30B-A3B at ridiculous speeds on medium-end gaming rig or a decent Macbook, or if you're a decently sized company, you could rent a 8xH200 rig 8h/day, every workday, for ~$3.5k/mo, and give 64 engineers simultaneous, unlimited access to Deepseek R1 with comparable speed and performance to the big known models, so like... $55/month per engineer. And I highly doubt they're going to fully saturate it every minute of every workday, so you could probably add even more users, or use a quantized/smaller model.

Corvos rae 3mo ago

you can run Qwen3-30B-A3B at ridiculous speeds on medium-end gaming rig

How are you doing that? Qwen3-30B-A3B-Q5_K_M.gguf is 21.7Gb, are you running it at 1it/s slowly swapping off the SSD or is your idea of a medium-end gaming rig a 3/4/5090?

I mostly gave up on local models because they hit such an obvious intelligence barrier compared to the big ones, but would love to give this a shot if you explain what you're doing. I have 16Gb VRAM.

gattsuru Corvos 3mo ago

The default settings for LM Studio on a GTX 3060, i3-12100, and 48GB DDR5 memory at pretty conservative XMP, using qwen3-30b-a3b-Q_6K gave >13 token/s for this test question. That's not amazing, and I could probably squeak out a lot more performance going to 4bpw or more aggressive tuning, but it's still performant enough to work.

((Although I'd put the code quality pretty low -- in addition to the normal brittleness, there's some stupid ~~bugs~~ unexpected behaviors with filesystemwatcher, and most LLMs so far have walked straight into them. But since some of the other LLMs don't even get the question understood enough to use a filesystemwatcher, there's a bit of a curve here.))

rae Corvos 3mo ago · Edited 3mo ago

You can get 10-20 tokens/s with CPU only inference as long as you have at least 32GB of RAM. You can offload some layers to your GPU and get probably 30-40 tokens/s? Of course, a 3090 gives you >100t/s but it’s still only $800, I’d consider that mid-range compared to a $2k+ 5090.

Swapping from the SSD is only necessary if you’re running huge 100B+ models without enough RAM.

100ProofTollBooth Dumber than a man, but faster than a dog. rae 3mo ago

Yes.

Which is why the Big AI companies are looking to tightly couple with existing enterprise SaaS and/or consumer hardware as fast as possible. And I'm reasonably sure that the large hardware companies may want to aid them. NVIDIA keeps making noise about "AI first" hardware at, I think, a consumer level.

They really do want a version of Sky Net.

What is this place?

Why are you called The Motte?

New post guidelines

Rules

Recommended Posts And Communities

Recommended Realtime Chats