@SubstantialFrivolity comments on "Culture War Roundup for the week of April 28, 2025

Culture War Roundup for the week of April 28, 2025

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

Shaming.
Attempting to 'build consensus' or enforce ideological conformity.
Making sweeping generalizations to vilify a group you dislike.
Recruiting for a cause.
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
Don't imply that someone said something they did not say, even if you think it follows from what they said.
Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

Jump in the discussion.

No email address required.

SubstantialFrivolity I'm not even supposed to be here today 7mo ago

There's a real sense in which it's just getting better at everything. It started out decent at some areas of code, maybe it could write sql scripts ok but you'd need to double check it. Now it can handle any code snippet you throw at it and reliably solve bugs one shot on files with fewer than a thousand lines.

What? That just isn't true. I've tried to have it write code and it's still in the same shitty place it was three years ago. You get something which looks correct, but maybe it is and maybe it isn't, and you have to double check every time. Which is to say, AI tools still slow you down rather than speed you up.

This is why I'm so skeptical that we'll have AI any time soon. The current tools aren't even good at the things their advocates say they are good at, let alone harder things. I have yet to see any substance behind the hype, at all.

Context

Tenaz SubstantialFrivolity 7mo ago

Yeah, I keep hearing this claim from people and keep rebutting it.

It's better than it was three years ago, and the autocomplete functionality saves some time, but it definitely can't "handle any code snippet you throw at it" and its reliability for solving bugs is like 1/10 maybe.

aqouta SubstantialFrivolity 7mo ago

Have you actually used the latest tooling? What tasks have you actually had it try? This seems incredibly unlikely to me.

SubstantialFrivolity I'm not even supposed to be here today aqouta 7mo ago

I just use whatever ChatGPT has to offer, which would mean yes I'm using the latest tooling (since they keep it up to date). I've tried a variety of things - writing config files for programs we use at work, writing shell scripts, and asking it to explain how to do tasks in AWS CloudFormation. The first and the third tasks it just makes shit up (in some cases even dreaming up code which isn't even syntactically valid), I've found it to be completely useless for those. I've gotten some mileage in shell scripting, where it does fine as long as I keep the request small (like a few lines) so it can't trip over itself. But shell scripting is also an area I'm incredibly weak (essentially I can read bash but can't write it well at all), so it has the biggest gains to make over my own skill there. In cases where I actually know the language well, there's no benefit to me to use these tools. Like I said, if I have to check carefully every time I have it generate something (and you really do), then that's not actually speeding me up.

SnapDragon SubstantialFrivolity 7mo ago

Here. I picked a random easyish task I could test. It's the only prompt I tried, and ChatGPT succeeded zero-shot. (Amusingly, though I used o3, you can see from the thought process that it considered this task too simple to even need to execute the script itself, and it was right.) The code's clear, well-commented, avoids duplication, and handles multiple error cases I didn't mention. A lot of interviewees I've encountered wouldn't do nearly this well - alas, a lot of coders are not good at coding.

Ball's in your court. Tell me what is wrong with my example, or that you can do this yourself in 8 seconds. When you say "like a few lines", is that some nonstandard usage of "few" that goes up to 100?

Even better, show us a (non-proprietary, ofc) example of a task where it just "makes shit up" and provides "syntactically invalid code". With LLMs, you can show your receipts! I'm actually genuinely curious, since I haven't caught ChatGPT hallucinating code since before 4o. I'd love to know under what circumstances it still does.

phailyoor SnapDragon 7mo ago

Wow! You asked chatgpt to solve a dead simple toy problem, and it solved it! I'm so impressed!!!! Surely this means that chatgpt is definitely capable of handling actual real world tasks.

-2

Amadan Letting the hate flow through me phailyoor 7mo ago

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.

SnapDragon phailyoor 7mo ago

Not my fault @SubstantialFrivolity chose to set the bar this low in his claims. An existence proof is all I need. But hey, you are fully free to replace that sarcasm with your example of how deficient ChatGPT/Claude is. Evidence is trivially easy to procure here!

Lizzardspawn SnapDragon 7mo ago

I think that we are in the phase of chatgpt in which calligraphs and stenographs don't see the point of typewriters. I would definitely say that chatgpt has saved me a lot of time and made me more productive.

It's sensitive to context and prompting. When having it write bash scripts have you consider just dumping the man files into the context? Don't bother actually formatting them, just dump anything that could possibly be relevant into the prompt.

You keep banging this drum. It's so divorced from the world I observe, I honestly don't know what to make of it. I know you've already declared that the Google engineers checking in LLM code are bad at their job. But you are at least aware that there are a lot of objective coding benchmarks out there, which have seen monumental progress since 3 years ago, right? You can't be completely insulated from being swayed by real-world data, or why are you even on this forum? And, just for your own sake, why not try to figure out why so many of us are having great success using LLMs, while you aren't? Maybe you're not using the right model, or are asking it to do too much (like write a large project from scratch).

monumental progress since 3 years ago, right?

On paper benchmarks number goes up, but it hasn't translated into real usefulness. Also Goodhart's law

What is this place?

Why are you called The Motte?

New post guidelines

Rules

Recommended Posts And Communities

Recommended Realtime Chats