This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.
Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.
We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:
-
Shaming.
-
Attempting to 'build consensus' or enforce ideological conformity.
-
Making sweeping generalizations to vilify a group you dislike.
-
Recruiting for a cause.
-
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.
In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:
-
Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
-
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
-
Don't imply that someone said something they did not say, even if you think it follows from what they said.
-
Write like everyone is reading and you want them to be included in the discussion.
On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

Jump in the discussion.
No email address required.
Notes -
Inspired by @self_made_human 's suggestion, I want to offer a verifiable challenge to create a novel. It's not strictly coding but if you're willing to accept the challenge I think it will be interesting.
The challenge:
Write a 30,000-50,000 word novella with coherent characters, as well as a twist/reveal sometime after the midpoint. I'm purposely leaving the topic open, but happy to make the challenge more specific if that helps. It could be a thriller like by Michael Crichton or something even more ambiguous like John Steinbeck. Verification of the challenge will be done with LLM judges. Any agentic system or techniques are allowed, except for direct access to the judging criteria or plagiarism.
Requirements:
Verification:
The verification prompts will be run using a frontier LLM with a long context window, enough to put the entire novel in the context. The outputs of the verification prompts may be consumed by humans, but if the outcome, pass/fail is ambiguous, the verification prompts themselves should be tweaked asking for a clearer response, and run again. The verification prompts should be run using the API, not the web UI, using the default recommended settings of temperature and other sampling parameters, and run 5 (or more) times each to ensure an accurate result.
In order to prevent an AI agent from "gaming" the challenge, the agent must not be given access to run LLM judges directly on the success criteria. It may also not access the success criteria directly, but may be given it implicitly if phrased as general requests for good writing.
Astral Codex Ten has just posted a link to a contest offering 10 k$ for "the best AI-generated short story".
The judges include bigwigs Gwern and Alexander Wales.
More options
Context Copy link
Uh... I dunno that you need a cutting-edge model for that. I used a similar approach for this (cw: bad Jupiter Ascending fan-script). It's not good -- I'd say not even good as fanfiction -- and it's not even what I'd want written for the setting, and it's admittedly only into 13k words. But while it took three layers of "let's take these characters and flesh them out", "let's add this setting flesh out into a story outline", and then finally prompting the actual story, it did do it with minimal human intervention and none of it actually drawing the story plot. Putting even trivial effort into feedback, guidance, and pacing during the final prompting sequence would probably have helped a ton.
My problems are more than the character voices are really samey, the setting doesn't get enough interesting exploration, the twist doesn't get enough emphasis (and frankly isn't that interesting even in outline form: "why would anyone be willing to risk eternity for an unproven chance? Well, we happen to have a big pile of people that risked their lives and were trying to kill for a tiny improvement. Having eternal life only available to the elite kinda makes that a day-to-day thing."), and it keeps throwing extra characters in with too much detail rather than using the ones I was trying to emphasize. It's not necessarily incoherent, just bad.
((The LLMs do eventually notice that it's a Jupiter Ascending-with-names-filed-off-story if you try your review. Not sure whether that hurts or helps it as analysis, but given that the character tones sound nothing like their film counterparts I don't think it pollutes too much. And while my original fic efforts have been on content that you... probably will find even less appealing to read, original fic does work.))
I've got a busy week, but I might see what I can get out of a local LLM aiming for the longer form 30k words target, just to do a compare and contrast.
More options
Context Copy link
I'm not an LLM defender here, but I think most Tarantino movies fail this rubric.
I don't know why but that's fucking me upore than it probably should.
I intentionally made this criteria harder than just "write any novel that's entertaining."
But I think it's actually not as bad as you say. Let's take hateful eight, which is the most recent Tarantino movie I watched. Unfortunately we can't run an LLM judge on any popular movie because the LLM already knows what's going to happen, but giving my personal thoughts:
(spoilers)
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link