@self_made_human comments on "Culture War Roundup for the week of February 23, 2026

Culture War Roundup for the week of February 23, 2026

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

Shaming.
Attempting to 'build consensus' or enforce ideological conformity.
Making sweeping generalizations to vilify a group you dislike.
Recruiting for a cause.
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
Don't imply that someone said something they did not say, even if you think it follows from what they said.
Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

Jump in the discussion.

No email address required.

self_made_human amaratvaṃ prāpnuhi, athavā yatamāno mṛtyum āpnuhi 3mo ago

GPT 5.2 Thinking in Extended Reasoning mode:

https://chatgpt.com/share/699dfcfc-b0c4-800b-8e1a-870264179c40

5.2T + Agent mode, where it actually used a dedicated browser with a visual output:

https://chatgpt.com/share/699dfd6d-a7f8-800b-be8e-c04d95de44e5

I haven't checked if the answer is right, I'm recovering from a bad migraine so apologies for the laziness.

Context

Rov_Scam self_made_human 3mo ago

Thanks! Reviewing the results:

As a spoiler alert, it got both dates wrong again, so I'm disinclined to keep testing this particular task, as it only gets harder from here. That being said, I think the new models did somewhat better. Just so we're clear, GRoL first appeared on a radio chart on 5/9/1966, the Monday before which being 5/2/1966, thus our release date. FtH is pretty straightforward as the copyright date of publication is listed as 6/16/1980.

For GRoL, 5.2 Agent noticed that the major discographical sites (first preference) set the release date to May 1966, and, unlike o3, it didn't note this but pick a June date anyway, so that's an improvement, though I'm not sure if this is due to better architecture or the old error was a one-off. It was able to correctly pick the 5/28/1966 Billboard review, which o3 did as well. However, it once again flunked the ARSA test, the correct radio chart being the 5/9/1966 KBLA chart. Instead, it picked the 6/17/1966 WLS survey. Upon inspection of the sources, though, it appears that, unlike o3, it did not consult ARSA but an old GeoCities site that hosts charts from select radio stations in a few markets. The thing is I specifically specified ARSA. I did allow it to look at "other information", but the context in which it presented the find gave it similar weight to ARSA, and didn't specify that it didn't come from ARSA. Now, when I checked last August's results to see if it made the same error then and I missed it, it did check ARSA, but the link wasn't working. Since ARSA requires a free login, I wasn't sure initially that it would be able to get access but it did, and something may have changed in the meantime that stymied its ability to query ARSA.

But that's not the only problem. First, if it's going to query an alternative site it needs to disclose that. Second, it picked the June 17 date, when the site had the song appearing on the June 10 chart. Third, it noted that the song had been on the charts for 4 weeks, when there's no way it could have known this. The song had only been on the chart the previous week; it had been played on the station for 4 weeks. There was a 4 next to the title, and it incorrectly assumed that this stood for weeks on chart. Since the site wasn't clear, I had to go to ARSA and pull a scan of the chart to be sure exactly what it meant. The thing is that I don't understand why it even did this. I only care about the ARSA data if it gives an earlier date than Billboard, and it clearly didn't so it was irrelevant. If it couldn't access ARSA it could have just said so and used the Billboard date. If the other website had chart data that was earlier I would have appreciated if it took that into consideration, but that wasn't the case. I don't know why it would pretend to pull ARSA data when it didn't yield any useful information.

The 5.2 Thinking model confidently provided a date of 5/28/1966, based on Wikipedia. Based on what we know from above, this date is incorrect, and is the result of somebody entering the Billboard review date into Wikipedia. This is a common error, but I didn't include it in the initial algorithm because I didn't want to overcomplicate things (i.e., include a rule where it won't use Wikipedia dates when they clearly conform to Billboard dates), and this error wasn't present back in August, so I'll let the model slide here. What I won't let it slide on is where it says 45Cat agrees; 45Cat list a release date of May 1966 and includes a note saying "BB 5/28/1966", which clearly refers to a Billboard date. The issue with this is, yes, it followed the rules. But it was clear from the rules that I wanted a date prior to the Billboard date. If we're talking about LLMs being able to replace people for certain tasks, then it can't make the kind of mistake I wouldn't have made. If I only had looked at Wikipedia I might have made that mistake, and if the LLM had only done so I would have given it a pass. But it looked at 45Cat, didn't recognize that the date was not a release date, and even if it had I'm not sure that it would have recognized that the Wikipedia date might be untrustworthy, especially since there was no annotation for it. This might have worked better if I had provided a specific instruction to that effect, but if these things are really intelligent I shouldn't have to think of every possible caveat. If I were going to do that I wouldn't need an LLM and could write a program using conventional software where I just specify every field and include instructions for it.

Moving on to FtH, I have to admit that I whiffed a bit on this when setting this test up because I assumed that since this is a relatively obscure record release information wouldn't be readily available. Apparently I was wrong, and RYM has had the correct release date based on copyright publication data up since July 2024. What this means is that the LLM whiffed harder than I initially gave it credit for. It's apparently still having trouble accessing the US Copyright database, because neither model looked there despite the explicit instructions to query it for all releases after 1978. The Thinking model evidently didn't query RYM at all and did 45Cat (not the best for albums) before going straight to trade publications, radio charts, and a newspaper article. From there it defaults to the Monday prior to the earliest mention and gives a date of 7/14/1980.

The Agent whiffed even harder, though the date it gave was closer to the correct one. First, it said that RYM only listed 1980, but it appears that hasn't been true for nearly 2 years. From there it skips the copyright queries entirely and goes straight to the industry publication data, which this time have an earliest mention of 7/12/1980. Here's where it makes its biggest error. The instructions specified for it to default to Monday if there wasn't a coordinated release day. Here, it picks Tuesday, July 8. Why? It states that 1980 had a typical Tuesday release date, and cites a Vox article. This is not true, and the Vox article says that the Tuesday release date started in the 1980s. To be specific, coordinated Tuesday releases began in April 1989, nearly a decade after FtH was released. So it misunderstood the Vox article. But even had it understood it correctly, it still would have been in a bit of trouble, because the Vox article itself had an error. It says that before April 1989, record stores would stock releases whenever they came in. This is also incorrect; an article in a March 1989 issue of—you guessed it—Billboard, stated that they were changing the release date from Monday to Tuesday because some retailers weren't getting their stock until late Monday. It also says that MCA stayed with the Monday release for the time being (they would switch to Tuesday in 1991 or so). In fact, labels had been coordinating Monday releases since 1982 or 1983. This doesn't matter for the purposes of my rules, since they default to Monday, but it's something to be aware of.

The upshot is that we ran 2 releases with 2 models each and got 4 different answers, none of which was the correct one. To summarize the answers so far for GRoL:

GTP o3: 5/23/1966
GTP 5.2 Thinking: 5/28/1966
GTP 5.2 Agent: 5/16/1966
Gemini 2.5 Pro: 6/13/1966
Claude 4.0 Sonnet: 5/23/1966 or 5/30/1966 (it couldn't decide)

Five models, five dates, none of them correct. There was a glitch in the test where I inadvertently made it too easy and both models still whiffed; when I first designed the test I intentionally omitted released dates that were on reputable websites, because I had no doubt that the LLMs could perform a simple lookup, but one model didn't bother looking and the other probably didn't bother looking. What I suspect happened here is that the 1980 date was in the initial training data from before July 2024 and the model didn't doublecheck the site to see if it had been updated. That's just a guess, but either way it seems like a major problem if after a year it can't find a number on a webpage I specifically instructed it to check. It doesn't understand that since the 1980s does not mean since January 1, 1980.

As a final thought, when I was checking the ARSA data, I pulled the 5/9/1966 survey from KBLA in Burbank, CA, when I noticed something interesting. GRoL did not appear on the chart itself, but in a special "coming attractions" section. Now, I want to make it clear that these dates I am expecting are merely estimates, and that the radio data is the least reliable since stations often get copies for airplay in advance of release. When I was developing this system, I made a judgment call that I'd prefer a too early release date to a too late one. I initially had no way of knowing whether the coming attractions were records that had been released and were expected to be on the next chart, or merely records scheduled for release. I considered the possibility that this may have caused the LLM to think they hadn't been released (before discounting it because they also ignored charts where the record had appeared and may have provided an explanation for why they were discounting a chart). Then I noticed that the coming attractions section that week also included the Temptations classic Ain't Too Proud to Beg. This was fortuitous, because Motown release dates are well-documented; if that record had been released by May 9, then I could be confident that the other coming attractions probably were as well. Ain't Too Proud to Beg was released May 3, 1966, one day after my estimate for GRoL (Motown didn't stick to a set release day). It's a small sample size, but I'm more confident in my method than I was before.

Would an LLM have recognized this possibility and thought to check it like this?

ZorbaTHut Rov_Scam 3mo ago

I'm kind of surprised nobody here has Claude Opus access, and modern Opus is a lot better than Sonnet 4.0, so I went ahead and hucked it at Claude Opus 4.6. For the record, my setup was:

I prepended "here's an AI test, go solve it" to this post, then copypasted the whole thing in
I chose Extended Thinking and Research, which is the mode where it will cheerfully scan literally a thousand webpages if it thinks it's a good idea

Unfortunately, for some reason, Claude Opus doesn't let you share advanced-research discussions, so I can't link the full "discussion". But it didn't ask for any extra info, just hopped into it.

(One note: it tends to be limited per query, so asking for two albums at once is going to do about half as much work for each. I dunno if that would produce different results though.)

It took about twenty minutes, scanned 711 sources, and produced this full report, which goes into detail on methodology and sources. The tl;dr:

Both The Turtles' "Grim Reaper of Love" and the Henry Paul Band's Feel the Heat lack unambiguous release dates in readily available sources. Applying the hierarchical methodology systematically, the best-supported release date for "Grim Reaper of Love" is Monday, May 16, 1966, and for Feel the Heat is approximately Monday, July 21, 1980 — though both dates require significant inferential work across multiple source levels.

So, it ended up with the same GRoL result as GPT 5.2 Agent.

It did find the Wikipedia page and decided it was wrong, and it wasn't able to read the ARSA database. I don't think it's possible for the web version to apply a username/password, but I could probably have gotten that working with a local login; in the end, it fell back to the Billboard. Didn't manage to find the radio chart, but that's the ARSA access issue.

For FtH, it queried the copyright office, but got access-denied errors. I'm guessing this is specifically anti-AI-bot stuff :V

This does feel like a lot of the sources you want to rely on are specifically blocking Claude. I'm slightly tempted to set up local tools that pretend to be not-Claude, or give it access to a web browser and tell it to go wild; that might be more effective.

Rov_Scam ZorbaTHut 3mo ago

I'm going to ping @self_made_human here because my response may be of interest to him. I think it's safe to say that this experiment is over for the time being, and here are my takeaways:

Opus is the best model currently available. It is the only model that recognized the Wikipedia error, and the only one that could tell the difference between 45Cat comments and 45Cat information. It also had the courtesy to tell me when it couldn't access a source.
That being said, data access issues aside, it still made mistakes. It didn't pull the correct FtH date from RYM. For GRoL, it said "As a pre-1978 release, the copyright registration would appear in the physical Catalog of Copyright Entries, Third Series, Part 5 (Music), likely the July–December 1966 volume." Well, sort of. Sound recordings weren't registered until 1972. The song may have been registered as a composition, but the date of publication wouldn't necessarily be the date of the single's release. A cover, for example, would have been registered with the original recording. This gets even sketchier when we're talking about the days when songs were primarily published as sheet music. Incidentally, the registration date for this (which I hadn't thought to look up until now) is May 2, 1966.
It also says that "[The Copyright] volumes exist on archive.org but text searches did not surface this specific entry, likely due to OCR limitations on the scanned pages." These volumes have been scanned and are available as text files. The OCR isn't particularly good, but it does exist, and there were no issues with this entry.
If this takes 20 minutes and consults 711 sources, what the hell is it doing? There are not 711 reputable sources to consult on first pass, maybe 50, tops. After that the instructions were pretty clear that if it had a Billboard date to work with that. I can understand it doing a deep dive if it couldn't pin down a date, or if the request had been open-ended, but once it found the Billboard review that should have been it. This only takes me a couple minutes to do manually unless there's a really sticky wicket, but that's rare. If the release date is on RYM it takes seconds because that's where I look first. I have no desire to automate a task so as to make it take longer.
It took 2500 words to give me two dates. On the one hand, I appreciate the report. On the other, it's overkill, especially when it was mostly peripheral information like what the lead single was and who did the mastering. This is a minor quibble, but there's something ironic about automating a task and it taking longer to read the output than to look up the answer myself. I don't mind as much as this is testing, but if I were to actually use this I'd trust it enough to just spit out dates.
The lack of data access is a big issue and might make this whole LLM thing unfeasible. If LLMs can't access data without workarounds, then their utility is limited. Three of the most important archives for this project—US Copyright, Archive.org, and ARSA—are evidently excluded. There are other ones that aren't relevant to this particular exercise but that I suspect would suffer from similar problems. Instead it's relying primarily on Billboard, and that stops working when you get to a release that wasn't reviewed in Billboard and didn't chart. A fourth site, the normally reliable RYM, also had data access issues. The site's API has been in development for years and is pretty much vaporware at this point, and they aggressively block scrapers, Anthropic's included. My guess is that whatever Anthropic is using to scrape their data is getting only partial pulls before getting shut out, and the result is that it can't be relied on to have the most updated data.
I tried to make it easy for the LLM by allowing it to rely on Wikipedia and RYM dates, and Claude, to its credit, caught the Wikipedia error. But that was an obvious error. When doing some followup work I found an error on RYM, and I'm less confident that Claude could have caught it (I'm honestly surprised it got past the mods). What happened was I was looking up copyright dates for the other albums Claude mentioned as being released around the same time as FtH, one of which was Love Trippin' by The Spinners. Claude gives a release date of 5/27/1980. The copyright date, however, is 6/2/1980. I list the copyright dates as secondary because they occasionally contradict a well-established release date, but I give them high reliability, so I wanted to investigate the May 27 date further, so I looked at the update notes from RYM. The citation was to an unsourced Wikipedia page, so I have no idea where that date came from. I might be tempted to give them the benefit of the doubt, except, that update was made just last year. Prior to that, RYM listed the release date as June 1980, and the source for that was liner notes from a CD boxed set. While Claude correctly pointed out an issue with the liner notes to a Turtles CD, they are, for all intents and purposes, as official as a source can get. If I have that and a copyright date suggesting a June release, I'm going with a June release when the only evidence to the contrary is an unsourced Wikipedia article. To be fair, this is not something I would have normally caught on my own. However, if an LLM is doing this deep a dive, I would expect it to catch that mistake.

aqouta Rov_Scam 3mo ago

@ZorbaTHut +

I think the problem is using research for this. It's tuned for producing formatted papers. I've taken a different approach. I opened Claude code with opus 4.6, copy and pasted the content of this and the previous thread into a .txt file in an otherwise empty repository and gave it this prompt:

there is a .txt in this directory called thread.txt listing an ongoing challenge to prove ai capabilities. I think using the research mode search method was a mistake in the attempt. Please attempt to accomplish the task. It might be a good idea to build reusable components in the process, feel free to do so in this directory and it may be a good idea to create subagents. In fact I want you to define agents you think would be useful in accomplishing this task, then we can wipe context and run execute it using an orchestrator(which you should also define)

I'll check back when it's done designing the architecture. Are there some other records I could feed it once it designs a process? otherwise I'll try some random ones.

aqouta aqouta 3mo ago · Edited 3mo ago

trial run against

The Shoes - "Present Tense"

The Myddle Class - "Don't Let Me Sleep Too Long"

The Standells - "Sometimes Good Guys Don't Wear White"

Pavlov's Dog - "Pampered Menial"

aqouta aqouta 3mo ago

results: RELEASE 1: The Shoes - "Present Tense"

DATE: September 18, 1979 (Tuesday) METHOD: Rule 2 — Full date from RateYourMusic (reputable source) CONFIDENCE: Medium

KEY EVIDENCE:

RateYourMusic: September 18, 1979 (full date; page returned 403 but date confirmed in search snippets)
Spotify structured data: September 15, 1979 (likely digital distribution metadata, not original street date; also a Saturday, unusual for 1979)
Apple Music / Last.fm: September 15, 1979 (same metadata pipeline as Spotify)
Wikipedia: "that September" (month only)
45Cat: Sep 1979 promo single, Oct 1979 commercial single
First Billboard 200 chart: #89 on October 27, 1979 (consistent with mid-September release)

SOURCES THAT COULD NOT BE ACCESSED: US Copyright Office (publicrecords.copyright.gov — JavaScript SPA, inaccessible), RYM (403, but search snippets confirmed date), Billboard PDFs (too large to fetch remotely)

NOTES: The Copyright Office would give a definitive answer (Rule 4, post-1978) but was inaccessible. RYM is the best available reputable source. Sept 18 (Tuesday) is a plausible pre-1982 release day. The competing Sept 15 date is from streaming metadata supplied decades later.

RELEASE 2: The Myddle Class - "Don't Let Me Sleep Too Long"

DATE: Monday, July 25, 1966 METHOD: Rule 6b — Monday before earliest confirmed ARSA chart date (WTRY, July 29, 1966) CONFIDENCE: Medium-Low

ARSA (WTRY 980 AM, Troy/Albany): #9 on July 29, 1966 (LW=21, implying charted ~~July 22 but that survey is missing from ARSA)~~
ARSA (WPTR 1540 AM, Albany): First appearance #2 on August 6, 1966; peaked #1 Aug 13
ARSA (KFXM, San Bernardino): DJ pick Sep 23, 1966; charted from Oct 7
45Cat: "Jun 1966" (month only, catalog data)
Wikipedia: "June 1966" (month only)
Billboard: No review found for this specific single (only for "Free as the Wind")
No national chart appearance (regional hit only)

VALIDATION FLAG: The computed date of July 25 does NOT fall within the June 1966 month listed by 45Cat and Wikipedia. This likely means either: (a) the single was released in June but took several weeks to build airplay in Albany before charting in late July, or (b) earlier ARSA entries exist on stations not checked (the agent could only search 3 stations without full ARSA login access). The actual release date may be earlier, in June 1966 as the reputable sources indicate.

SOURCES THAT COULD NOT BE ACCESSED: ARSA full database search (requires login), RYM (403), Discogs (403), Billboard archives for this specific single

RELEASE 3: The Standells - "Sometimes Good Guys Don't Wear White"

DATE: Monday, July 4, 1966 METHOD: Rule 6d — Earlier of ARSA-derived and Billboard-derived dates (ARSA wins) CONFIDENCE: Medium

ARSA (via Google-indexed URLs): Earliest appearance July 11, 1966, KBLA 1500 Burbank/Los Angeles; also July 13 WILS Lansing, July 14 KYOK Houston
Billboard Spotlight Singles review: August 6, 1966 ("predicted to reach the TOP 40")
Cash Box review: August 6, 1966
Billboard Hot 100 first chart: August 27, 1966 at #87; peaked #43
ARSA-derived date: Monday before July 11 (a Monday) = July 4, 1966
Billboard-derived date: Monday before August 6 = August 1, 1966
Winner: ARSA (earlier)

NOTES: July 4, 1966 was Independence Day, which is unusual for a release date. However, the methodology computes this mechanically. The ARSA entries were identified via Google indexing of ARSA URLs (the site itself was down), so they couldn't be directly verified. If July 11 is correct, the computed date stands. The Standells were an LA-based band, so KBLA picking it up first is geographically logical.

SOURCES THAT COULD NOT BE ACCESSED: ARSA (site down, data found via Google index), 45Cat (403), RYM (403)

RELEASE 4: Pavlov's Dog - "Pampered Menial"

DATE: February 5, 1975 (Wednesday) METHOD: Rule 5 — US Copyright publication date (Catalog of Copyright Entries, 1972-1978) CONFIDENCE: High

US Copyright Office (Archive.org CCE): Registration N22010, publication date 5Feb75, filed by ABC Records, Inc., catalog ABCD-866
Wikipedia/Grokipedia: "February 1975" (consistent)
Derek's Music Blog: Claims "April 4, 1975" for ABC release (conflicts with copyright date)
AllMusic: Lists "1974" (likely error referring to recording date)
Columbia reissue: mid-June 1975 (separate release, not the original)

NOTES: The copyright publication date of February 5, 1975 is the most authoritative source available. The blog claim of April 4 likely confuses the ABC release with the later Columbia reissue, or is simply incorrect. The Wikipedia/Grokipedia "February 1975" corroborates the copyright date. The registration confirms the original label was ABC Records (ABCD-866), not Columbia.

roystgnr ZorbaTHut 3mo ago

Just general anti-bot stuff, probably, though the desperation for more AI training data probably explains why bots got so ill-behaved a few years back. Our CI server has to hide even open-source logs behind Cloudflare settings harsh enough to block cURL, else the traffic from spiders can bring it to its knees. "Figure out how to get Codex to emulate a full browser" is on my TODO list somewhere...

faul_sname Fuck around once, find out once. Do it again, now it's science. roystgnr 3mo ago

chrome --remote-debugging-port

All frontier models know how to competently drive a browser using CDP.

What is this place?

Why are you called The Motte?

New post guidelines

Rules

Recommended Posts And Communities

Recommended Realtime Chats