site banner

Friday Fun Thread for May 22, 2026

Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.

1
Jump in the discussion.

No email address required.

I've expressed here in the past that I'm not seeing the huge gains from LLM based coding that a lot of other parties are seeing. A new research paper suggests that maybe I'm not just prompting it wrong.

We present a systematic study evaluating how well agents handle structural constraints in multi-file backend generation. By fixing a unified API contract across 80 greenfield generation tasks and 20 feature-implementation tasks spanning eight web frameworks, we isolate the effect of structural complexity using a dual evaluation with end-to-end behavioral tests and static verifiers. Our findings reveal a phenomenon of constraint decay: as structural requirements accumulate, agent performance exhibits a substantial decline. Capable configurations lose 30 points on average in assertion pass rates from baseline to fully specified tasks, while some weaker configurations approach zero.

Specifically, the paper calls out convention based frameworks and ORMs as particularly problematic, and I'm using both. It feels good to see some evidence that I'm not crazy and incompetent.

Speaking of AI disappointments, here's Claude Sonnet 4.6 today:

Prompt:

I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

Claude:

Should You Walk or Drive to the Car Wash?

You should walk, of course! 🚶

Here's why driving doesn't make much sense:

It's only 50 meters away — that's roughly half a football field, or about a 30–60 second walk.
Driving defeats the purpose — your car will get dirty again just driving those 50 meters, especially the tires and wheel wells.
Practical issues with driving:
    You'd spend more time buckling up and adjusting mirrors than actually traveling.
    Parking/maneuvering in such a short distance wastes fuel.
    You'd likely get the car wash queue dirty before you even start.

Gemini was cleverer:

Walking over is a great option if you just want to grab a detail spray, buy a voucher, or stretch your legs, but if you're planning to actually wash the vehicle, walking a block over leaves you with one minor logistical problem: your car is still in the driveway!

It went on for a couple of paragraphs about car washes, and gave me a fun tip about letting rain wash my car: spray it down with a hose after a storm before the rain dries completely to clear the dust dissolved in the rain.