site banner

Transnational Thursday for November 27, 2025

Transnational Thursday is a thread for people to discuss international news, foreign policy or international relations history. Feel free as well to drop in with coverage of countries you’re interested in, talk about ongoing dynamics like the wars in Israel or Ukraine, or even just whatever you’re reading.

1
Jump in the discussion.

No email address required.

In fact the models do not seem to be capable of differentiating on their own between success and pretend-success.

Of course! If there were a way to evaluate the quality of the result, the hyper-smart people earning billions of dollars would think about a thing as trivial as inserting "if the result is of low quality, try doing better" at the end of the AI pipeline. If we, as the end users, see low quality results, it is a hard evidence that their best effort at evaluating the quality of the results are failing. Otherwise they'd build a perfect AI chat and move from billions to trillions.