@sarker's banner p

sarker

ketman hetman

0 followers   follows 0 users  
joined 2022 September 05 16:50:08 UTC

				

User ID: 636

sarker

ketman hetman

0 followers   follows 0 users   joined 2022 September 05 16:50:08 UTC

					

No bio...


					

User ID: 636

because the companies running the models gave them answers?

It's not an unreasonable criticism in the abstract, but a few minutes of reading shows that it just doesn't apply in this case.

  1. The higher score was published by Symbolica AI using Opus 4.6. So it couldn't be that Opus was retrained with the answers.

  2. "This uses the same harness we previously published" so it couldn't be that they simply prompt the model with the answers.

  3. The harness is published so you can see for yourself.

  4. Benchmarks do not publish their entire problem set, so in general it's impossible for labs to simply "give the models the answers" to the problems that aren't published.

I don't see any evidence that there were refugees in the Abbey, though it also seems that there probably weren't any german soldiers in there either.

People I knew who experienced western supermarkets with truly virgin eyes (coming e.g. from the USSR) seemed to not be alienated either.

Really? I love supermarkets.

I guess if you really drill your kids in bell curves they start to wonder why they'd believe their dad (midwit gentile European) instead of their Jewish friends (+1SD Ashkenazim). Then the whole edifice falls down.

Be careful what you wish for!

Raising non-woke kids is probably easy. The question is how do you turn them into non-woke adults through the teenage years of thinking mom and dad are stupid. Bryan Caplan seems to have managed it.

f37ac702-4bc1-4f26-a6e7-cde2080eaf75

300, apparently dragged down by my literary knowledge at the 78th percentile. 58/60 for technical knowledge though. I guess I really am a codecel.

Is 420 a slang term for marijuana?

Yes. I mean, no. I mean, yes. I mean...

I can't believe there's no fähn on the autobahn.

Okay, you got me, especially since Shakes wrote the grandparent comment.

Okay. But "we got the Iranians to attack our allies with missiles" is not much of an achievement, or at least, it doesn't indicate on its own that the war is going particularly well.

You neglected eleven days ago to specify what kind of situation would make you say that the five week special operation is going poorly. Care to update that or do you feel that the war is basically already a success since our allies got bombed?