Contact Us
Sign In
Sign Up
Rules Admins Moderation Log Random Post Random User
What is this place?

This website is a place for people who want to move past shady thinking and test their ideas in a court of people who don't all share the same biases. Our goal is to optimize for light, not heat; this is a group effort, and all commentators are asked to do their part.

The weekly Culture War threads host the most controversial topics and are the most visible aspect of The Motte. However, many other topics are appropriate here. We encourage people to post anything related to science, politics, or philosophy; if in doubt, post!

Check out The Vault for an archive of old quality posts. You are encouraged to crosspost these elsewhere.

Why are you called The Motte?

A motte is a stone keep on a raised earthwork common in early medieval fortifications. More pertinently, it's an element in a rhetorical move called a "Motte-and-Bailey", originally identified by philosopher Nicholas Shackel. It describes the tendency in discourse for people to move from a controversial but high value claim to a defensible but less exciting one upon any resistance to the former. He likens this to the medieval fortification, where a desirable land (the bailey) is abandoned when in danger for the more easily defended motte. In Shackel's words, "The Motte represents the defensible but undesired propositions to which one retreats when hard pressed."

On The Motte, always attempt to remain inside your defensible territory, even if you are not being pressed.

New post guidelines

If you're posting something that isn't related to the culture war, we encourage you to post a thread for it. A submission statement is highly appreciated, but isn't necessary for text posts or links to largely-text posts such as blogs or news articles; if we're unsure of the value of your post, we might remove it until you add a submission statement. A submission statement is required for non-text sources (videos, podcasts, images).

Culture war posts go in the culture war thread; all links must either include a submission statement or significant commentary. Bare links without those will be removed.

If in doubt, please post it!

Rules
Recommended Posts And Communities
Recommended Realtime Chats
- Quokka's Den Telegram
- Astral Codex Ten Discord

PaperclipPerfector 1mo ago (text post) 21619 thread views

Culture War Roundup for the week of May 25, 2026

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

Shaming.
Attempting to 'build consensus' or enforce ideological conformity.
Making sweeping generalizations to vilify a group you dislike.
Recruiting for a cause.
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
Don't imply that someone said something they did not say, even if you think it follows from what they said.
Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

1376

1376
4

Jump in the discussion.

No email address required.

Soteriologian Unstammering Papageno 1mo ago

I made a comment recently speculating that Terence Tao may be a paid promoter of AI. After some off-site discussion elsewhere, our resident shaman on whom the mods cast a long-duration silence posted my comment as a circus exhibit on Twitter to demonstrate a dangerous form of psychosis. Fortunately, OpenAI deemed this a worthy time to intervene and publish an ad featuring Terence Tao, who has taken time out of his busy schedule to assist this struggling non-profit with their promotional.

I want to dissect what I think is really going on.

For starters, every time I sit down to watch WoW on YouTube, I'm greeted with my favourite streamer telling me about the super fun game Raid Shadow Legends, which I should definitely download and play because it's super fun. Is the OpenAI-Terence Tao relationship like this? Not really. Terence Tao does appear to actually spend some time playing around with LLMs. Further, he's not exactly saying a bunch of empty marketing blather, either. In fact, probably to the annoyance of some readers here, I want to take a couple paragraphs for a technical aside, because there is actually subtlety here. I'll bound this in horizontal bars so non-technical readers can skip it:

The main talking point is that automated theorem proving is a perfect fit for LLMs precisely because it's not vulnerable to their main catastrophic failure mode: hallucination. The model can hallucinate whatever it wants, but the text still goes into the theorem prover, and if it's bullshit, well, the prover just rejects it and you query the LLM again. Do this in a loop, burn whatever unholy amount of compute you want, and if the loop stops, you've got yourself a proof! (Well, or a bug in the theorem prover. Or a "You've run out of tokens on your budget" error message. But I digress). This story is largely true. There's a giant asterisk of "Uh, so how much compute we talkin' about?", and the answer is "As much as you need or can afford, whichever comes first!" Which is, of course, the business model.

I will point out one additional technical nit-pick that annoys me because Terence Tao is working in Lean 4, which is a dependently-typed theorem prover. In classical mathematics, one is concerned solely with whether a theorem is true or false, and the structure of the proof is basically irrelevant as long as it's valid. Lean 4 is not based on this model. Rather, it's based on a more computationally-motivated model of mathematics pioneered by Brouwer in the early 1900s called "constructivism." In this world, the question isn't the boolean notion of "Is this theorem true or false?" but rather the related but distinct notion of "Which proof do you have?" To ground this in practical terms, consider the following example: I can prove that True|False and Yes|No are isomorphic, but I can do so in multiple ways: I can map True to Yes and False to No, or I can map True to No and False to Yes (and then show that there are respective inverses which preserve identity, obviously). It is in this sense that one can meaningfully say "Which proof of isomorphism?" when I say I have a proof of isomorphism. Perhaps this all sounds like technobabble, but to connect it to the preceding paragraph: you can immediately see how this does reveal some cracks into the narrative being sold there. It does actually matter which proof is produced, not merely in a social sense of "can any human understand this wall of text the LLM spit out", but in a technical, computationally-relevant sense. For pure mathematics, this distinction is often not considered important -- in fact, many classical mathematicians aren't even aware of the difference, and will be confused if you try to explain it to them and think this is all a bit silly. However, it's not a silly or minor distinction for the following reason: one of the motivations of this computational model for "theorem provers" (it's really a programming language + compiler, rebranded for mathematicians) like Lean is so that formal methods can be applied not just to classical mathematics, but to software in general. And as soon as you enter software formalisation, this distinction is no mere intellectual curiosity, but of paramount relevance. The classical-style logic in the preceding paragraph does not apply to constructivist logic used for software formalisation! I'm sure this distinction is not lost on Terence Tao. But that doesn't concern OpenAI. OpenAI is more concerned with whether the distinction will be lost on the MBAs listening to Terence Tao, and the answer is "absolutely."

Ok, no more technical details like that, I promise. Back to the social level:

So, I mentioned Raid Shadow Legends is a poor metaphor for the OpenAI-Tao relationship. Let me propose some better ones: Michael Phelps and Wheaties (with the added benefit that Terence Tao never smokes weed. See, this is why mathematicians are better than athletes), or better yet, attending Harvard University. This may seem like a strange juxtaposition, but I've done so intentionally, because the marketing is obvious in one but subtle in the other, but it's actually the same trick: the goal is to misattribute performance. With Wheaties, the goal is to sell the notion that Michael Phelps is a great swimmer because he has a healthy diet of stuff like Wheaties, and if you eat Wheaties, maybe you'll perform well, too! Of course, in reality, he was eating sugar-coated french toast and chocolate chip pancakes because he needed 10k Calories/day just to break even on energy, and the reason he's such an awesome swimmer is in large part genetics. Wheaties, or anything similar to it, has virtually no relevance to Michael Phelps at all. But what about Harvard?

Well, Harvard sells the image so well that most people outside this forum outright believe the illusion. The illusion is, of course, that attending Harvard makes you smart and likely to succeed, rather than Harvard accepting only people who were smart and likely to succeed in the first place and thus redirecting credit for these future achievements to Harvard. Mark Zuckerberg may see Harvard as a pointless waste of time, but the world sees Harvard as "The university that made Mark Zuckerberg happen!"

I like the Harvard analogy because this is surely the intent with Terence Tao. There's a high chance sooner or later Terence Tao will prove something cool "using" ChatGPT, and if he does, it would be really awesome if we could make it sound like the secret ingredient in the ChatGPT-Terence Tao alliance was ChatGPT, when obviously the actual secret ingredient is Terence Tao. The analogy I always use for this is stone soup, a European folktale where starving travelers dupe gullible townsfolk into helping them make soup from stones by requesting "extra" ingredients bit by bit until they've just made actual soup, thus astonishing the gullible townsfolk.

There are a lot of other things I could say on this, especially on the technical side, as there are a lot of clever tricks you can pull to make it look like a model is doing more than it really is, but I'll stop for now and conclude with this:

Just be cognizant that OpenAI, and all the other LLM vendors, do marketing. They have an enormous budget that dwarfs anything you have ever seen before. Remember the reality distortion field of Black Lives Matter? Or trans people? Imagine that, but like... two orders of magnitude larger. That is the level of persuasive pressure we're dealing with here.

Just take it all with a grain of salt.

Context

NunoSempere Soteriologian 1mo ago

I don't follow your technical aside. Isn't the point of an isomorphism that all the isomorphisms are the same in the sense in which you are isomorphic?

I can prove that True|False and Yes|No are isomorphic, but I can do so in multiple ways: I can map True to Yes and False to No, or I can map True to No and False to Yes (and then show that there are respective inverses which preserve identity, obviously)

I don't understand this example, because if you have A|B, A+A = A, B+B=B, but if A+B=A then A maps to true and if A+B=B then B maps to true.

The classical-style logic in the preceding paragraph does not apply to constructivist logic used for software formalisation!

But how? Like, what is the failure mode that you might get if you try to prove the correctness of a program for which this disquisition about isomorphisms is relevant?

Context

Soteriologian Unstammering Papageno NunoSempere 1mo ago · Edited 1mo ago

I'm not sure I follow your confusion. In a strictly mathematical sense, you may have equivalence, but if you publish your paper in the world where electrons have positive charge and everyone else publishes their paper in the world where they have negative charge, well… you can see how this appeal to "Well they’re equivalent either way 🤷‍♂️" is both technically correct and also asinine.

But it's not just this sense: in the bool example, there's only one bit of information at play, but say you're talking about something more complicated like graph isomorphism or SAT solving. The representation of the problem matters a lot in practice! You can famously write a SAT solver in a single line of Haskell code, but you can also do it in 60,000+ lines of low-level systems code. I'm sure you can guess which is the useless party trick and which is the industrial-grade SAT solver. But even more relevant is that the technical curiosity is, in a strictly mathematical sense, more correct! The industrial-grade SAT solver has a maximum variable count just by the fact that it labels them with 32-bit ints, which is obviously ridiculous by theoretical standards: there’s no magical limit on the number of variables you can have in a SAT problem.

So, let’s say we’re interested in proving the correctness of a SAT solver. I bet there are a fair number of people who’ve played around with Lean and similar systems who would have a fair shot at proving the party trick correct.

There’s not a single person on the planet who stands a shot at proving the industrial SAT solver correct, nor is there likely to be within a thousand years.

And this is how it always is! There is this enormous disconnect between the truly formal representation of a system and "equivalent" (in any colloquial sense) real-world systems anyone cares about, even for systems that are themselves formal fields of study like SAT (much less anything an MBA would consider “real world”, like whether his website is secure).

The reason money has any interest in Lean 4 beyond "the government will give me a tax deduction for my charitable donation" is because we want systems to stop being so broken, and formal verification is supposed to help with that.

My point is just that there are… asterisks. A lot of asterisks.

Context

RandomRanger Just build nuclear plants! Soteriologian 1mo ago · Edited 1mo ago

Let's suppose AI models aren't so great for mathematics and it's Terence Tao doing most of the work.

The primary commercial usecase for AI isn't mathematics, it's coding, as you say.

I cannot code beyond the most basic Khan Academy beginner sense, my actual end to end abilities are completely worthless. And yet I can make useful AI tools for work, processing documents in various ways that saves lots of human time. In a certain sense, I'm providing most of the 'secret ingredient' since you cannot just tell an AI to do these fiddly tasks and expect them done properly in oneshot. It will usually not work the first time. So I give it some counsel and tell it what to troubleshoot, errors, differences from expected output, clarify my intentions and ultimate usefulness. Eventually it works and then I refine it to work better and better by getting AI to handle all these edgecases and Word-induced BS.

And how could I make a (still under development) 4X game with AI if I can't code? There's a fair bit going on. Space battles, ground battles, culture, technologies, buildings, resources, goods, markets, map generation, turn order, trade between provinces (intra faction), trade between factions, freighters, pops and social classes, loans, diplomacy and war plotting, coalition building... Some things are not well fleshed out but there is quite a bit there.

I was just now getting it to make an evolutionary testing system to refine ship designs and fleet compositions and so define the meta. First time it worked OK, then when trying to make it better (too many bad mutations!) it broke, then I overhauled it and now it works great and with multicore processing too. Apparently the dominant strategy is getting hundreds of incredibly cheap and terrible warships to act as chaff for a small core of high-tech warships to exploit the targeting and reinforcement logic. So clearly I need to change how reinforcement and targeting works, raise minimum costs for ships.

It was my idea to make this tool, my idea behind the overhaul and my ideas behind every mechanic but I could never have done it myself. The secret ingredient is clearly the AI.

Context

Soteriologian Unstammering Papageno RandomRanger 1mo ago · Edited 1mo ago

I don't know, my experience is kinda the opposite, but I have been programming since childhood. I find AI very frustrating to use, and rarely consult it.

Keep in mind that it has all of Github in its training data. It can go a long way by basically feeding you an existing project and not mentioning where it got it. I don’t mean to oversimplify and say that this is all models do and that there’s no reasoning, but I think people underestimate just how much is memorised: you can get mainstream models to recite entire books like Harry Potter almost word-for-word, with upwards of 95% accuracy, and to the extent that you can't, it's usually due to active sabotage by the model vendor to prevent people from getting free Harry Potter, not because the model doesn't have Harry Potter memorised. Vendors really do not want it to look like everything is getting memorised, but to a large extent it is. I just tried it with some random books I'm reading now and… yeah, they definitely have every book I'm trying memorised, even though they're very coy about it. And some of these are quite obscure books that I'm confident less than 1% of people would even recognise the name of. I then tried it on a snippet from a random library I published ten years ago on Github and… yeah, it has it memorised, down to the exact word. Amusingly, unlike the books, it does not go out of its way to tell me where this quote is from and lecture me about the importance of copyright. When I explicitly ask it where it got that, it just says it made it up—that it’s not a quote from any published book.

EDIT: to drive the point home, it is not so good at predicting the text of an unpublished library I have sitting on my PC! I suppose it's some consolation that this vendor does not appear to have access to my computer.

Context

RandomRanger Just build nuclear plants! Soteriologian 1mo ago · Edited 1mo ago

Well, the odds it has memorized the autistic game mechanics I invented are nil. But it can still implement and work with them.

Consider this paper for instance, they have the AIs devise experiments to derive physical laws in simulated universes unlike our own, Opus and the strongest GPT do OK: https://arxiv.org/abs/2605.26087

They can't have memorized imaginary physics. With sufficient repetition learning, creativity and intelligence emerge. Do we not hone mathematical talent by repetition? Github is a pretty diverse space, if there's some some set of components needed to make cool things, good odds they are on github. What isn't on github could be assembled from what is. Perhaps all one needs to do to write arbitrary software is assemble all the pieces in such a way as to meet the goal.

Context

pbmonster Soteriologian 1mo ago

It can go a long way by basically feeding you an existing project and not mentioning where it got it. I don’t mean to oversimplify and say that this is all models do and that there’s no reasoning, but I think people underestimate just how much is memorised

I have the same experience. It's just so glaringly obvious when it has to code something that is rare in its training data.

Load this folder of data (convoluted structure, some data is binary): perfect oneshot
Build a database from it: perfect oneshot
Do some signal processing on the data for cleanup: good ideas, uses the correct libraries and algorithms unprompted, but not immediately usable
Make plots for me to debug the signal processing: perfect oneshot, signal processing is now usable with minimal handholding

Data analysis, involving multiple integrals over the data along different axis: completely wrong, not even close. A first year grad student could solve this in a few hours, this is in at least a dozen of standard sub-field specific text books and has a very specific name - but it's probably not on github.

Discussion about a factor of 2 included in some text books, but not in others: completely wrong. having this discussion with an actual parrot would be equally as productive as the stochastic parrot.

Context

quiet_NaN Soteriologian 1mo ago

constructivism

The main thing about constructivists is that they do not generally accept proofs where you you assume the negation of what you are want to prove and arrive at a contradiction (Reductio ad absurdum -- though it seems that some cases remain valid in intuitionist/constructivist logic).

As a layman, my gut feeling is that the constructivists would probably accept Euclid's proof of an infinite magnitude of primes, because it gives you a rather concrete algorithm for finding a prime which is not in your list of all primes, but not Hilbert's proof of his Nullstellensatz, because it does not give you a way to determine the algebraic relationship.

WP on constructivism

These views were forcefully expressed by David Hilbert in 1928, when he wrote in Grundlagen der Mathematik, "Taking the principle of excluded middle from the mathematician would be the same, say, as proscribing the telescope to the astronomer or to the boxer the use of his fists".

Moving on further

The classical-style logic in the preceding paragraph does not apply to constructivist logic used for software formalisation!

I am not sure I follow. Suppose you have a program and a claim about that program ("this Turing machine will halt for any input", "This C code will never invoke undefined behavior", etc). In my mind, a proof which is just half a gigabyte of gibberish without rhyme or reason would still be acceptable for such practical problems. If someone formally verified that a given version of the Linux kernel did not suffer from a given class of exploits, I would not complain about them using proof by negation.

By contrast, open problems in mathematics are not things where our main interest is in knowing the answer. Few people are interested in P==NP because they think there is a practical algorithm for solving SAT to be discovered. If an ASI told us what the answer is ("P is equal to NP, but the polynomial has coefficients and exponents so large that your mathematics can't even express them using all the protons in the universe"), that would be of little value. People are interested in these big open questions because their answer sometimes lead to the development of new and exciting branches of mathematics.

The textbook example of a problem which looked promising in that regard until it was proven would be the four color theorem. "So it can just be proven by brute-forcing 1834 configurations with a computer? Seems it was not a nice problem, after all."

With most of our existing software, the problem is even identifying what formal properties we would want, or constructing it so that has the desired properties (and ideally we can easily proof them).

Context

Soteriologian Unstammering Papageno quiet_NaN 1mo ago

The Hilbert thing is more a sign of the times. Back then, there was a lot of resistance and emotional slapfighting over this, because in ye olden days, people were very concerned about which postulates were True (TM), and viewed any attempt to discuss this as an attack on Truth.

Today, we worry less about this absolute notion of Truth and more about models: is Euclid's parallel postulate True or False? Well, there's actually nothing to fight over: you just get different geometries, but they're all meaningful and useful! So rather than saying "The parallel postulate is true, therefore XYZ", you can just say "When the parallel postulate is true, XYZ holds." Even questioning logical primitives that seem "very true" like "can you appeal to a theorem more than once in the same proof?" turns out to have surprisingly useful implications! For example, if you're modeling a cookie, it makes a lot more sense to say you can only eat it once, rather than you can eat it as many times as you want. It's not wrong; it's just a different thought model. A thought model that, incidentally, is the foundation of a very popular programming language.

One can say "Ok, but what if the postulates imply a contradiction? Surely that's bad, right?" Well... yes, but actually even here there's a lot of subtlety. Not for mathematicians, but for everyone else: see, the trick to defining logical systems that dodge Russell's Paradox is to have a cumulative hierarchy of universes, which is what a lot of dependently-typed proof systems use. But it turns out this is an enormous pain in the ass to work with for writing normal software, to the point that literally nobody does it, and instead we settle for simpler systems that are much more ergonomic to write in yet still, in practice, give a pretty strong (but not rock-solid!) guarantee that you haven't contradicted yourself.

Anyway, constructivism is a more grounded model compared to classical logic, in the sense that it actually computes results, and you can do the legacy thing by just saying "When the law of the excluded middle holds, XYZ."

Context

dailydogma Soteriologian 1mo ago

I'm skeptical of math as a field and how Tao practices it. I think there's a chance he's an invention of his parents and academia and so secret shilling for OpenAI is right in his lane as far as attention seeking behavior goes. Take this with a grain of salt though, I don't want to slander him. He's certainly intelligent but yet none of his ideas have led to anything tangible and the praise heaped on hims just feels like it could be better allocated to practical problem solving, so it makes me a little suspicious that there's a confidence trick going on under the surface. Also, if there's anything to this, he wouldn't be the first, Von Neumann was suspicious too, for example his uncle mysteriously attributed to him physically impossible abilities that have been later debunked. Again, not saying these men aren't smart, I think they are extremely intelligent and what they do is difficult, but that labels like genius are political in nature and that these men and their entourages seem to do everything in their power to claim that label, behind just focusing on the best contributions to math and science they can make. To the extent that I think their contributions to humanity suffer as they chase fads and prove abstractions instead of undergoing less prestigious work to make material progress.

-2

Context

dr_analog top 1% of underdog fetishists dailydogma 1mo ago

I'm skeptical of math as a field and how Tao practices it.

This is an important concern.

Tao has almost certainly been offered dump trucks full of cash to join quant funds. Perhaps if he made global derivatives arbitrage a few points more efficient he could save every American's retirement fund a few hundred bucks a year. Would that be a better contribution to humanity than whatever he's doing now?

Context

Tretiak If you know you know, if you don’t you don’t. dr_analog 1mo ago · Edited 1mo ago

Don’t underestimate just how much you can get fatigued by money overtime. I have a known technical skill set that some of my close friends and family know about and by proxy to others who know them, I’ve been offered a good amount of money to engage in work for them in the past (nothing like Tao probably has; not even close, but around low 5 figures). I’ve even received direct phone calls about it. I declined all of them because it’s very precise, very high pressure work, plus it’s very inconsistent and intermittent which disrupts and prevents me from establishing and keeping the stable track I’m already on. It’s hard for people to believe but money really isn’t everything.

Context

ToaKraka Dislikes you Tretiak 1mo ago

by money

*by work

Context

Tretiak If you know you know, if you don’t you don’t. ToaKraka 1mo ago

Well, if someone’s going to give me free money I am going to take it, but salaries often increase with the difficulty of the work you do. I’d much rather take a lower end job where I got to pursue the things I wanted than a high end one where life outside of a paycheck was terrible.

Context

dailydogma dr_analog 1mo ago

Good point, and grim. But he was also offered those dump trucks by OpenAI, and he took it. But he's not contributing to AI research, just branding. I don't blame him, AI research sucks. There's no theory to it, it's just try random things. Same with quant research.

Context

Tretiak If you know you know, if you don’t you don’t. dailydogma 1mo ago

Does MIRI not do research anymore?

Context

YoungAchamian We walk conditioned ground and name our folly civilization. Tretiak 1mo ago

Was MIRI ever considered actual AI research? Or just a jobs program for the nerdy connected word-cell rationalist in the bay. It feels like MIRI just conned a bunch of wealthy weirdos out of some grant money and then posted alignment papers akin to science fiction musings about an AI that does not exist.

Context

Tretiak If you know you know, if you don’t you don’t. YoungAchamian 1mo ago

The latter is how it started and now I’m pretty sure that’s where it wound up. At one point they’d go around and organize research workshops among experts in the fields to make progress and produce research and at one point they did. I read some of their papers but don’t know if that’s still going on. Eliezer I think is on a crusade to bring it all to a halt now because of his epiphany about alignment but I doubt it’ll stop progress behind closed doors.

Context

magicalkittycat dailydogma 1mo ago

There is tangible real world improvements that Tao's work has led to, like MRI scanners is brought up constantly as one of the most prominent and direct examples. It's true that pure math doesn't have many easy and obvious results like that, but that's true of a lot of science. Tons of important inventions are based off things that no one was expecting like microwave ovens (just a guy discovering candy melted in his pocket while working on radar equipment) or penicillin (accidently contaminated a petri dish while studying bacteria).

For every compound found in a gila monster that cures obesity, there's hundreds (thousands? Maybe tens or hundreds of thousands?) of other compounds that aren't found to do anything usable (at least not that we currently know of) but to find the former we have to go digging through the latter. We can make some educated guesses, but we don't know ahead of time too well what will be useful and what won't be. After all, that's why we do the work and the research, to find out.

Context

dailydogma magicalkittycat 1mo ago

ChatGPT indicates that maybe 10% of the improvement to MRIs from 2000 to 2020 was due to pure math, and Tao was maybe 10% of that. So that's 1%. But it says most of the improvement came from applied engineering using math as a tool.

After all, that's why we do the work and the research, to find out.

But pure math gets way too much status compared to how likely it is to find something useful/interesting. Most of it is pretty obviously a waste of time. Quantitative science such as physics is a much more efficient way to search useful state space with math IMHO.

Context

sarker competency crisis actor Soteriologian 1mo ago

I don't really understand your overall point. To whatever extent Phelps' success was said to be thanks to wheaties, it's obvious the same can't be said for Tao for the simple reason that wheaties are from 1924 (predating Phelps) and Tao won the fields medal in 2006 (predating ChatGPT). It's obviously a logical contradiction.

The other thing is that we are at the point where LLMs are solving open problems with minimal or no involvement from humans:

Automated theorem proving across open Erdos problems. Each solved problem cost a few hundred dollars, which is hardly an unbelievably large number.
A problem relating to sumset combinatorics. Gowers is also a Fields medalist and seems pretty impressed with ChatGPT here. Reading the post, it does not seem that it takes a Fields medalist to do the prompting here: "the era where you could enjoy the thrill of having your name forever associated with a particular theorem or definition may well be close to its end." This also doesn't seem to have cost an inordinate amount of money, although there isn't a figure given.
A disproof of a conjecture related to the planar unit distance problem. I'll grant that it's not explicitly clear what the human prompting looked like, but there's no mathematician whose name is attached to this so I can't imagine it was something that only a select few could have done.

This is not an exhaustive list.

In other words, you don't have to be Tao to find new results with these tools. I am sure your response will be that nobody cares about these results, but unless you were predicting beforehand that we would get to this stage but no farther, it's hard to take this seriously.

Context

Tretiak If you know you know, if you don’t you don’t. sarker 1mo ago

In other words, you don't have to be Tao to find new results with these tools.

Sure but that still doesn’t eliminate the problem exactly. It’s somewhat reminiscent of when Fermat’s Last Theorem was proven by Andrew Wiles. The early stage formulated results he came up with he remained confident about, until it took other mathematicians joining in to show him where he was off track. You don’t have to be a Tao to “find” these results per se, but how do you validate them?

Context

sarker competency crisis actor Tretiak 1mo ago

It's frequently the case that it's easier to check a result for correctness than it is to generate the result in the first place. This is especially the case if the problem can be formalized in Lean.

Context

Tretiak If you know you know, if you don’t you don’t. sarker 1mo ago

But it still requires complex understanding. Even if the models on hand compute the equations perfectly without error, each step requires input validation and verification of the end result. In some cases this can be a time save, in others, not no much.

Context

sarker competency crisis actor Soteriologian 1mo ago · Edited 1mo ago

Wait, Teortaxes is Dase? How did he manage to stop rageposting for his twitter persona?

Context

aqouta sarker 1mo ago

He didn't stop lol.

Context

sarker competency crisis actor aqouta 1mo ago

I've read several posts by Teortaxes where he didn't say all his opps think Chinese people are bugmen, for your information.

Context

aqouta sarker 1mo ago

He has posts on this site where he doesn't do that too tho. I may just be annoyed because he blocked my on twitter after accusing me of being a Maga guy after his flame out here.

Context

sarker competency crisis actor aqouta 1mo ago

Yep. It's him alright.

Context

HereAndGone2 Soteriologian 1mo ago

Re: Tao and the "precarious financial situation", I think that's just whining over "Waaah, my funding got cut and it's all Trump's fault!" If he really isn't being paid his stipend/salary, then he should kick the university financial department up the backside, not go on Substack to Orange Man Bad. But then again, I think Tao gets treated like a living god by some because of his undoubted mathematical genius.

Which just demonstrates why people go "If you're so smart, why ain't you rich?" He prefers (and maybe is only capable in) academia, which means he is going to be dependent on that kind of funding to make a living. Nobody is forcing him or playing 4-D chess to impoverish him so he has to be the spokesmodel for AI.

As for the rest of it, I have no dog in any fight concerning universities (should you go there, are the Big Names any better than empty signalling, etc.)

Context

Brainwavez Soteriologian 1mo ago · Edited 1mo ago

I’m confident Terrace Tao is pro-AI both because it funds him and he finds it interesting and potentially useful. That’s academia (usually).

OpenAI’s model did solve a long-standing Erdos problem (not in Lean, hand-checked by mathematicians, but still)

Context

Soteriologian Unstammering Papageno Brainwavez 1mo ago

I think crypto is a good analogy. There is actual tech there, and many do indeed believe in it (me being among them). But the hype and noise was wildly disproportionate to what was realistic and done.

At this point, if you put your bet on anything other than "Slightly improved Bitcoin with privacy that was obviously intended originally but not known how to do at the time," you’re probably down 70+%, if not entirely liquidated. Entire narratives about "business on the blockchain!" were complete nonsense.

Fun fact: Sam Altman himself launched a crypto coin back in the heyday. It's down 96%.

Context

Corvos Soteriologian 1mo ago

I think crypto isn't such a good analogy. I never saw anyone get value out of crypto qua crypto. As an asset and an investment, yes, and occasionally as a way of paying for mildly shady or super-techy things, but in general the value proposition just never seems to have manifested to me.

Whereas I get massive value out of AI. For writing, for my hobby projects. My startup would be facing much larger headwinds without AI for coding and research. I think the hype is still kind of overdone, but only because the hype is so strong that only the immanent eschaton could live up to it and because it's not clear how much of a directly-related ecosystem there will be for third parties.

Context

urquan Hold! What you are doing to us is wrong! Why do you do this thing? Corvos 1mo ago · Edited 1mo ago

I have mixed feelings about AI; I have concerns about it being used to automate military decisions that should require human moral judgment (the traditional Terminator-style concern over computer command and control), and also the potential for deepfaking and manufacturing false content to mislead or manipulate. The latter has already been used in new and more sophisticated scams, and I worry about what a nation-state-level actor could do with that kind of power. Economic disruption is there as a genuine possibility, and that's difficult, but I'd prefer if people expressed that possibility directly as a livelihood threat rather than trying to launder the (genuinely sympathetic) concern into environmentalism or moral grandstanding about human creativity or interpretations of IP law in which AI training is assumed-illegal.

I've rarely actually heard someone say, "I don't like AI because doing my job without it gives me satisfaction and a good-paying job, and the introduction of AI into the workplace makes me feel like I'm losing the livelihood I prefer." Instead, I typically hear things like "AI was developed by stealing the intellectual property of hardworking people in order to enrich the billionaires and ELON MUSK and DONALD TRUMP," part of the large egregore of "all my enemies are evil rich fascists."

People would rather be angry than admit vulnerability. Our discussions over issues of social importance would be strikingly improved if people were willing to admit when their principles are self-serving -- which there's nothing wrong with, everyone deserves to advocate for themselves -- instead of trying to convert everything into an argument in which justice, law, the hand of God, and the long arc of history all militate against whoever you think is opposing your interests.

I don't agree with the environmental or land-use concerns for the most part, and it strikes me as degrowth corporate-hate and NIMBYism rather than principled objections. Energy use is not automatically immoral. I'm disappointed in the ways in which AI's demand for silicon is draining the consumer market of computer components and I worry about the impact on individual people's ability to control the means of technological production, but at least so far, this is offset to me by the increase in the ability to interface with computers using natural language.

The kind of generalized AI hate I see out there, online, occasionally in person, is hard for me to wrap my head around. I'm in the 10% of Americans who are more excited than concerned about AI. Generative AI has been great for me, in ways similar to what it's been for you. I enjoy using it. I get value out of it. I think AI slop memes are funny sometimes. I don't like when it's used to write personal messages or fill out marketing boilerplate copy, but I don't hate AI text as a general principle, especially if it's used to bolster and not replace human effort and creativity. And I dislike the invective and contempt that valid uses of AI generate in critics far, far more than I dislike the silliness or laziness of uses of AI that are in poor taste. That's the self-interested vulnerability of my own: I don't want a tool that has expanded my capability to become socially radioactive.

I don't know enough about AI to comment with any level of expertise on the research frontier. But I do have a skeptical prior towards the idea that this generation of AI will produce genuinely generalized AI that can meaningfully, affordably, and trustworthily replace human oversight. But we've gone farther with agentic AI use than I would have expected, so I might be wrong about that.

Context

Jiro urquan 1mo ago

Our discussions over issues of social importance would be strikingly improved if people were willing to admit when their principles are self-serving -- which there's nothing wrong with, everyone deserves to advocate for themselves -- instead of trying to convert everything into an argument in which justice, law, the hand of God, and the long arc of history all militate against whoever you think is opposing your interests.

Are you sure it isn't the reverse? What drives people's opposition to AI is that it gives them another excuse to call their enemies evil rich fascists, and they actually don't have any personal reasons like losing their livelihood at all.

I listened to a video game podcast where the very woke hosts said that when they talk to people in Asia, they keep getting asked why the Americans oppose AI. To which my response is exactly that: opposition to AI in the loud sections of the media is entirely for political reasons, and your politics has no effect over there.

Context

Corvos urquan 1mo ago

I agree with pretty much all of this.

Context

quiet_NaN Corvos 1mo ago

As an asset and an investment, yes, and occasionally as a way of paying for mildly shady or super-techy things, but in general the value proposition just never seems to have manifested to me.

Minor point, this seems to be the wrong way round. The asset aspect is the boring (if hyped) part, and the proposal to do financial transactions without a trusted party (which can easily coerced to block some transactions by the feds) was the innovation. Of course, this freedom to do transactions has mostly been used in darknet marketplaces and for ransomware, but that's humans for you.

I will grant you that the anarcho-libertarian utopia promised by the blockchain has not happened, though. 'crypto' is 99% get-rich-quick scams, and the 1% are probably mostly ransomware and the like, with 0.01% being nerds buying acid or donating to wikileaks. Legal crypto exchanges are very much centralized, and banking laws in the US are probably broad enough that the feds can jail you for decades if you put substantial amounts of your money through a mix or otherwise annoy them.

And gen-AI is definitely the bigger deal, sure. It might take six or eight orders of magnitudes more money to train a LLM than it takes to train an individual human, but my feeling is that if we assume that the tech will keep the current intelligence level and and simply improve on the execution, that is already enough to make the mean white collar worker obsolete. Heck, I have a PhD-level education and consider a future where I am reduced to wearing AR goggles and connecting cables to where some AI decides they should go while it takes care of the software tasks far more efficiently than I ever could distinctly possible.

This is not to say that the AI bubble bursting is not also possible. I mean, investors in the late 90s were not wrong about everything -- the internet did have an enormous effect on commerce. It was more the specifics which they were wrong about, like if pets.com would ever become profitable.

Context

Tretiak If you know you know, if you don’t you don’t. Corvos 1mo ago

Whereas I get massive value out of AI. For writing, for my hobby projects. My startup would be facing much larger headwinds without AI for coding and research.

For complex tasks though, or for cutting down the massive overhead? I just think for most of the use cases I keep coming across, AI is nothing more than a fancy lawnmower that saves you time to cut the grass, but doesn’t do anything revolutionary for you.

Context

gattsuru Tretiak 1mo ago

That line gets fuzzy.

I can't talk work examples, but a hobbyist thing I've been crunching for the last two weeks is building a couple small educational robots.

That's not revolutionary, in the sense of completely breaking the field. It's something I've even done before at smaller scales: the first DIY educational robot kit I provided for a summer camp is almost a decade old now. But it's the sort of thing that's a massive time investment, especially when you're looking at a new microcontroller architecture or building something far from the standard line-follower or simple ESP32 websocket racer. Figuring out chip documentation, finding actual sane BoM materials instead of the wacky versions people go with at unit size 10k, managing errata, sanity-checking EMI, it's a nontrivial effort at even the smallest scale. On that side, AI's probably dropped it from a month of nights-and-weekends to a week or two, and probably made it better or surfaced information I would have missed otherwise.

(though even there, being able to actually find and translate information has encouraged me to go a lot broader than I did in 2018: there's been a few chips and targets I can genuinely evaluate five or ten options now, where before it'd just be a matter of finding anything not-EOL.)

The harder part is where I want to sell these things. People did that, pre-AI, don't get me wrong. But it was insurmountable to me, and probably insurmountable at my expected business scale. The actual build and development costs are trivial compared to the compliance costs, just figuring out the order of magnitude of the compliance costs meant hiring an expert, and worst of all, there's a lot of landmines I knew about even then which could invalidate a lot of your past efforts all at once, and others I didn't.

That's not, pointedly, revolutionary to the world. But it's revolutionary for my use case.

Context

Corvos Tretiak 1mo ago · Edited 1mo ago

As Napoleon once said, quantity has a quality all its own.

My private project is a graphics thing for ricing. To get what I wanted, I would have had to become proficient in desktop compositing, OpenGL, wayland, and several disciplines around graphics and rendering. Then I would have had to write several thousand lines of fairly finicky boilerplate, including several false starts and bad assumptions.

If I were retired and had the time and the energy, I could do that. In practice, though, switching from 5% ideas 95% grind to 60% ideas 30% reading 10% grind means that it’s fun and I’m a good chunk of the way there after maybe three good evenings of work. Without AI that just wouldn’t have happened and it would go into the bin of ‘someday’.

For my startup, again, AI is not a superintelligence but it sirfaces good papers, explains the maths when I get stuck, implements diagnostics in minutes that would take me hours. It’s not like having a Nobel winner in my pocket, it’s like having a textbook that can talk to me and a bunch of PhD students on Speed. Very senior people in very serious organisations are using it for proof of concepts and your projects.

TLDR: no individual thing it does is truly revolutionary except maybe the maths from my perspective, but I find the ease and quality and speed with which it does it is revolutionary in aggregate.

Context

Tretiak If you know you know, if you don’t you don’t. Corvos 1mo ago

That just proves the point though. That also holds true for most things in the industrial world. The gap for me stems not from it providing no value, but how it differentiates itself from everything else that achieves the same thing in its respective domain. There’s one of two categories the tech falls into:

AI making existing technologies easier to use and increase productivity.
AI inventing new tools, technologies, methods, routines and research.

The problem I have with so many people who love to talk up the AI ladder is they use 1 as a way to argue for 2. 1 has been the whole long read of technological and economic progress since humanity has existed. There’s nothing “new” about that. I’m glad in your case it’s lowered the barrier to entry for you, but I don’t see that as a strongly given “new inroad” for the tech itself.

Context

Corvos Tretiak 1mo ago · Edited 1mo ago

I think that's a broadly artificial separation. In my opinion the vast majority of new tools / technologies / methods / routines / research come from some combination of:

Observation of something interesting during a routine process.
Application of something routinely used in one context to another context.
Common-sense extension that has only now become available because of advancements in another area.

I have observed AI doing (2) and it makes (1) and (3) considerably easier.

If my project works it will be an entirely new way of doing desktops, and I guess it was my idea not the AI's, which is maybe what you mean? But I got a lot of the techniques from another area and 90% of the design is the AI's suggestion and uses techniques I'd never heard of, so it's still more complicated. I'm quite happy for the top-level what to stay my job and leave the how to the machine, of course.

Context

functor Soteriologian 1mo ago

The primary smell test is if a person promoting something is only using one brand. Does he only use chatgpt and never mention something else? Then that is fishy. Does he hop between models and providers than it is probably genuine excitement.

Speaking of covert LLM marketing I was recently invited to a meetup that specifically mentioned claude code vibe coding. There was no official sponsor yet they had rented a somewhat fancy venue and there were heaps of food. They were going to another city to do the same thing the day after. According to themselves they were just passionate about claude code.

Considering the boatload of money invested into these spaces it makes sense to market their products and traditional advertising probably gives less bang for the buck.

Context

Soteriologian Unstammering Papageno functor 1mo ago

The primary smell test is if a person promoting something is only using one brand.

Typically, yes, but we're unironically talking 160+ IQ here. He certainly has negotiating power and some sense of dignity and taste. Anyway, yes, I freely concede my searches specifically for "OpenAI gives TT a bunch of money" were in vain. But they did turn up his new foundation, which comes remarkably close to confirming my original conspiracy theory right on the tin. He’s clearly salty about losing that government money (and rightly so), and is pivoting to asking for private sources.

Context

FearandLoathingintheMotte functor 1mo ago

I know little of the math / AI space but google is genuinely a player here in LLM math along with OpenAI

Has anyone ever seen Terrance speak of Gemini or use a Google LLM math product

Context

BigObjectPermanenceShill FearandLoathingintheMotte 1mo ago

Has anyone ever seen Terrance speak of Gemini or use a Google LLM math product

Yes we did, even recently:

Bogdan Georgiev, Javier Gómez-Serrano, Adam Zsolt Wagner, and I have uploaded to the arXiv our paper “Mathematical exploration and discovery at scale“. This is a longer report on the experiments we did in collaboration with Google Deepmind with their AlphaEvolve tool, which is in the process of being made available for broader use. Some of our experiments were already reported on in a previous white paper, but the current paper provides more details, as well as a link to a repository with various relevant data such as the prompts used and the evolution of the tool outputs.

or here you can see his comments on erdosproblems.com:

I asked ChatGPT and Gemini for literature review. ChatGPT DeepResearch basically just cited this web page and declared the problem open. Interestingly, Gemini DeepResearch reproduced essentially the same proof as yours (in the section under "Harmonic capacity"), but without directly citing the above argument. Also it seemed unaware that it had actually established the result, and instead devoted the rest of the report to explaining why the problem was difficult.

The entire OpenAI conspiracy theory is asinine. Tao isn't a shill, he's pretty restrained in his praise of AI, and his experiments have been going on for years now, gradually escalating from "neat paperwork automation" to "can assist in mundane bits of research" to recognition of autonomous discovery, in tune with AI gaining these capabilities. Eg here he talks about GPT-4:

Today was the first day that I could definitively say that #GPT4 has saved me a significant amount of tedious work. As part of my responsibilities as chair of the ICM Structure Committee, I needed to gather various statistics on the speakers at the previous ICM (for instance, how many speakers there were for each section, taking into account that some speakers were jointly assigned to multiple sections). The raw data (involving about 200 speakers) was not available to me in spreadsheet form, but instead in a number of tables in web pages and PDFs. In the past I would have resigned myself to the tedious task of first manually entering the data into a spreadsheet and then looking up various spreadsheet functions to work out how to calculate exactly what I needed; but both tasks were easily accomplished in a few minutes by GPT4, and the process was even somewhat enjoyable (with the only tedious aspect being the cut-and-paste between the raw data, GPT4, and the spreadsheet).

Am now looking forward to native integration of AI into the various software tools that I use, so that even the cut-and-paste step can be omitted. (Just being able to resolve >90% of LaTeX compilation issues automatically would be wonderful...)

This precedes Trump and discredits speculations about his recent financial motive.

He is also not locked into OpenAI ecosystem, as I've just demonstrated, and he's been on the board for AIMO, which is a Kaggle competition of tuning small open models for solving math problems. Progress Prize 1 and 2 were won using DeepSeek and Qwen models. I don't see how that advances OpenAI's agenda. Prize 3 seems to be won using gpt-oss, but it was unusually strong in its weight class in the relevant time window, so this is fair.

He has no reason to refuse participating in OpenAI-funded activities, he's got no ideological dog in this fight, so that proves nothing. Him, Gowers and other extremely successful mathematicians are obviously just excited about AI getting good, which is a very natural thing to be excited about. I struggle to understand the theory of mind that makes one see anything suspicious here. Probably if you're convinced that AI is a sham and all smart people ought to recognize it as well as you do… but that's such a fanatical viewpoint. More sane to just dismiss him as a washed-up crank.

Context

Soteriologian Unstammering Papageno FearandLoathingintheMotte 1mo ago

I wouldn't read too much into this. Has anyone ever seen him use a theorem prover other than Lean? If anything, he's a lot more explicit about using the Lean "brand" than the OpenAI brand. Yet obviously he's not a sponsored shill for Big Lean (because there is no Big Lean).

What's clear is his organisation is pursuing private funding since the government axed his grants. I think it's likely OpenAI is among them given his appearance on their Twitter promo. Whether there's any exclusivity contract, who knows, I don't have a confident assessment.

Context

4bpp Now I am become a Helpful, Honest and Harmless Assistant, the destroyer of jobs Soteriologian 1mo ago

I do not understand where you were trying to go with your digression about constructivism. Is this just supposed to be FUD, as in you hope non-technical readers will look at it and walk away with an understanding like "what Terry produces with his AI is not a real proof"? Because the reality is rather the opposite: any proof in a constructive system like Lean is a proof in a non-constructive system like ZFC, and certainly it is especially a proof in the unspecified system that is mathematical practice outside of a handful of particularly rigorous subfields.

Context

Deleted by author

Soteriologian Unstammering Papageno Tretiak 1mo ago

I don’t think we’re referring to the same constructivism. I’m referring to this: https://en.wikipedia.org/wiki/Constructivism_(philosophy_of_mathematics)

Context

Soteriologian Unstammering Papageno 4bpp 1mo ago

as in you hope non-technical readers will look at it and walk away with an understanding like "what Terry produces with his AI is not a real proof"?

Come on, that is not an honest reading of what I said.

In constructive systems, one proof—in the Curry-Howard sense—is not interchangeable with another. For example, you can sort in O(n^2) (or worse), which would be a perfectly valid thing to do in classical mathematics, but a big oopsie in formally-verified software. This is doubly-so in cases like cryptography, where almost none of the "incorrectness" of compromised crypto is incorrect at all in the classical mathematical sense. Further, as someone who has a fair bit of experience playing around with these systems in the past, theorem provers often induce you to write code in a way that is easy to make formal statements about, rather than in a way that actually runs well enough that anyone would want to use it.

My point is the software verification story—which is surely of high importance to businesses—is far more nuanced than "throw the clanker at Lean and let it churn". I think the bucket of cold water I’m tossing is completely justified (even as someone who does indeed buy the baseline premise—that LLMs will be a big boon to working with theorem progress).

Context

Brainwavez Soteriologian 1mo ago · Edited 1mo ago

Your general argument is correct, because proofs won't scale without efficient tactics and smart sub-lemmas, which can't be algorithmically verified.

But Cunningham's Law obligates me to point out that proofs are interchangeable, it's called proof irrelevance. A value is basically only considered a proof if it's type's type is Prop, and once the proof is verified (i.e. value is type-checked), Lean can forget it and only remember that the theorem is proven (i.e. type is inhabited).

Context

Soteriologian Unstammering Papageno Brainwavez 1mo ago

Well, if we're going to be pedantic, proofs of kind Prop are irrelevant. I haven't followed this stuff in a while, but I'm pretty sure this opens a big can of worms, especially wrt erasure. Google's AI assures me this is all totally resolved and there's zero overhead, then links me to this thread which is not that old and smells of all the swamp I remember from back when I followed this stuff.

Context

Brainwavez Soteriologian 1mo ago · Edited 1mo ago

if we're going to be pedantic, proofs of kind Prop are irrelevant

If someone's "proofs" aren't of kind Prop, they're doing something wrong.

And irrelevance implies interchangeability: the definition

proof_irrel : ∀ {a : Prop} (h₁ h₂ : a), h₁ = h₂

combined with a property of equality

Eq.subst : ∀ {α : Sort u} {motive : α → Prop} {a b : α}, a = b → motive a → motive b

allow substituting a proof with any other of the same theorem.

Context

Soteriologian Unstammering Papageno Brainwavez 1mo ago · Edited 1mo ago

If someone's "proofs" aren't of kind Prop, they're doing something wrong.

This is simply not the case, as even my example in the OP shows. Proofs do contain computational info, and many times you cannot choose to ignore it: it really makes a difference whether the off switch turns the system off or on! You can’t just say "Well, there’s two options either way, so they’re equivalent," and be done with it.

Prop isn’t some special universe where your proofs live. If you have that mental model, you’ve never built anything with these systems. That’s like a "meat comes from the supermarket"-tier take on how this works.

Even the name "Prop" is a reference to it being intended for propositional equality, which in some flavours of dependent type theory is asserted to have one constructor (and traditionally, this axiom is one that induces poor ergonomics; hence, Voevodsky’s axiom and the development of HoTT/cubical theories).

-1

Context

Brainwavez Soteriologian 1mo ago

The theorem that specifies that an off switch turns the system off cannot be proven by a proof that specifies that an off switch turns the system on. It's analogous to the theorem that off maps to false, not that Toggle and Bool are isomorphic (which is similar to your original example).

I can prove that Toggle and Bool are isomorphic by defining a function toggleToBool that maps off to true, another that maps true to off, and showing composition both ways is identity. For that theorem specifically, it doesn't matter that toggleToBool is backwards. I cannot prove that toggleToBool off = false, analogous to "the off switch turns the system off", with that definition.

Likewise, if ChatGPT proves the main theorem with backwards definitions, it doesn't matter, like QT-1 in Asimov's short story Reason. If the main theorem is "the off switch turns the system off" and ChatGPT programmed the off switch to turn the system on, it will simply fail to prove.

Context

Lost_Geometer Soteriologian 1mo ago

Are they selling these things as proof of applicability to software verification? That seems silly, for many reasons. The domains share some language, but diverge a lot too.

The role of constructivity is a bit mysterious to me here. The headline results (at least the unit distance problem) were indeed constructive. On the other hand you can verify classical logic perfectly well by assuming, for example, a double negation oracle. Your programs won't run, of course, but they'll type check fine.

Context

Soteriologian Unstammering Papageno Lost_Geometer 1mo ago

Are they selling these things as proof of applicability to software verification?

I mean, that was certainly my impression. Maybe I've self-selected into circles where that's the sort of thing people naturally care about, and incorrectly assumed this is how everyone is thinking.

Sure, if all you care about is Erdos problems, I guess much of what I'm saying is moot. But this whole "Mythos is finding all these vulnerabilities, the whole meta has shifted!" narrative sure leads me to the next part of the story of "Ok, so how do we actually build stuff correctly in the presence of tools like Mythos?" Because the mythos story, like the classical theorem prover story, is also basically invulnerable to the primary problem of LLMs, hallucination. If the model hallucinates a vulnerability that doesn't work, just throw it away and try again! (Or, in the case of low-human capital, spam the issue tracker with bogus vulnerability reports. But I digress).

Context

sarker competency crisis actor Soteriologian 1mo ago

I mean, that was certainly my impression.

I think you made up a thing that people are doing and then wrote a post about how it's dumb that they are doing that.

If the model hallucinates a vulnerability that doesn't work, just throw it away and try again!

I mean, yes?

Context

4bpp Now I am become a Helpful, Honest and Harmless Assistant, the destroyer of jobs Soteriologian 1mo ago

I still can't make sense of what you are trying to say then. What is the sense in which you claim proofs to be "interchangeable" in classical systems, why do you believe this to be a desirable property, and how do you contend with the fact that constructive proofs are classical proofs?

There may be some HoTT-like sense in which two proofs in a given system are not equivalent (because you introduced some equivalence relation on proofs that does not relate them). If you have a sound embedding from proofs in this system to proofs in another system, and the other system comes with its own equivalence relation on proofs, the images of your two proofs under the embedding might still be equivalent in the other system. Is this not just what would happen here, if you assert that all classical proofs of a given sentence are equivalent (under an equivalence relation you picked) but not all constructive ones are (under an equivalence relation you picked)?

This is doubly-so in cases like cryptography, where almost none of the "incorrectness" of compromised crypto is incorrect at all in the classical mathematical sense.

You will have to elaborate on this statement. What do you take to be an instance of compromised crypto that is not "incorrect in the classical mathematical sense"?

Context

Soteriologian Unstammering Papageno 4bpp 1mo ago · Edited 1mo ago

Well, in the classical mathematical sense, crypto doesn’t work at all: just factor the composite number. The entire premise relies on the relatively ill-specified (by mathematical standards) notion of relative computational cost disparity.

What is the sense in which you claim proofs to be "interchangeable" in classical systems

What I mean is when I sit down to write something in a theorem prover, I speak in terms of Peano nats and inductive lists. When I write software anyone would ever want to actually use, I use machine ints and arrays. There is definitely a sense in which I’m doing the same thing in both cases, but nailing this down precisely is… well, non-trivial. (aka, a royal pain in the ass). Like, the discrepancy here is show-stoppingly problematic.

EDIT: basically, the computation is relevant, and more specifically the speed of computation is also relevant, both in terms of practical usability and outright security in the case of crypto. In the classical world, there is no notion of computational relevance at all -- in fact, you outright end up with these counterintuitive weirdo theorems like Zorn's Lemma. The counterintuitiveness depends entirely on this conspicuous lack of computational awareness in the model: as soon as you put that ingredient back in, you're in the construcivist world, and silly results like Zorn's Lemma don't hold anymore unless you postulate them. My contention is the class of problems for which one can say "I don't care about which proof, just that one exists" is relatively narrow and of little practical relevance (in the sense that one can get an incomputable proof, which is, in a fundamental technical sense, useless), while the class of problems about which one does care which proof, in the Curry-Howard sense, is large and of enormous practical consequence. I want to argue that enthusiasm over the former should not translate to enthusiasm about the latter.

Context

Corvos 4bpp 1mo ago

I think he means it in the sense that "a nice man on the phone told me he was from BitPanda and asked me to read my password to him so he could check if it was secure" or "SBF stole all my money" can't be detected by enforcing code correctness. All of the badness has happened outside the code, everything inside the code is a perfectly valid transaction.

Context

Brainwavez Corvos 1mo ago

But this general argument (ChatGPT can claim something in English, then formally prove something completely different) still applies if the proofs are classical.

Context

sarker competency crisis actor Soteriologian 1mo ago

Who is saying anything about AI assisted software verification via Lean? I've literally never heard of anyone suggesting that we should do this.

Context

ChickenOverlord sarker 1mo ago

Terrence Tao literally talks about this in his Lex Fridman interview:

https://youtube.com/watch?v=hh4cjZOddQA

Context

sarker competency crisis actor ChickenOverlord 1mo ago

No, he's talking about theorem proving, not formal software verification.

Context

BurdensomeCount Thou Shalt Read BC's Writings! 4bpp 1mo ago

Yeah, I'm not sure what his point about "the proof isn't canonical" is all about. A proof is a proof, and the underlying statement is just as valid regardless. The fact that there's no canonical isomorphism between a finaite dimensional vector space and its dual doesn't mean the two spaces aren't isomorphic or any such isomorphism is less "real".

Context

Soteriologian Unstammering Papageno BurdensomeCount 1mo ago

Part of the misunderstanding is the word "proof" has different flavour in classical mathematics vs theorem prover math, which is why the word is rarely used in this world except when appealing to legacy mathematicians. The native vocabulary is "3 is a value of type Nat" or "Refl is a value of type 0 + 0 = 0". A legacy mathematician would use the word "proof" only to describe the latter, and think it odd to use the word for the former. But internal to the formal logical framework, these are same notion, and so they should have the same word.

Context

07mk Soteriologian 1mo ago

The native vocabulary is "3 is a value of type Nat" or "Refl is a value of type 0 + 0 = 0". A legacy mathematician would use the word "proof" only to describe the latter, and think it odd to use the word for the former.

Neither of these would be considered a "proof" by a mathematician. Those are just statements of fact. A mathematical proof is a logical argument of deduction that shows that some statement must be true if some set of premises are true. Statements of fact are used in proofs, but a single statement of fact wouldn't actually constitute a proof.

Context

Soteriologian Unstammering Papageno 07mk 1mo ago

I think you've never spent any time with a formal theorem prover lol. You're conflating the definition of + with the proof Refl : 0 + 0 = 0.

Mathematicians can say whatever they want, but the story here is about mechanically-verified formal verification, and this is how formal verification works, whether legacy mathematicians think this pedantically or not.

Context

07mk Soteriologian 1mo ago

Mathematicians can say whatever they want

A legacy mathematician would use the word "proof" only to describe [("Refl is a value of type 0 + 0 = 0")]

(emphasis added)

So it seems like you're walking back the claim you made about what a (legacy) mathematician would say. Again, a legacy mathematician or any mathematician wouldn't describe that as a "proof."

Context

Soteriologian Unstammering Papageno 07mk 1mo ago

An appeal to a definition is a proof ("By the definition of XYZ, we conclude..."). A definition itself is not a proof ("Let xyz be defined as ...").

The confusion is coming from the fact that Refl : 0 + 0 = 0 is the syntax of an appeal to the definition of plus. It is not the syntax of defining the + operation.

Look, I'm not going to run around in circles all day about this. I'm pointing out that mathematicians are more sloppy in their vocabulary than the theorem prover, and that there are subtle differences over how terminology is used. I do not care to bikeshed endlessly over this. It's obviously true to anyone with any experience in these fields.

Context

What is this place?

This website is a place for people who want to move past shady thinking and test their ideas in a court of people who don't all share the same biases. Our goal is to optimize for light, not heat; this is a group effort, and all commentators are asked to do their part.

The weekly Culture War threads host the most controversial topics and are the most visible aspect of The Motte. However, many other topics are appropriate here. We encourage people to post anything related to science, politics, or philosophy; if in doubt, post!

Check out The Vault for an archive of old quality posts. You are encouraged to crosspost these elsewhere.

Why are you called The Motte?

A motte is a stone keep on a raised earthwork common in early medieval fortifications. More pertinently, it's an element in a rhetorical move called a "Motte-and-Bailey", originally identified by philosopher Nicholas Shackel. It describes the tendency in discourse for people to move from a controversial but high value claim to a defensible but less exciting one upon any resistance to the former. He likens this to the medieval fortification, where a desirable land (the bailey) is abandoned when in danger for the more easily defended motte. In Shackel's words, "The Motte represents the defensible but undesired propositions to which one retreats when hard pressed."

On The Motte, always attempt to remain inside your defensible territory, even if you are not being pressed.

New post guidelines

If you're posting something that isn't related to the culture war, we encourage you to post a thread for it. A submission statement is highly appreciated, but isn't necessary for text posts or links to largely-text posts such as blogs or news articles; if we're unsure of the value of your post, we might remove it until you add a submission statement. A submission statement is required for non-text sources (videos, podcasts, images).

Culture war posts go in the culture war thread; all links must either include a submission statement or significant commentary. Bare links without those will be removed.

If in doubt, please post it!

Rules

Recommended Realtime Chats

Link copied to clipboard

Action successful!

Error, please try again later.

Culture War Roundup for the week of May 25, 2026

Jump in the discussion.

What is this place?

Why are you called The Motte?

New post guidelines

Rules

Recommended Posts And Communities

Recommended Realtime Chats