@kky's banner p

kky


				

				

				
1 follower   follows 0 users  
joined 2025 March 03 19:40:22 UTC

				

User ID: 3570

kky


				
				
				

				
1 follower   follows 0 users   joined 2025 March 03 19:40:22 UTC

					

No bio...


					

User ID: 3570

Compute can be spent in many different ways. We're moving from a paradigm of scaling up the size of a model, in terms of parameter count, in favor of scaling run-time compute (time spent thinking) and reinforcement learning.

Very interesting aside! However, it doesn't address the question of diminishing returns.

There isn't much of a market for AI playing Pokémon. There is immense demand for them to be good at coding and maths. We've seen stunning progress in that regard, as you acknowledge. You attempt to back-chain your argument, saying that they're said to be good at maths but look, they're shit at Pokémon, which apparently invalidates the former. It really doesn't.

I've used AI for coding, which you mention further down as a crowning triumph. It is... not particularly good. It struggles at anything past a very general form of the problem. It was very useful for copy-pasting similar pieces of code! Not very useful for building new features. It had a distinct habit of waiting until the interesting or important part of the problem and leaving a comment saying "Implement a function to do X!" Hmm, very interesting, if I tried that I'd get fired. So no, I think this is a valid argument. AI can be taught to the test, and indeed appears to have been, but the actual world involves far more de novo work than the test includes. That's why school-trained pre-professionals tend to need a pretty hefty ramp-up to start being really useful - they've only been working on tests so far. Pokemon is interesting precisely because it has not been trained for. You should expect more, not less, untrained situations for AI to do anything meaningful in the job market - and you should weight untrained situations in your analysis several orders of magnitude higher than trained situations.

Do you use AI to augment your work? Is it going to take your job? On what kind of a timescale? Do you think you'd be able to substitute yourself for an unmonitored AI without issue on any tasks? If not, what errors do you think it would make, and why? Honestly interested in your answers here, if nothing else. I would greatly respect you for putting your money where your mouth is on this one and bringing receipts.

I had never given it any thought before the demonstration. But plenty of people have speculated that LLMs would never be any good at video games, and now that they're not good but not terrible, it's only a matter of time before they're great. And that time can be very short.

Hmm... you think getting stuck in what appears to be a permanent loop is not terrible? Is this the behavior you'd accept from anyone working for you?

The thing that keeps puzzling me about your comments is that you seem to simultaneously view ANY capacity in a task as an impressive accomplishment at the same time as you assert that AI has overwhelming general ability. Those two don't go hand in hand, except maybe by this little quote. Any capacity seems to be, for you, an indisputable sign of unlimited future capacity - as though the only question to be answered is total disability versus infinite ability. There's no clear reason that this has to be the limit of the answer space. Line go up... forever? Like with bitcoin? There's also the rather bizarre fixation on LLMs - even though something like, say, an octopus is very obviously not an LLM and still has meaningful if primitive intelligence. The sheer gnostic power of your position is hard to argue against, and unfortunately I don't find it very convincing based on my own experience. It takes rather a lot on faith.

Glad you like it!

By "think spacially/temporally," do you mean "produce valid outputs for spacial/temporal problems" or "model space and time as first-order constructs"? I definitely believe the former, but I'm skeptical of the latter. Claude's adventures in the "tard yard" showed a real difficulty in grasping that, if the back of the house is a closed-off yard, maybe you should exit through the front. Looping is a problem, but I don't think any of us would consider this to be a particularly information-dense problem. The only way it could be is if the AI's ability to recognize the problem is hamstringed by its need to encode the state as a totally different sort of resource (linguistic tokens) - which brings us around to the top.

Battles are, of course, way easier, because they can be cast as a narrative (and I'm pretty sure every AI is trained on Smogon's ample fora).

Another interesting thing, not sure what to think of it. When I play a game like this, my default behavior on entering a new area is to explore it thoroughly and learn what there is to learn about it before seeking out objectives. Claude seems to prioritize specific objectives over general exploration, to its detriment. Wonder why that is?

You are literally erasing my existence, mods???

My culture is NOT your costume, fake-normie.

Well, if that's what you want to call an Anthropic researcher who decided to make their experiment public.

"Claude Plays Pokémon continues on as a researcher's personal project."

Ha, wow! Was not aware of that. I guess that makes sense w/r/t the funding.

You've written a lot. I think it's best to focus. (As much as I'm tempted to talk about concepts.)

What I understand to be your main point is (my words because you did not state it in concrete terms):

AI has rapidly improved in the recent past. We should expect it to continue improving at a similar rate. So if you see any success in a given metric now, you should expect to see much more success in the near future.

Which is a fair point! The only counterargument to that is on the specifics: why is it improving and what do we expect future improvements to look like? Almost all of the improvement thus far is based on throwing more compute at the problem - so if we're going to see improvements of the same kind, we should see them based on more compute. However, improvements in models are logarithmic - steps up in capacity tend to require 10x compute (by appearances you're pretty educated about AI, so I suspect that is not news to you). So although improvements in efficiency can effectively allow for somewhat more compute, like with Deepseek, we should expect that throwing more compute at the problem will get prohibitively expensive. I believe this has already happened. So while under hypothetical conditions of infinite compute we could have an LLM that infinitely approximates an AGI, similar to the implausible premise of Searle's Chinese Room (a book that allows one to construct a correct response to any input), we are unlikely to see that in practice.

So, how are we to get to AGI, in my opinion? By improving AI on completely different parameters from what currently exists - a revolution in thought about how AI should function. And tests like Claude Plays Pokemon are a fun way of showing us where the gaps in our thinking are.

For my own point of view:

This AI can strategize in battle, understand complex instructions, and process information, BUT it struggles with spatial reasoning in a poorly-rendered 2D GameBoy game, therefore it's not intelligent.

That's not the argument. The argument is: this AI is struggling in a VERY non-human way with what we would consider a pretty trivial task. This reveals that its operational parameters are not like those of a human, and that we should figure out where else it is going to perform at sub-human levels. The fact that we're seeing this at the same time as it performs at SUPERhuman levels in other tasks shows that this is not AGI, or even in the direction of AGI, but rather is tool AI. (I assume you think humans are, at the very least, general intelligence - right?)

I don't think you've addressed this point, except here:

It wasn't designed to play Pokémon. It still does a stunningly good job when you understand what an incredibly difficult task that is for a multimodal LLM.

Why should I care? AGI is supposed to be GENERAL. This is the stuff that's supposed to be taking people's jobs in a few years! And yet it gets lost in Cerulean City? As a tech demo, this is very cool - it's remarkable that someone was able to pipe these pieces together, and the knowledge base idea is very cool and is a plausible direction to take new LLMs into. A hypothetical Claude 3.8 that is explicitly trained to make knowledge base manipulation a central feature of the model could potentially perform miles better on some of these tasks. But all you've told me is that I should expect AI to struggle with these tasks. In which case: doesn't it sound like we agree? We both agree that there was no reason to expect Claude to succeed with Pokemon at the level of an eight-year-old. So, from the perspective of an uncommitted third party, given that an AI skeptic and an AI optimist have both agreed that an LLM can't play Pokemon like an eight-year-old... well, it feels pretty clear to me.

Obviously, if this becomes a big selling point for the next generation of LLMs, then we'll see them all benchmarked on Pokemon Red speedruns and you can I-told-you-so about AI being able to beat Pokemon. I don't doubt the ability of motivated corporations to "teach to the test" - it's what we've been seeing with "reasoning" AIs. It's just one of the problems with setting up real tests of ability for some of these AIs, because they get so much data that it's all but impossible to ensure you have a pure test like what the IQ test aspires to.

In other news: a streamer with deep pockets and a love of AI has decided to have Claude play Pokemon.

To get this working, ClaudeFan (as I'll be calling the anonymous streamer) set up some fairly sophisticated architecture: in addition to the basic I/O shims required to allow an LLM to interface with a GameBoy emulator and a trivial pathfinder tool, Claude gets access to memory in the form of a "knowledge base" which it can update as it desires and (presumably) keep track of what's happening throughout the game. All this gets wrapped up into prompts and sent to Claude 3.7 for analysis and decision. Claude then analyzes this data using a <thinking>reasoning model</thinking>, decides on its next move, and then starts the process over again. Finally, while ClaudeFan claims that "Claude has no special training for Pokemon," it's obvious by the goal-setting that the AI has some external knowledge of where it's supposed to go - it mentions places that it has not yet reached by name and attempts to navigate towards them. Presumably part of Claude's training data came from GameFaqs. (Check out the description on the Twitch page for more detail on the model.)

So, how has this experiment gone?

In a word: poorly. In the first week of playing, it managed to spend about two days wandering in circles around Mt Moon, an early-game area not intended to be especially challenging to navigate. It managed to leave after making a new decision for unexplained reasons. Since then, it has been struggling to navigate Cerulean City, the next town over. One of its greatest challenges has been a house with a yard behind it. It spent some number of hours entering the house, talking to the NPC inside, exhausting all dialogue options, going out the back door into the yard, exploring the yard thoroughly (there are no outlets), re-entering the house, and starting from the top. It is plausible, though obviously not possible to confirm, that ClaudeFan has updated the model some to attempt to handle these failures. It's unclear whether these updates are general bugfixes

How should we interpret this? On the simplest level, Claude is struggling with spacial modeling and memory. It deeply struggles to interpret anything it's seeing as existing in 2D space, and has a very hard time remembering where it has been and what it has tried. The result is that navigation is much, much harder than we would anticipate. Goal-setting, reading and understanding dialogue, and navigating battles have proven trivial, but moving around the world is a major challenge.

The current moment is heady for AI, specifically LLMs, buoyed up by claims by Sam Altman types of imminent AGI. Claude Plays Pokemon should sober us a little to that. Claude is a top performer on things like "math problem-solving" and "graduate-level reasoning", and yet it is performing at what appears to me below the first percentile at completing a video game designed for elementary schoolchildren. This is a sign that what Claude, and similar tools, are doing is not in fact very analogous to what humans do. LLM vendors want the average consumer to believe that their models are reasoning. Perhaps they are not doing that after all?

It's a bit of a tired point, but LLMs are known to be "next likely text" generators. Given textual input, they predict the most likely desired output and return it. Their power at doing this is quite frankly superhuman. They can generate text astonishingly quickly and with unparalleled flexibility in style and capacity for word use. It appears that they are so good at handling this that they are able to pass tests as if they were actually reasoning. The easiest way to trip them up, on the other hand, is to give them a question that is very much like a very common question in their training data but with an obvious difference that makes the default answer inappropriate. The AI will struggle to get past its training and see the question de novo, as a human would be able to. (In case anyone remembers - this is the standard complaint that AI does not have a referent for any of the words it uses. There is no model outside of the language.)

So, as you might guess, I'm pretty firmly on the AI-skeptic side as far as LLMs are concerned. This is usually where these conversations end, as the AI-skeptics believe they've proven their case and (as I understand it) the AI-optimists don't believe that the skeptics have any kind of provable, or even meaningful, model for what intelligence is. But I do actually believe that AGI (meaning: AI that can reason generally, like a human - not godlike Singularity intelligence) is possible, and I want to give an account of what that would entail.

First, and most obviously, an actual AGI must be able to learn. All our existing AI models have totally separate learning and output phases. This is not how any living creature works. An actual intelligence must be able to learn as it attempts to apply its knowledge. This is, I believe, the most natural answer for what memory is. Our LLMs certainly appear to "remember" things that they encountered during their training phase - the fault is in our design that prevents them from ever learning again. However, this creates new problems in how to "sanitize" memory to ensure that you don't learn the wrong things. While the obvious argument around Tay was whether it was racist or dangerously based, a more serious concern is: should an intelligence allow itself to get swayed so easily by obviously biased input? The users trying to "corrupt" Tay were not representative and were not trying to be representative - they were screwing with a chatbot as a joke. Shouldn't an intelligence be able to recognize that kind of bad input and discard it? Goodness knows we all do that from time to time. But I'm not sure we have any model for how to do that with AI yet.

Second, AI needs more than one capacity. LLMs are very cool, but they only do one thing - manipulate language. This is a core behavior for humans, but there are many other things we do - we think spacially and temporally, we model the minds of other people, we have artistic and other sensibilities, we reason... and so on. We've seen early success in integrating separate AI components, like visual recognition technology with LLMs (Claude Play Pokemon uses this! I can't in good faith say "to good effect," but it does open meaningful doors for the AI). This is the direction that AGI must go in.

Last, and most controversial: AI needs abstract "concepts." When humans reason, we often use words - but I think everyone's had the experience of reasoning without words. There are people without internal monologues, and there are the overwhelming numbers of nonverbal animals in the world. All of these think, albeit the animals think much less ably than do humans. Why, on first principles, would it make sense for an LLM to think when it is built on a human capability absent in other animals? Surely the foundation comes first? This is, to my knowledge, completely unexplored outside of philosophy (Plato's Forms, Kant's Concepts, to name a couple), and it's not obvious how we could even begin training an AI in this dimension. But I believe that this is necessary to create AGI.

Anyway, highly recommend the stream. There's powerful memery in the chat, and it is VERY funny to see the AI go in and out of the Pokemon center saying "Hm, I intended to go north, but now I'm in the Pokemon center. Maybe I should leave and try again?" And maybe it can help unveil what LLMs are, and aren't - no matter how much Sam Altman might wish otherwise!