Interesting this is released literally one hour after another discussions sugges...

sshh12 · 2025-04-05T19:17:54 1743880674

Not that I agree with all the linked points but it is weird to me that LeCun consistently states LLMs are not the right path yet LLMs are still the main flagship model they are shipping.

Although maybe he's using an odd definition for what counts as a LLM.

https://www.threads.net/@yannlecun/post/DD0ac1_v7Ij?hl=en

ezst · 2025-04-05T19:47:32 1743882452

> LeCun consistently states LLMs are not the right path yet LLMs are still the main flagship model they are shipping.

I really don't see what's controversial about this. If that's to mean that LLMs are inherently flawed/limited and just represent a local maxima in the overall journey towards developing better AI techniques, I thought that was pretty universal understanding by now.

singularity2001 · 2025-04-05T21:30:01 1743888601

local maximum that keeps rising and no bar/boundary in sight

Jensson · 2025-04-06T09:06:04 1743930364

Even a narrow AI can get better with no bar in sight, but it will never get to AGI. That is the argument here.

phren0logy · 2025-04-05T19:30:47 1743881447

That is how I read it. Transformer based LLMs have limitations that are fundamental to the technology. It does not seem crazy to me that a guy involved in research at his level would say that they are a stepping stone to something better.

What I find most interesting is his estimate of five years, which is soon enough that I would guess he sees one or more potential successors.

kadushka · 2025-04-05T19:44:38 1743882278

In our field (AI) nobody can see even 5 months ahead, including people who are training a model today to be released 5 months from now. Predicting something 5 years from now is about as accurate as predicting something 100 years from now.

throwaway314155 · 2025-04-05T20:07:34 1743883654

Which would be nice if LeCun hadn't predicted the success of neural networks more broadly about 30 years before most others.

esafak · 2025-04-05T20:21:48 1743884508

That could be survivor bias. What else has he predicted?

throwaway314155 · 2025-04-06T10:50:54 1743936654

I don't know. The only point I'm trying to make is that predictions can indeed survive intervals exceeding 5 months or even 5 years.

falcor84 · 2025-04-05T19:08:53 1743880133

I don't understand what LeCun is trying to say. Why does he give an interview saying that LLM's are almost obsolete just when they're about to release a model that increases the SotA context length by an order of magnitude? It's almost like a Dr. Jekyll and Mr. Hyde situation.

martythemaniak · 2025-04-05T19:22:53 1743880973

LeCun fundamentally doesn't think bigger and better LLMs will lead to anything resembling "AGI", although he thinks they may be some component of AGI. Also, he leads the research division, increasing context length from 2M to 10M is not interesting to him.

sroussey · 2025-04-05T20:25:54 1743884754

He thinks LLMs are a local maxima, not the ultimate one.

Doesn't mean that a local maxima can't be useful!

falcor84 · 2025-04-05T22:15:01 1743891301

If that's what he said, I'd be happy, but I was more concerned about this:

> His belief is so strong that, at a conference last year, he advised young developers, "Don't work on LLMs. [These models are] in the hands of large companies, there's nothing you can bring to the table. You should work on next-gen AI systems that lift the limitations of LLMs."

It's ok to say that we'll need to scale other mountains, but I'm concerned that the "Don't" there would push people away from the engineering that would give them the relevant inspiration.

Jensson · 2025-04-06T09:08:24 1743930504

> but I'm concerned that the "Don't" there would push people away from the engineering that would give them the relevant inspiration.

You have way more yay-sayers than nay-sayers, there is never a risk that we don't go hard enough into the current trends, there is however a risk that we go too hard into it and ignore other paths.

falcor84 · 2025-04-05T19:26:11 1743881171

But ... that's not how science works. There are a myriad examples of engineering advances pushing basic science forward. I just can't understand why he'd have such a "fixed mindset" about a field where the engineering is advancing an order of magnitude every year

j_maffe · 2025-04-05T20:05:02 1743883502

> But ... that's not how science works

Not sure where this is coming from.

Also, it's important to keep in mind the quote "The electric light did not come from the continuous improvement of candles"

falcor84 · 2025-04-05T20:45:56 1743885956

Well, having candles and kerosene lamps to work late definitely didn't hurt.

But in any case, while these things don't work in a predictable way, the engineering work on lightbulbs in your example led to theoretical advances in our understanding of materials science, vacuum technology, and of course electrical systems.

I'm not arguing that LLMs on their own will certainly lead directly to AGI without any additional insights, but I do think that there's a significant chance that advances in LLMs might lead engineers and researchers to inspiration that will help them make those further insights. I think that it's silly that he seems to be telling people that there's "nothing to see here" and no benefit in being close to the action.

j_maffe · 2025-04-05T23:09:27 1743894567

I don't think anyone ould disagree with what you're saying here, especially LeCun.

goatlover · 2025-04-05T19:39:54 1743881994

Listening so Science Friday today on NPR, the two guests did not think AGI was a useful term and it would be better to focus on how useful actual technical advances are than some sort of generalized human-level AI, which they saw as more of a marketing tool that's ill-defined, except in the case of makes the company so many billions of dollars.

charcircuit · 2025-04-05T23:12:34 1743894754

A company can do R&D into new approaches while optimizing and iterating upon an existing approach.

joaogui1 · 2025-04-05T19:32:32 1743881552

I mean they're not comparing with Gemini 2.5, or the o-series of models, so not sure they're really beating the first point (and their best model is not even released yet)

Is the new license different? Or is it still failing for the same issues pointed by the second point?

I think the problem with the 3rd point is that LeCun is not leading LLama, right? So this doesn't change things, thought mostly because it wasn't a good consideration before

Melklington · 2025-04-05T19:09:42 1743880182

LeCun doesn't believe in LLM Architecture anyway.

Could easily be that he just researches bleeding edge with his team and others work on Llama + doing experiements with new technices on it.

Any blog post or yt docu going into detail how they work?