"The distinction between likely and grammatical, which all humans have, is entir...

adastra22 · on March 19, 2023

Sure it is. Something is grammatical because you have frequently in the past heard sentences constructed that way. Something is ungrammatical because you never heard sentences constructed that way. It’s purely based on frequency.

adammarples · on March 19, 2023

This is wrong. "Colorless green ideas sleep furiously" is a famous example proposed as a clearly grammatical but meaningless, entirely new and unlikely sentence. There are many grammatically correct sentences you will hear in future that you've never heard before.

brokenkebaby · on March 19, 2023

Sorry, but what is wrong then? The example is grammatically correct, because it fits established pattern, i.e. you may have never heard that exact same phrase, but you definitely heard multitude of phrases exhibiting the same pattern/rules

richardjam73 · on March 19, 2023

LLMs have been exposed to much larger datasets than any human ever has. If it was frequency then humans would make the grammatical errors and not the LLMs. The LLMs are making grammatical errors as shown in the article and therefore it is not about frequency.

brokenkebaby · on March 20, 2023

Humans do grammatical errors (non-native, for whatever expressive effect, typos, dialects, slangs etc.) => datasets contain a percentage of grammatical errors => LLMs does it. I mean it can be about frequency, and carry infrequent errors because of it. I don't see any contradiction.

adastra22 · on March 19, 2023

That’s entrance fits what I said just fine. The words are arranged in the order that I have frequently seen in the past, so I accept it as grammatical (albeit meaningless).

adammarples · on March 26, 2023

You've seen those words in that order in the past? Then pick an example you haven't seen (orange ideas etc)