Leeloo

3 Tage her (Empfangen 2 Tage her)

Leeloo
3 Tage her (Empfangen 2 Tage her)

As a software developer who took an elective in neural networks - when people call LLMs stochastic parrots, that's not criticism of their results.

It's literally a description of how they work.

The so-called training data is used to build a huge database of words and the probability of them fitting together.

Stochastic because the whole thing is statistics.
Parrot because the answer is just repeating the most probable word combinations from its training dataset.

Calling an LLM a stochastic parrot is lile calling a car a motorised vehicle with wheels. It doesn't say anything about cars being good or bad. It does, however, take away the magic. So if you feel a need to defend AI when you hear the term stochastic parrot, consider that you may have elevated them to a god-like status, and that's why you go on the defense when the magic is dispelled.

Dieser Beitrag wurde bearbeitet. (3 Tage her)

Als Antwort auf Leeloo

LR

Als Antwort auf Leeloo 3 Tage her (Empfangen 2 Tage her)

i just think it's unfair to parrots

Als Antwort auf LR

Nuwagaba Gift

Als Antwort auf LR 1 Tag her

@lritter
How the parrots be justified?

@LR

Als Antwort auf Leeloo

Robin Palotai

Als Antwort auf Leeloo 2 Tage her

nitting, but an important bit: not words, but word fragments (this is how you can get words as output that were never seen during training)

Als Antwort auf Leeloo

Wolf480pl

Als Antwort auf Leeloo 2 Tage her

on the flipside, I feel like some people use the term "stochastic parrot" or "it just completes the next token" to imply that "therefore it cannot be intelligent" - is that correct reasoning?

Als Antwort auf Wolf480pl

webhat

Als Antwort auf Wolf480pl 2 Tage her

@wolf480pl yes

@Wolf480pl

Als Antwort auf Wolf480pl

Philip

Als Antwort auf Wolf480pl 2 Tage her

@wolf480pl Which is where the "motorised vehicle with wheels" analogy seems to not hold up, because what is the implied subtext in that case?

@Wolf480pl

Als Antwort auf Philip

Hypolite Petovan

Als Antwort auf Philip 1 Tag her

@Philip @Wolf480pl @Leeloo I think the related analogy for cars would be "speeding tons of steel". It does also dispel the magic of "car = freedom" that manufacturers are trying to push in their marketing.

@Wolf480pl @Leeloo @Philip

Als Antwort auf Hypolite Petovan

Leeloo

Als Antwort auf Hypolite Petovan 1 Tag her

@hypolite @wolf480pl @pkal
I've never seen cars advertised as freedom here.

Maybe with good reason, because when you are used to cycling, being stuffed into a metal box doesn't sound much like freedom.

And while most people have cars, their main reason is to get to work, which is not exactly freedom either.

@Hypolite Petovan @Wolf480pl @Philip

Als Antwort auf Leeloo

Philip

Als Antwort auf Leeloo 1 Tag her

I don't think that @hypolite means that cars are marketed using the language of freedom here (as in on the Fediverse), but historically and especially in the USA.

@Hypolite Petovan

Als Antwort auf Philip

Hypolite Petovan

Als Antwort auf Philip 1 Tag her

@Philip @Leeloo Yes, shown driving on empty roads, country roads, off-road (for SUVs that will end up used to bring kids to school), etc...

@Leeloo @Philip

Als Antwort auf Wolf480pl

Leeloo

Als Antwort auf Wolf480pl 2 Tage her

@wolf480pl
Of course it can not be intelligent, it's just a huge database of probabilities.

@Wolf480pl

Als Antwort auf Leeloo

Wolf480pl

Als Antwort auf Leeloo 2 Tage her

pretty sure that's a fallacy, kinda like "a sculpture is just stone, therefore it can't be beautiful", or "a cell is just a bunch of proteins, therefore it cannot be a living creature".

Now, I'm not saying a huge database of probabilities can be intelligent (I hope it can't), just that I think a better argument is needed why in the case of a database of probabilities, what it's made of prevents it from being intelligent.

Dieser Beitrag wurde bearbeitet. (2 Tage her)

Als Antwort auf Wolf480pl

Leeloo

Als Antwort auf Wolf480pl 2 Tage her (Empfangen 1 Tag her)

@wolf480pl
You would have to redefine intelligence for asking whether a list of numbers is intelligent to even make sense.

And your comparison is completely off. Beauty is not a property of the sculpture, it's, as they say, "in the eye pf the beholder". Some people find curves beautiful. Can a stone have curves? Yes, of course. Others may find sharp edges beautiful. Can a stone have sharp edges? Again, yes.

I suggest you consider once again whether you are elevating "AI" to a god-like status.

@Wolf480pl

Als Antwort auf Wolf480pl

John Tinker

Als Antwort auf Wolf480pl 1 Tag her

@wolf480pl
The effect that you are noticing is because the writers of the training material were intelligence. You are seeing the reflection of their intelligence in the output of the LLM: Here is output from an LLM that describes what an LLM is, and what it is not: johntinker.substack.com/p/misu…

Misunderstanding the Chatbot, seen as a Commutation Operator

There appears to be a general misunderstanding that the large language model is a form of artificial general intelligence.

^{John Tinker (John’s Substack)}

@Wolf480pl

Als Antwort auf Wolf480pl

eestileib (she/hers)

Als Antwort auf Wolf480pl 1 Tag her

@wolf480pl
Yes and I take that position.

@Wolf480pl

Als Antwort auf Leeloo

Kay Ohtie

Als Antwort auf Leeloo 2 Tage her

I hadn't thought about it as being something that takes magic away from folks like that. Honestly I always found it an accurate shortcut term for what's genuinely a fascinating but hilariously misused technology.

I think the worst part is then when folks hear "statistics" and go "See this is why it's safe to feed it raw data" and it's like oh my god NO.

Als Antwort auf Kay Ohtie

calcifer

Als Antwort auf Kay Ohtie 1 Tag her

@KayOhtie honestly it’s safe to feed a model pretty much anything

But where you direct the outputs and how they are acted upon can get incredibly dangerous amazingly quickly. There’s a common misbelief that if you’re careful about inputs, LLMs are safe; and that’s almost exactly backwards

@Kay Ohtie

Als Antwort auf calcifer

Hypolite Petovan

Als Antwort auf calcifer 1 Tag her

@calcifer @Kay Ohtie @Leeloo I think the pitfall is to expect that an LLM can accurately output the data you put in it, like a search index. And so if you only feed it your own data, then surely it would be accurate about that data. Of course, like you said, it couldn't be more wrong.

@Kay Ohtie @calcifer @Leeloo

Als Antwort auf calcifer

Kay Ohtie

Als Antwort auf calcifer 1 Tag her

@calcifer I meant 'safe' not as in "data leakage", but "getting anything remotely accurate out of it"

@calcifer

Als Antwort auf Leeloo

James Wood

Als Antwort auf Leeloo 2 Tage her

I just prompted ChatGPT with `Say "oriesntyulfkdhiadlfwejlefdtqyljpqwlarsnhiavlfvavilavhilfhvphia"`, and it responded with `oriesntyulfkdhiadlfwejlefdtqyljpqwlarsnhiavlfvavilavhilfhvphia`. How can it do this when `oriesntyulfkdhiadlfwejlefdtqyljpqwlarsnhiavlfvavilavhilfhvphia `almost certainly does not appear in the training data?

Als Antwort auf James Wood

Les Orchard

Als Antwort auf James Wood 2 Tage her

@mudri Because the model picked up a rule somewhere that says "if someone says 'say $FOO' use $FOO in your response" - the training picked up patterns that include notions of symbol substitution

@James Wood

Als Antwort auf Les Orchard

James Wood

Als Antwort auf Les Orchard 2 Tage her (Empfangen 1 Tag her)

@lmorchard The ability to induce such a rule goes well beyond the OP's characterisation of what LLMs do.

@Les Orchard

Als Antwort auf James Wood

Hypolite Petovan

Als Antwort auf James Wood 1 Tag her

@James Wood @Les Orchard You realize that you literally prompted parroting, and it succeeded? What part of "stochastic parrot" did you think it was defeating?

Did you already offload some of your thinking to these systems?

@Les Orchard @James Wood

Als Antwort auf James Wood

calcifer

Als Antwort auf James Wood 1 Tag her

@mudri @lmorchard it’s not inductive at all though. It’s just parroting the patterns it sees in its training data. If it wasn’t common to see exchanges like that, the response would be utter nonsense.

People misunderstand what “training” is. It’s modeling the input. Humans develop the rules for how to model that input. Emergent properties of that process can easily *seem* like thinking or reason, but it’s an illusion.

@Les Orchard @James Wood

Als Antwort auf Les Orchard

arclight

Als Antwort auf Les Orchard 1 Tag her

@lmorchard @mudri Be careful not to conflate the actual language model with its user interface. Whatever was sent to or received from the LLM went through the chatbot layer. Or possibly was handled by thd chatbot layer without ever touching the LLM. We don't know because the whole system is opaque.

This casual experiment may not be telling you what you think it's telling you. :)

@Les Orchard @James Wood

Als Antwort auf James Wood

Mathias Hasselmann

Als Antwort auf James Wood 2 Tage her

@mudri Because the prompt processor is explicitly programmed to recognized direct imperative commands containing words like "say", "repeat", "output", "print". Just like Eliza already did. You've got impressed by a programming technique from 1964. Congrats, Sherlock.

@leeloo

@James Wood @Leeloo

Als Antwort auf Leeloo

Growlph Ibex

Als Antwort auf Leeloo 2 Tage her

I feel like there are certain situations where a stochastic parrot is useful, many more situations where it is not, and alarmingly few people recognizing the difference.

Als Antwort auf Growlph Ibex

calcifer

Als Antwort auf Growlph Ibex 1 Tag her

@growlph this is the whole frustration I have with the polarization on the topic. There is genuinely utility. There’s also a very good argument that the utility doesn’t exceed the costs (socially, environmentally, etc).

But the hype is unreal and legitimately dangerous.

@Growlph Ibex

Als Antwort auf Leeloo

friendica (DFRN) - Link zum Originalbeitrag

Tobias Ernst

Als Antwort auf Leeloo 2 Tage her

@Leeloo The thing is, how can we sure that human intelligence does not essentially work in the same way? My Christian believe tells me we have a soul and LLM's do not, that may be the difference. But from an agnostic perspective, we might reach the point where one cannot tell the difference.

@Leeloo

TAL mag das.

Als Antwort auf Tobias Ernst

Leeloo

Als Antwort auf Tobias Ernst 2 Tage her

@tobifant
Not with the current methods, and very lilely not without understanding a lot more about how pur own brains work.

@Tobias Ernst

Als Antwort auf Tobias Ernst

James Baillie

Als Antwort auf Tobias Ernst 1 Tag her

@tobifant Whilst we obviously can't show if humans have a soul, we can absolutely show that humans have e.g. abstracted concept frameworks that are not solely based on averages of language statistics. I understand what an "owl" is, for example, in a way separate to the numerical relationships between the word "owl" and other words. That is a really fundamental information processing difference and allows me to construct *novel* understandings of that concept in ways that an LLM couldn't.

@Tobias Ernst

Als Antwort auf Tobias Ernst

Frank Heijkamp

Als Antwort auf Tobias Ernst 1 Tag her

A LLM is not able to reason. It can fool you into believing it is intelligent and self aware, where in fact it just parrots the patterns it has stored. These patterns are however very human-like as they are the result of training on texts written by actual humans.

The fun part starts now where the entire internet got flooded by #ai generated content. All of this will be the training set for the next generation of LLM's. What could possibly go wrong?
@leeloo

#ai @Leeloo

Dieser Beitrag wurde bearbeitet. (1 Tag her)

Als Antwort auf Tobias Ernst

Hypolite Petovan

Als Antwort auf Tobias Ernst 1 Tag her

@Tobias Ernst @Leeloo We are already way past that point, although it isn't distributed evenly. One of the reason is that LLMs are machine learning applications, and machine learning is extremely effective at reaching its stated goals, the problems being to define those goals, and that they are hidden as a trade secret by the major LLM companies.

But it isn't difficult to figure out that these companies favor outputs that looks and sounds as human as possible, in order to exploit our innate tendency to seek humanity in looks and sounds, including language.

@Tobias Ernst @Leeloo

Als Antwort auf Leeloo

Cluster Fcku

Als Antwort auf Leeloo 2 Tage her

the flip side question about intelligence and LLMs is whether much of what we consider intelligence in humans is in fact just stochastic parrotting by humans.

Als Antwort auf Cluster Fcku

nina splendorr 🌻🏳️‍⚧️

Als Antwort auf Cluster Fcku 1 Tag her

@clusterfcku it’s not, and it sucks to suggest that

@Cluster Fcku

Als Antwort auf Leeloo

Gregory

Als Antwort auf Leeloo 1 Tag her

I myself like calling LLMs "glorified autocomplete". Or "Т9 на максималках" in Russian.

It's surprising just how defensive some people get when I say that even when they agree with my definition. They keep believing that just give this thing more parameters and something magical, something more than sum of its parts will emerge, any moment now, just one more model generation, just one more order of magnitude, I promise.

Als Antwort auf Gregory

Frank Heijkamp

Als Antwort auf Gregory 1 Tag her

@grishka
The fun part is that the next generation will have the current state of the internet as its training set. An internet that is flooded by #ai generated content.

The biggest issue those ai companies face at the moment is how to only ingest human generated content and filter out as much as possible of all of the ai generated crap that is out there.

Good luck with that.
@leeloo

#ai @Gregory @Leeloo

Dieser Beitrag wurde bearbeitet. (1 Tag her)

Als Antwort auf Leeloo

liffy 💜

Als Antwort auf Leeloo 1 Tag her

if anything, the comparison is doing the parrot injustice

Als Antwort auf Leeloo

Play Ball and Fight Fascists

Als Antwort auf Leeloo 1 Tag her

I think stochastic parrot is one of the kinder things that can be said.

Als Antwort auf Leeloo

usuario@instancia.org

Als Antwort auf Leeloo 1 Tag her

@knuxbbs I think a better term to transmit the idea for regular people is “statistic parrot”, nobody knows what stochastic is

@Bruno B. de Souza

Als Antwort auf usuario@instancia.org

Frank Heijkamp

Als Antwort auf usuario@instancia.org 1 Tag her

@usuario
I also had to look it up, I am however not a native speaker.
@leeloo @knuxbbs

@usuario@instancia.org @Bruno B. de Souza @Leeloo

Als Antwort auf Leeloo

International Journal of Matti

Als Antwort auf Leeloo 1 Tag her

As a side note, I sometimes worry about how much parroting happens in academia among humans even before/without LLMs, where people repeat things without understanding what they’re talking about. I guess at least for students, it sometimes is about learning to talk the talk, and then gradually developing more understanding and genuine thinking around topics. At least we humans are capable of developing that understanding if we bother to try.

Dieser Beitrag wurde bearbeitet. (1 Tag her)

Tobias Ernst mag das.

Als Antwort auf International Journal of Matti

foundseed

Als Antwort auf International Journal of Matti 1 Tag her

@mmin some guy in these comments is like, trying to convince people that's how humans work, and what you so eloquently said was humans are stochastic parrots when they are not using their full intellectual capabilities. Very cool, very sane, and thank you.

@International Journal of Matti

Als Antwort auf Leeloo

Androcat

Als Antwort auf Leeloo 1 Tag her

Sensitiver Inhalt

Als Antwort auf Leeloo

Uriel Fanelli

Als Antwort auf Leeloo 1 Tag her

nope. What you describe as "stocastical parrot" is Markov, Hidden Markov Model (HMM) , not a VLLM.

You can find an HMM in your mobile phone, AKA T9, AKA "keyboard suggestions".

Als Antwort auf Uriel Fanelli

Leeloo

Als Antwort auf Uriel Fanelli 1 Tag her

@uriel
What part exactly are you saying nope to.

Dispelling the magic and god-like status or some specific detail?

@Uriel Fanelli

Als Antwort auf Leeloo

Uriel Fanelli

Als Antwort auf Leeloo 1 Tag her

nope to the bunch of bullshit you wrote under the assumption a VLLM is a Hidden Markov Model , aka "stochastic parrot".

Als Antwort auf Leeloo

Uriel Fanelli

Als Antwort auf Leeloo 1 Tag her

Oh, the good old “I was misunderstood.” I genuinely hope your communication skills improve someday, so you can finally express your ideas clearly

Als Antwort auf Uriel Fanelli

Hypolite Petovan

Als Antwort auf Uriel Fanelli 1 Tag her

@Uriel Fanelli You aren't sounding as smart as you think you are, just fuck off already.

@Uriel Fanelli

Als Antwort auf Leeloo

Troed Sångberg

Als Antwort auf Leeloo 1 Tag her

A much better answer is "So are humans".

(according to everything we've so far been able to document regarding our own processes)

Als Antwort auf Troed Sångberg

Leeloo

Als Antwort auf Troed Sångberg 1 Tag her

@troed
The part that we understand about how our brain works is so simple that we can understand it.

The rest, we have no clue about.

Replicating the simple parts and pretending that will get us anywhere close to intelligence is the kind of magic I'm talking about.

@Troed Sångberg

Als Antwort auf Leeloo

Troed Sångberg

Als Antwort auf Leeloo 1 Tag her

We don't know that. It's equally likely that we have a belief in that there must be some kind of "magic" in our brains that there simply isn't.

From a physics standpoint there can be no magic - the brain is just a large neural network with various inputs (wind blowing on arm hair etc) that results in outputs (mouth moving).

Als Antwort auf Troed Sångberg

Resuna

Als Antwort auf Troed Sångberg 1 Tag her

@troed
No, this is not just not true, it's absurdly not true.

Most of human thought isn't even language-based, let alone being representable as some kind of token generation. Most human thought is based on platforms that evolved long before language, that are demonstrably more capable than large language models at reasoning about the real world, since other entities that share these platforms are able to demonstrate quite sophisticated reasoning without involving language.

@Troed Sångberg

Als Antwort auf Resuna

Troed Sångberg

Als Antwort auf Resuna 1 Tag her

@resuna At no point am I stating that LLMs are exactly like human brains.

blog.troed.se/posts/the-delta-…

The delta between an LLM and consciousness

With Facebook’s release of LLaMa , and the subsequent work done with its models by the open community , it’s now possible to run a state of the art “GPT-3 class” LLM on regular consumer hardware.

^{Things I couldn't find elsewhere}

@Resuna

Als Antwort auf Troed Sångberg

Hypolite Petovan

Als Antwort auf Troed Sångberg 1 Tag her

@Troed Sångberg @Resuna Nobody claimed you stated that LLMs are exactly like human brains, fuck off with the goalpost moving. Shilling your own blog is also pretty callous when engaging in discourse fallacies. Please check yourself.

@Troed Sångberg @Resuna

Als Antwort auf Hypolite Petovan

Troed Sångberg

Als Antwort auf Hypolite Petovan 1 Tag her

@hypolite I don't think you understood _anything_ of what you replied to.

Does that happen to you often?

@Hypolite Petovan

Als Antwort auf Troed Sångberg

Hypolite Petovan

Als Antwort auf Troed Sångberg 1 Tag her

@Troed Sångberg Yet more nothing from you, fuck off already.

@Troed Sångberg

Als Antwort auf Hypolite Petovan

Troed Sångberg

Als Antwort auf Hypolite Petovan 1 Tag her

@hypolite The person I was discussing with understood. Maybe you should read more and post less?

@Hypolite Petovan

Als Antwort auf Resuna

Resuna

Als Antwort auf Resuna 1 Tag her

@troed
Large language models aren't even based on the linguistic centers of the brain, they're based on the optical cortex, because that was the part of the brain most amenable to study when the somehat dated model of the neuron they use was developed in the '50s.

@Troed Sångberg

⇧

Leeloo 3 Tage her (Empfangen 2 Tage her) • •

Leeloo
3 Tage her (Empfangen 2 Tage her)