It must be a silent R

Schal330@lemmy.world · 1 year ago

It must be a silent R

1 year ago

Plenty of fun to be had with LLMs.

Melvin_Ferd@lemmy.world · 1 year ago

So ChatGPT has ADHD

tektite@slrpnk.net · 1 year ago

ADHD contains twelve “r’s”

eatCasserole@lemmy.world · edit-2 1 year ago

Copilot seemed to be a bit better tuned, but I’ve now confused it by misspelling strawberry. Such fun.

werefreeatlast@lemmy.world · 1 year ago

I’ve one upped you!

werefreeatlast@lemmy.world · 1 year ago

And this

werefreeatlast@lemmy.world · 1 year ago

This is hilarious!

Ok so how many war planes can this AI fly over our own country without attacking Georrrryiar?

Matriks404@lemmy.world · edit-2 1 year ago

That’s one example when LLMs won’t work without some tuning. What it does is probably looking up information of how many Rs there are, instead of actually analyzing it.

exocortex@discuss.tchncs.de · 1 year ago

It cannot “analyze” it. It’s fundamentally not how LLM’s work. The LLM has a finite set of “tokens”: words and word-pieces like “dog”, “house”, but also like “berry” and “straw” or “rasp”. When it reads the input it splits the words into the recognized tokens. It’s like a lookup table. The input becomes “token15, token20043, token1923, token984, token1234, …” and so on. The LLM “thinks” of these tokens as coordinates in a very high dimensional space. But it cannot go back and examine the actual contents (letters) in each token. It has to get the information about the number or “r” from somewhere else. So it has likely ingested some texts where the number of "r"s in strawberry is discussed. But it can never actually “test” it.

A completely new architecture or paradigm is needed to make these LLM’s capable of reading letter by letter and keep some kind of count-memory.

pyre@lemmy.world · 1 year ago

the sheer audacity to call this shit intelligence is making me angrier every day

VirtualOdour@sh.itjust.works · 1 year ago

That’s because you don’t have a basic understanding of language, if you had been exposed to the word intelligence in scientific literature such as biology textbooks then you’d more easily understand what’s being said.

‘Rich in nutrients?! How can a banana be rich when it doesn’t have a job or generational wealth? Makes me so fucking mad when these scientists lie to us!!!’

The comment looks dumb to you because you understand the word ‘rich’ doesn’t only mean having lots of money, you’re used to it in other contexts - likewise if you’d read about animal intelligence and similar subjects then ‘how can you call it intelligence when it does know basic math’ or ‘how is it intelligent when it doesn’t do this thing literally only humans can do’ would sound silly too.

pyre@lemmy.world · 1 year ago

this is not language mate, it’s pr. if you don’t understand the difference between rich being used to mean plentiful and intelligence being used to mean glorified autocorrect that doesn’t even know what it’s saying that’s a problem with your understanding of language.

also my problem isn’t about doing math. doing math is a skill, it’s not intelligence. if you don’t teach someone about math they’re most likely not going to invent the whole concept from scratch no matter how intelligent they may be. my problem is that it can’t analyze and solve problems. this is not a skill, it’s basic intelligence you find in most animals.

also it doesn’t even deal with meaning, and doesn’t even know what it says means, and doesn’t even know whether it knows something or not, and it’s called a “language model”. the whole thing is a joke.

VirtualOdour@sh.itjust.works · 1 year ago

Again you’re confused, it’s the same difficulty people have with the word ‘fruit’ because in botany we use it very specifically but colloquially it means a sweet tasting enable bit of a plant regardless of what role it plays in reproduction. Colloquially you’d be correct to say that corn grain is not fruit but scientifically you’d be very wrong. Ever eaten an Almond and said ‘what a tasty fruit?’ probably not unless your a droll biology teacher making a point to your class.

Likewise in biology no one expects a slug or worms or similar to analyze and solve problems but if you look up scientific papers about slug intelligence you’ll find plenty, though a lot will also be about simulating their intelligence using various coding methods because that’s a popular phd thesis recently - computer science and biology merge in such interesting ways.

The term AI is a scientific term used in computer science and derives its terminology from definitions used in the science of biology.

What you’re thinking of is when your mate down the pub says ‘yeah he’s really intelligent, went to Yale and stuff’

They are different languages, the words mean different things and yes that’s confusing when terms normally only used in textbooks and academic papers get used by your mates in the pub but you can probably understand that almonds are fruit, peanuts are legumes but both will likely be found in a bag of mixed nuts - and there probably won’t be a strawberry in with them unless it was mixed by the pedantic biology teacher we met before…

Language is complex, AI is a scientific term not your friend at the bar telling you about his kid that’s getting good grades.

pyre@lemmy.world · 1 year ago

are you AI? you don’t seem to follow the conversation at all

Matriks404@lemmy.world · 1 year ago

Exactly my point. But thanks for explaining it further.

1 year ago

I doubt it’s looking anything up. It’s probably just grabbing the previous messages, reading the word “wrong” and increasing the number. Before these messages I got ChatGPT to count all the way up to ten r’s.

portuga@lemmy.world · 1 year ago

There’s a simple explanation: LLMs are “R” agnostic because they were specifically trained to not sail the high seas

cypherix93@lemmy.world · 1 year ago

"strawberry".split('').filter(c => c === 'r').length

Anticorp@lemmy.world · 1 year ago

deleted by creator

Sluyter548@lemmy.world · 1 year ago

A zero indexed array doesn’t have a different length ;)

anti-idpol action@programming.dev · 1 year ago

(\r (frequencies “strawberry”))

SpaceNoodle@lemmy.world · 1 year ago

len([c if c == ‘r’ for c in “strawberry”])

MrLLM@ani.social · 1 year ago

deleted by creator

Tiefling IRL@lemmy.blahaj.zone · 1 year ago

'strawberry'.match(/r/ig).length

beebarfbadger@lemmy.world · 1 year ago

Q: “How many r are there in strawberry?”

A: “This question is usually answered by giving a number, so here’s a number: 632. Mission complete.”

CanadaPlus@lemmy.sdf.org · 1 year ago

A one-digit number. Fun fact, the actual spelling gets stripped out before the model sees it, because usually it’s not important.

Toneswirly@lemmy.world · 1 year ago

Ladies and gentlemen: The Future.

Comrade Rain@lemmygrad.ml · 1 year ago

“In the Future, people won’t have to deal with numbers, for the mighty computers will do all the numbers crunching for them”

The mighty computers:

glimse@lemmy.world · 1 year ago

Copilot may be a stupid LLM but the human in the screenshot used an apostrophe to pluralize which, in my opinion, is an even more egregious offense.

It’s incorrect to pluralizing letters, numbers, acronyms, or decades with apostrophes in English. I will now pass the pedant stick to the next person in line.

Beanie@programming.dev · 1 year ago

That’s half-right. Upper-case letters aren’t pluralised with apostrophes but lower-case letters are. (So the plural of ‘R’ is ‘Rs’ but the plural of ‘r’ is ‘r’s’.) With numbers (written as ‘123’) it’s optional - IIRC, it’s more popular in Britain to pluralise with apostrophes and more popular in America to pluralise without. (And of course numbers written as words are never pluralised with apostrophes.) Acronyms are indeed not pluralised with apostrophes if they’re written in all caps. I’m not sure what you mean by decades.

Mouselemming@sh.itjust.works · 1 year ago

By decades they meant “the 1970s” or “the 60s”

I don’t know if we can rely on British popularity, given y’all’s prevalence of the “greengrocer’s apostrophe.”

Beanie@programming.dev · 1 year ago

Oh right - that would be the same category as numbers then. (Looked it up out of curiosity: using apostrophes isn’t incorrect, but it seems to be an older/less formal way of pluralising them.)

Mouselemming@sh.itjust.works · edit-2 1 year ago

Now, plurals aside, which is better,

The 60s

Or

The '60s

?

ProfessorProteus@lemmy.world · 1 year ago

Never heard of the greengrocer’s apostrophe so I looked it up. https://www.thoughtco.com/what-is-a-greengrocers-apostrophe-1690826

I absolutely love that there’s a group called the Apostrophe Protection Society. Is there something like that for the Oxford Comma? I’d gladly join them!

Beanie@programming.dev · 1 year ago

Hah, I do not like the greengrocer’s apostrophe. It is just wrong no matter how you look at it. The Oxford comma is a little different - it’s not technically wrong, but it should only be used to avoid confusion.

VirtualOdour@sh.itjust.works · 1 year ago

I use it for fun, frivolity, and beauty.

Aatube@kbin.melroy.org · 1 year ago

In fact, there’s a hit song called “Oxford Comma”!

SkyezOpen@lemmy.world · 1 year ago

I will die on both of those hills alongside you.

ryannathans@aussie.zone · 1 year ago

Why use for lowercase?

GetOffMyLan@programming.dev · 1 year ago

Because English is stupid

SlopppyEngineer@lemmy.world · 1 year ago

It’s not stupid. It’s just the bastard child of Germany, Dutch, French, Celtic and Scandinavian and tries to pretend this mix of influences is cool and normal.

DragonTypeWyvern@midwest.social · 1 year ago

Victim blaming and ableism!

The French and Scandinavian bits were NOT consensual.

(Don’t forget Latin btw)

✺roguetrick✺@lemmy.world · 1 year ago

There are plenty of non Norman consensual French words and the Danes had as much a right to be there as the Angles and the Saxons did in kicking the celts out. Let’s not even talk about if the anglo-Saxons had more legitimate claim than the norse-gaels.

bisby@lemmy.world · 1 year ago

Because otherwise if you have too many small letters in a row it stops looking like a plural and more like a misspelled word. Because capitalization differences you can make more sense of As but not so much as.

psud@aussie.zone · 1 year ago

As

That looks like an oddly capitalised “as”

That really gives the reason it’s acceptable to use apostrophes when pluralising that sort of case

Bunnylux@lemmy.world · 1 year ago

Oooh, pedant stick, pedant stick! Give it to me!!

warbond@lemmy.world · 1 year ago

Thank you. Now, insofar as it concerns apostrophes (he said pedantically), couldn’t it be argued that the tools we have at our immediate disposal for making ourselves understood through text are simply inadequate to express the depth of a thought? And wouldn’t it therefore be more appropriate to condemn the lack of tools rather than the person using them creatively, despite their simplicity? At what point do we cast off the blinders and leave the guardrails behind? Or shall we always bow our heads to the wicked chroniclers who have made unwitting fools of us all; and for what? Evolving our language? Our birthright?

No, I say! We have surged free of the feeble chains of the Oxfords and Websters of the world, and no guardrail can contain us! Let go your clutching minds of the anchors of tradition and spread your wings! Fly, I say! Fly and conformn’t!

…

I relinquish the pedant stick.

Optional@lemmy.world · 1 year ago

I salute your pedantry.

Melvin_Ferd@lemmy.world · 1 year ago

English is a filthy gutter language and deserves to be wielded as such. It does some of its best work in the mud and dirt behind seedy boozestablishments.

NeatNit@discuss.tchncs.de · 1 year ago

Prescriptivist much?

I Cast Fist@programming.dev · 1 year ago

It has [2] R’s, simple!

Optional@lemmy.world · 1 year ago

Jesus hallucinatin’ christ on a glitchy mainframe.

I’m assuming it’s real though it may not be but - seriously, this is spellcheck. You know how long we’ve had spellcheck? Over two hundred years.

This? This is what’s thrown the tech markets into chaos? This garbage?

Fuck.

DragonTypeWyvern@midwest.social · 1 year ago

I was just thinking about Microsoft Word today, and how it still can’t insert pictures easily.

This is a 20+ year old problem for a program that was almost completely functional in 1995.

Rhaedas@fedia.io · 1 year ago

I tried it with my abliterated local model, thinking that maybe its alteration would help, and it gave the same answer. I asked if it was sure and it then corrected itself (maybe reexamining the word in a different way?) I then asked how many Rs in “strawberries” thinking it would either see a new word and give the same incorrect answer, or since it was still in context focus it would say something about it also being 3 Rs. Nope. It said 4 Rs! I then said “really?”, and it corrected itself once again.

LLMs are very useful as long as know how to maximize their power, and you don’t assume whatever they spit out is absolutely right. I’ve had great luck using mine to help with programming (basically as a Google but formatting things far better than if I looked up stuff), but I’ve found some of the simplest errors in the middle of a lot of helpful things. It’s at an assistant level, and you need to remember that assistant helps you, they don’t do the work for you.

kubica@fedia.io · 1 year ago

5% of the times it works every time.

some_guy@lemmy.sdf.org · 1 year ago

You can come up with statistics to prove anything, Kent. 45% of all people know that.

CrypticCoffee@lemmy.ml · 1 year ago

Garbage in, garbage out. Keep feeding it shit data, expect shit answers.

RustyNova@lemmy.world · 1 year ago

I hate AI, but here it’s a bit understandable why copilot says that. If you ask the same thing to someone else they would surely respond 2 as they my imply you are trying to spell the word, and struggle on whether it’s one or two R on the last part.

I know it’s a common thing to ask in french when we struggle to spell our overly complicated language, so it doesn’t shock me

OsrsNeedsF2P@lemmy.ml · 1 year ago

Nah it’s because AI works at the token level which is usually words. They don’t even “see” the letters in the words

Hackworth@lemmy.world · 1 year ago

Thank you. For as much as this post comes up, I hope people are at least getting an education.

Clinicallydepressedpoochie@lemmy.world · 1 year ago

Welp, it’s reached my level of intelligence.

MystikIncarnate@lemmy.ca · 1 year ago

Aww, C’mon, don’t sell yourself short like that, I’m sure you’re great at… Something…

For example, you would probably be way more useful than an AI, if there was a power outage.

Clinicallydepressedpoochie@lemmy.world · 1 year ago

Geee, you really mean that?!

beebarfbadger@lemmy.world · 1 year ago

Sure, when the chips fall, eating a computer rig won’t stave off starvation for even a minute.

Clinicallydepressedpoochie@lemmy.world · 1 year ago

Codex@lemmy.world · 1 year ago

I was curious if (since these are statistical models and not actually counting letters) maybe this or something like it is a common “gotcha” question used as a meme on social media. So I did a search on DDG and it also has an AI now which turned up an interestingly more nuanced answer.

It’s picked up on discussions specifically about this problem in chats about other AI! The ouroboros is feeding well! I figure this is also why they overcorrect to 4 if you ask them about “strawberries”, trying to anticipate a common gotcha answer to further riddling.

DDG correctly handled “strawberries” interestingly, with the same linked sources. Perhaps their word-stemmer does a better job?

ReveredOxygen@sh.itjust.works · 1 year ago

DDG’s one isn’t a straight LLM, they’re feeding web results as part of the prompt.

CommanderCloon@lemmy.ml · 1 year ago

Lmao it’s having a stroke

sus@programming.dev · edit-2 1 year ago

many words should run into the same issue, since LLMs generally use less tokens per word than there are letters in the word. So they don’t have direct access to the letters composing the word, and have to go off indirect associations between “strawberry” and the letter “R”

duckassist seems to get most right but it claimed “ouroboros” contains 3 o’s and “phrasebook” contains one c.

affiliate@lemmy.world · 1 year ago

maybe it’s using the british pronunciation of “strawbry”