- cross-posted to:
- fuck_ai@lemmy.world
- cross-posted to:
- fuck_ai@lemmy.world
Big tech boss tells delegates at Davos that broader global use is essential if technology is to deliver lasting growth
Nadella maybe knows a lot more than any of us about LLMs/GenAI tech, but one doesn’t need to know anything about LLMs (or even technology) to know that an oligarch like Nadella cannot be trusted (in any context).
Power corrupts, we’ve know this since the dawn of time
I’m kind of more-sympathetic to Microsoft than to some of the other companies involved.
Microsoft is trying to leverage the Windows platform that they control to do local LLM use. I’m not at all sure that there’s actually enough memory out there to do that, or that it’s cost-effective to put a ton of memory and compute capacity in everyone’s home rather than time-sharing hardware in datacenters. Nor am I sold that laptops — which many “Copilot PCs” are — are a fantastic place to be doing a lot of heavyweight parallel compute.
But…from a privacy standpoint, I kind of would like local LLMs to be at least available, even if they aren’t as affordable as cloud-based stuff. And at least Microsoft is at least supporting that route. A lot of companies are going to be oriented towards just doing AI stuff in the cloud.
They’re trying to leverage their windows platform to seek rent (sell premium cloud services like LLM access) for shit people don’t even want because they aren’t satisfied making very respectable money on licenses.
Is that true? I haven’t heard MS say anything about enabling local LLMs. Genuinely curious and would like to know more.
That’s why they have the “Copilot PC” hardware requirement, because they’re using an NPU on the local machine.
searches
https://learn.microsoft.com/en-us/windows/ai/npu-devices/
Copilot+ PCs are a new class of Windows 11 hardware powered by a high-performance Neural Processing Unit (NPU) — a specialized computer chip for AI-intensive processes like real-time translations and image generation—that can perform more than 40 trillion operations per second (TOPS).
It’s not…terribly beefy. Like, I have a Framework Desktop with an APU and 128GB of memory that schlorps down 120W or something, substantially outdoes what you’re going to do on a laptop. And that in turn is weaker computationally than something like the big Nvidia hardware going into datacenters.
But it is doing local computation.
Isn’t that the whole shtick of the AI PCs no one wanted? Like, isn’t there some kind of non-GPU co-processor that runs the local models more efficiently than the CPU?
I don’t really want local LLMs but I won’t begrudge those who do. Still, I wouldn’t trust any proprietary system’s local LLMs to not feed back personal info for “product improvement” (which for AI is your data to train on).
NPU neural processing unit
I wouldn’t trust a local LLM solution from a large American company. Not saying that they would try to “pull a quick one”, but they are unreliable and corrupt.
Microsoft wants developers to have local access to models but end users are 100% corralled into OneDrive and Copilot. I’m not sympathetic to them at all.
If Microsoft cared about privacy then they wouldn’t have made windows practically spyware. Even if they install AI locally in the OS, it’s still proprietary software that constantly sends data back to the mothership, consuming your electricity and RAM to do so. Linux has so many options, there’s really no reason not to switch.
Small LLMs already exist for local self-hosting, and there are open-source options which won’t steal your data and turn you into a product.
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/
Bear in mind that the number of parameters your system can handle is limited by how much memory is available, and using a quantized version can increase the number of parameters you can handle with the same amount of memory.
Unless you have some really serious hardware, 24 billion parameters is probably the maximum that would be practical for self-hosting on a reasonable hobbyist set-up. But I’m no expert, so do some research and calculate for yourself what your system can handle.
Unless you have some really serious hardware, 24 billion parameters is probably the maximum that would be practical for self-hosting on a reasonable hobbyist set-up.
Eh…I don’t know if you’d call it “really serious hardware”, but when I picked up my 128GB Framework Desktop, it was $2k (without storage), and that box is often described as being aimed at the hobbyist AI market. That’s pricier than most video cards, but an AMD Radeon RX 7900 XTX GPU was north of $1k, an NVidia RTX 4090 was about $2k, and it looks like the NVidia RTX 5090 is presently something over $3k (and rising) on EBay, well over MSRP. None of those GPUs are dedicated hardware aimed at doing AI compute, just high-end cards aimed at playing games that people have used to do AI stuff on.
I think that the largest LLM I’ve run on the Framework Desktop was a 106 billion parameter GLM model at Q4_K_M quantization. It was certainly usable, and I wasn’t trying to squeeze as large a model as possible on the thing. I’m sure that one could run substantially-larger models.
EDIT: Also, some of the newer LLMs are MoE-based, and for those, it’s not necessarily unreasonable to offload expert layers to main memory. If a particular expert isn’t being used, it doesn’t need to live in VRAM. That relaxes some of the hardware requirements, from needing a ton of VRAM to just needing a fair bit of VRAM plus a ton of main memory.
See, you have more experience in the matter than I do, hence the caveat that I’m not an expert. Thanks for sharing your experience.
Then again, I’d consider 128GB of memory to be fairly serious hardware, but if that’s common among hobbyists then I stand corrected. I was operating on the assumption that 64GB of RAM is already a lot
All in all, 106 billion parameters on 128GB of memory with quantization doesn’t surprise me all that much. But again, I’m just going off of the vague notions I’ve gathered from reading about it.
The focus of my original comment was more on the fact that self-hosting is an option, I wasn’t trying to be too precise with the specs. My bad if it came off that way
If he wanted people to like it then he should have made it do things people want it to do.
It is the new metaverse.
Hell I’d almost settle for just “making it work”. No disclaimers, no bullshitting. Computers should be optimized and accurate. AI is neither.
Ai does work great, at some stuff. The problem is pushing it into places it doesn’t belong.
It’s a good grammar and spell check. It helps me get a lot of English looking more natural.
It’s also great for troubleshooting consumer electronics.
It’s far better at search than google.
Even then it can only help, not replace folks or complete tasks.
It only looks good in comparison to Google search because they trashed Google search.
Which of course, Google did just so you’d have to search more, so you’d see more ads.
Ai does work great, at some stuff. The problem is pushing it into places it doesn’t belong.
I can generally agree with this, but I think a lot of people overestimate where it DOES belong.
For example, you’ll see a lot of tech bros talking about how AI is great at replacing artists, but a bunch of artists who know their shit can show you every possible way this just isn’t as good as human-made works, but those same artists might say that AI is still incredibly good at programming… because they’re not programmers.
It’s a good grammar and spell check.
Totally. After all, it’s built on a similar foundation to existing spellcheck systems: predict the likely next word. It’s good as a thesaurus too. (e.g. “what’s that word for someone who’s full of themselves, self-centered, and boastful?” and it’ll spit out “egocentric”)
It’s also great for troubleshooting consumer electronics.
Only for very basic, common, or broad issues. LLMs generally sound very confident, and provide answers regardless of if there’s actually a strong source. Plus, they tend to ignore the context of where they source information from.
For example, if I ask it how to change X setting in a niche piece of software, it will often just make up an entire name for a setting or menu, because it just… has to say something that sounds right, since the previous text was “Absolutely! You can fix x by…” and it’s just predicting the most likely term, which isn’t going to be “wait, nevermind, sorry I don’t think that’s a setting that even exists!”, but a made up name instead. (this is one of the reasons why “thinking” versions of models perform better, because the internal dialogue can reasonably include a correction, retraction, or self-questioning)
It will pull from names and text of entirely different posts that happened to display on the page it scraped, make up words that never appeared on any page, or infer a meaning that doesn’t actually exist.
But if you have a more common question like “my computer is having x issues, what could this be?” it’ll probably give you a good broad list, and if you narrow it down to RAM issues, it’ll probably recommend you MemTest86.
It’s far better at search than google.
As someone else already mentioned, this is mostly just because Google deliberately made search worse. Other search engines that haven’t enshittified, like the one I use (Kagi), tend to give much better results than Google, without you needing to use AI features at all.
On that note though, there is actually an interesting trend where AI models tend to pick lower-ranked, less SEO-optimized pages as sources, but still tend to pick ones with better information on average. It’s quite interesting, though I’m no expert on that in particular and couldn’t really tell you why other than “it can probably interpret the context of a page better than an algorithm made to do it as quickly as possible, at scale, returning 30 results in 0.3 seconds, given all the extra computing power and time.”
Even then it can only help, not replace folks or complete tasks.
Agreed.
I find that people only think its good when using it for something they dont already know, so then they believe everything it says. Catch 22. When they use it for something they already know, its very easy to see how it lies and makes up shit because its a markov chain on steroids and is not impressive in any way. Those billions could have housed and fed every human in a starving country but instead we have the digital equivalent of funko pop minions.
I also find in daily life those who use it and brag about it are 95% of the time the most unintelligent people i know.
Note this doesnt apply to machine learning.
Fundamentally due to it’s design, LLMs are digital duct tape.
The entire history of computer science has been making compromises between efficient machine code and human readable language. LLM’s solve this in a beautifully janky way, like duct tape.
But it’s ultimately still a compromise, you’ll never get machine accuracy from an LLM because it’s sole purpose is to fulfill the “human readable” part of that deal. So it’s applications are revolutionary in the same way as “how did you put together this car engine with only duct tape?” kind of way.
We’ll have to agree to disagree. To go through your points, spell check I don’t find particularly impressive. That was solved previously without requiring the power demands of a small town. Grammer, maybe - but in my experience my “LLM powered” keyboard’s suggestions are still worse than old T9 input.
I’ve had no luck troubleshooting anything with AI. It’s often trained on old data, tries to instruct you to change settings that don’t exist, or dreams up controls that might appear on “similar” hardware. Sure you can perhaps infer a solution, maybe, but it’s rarely correct at first response. It’ll happily run you through steps that are inconsequential to fixing a problem.
Finally, it might be better than indexed search NOW - but mostly because LLMs wrecked that too. I used to be able to use a couple search operators and get directly to the information I needed - now search is reduced to shifting through slop SEO sites.
And it does all this half assing while using enough power to justify dedicated nuclear reactors. I cant help but feel we’ve regressed on so many fronts.
“We’re not seeing the numbers we want, so we’re putting copilot in everything to expand our active user base. Hell, rename everything copilot!”
Who needs separate apps when you can just tell copilot what you want and it can put the slop straight into your trough?
You could have every single piece of technology on the planet using AI and it would still falter, because HUMANS DON’T WANT AI! Time and time again it’s been shown that people don’t like this shit. You’re spending money that hasn’t been made, on ram that hasn’t been produced, to be installed in AI data centers that haven’t been built, to run AI farms that have zero interest from humans, to chase profits that will never come.
I would normally say “congratulations, you fell for it again”, except nobody is tricking you here. YOU are the one tricking yourself. Every expert has stated that CEOs everywhere report no actual benefit from their AI use. Tech experts everywhere report that customers don’t want AI in their toilet. Or their toaster. Or their TV. Or their cell phone.
So who is this for?
Butlerian Jihad here we come!
You mean you’re telling me that the technology that can’t even correctly predict my next word in my text messages, is untrustworthy at an even bigger scale? I remember when AI first came out and I talked about that, they went on and on about how they’re different.
Also, please don’t anyone forget that the CEOs of these corporations were firing and replacing their workers in proportion to the amount of trust they gave LLMs. Do not forget.
It’s simple, really.
You’ll save everything to the cloud. Your work really likes this, as they don’t have to maintain their own share drives anymore. All your files are on OneDrive.
Home users can no longer afford their own computing. The PC is dead. All the parts have been scooped up to build out massive AI data centers. All they can do is afford “dumb terminals”: cheap tablets and laptops that can’t do much on their own, but can stream cloud apps, videos, games exceedingly well. Most don’t care because they don’t want to be bothered with “all this IT stuff. God, those computer guys are annoying!”
LLMs are only actually “good” at one thing, analysis of text (including non-English languages and also computer file formats).
Microsoft and others will use their new trove of information, with their new tools of analysis, to get ahead of the market. Imagine what companies would pay for the “intelligence” of knowing what their competitors are thinking, as they type it out. Imagine what governments would pay for near instant knowledge of who disagrees with them. Imagine what advertisers might get for bids for when they can guarantee someone’s thought pattern will result in a successful sale.
And big tech will charge us all for the privilege, or we will get left behind and unable to participate in society. When Microsoft turns off the lights, you’re basically homeless and unable to work.
It needs to crash and burn before this can ever happen. It needs to be ended now.
You had me until:
Imagine what companies would pay for the “intelligence” of knowing what their competitors are thinking
Right there is the reason why it will fall apart once it hits critical mass.
The little guys might not have a choice but the big players will run away from the cloud if it means they lose their edge.
I kind of agree with you but from experience I know when businesses get way waaay too big, they kind of trip over themselves cause of all the tech debt and spaghetti code
Oh, I don’t think it’ll work. They’ve sold their own snake oil to themselves. It’s a modern day Mechanical Turk. They’re in love with Eliza.
LLMs are quite fast at summarizing large blobs of data though, and they’re so desperate to be first to market with this capability. They think they can become the all-seeing eye of Sauron (I mean…Palantir? Cmon), and they are salivating at the chance that it will work out for them.
But they will fuck up computing for everyone else to get the chance. Already have, in fact.
oooh, promise?
If nobody is using your product then you made a shit product and should scrap it before you lose any more money.
Follow me for more business tips.
but what about FORCING everyone to use the product whether they want to use it or not? I heard that if you’re powerful enough there’s no need to ask permission, “they let you do it.”
Follow me for more business tips.
I am so here for this.
Dear Rooty,
We promised investors adoption we could not achieve, and we had to rebrand our best product to create the illusion of adoption.
This leads to our question:
How can we make more of our product offering icons look like butt holes?
Thank you in advance for your wisdom!
Oh dear, what a shame, never mind
haha lol
🤞
He means the LLM boom. Their conflating LLMs with all of AI is nonsense.
There are plenty of other machine learning projects, even ones using Transformer-based neural networks), that are doing just fine because they are not built on top of ridiculous business models like ‘buy every bit of computer hardware and hope someone makes something to run on it’.
The faster this thing crashes, the faster all consumer electronics becomes cheaper.
I think the backlash to LLM is already hurting all AI projects and will be especially bad when that bubble pops.
Everyone’s going to think all AI, and AGI, projects are LLMs and scoff at them.
The magic that used to be tied to the term AI is dead by fraud.
Just one more GPU rack I swear, were totally right on the cusp of AGI. Just need you to lend me a couple hundred more million.
2025-2026 - Year of the Linux Desktop
As soon as we get AI toilet paper, then we’ll finally start making a profit!
Hey, that sounds great. Let me write it down. I’m going to be billionaire in no time.
I had a machine re-write your sentence into a business proposal and now everyone except me is doing coke in the bathroom
Is it a “you’re holding it wrong” moment? Or a “no, it’s the consumers who are wrong” moment?
We’ll try to keep shoving it down your throats anyway though.
-Tech companies as evidenced by how many new places it keeps cropping up












