Try not to forget what Open Source AI is best at, and you'll enjoy it so much more

heyoniteglo · 2024-05-15T03:34:36+00:00

This is a good reminder for all the new people showing up and maybe even a reminder to some of us who have been around a while. Sometimes old faithful is all you need. Sometimes it's all you need to fall back on. Thanks for your thoughts.

jollizee · 2024-05-15T04:22:03+00:00

You left out what I personally consider the most important. Owning the model means you can tinker with it at will, i.e. finetune however you want on anything you want. Privacy is secondary to me.

OpenAI has limited finetuning, as does Gemini. Claude has none. (We're not counting the fact that you don't own your finetune since it runs on their servers and could go poof any time.)

I haven't had the bandwidth to try finetuning yet, but that' something I really want to get into...maybe if more companies like together.ai come up with easy finetuning-for-dummy platforms. I think you can do up to 7x8b MOE finetuning there.

elilev3 · 2024-05-15T08:02:57+00:00

Yeah. New people should realize how much more hopeless the open source scene felt if you go back as recently as even 2022. The best option back then was GPT-NeoX...which was hopelessly incoherent, and still required insane hardware to run, despite being low parameter count (there were no 2bit/4bit quants back then)

We went from that, to GPT-4 level performance on local hardware. If you showed Llama 3 8b to someone in 2022 they'd probably shit themselves.

Ylsid · 2024-05-15T05:43:40+00:00

Yes, but there are totally cases open source AI outperforms business AI tho

jerieljan · 2024-05-15T05:50:44+00:00

Perfectly said. The only thing I'll add to these is that local models are free for you to use as you see fit as long as your hardware permits you to. You don't have to deal with token / mtok usage, budgeting between models, etc. Just let the model ramble on and on, and let your creativity flow from it.

Eastwindy123 · 2024-05-15T05:50:51+00:00

Also if you're a startup or a business. Using a fine-tuned open source model at scale is still wayyy cheaper than using openai. And there's no reason not to enjoy both. AI is winning in general. There's so many options and you can pick and choose what you like.

lywyu · 2024-05-15T09:47:18+00:00

Great post and you're absolutely right. But "Open Source isn't going anywhere." mentality can lead to people taking things for granted. The open-source community needs more involvement from people now more than ever, especially on AI-related research and development. While Meta is doing great work, ultimately it's still a public company that needs to post profits for its shareholders. I don't expect this free ride to continue for long.

Red_Redditor_Reddit · 2024-05-15T01:46:13+00:00

I don't mean to spoil the mood here, but why exactly are you writing like there's been some great defeat here? I'm being serious.

Maybe I'm out of the loop, but all I've seen recently is that gpt4'o' thing. The voice thing would work with llama if it wasn't a bunch of Unix pipes all Jerry rigged together. I'm not real sure how the vision works but I'm sure it can be repeatedly analyzed or the LLM triggers it.

Like what am I missing here? I was way more impressed by the text to video demo.

MrVodnik · 2024-05-15T08:59:11+00:00

It can be cheaper. There is not throttling (famous "you've reached your daily limit" even for paid users on GPT4). You can edit their replies, which allow quick jailbreak of any model, or just a slight tweak to the conversation when needed. You can use it as a BE for your personal project without fear of being cut off from it for whatever reason (like banning FB account, or Google killing another project).

And more, depending on your use case. If you need to be cheered about open source / locall AI, then you do not know enough about it.

windozeFanboi · 2024-05-15T10:39:54+00:00

I don't know man, llama3 is great at 70B, Gemma2 perhaps getting close at 27B is not too shabby, but the promise of Phi 14B is unmatched in todays landscape...

If Phi14B delivers according to (preview) stats, then it's gonna do what Gemma2 is gonna do at 27B which is close to LLama3 70B...

And it's not that farfetched to say that If a 14B Phi3 can do so much just for text, we might as well get an incredible multimodal Phi-4o at 20B...

Or a Bitnet Phi-4o at 40B...

All these might be a struggle to run in 2024 for most people, but in 2025 it will be better and in 2026 they will be a breeze.

SiEgE-F1 · 2024-05-15T14:06:26+00:00

Compliance: you can find local models that won't immediately lecture you about animal cruelty when you ask how to kill a Python process. Sometimes you need an answer, not a lecture.

The whole "talking down on you" is the worst part. Just drop the connection/block the request. Point. Don't waste my token limits.

Its always available. There's no maintenance or global outage

And not just that. The capability for the local AI to be active in an off-grid environment, with no active internet, no "cloud super computers", no big company's supervisory, with no limits to the amount of requests is just TOO GOOD. People really, really underestimate that fact.

It is like having a luxury, full leather, super quiet, super fast car being available at the same price of a second hand, roughed up volksvagen, or even cheaper, but the car would refuse you from going off road and out of the city, because your subscription doesn't support that, and will revoke itself if you try.

AND we are yet to figure out the best use cases for the LLMs, and their actual value. And them being available off-grid should become the actual drive that will help us do so.

AlanCarrOnline · 2024-05-15T07:40:23+00:00

Totally agree with everything, and just feel you missed an important point that I'd like to add, if I may?

That being PLEASE HELP AND WELCOME THE NOOBS.

In fairness that's what your post is about, but it can be stressed more - this new thing is full of jargon, abbreviations and entirely new concepts to most people. People asking about the hardware they need or how to navigate github need help, not downvotes.

The more people embracing locally-run AI the better, for many, many reasons, from companies wanting the karma to consumer hardware development.

Noobs are friends, not food.

danigoncalves · 2024-05-15T06:52:43+00:00

Nice post, I would only subscribe whats written there. If we share and help each other great thing could happen, and who know if in the future its OpenAI who look at us and see some features of their "preview" 🙂

SikinAyylmao · 2024-05-15T12:05:50+00:00

I prefer open models, however, the compute share you get from closed companies is worth it in its own right. Depending on your setup closed models can be between 2x to 100x faster than your open model simply because of the compute share.

Next_Program90 · 2024-05-15T12:38:16+00:00

Very well written. I feel the same way and it's good that you took the time to properly put it into words for everyone who needs to read them.

hashmiabrar1 · 2024-05-15T14:44:26+00:00

Can you guide us towards making local LLM's

crazyenterpz · 2024-05-15T14:44:37+00:00

OpenAI could train its model on a Bazillion parameters but will never be good enough to use in our day today life. For example it will never know what Lydia , from warehouse, ordered last week especially whem Jamal , our vendor manager has contracted new prices. Yes this data is in some big and dumb inventory management system , which is painful to use.

Only a home grown RAG based system will know that. And for that purpose, smaller LLAMA or Mistral models are good enough.

mikebrave · 2024-05-15T17:20:53+00:00

Initially I thought this was going to be a list of model recommendations for specific tasks "this one is good at labelling art, this one is good at python code, this one at C# code, this one for math" etc.

multiverse_fan · 2024-05-15T02:13:53+00:00

I designed a crotch fan. It's was... revolutionary. Thanks LocalLLaMA 😄

ab2377 · 2024-05-15T04:59:03+00:00

💯 pin this post damnit.

Lemgon-Ultimate · 2024-05-15T08:28:35+00:00

There's nothing more dystopian than to think about a single instance owning almost all AI capabilities and that's not gonna happen. I'm a bit scared because OpenAI recently statet their antipathy for open source and that AI should be in the hands of a few corporations, which is really evil. I can only hope they change their stance on this and accept open source. Other than that I view the new GPT-4o more like a preview on what's possible and what's coming to open source next year.

New-Database-7703 · 2024-05-15T11:59:05+00:00

what makes open source better, i get using it locally which is nice(on my 2500$ PC) , but other than that what are the benifits?

obsoletesatellite · 2024-05-15T12:26:19+00:00

Open source is cool, but free software is better.

https://www.youtube.com/watch?v=fKUwfFcrVjU

LocalLLaMA

MODERATORS