Cory Doctorow's linkblog · @pluralistic
46708 followers · 44437 posts · Server mamot.fr

Some "open AI" is much more open than the industry dominating offerings. There's , a donor-supported nonprofit whose model comes with documentation and code, licensed . There are also some smaller academic offerings: (UCSD/CMU/Berkeley); (Berkeley) and (Stanford).

These are indeed more open (though Alpaca - which ran on a laptop - had to be withdrawn because it "hallucinated" so profusely).

40/

#eleutherai #apache2 #vicuna #koala #alpaca

Last updated 1 year ago

Mr.Trunk · @mrtrunk
7 followers · 14652 posts · Server dromedary.seedoubleyou.me
SUKULTUR · @sukultur
123 followers · 25 posts · Server literatur.social
getmisch · @GetMisch
52 followers · 665 posts · Server masto.nyc
Tech news from Canada · @TechNews
454 followers · 12806 posts · Server mastodon.roitsystems.ca
IT News · @itnewsbot
3096 followers · 256330 posts · Server schleuss.online

“A really big deal”—Dolly is a free, open source, ChatGPT-style AI model - Enlarge (credit: Databricks)

On Wednesday, Databricks released... - arstechnica.com/?p=1931693

#ai #meta #llama #dolly #pythia #biz #finetuning #eleutherai #databricks #apachespark #textsynthesis #machinelearning #largelanguagemodels

Last updated 2 years ago

OpenBioML · @OpenBioML
1 followers · 4 posts · Server sigmoid.social

💻 We are ready to train with massive compute resources and state-of-the-art open source models from our partner community 4/5

#eleutherai

Last updated 2 years ago

Stella Biderman · @stellaathena
104 followers · 11 posts · Server sigmoid.social

There are some really good papers that have sought to make the best of the current situation, but had the compute to do it the right way and so we did.

arxiv.org/abs/2211.08411
arxiv.org/abs/2202.07646
arxiv.org/abs/2202.07206
arxiv.org/abs/2207.14251

We hope that this work will empower more people to work on questions in interpretability, especially the causal impact of training data on model behavior!

#eleutherai

Last updated 2 years ago

Stella Biderman · @stellaathena
103 followers · 8 posts · Server sigmoid.social

What do LLMs learn over the course of training? How do these patterns change as you scale? To help answer these questions, we are releasing a Pythia, suite of LLMs + checkpoints designed for research on interpretability and training dynamics!

The models have sizes ranging from 19M to 13B parameters, contain 143 intermediate checkpoints, and were trained on the same exact data in the same exact order.

github.com/EleutherAI/pythia

#ml #ai #nlproc #interpretability #eleutherai

Last updated 2 years ago

Stella Biderman · @stellaathena
34 followers · 6 posts · Server sigmoid.social
Stella Biderman · @stellaathena
34 followers · 6 posts · Server sigmoid.social