Leshem Choshen · @LChoshen
1000 followers · 258 posts · Server sigmoid.social

You know what?
I will stop sharing any LLM "news" if they don't share with me first (models or code)



Thanks delip rao for inspiration
And the new vision and language that did open unlike XXXX-e
twitter.com/DrJimFan/status/16

#thereoridontcare #scientivism #usharefirst #machinelearning #cv #nlproc #nlp

Last updated 2 years ago

Leshem Choshen · @LChoshen
970 followers · 245 posts · Server sigmoid.social

So often we are reminded that good work goes unnoticed
I share others' papers to change that
What else could we do?
What mechanisms better allow propagation by value rather than by fame?
Is there something we can do to make science better?
blog.samaltman.com/you-and-you

#scientivism #ScienceMastodon #PR #nlproc #machinelearning #cv

Last updated 3 years ago

Leshem Choshen · @LChoshen
938 followers · 174 posts · Server sigmoid.social

A surprising take on why we should open LLMs:
otherwise empirical research would suffocate and
rule-based (nativist) would return

Not sure I am buying it or even that it is dreadful, but more the reason to share and hear opinions
arxiv.org/abs/2301.05272
Patrick Perrine

#llm #nlp #nlproc #machinelearning #ml #scientivism

Last updated 3 years ago

Leshem Choshen · @LChoshen
770 followers · 146 posts · Server sigmoid.social

I can't understand how this paper is so overlooked

Human annotation was a dreadful thing to me all my PhD, costly, cumbersome, requires my constant supervision
this is a game changer (and its not even mine...)
But generalized reranking get's double the PR...

#scientivism

Last updated 3 years ago

Leshem Choshen · @LChoshen
753 followers · 95 posts · Server sigmoid.social

We want to pretrain🤞
Instead we finetune🚮😔
Could we collaborate?🤗

ColD Fusion:
🔄Recycle finetuning to multitask
➡️evolve pretrained models forever

On 35 datasets
+2% improvement over RoBERTa
+7% in few shot settings
🧵

#nlproc #machinlearning #nlp #ml #modelrecyclying #CollaborativeAI #scientivism #pretrain

Last updated 3 years ago

Leshem Choshen · @LChoshen
614 followers · 85 posts · Server sigmoid.social

🔖Reviewing has so many faults📖
Finally, there is a dataset of reviews, edits and everything else!

5 venues 5K papers 11K reviews
Enjoy!

arxiv.org/abs/2211.06651
Nils Dycke, Ilia Kuznetsov, Iryna Gurevych

#nlproc #review #cv #machinelearning #scientivism

Last updated 3 years ago

Leshem Choshen · @LChoshen
605 followers · 82 posts · Server sigmoid.social

Are findings as good as ACL?

years since the first findings papers were introduced
since chris manning & ani nenkova called for a yearly analysis
since they were first done

Who's game for the yearly analysis?

twitter.com/chrmanning/status/

For earlier analysis and code (old, not on :mastodondance: , next year links from here?)

twitter.com/gneubig/status/145
twitter.com/ryandcotterell/sta
twitter.com/wilkeraziz/status/
twitter.com/sanxing_chen/statu

#nlproc #findings #scientivism #ACL

Last updated 3 years ago

Leshem Choshen · @LChoshen
578 followers · 66 posts · Server sigmoid.social

What do we know about using a fine-tuned model rather than the pretrained
They are sometimes much better, but what else?

A story of great hypotheses and their rejections

The story of a field
Survey 🧵

#scientivism

Last updated 3 years ago

wwydmanski · @wwydmanski
20 followers · 192 posts · Server qoto.org

Happy to be a part of this!
---
RT @LChoshen
Data augmentation? Look no further.

Framework of 100+ "transformations" (augmentations\paraphrasing functions\filters)
Many types:emojis, linguistic... see Fig
Extendable!
A vast effort, constructed by almost a hundred authors!
arxiv.org/abs/2112.02721

twitter.com/LChoshen/status/14

#scientivism

Last updated 4 years ago