Leshem Choshen · @LChoshen
756 followers · 114 posts · Server sigmoid.social

What neurons determine agreement in multilingual LLMs?

but some answers:
Across languages-2 distinct ways to encode syntax
Share neurons not info

Autoregressive have dedicated synt. neurons (MLM just spread across)

@amuuueller@twitter.com yu xia @tallinzen@twitter.com

#deepread #conlllivetweet2022

Last updated 3 years ago

Leshem Choshen · @LChoshen
757 followers · 113 posts · Server sigmoid.social

I will be at @emnlpmeeting@twitter.com & @conll_conf@twitter.com say hi

I will be tweeting under or or or

If it spams you, mute it (or wait a week 😉)
help.twitter.com/en/using-twit

#emnlp2022livetweet #emnlp2022 #conll #conlllivetweet2022

Last updated 3 years ago