Leshem Choshen · @LChoshen
1059 followers · 326 posts · Server sigmoid.social

Predictions throughout training, hyperparams and architectures are yet again shown to be on

a small manifold

which means models learn their classifications outputs similarly
arxiv.org/abs/2305.01604
Mao ... @pratikac

#machinelearning #enough2skim

Last updated 2 years ago

Leshem Choshen · @LChoshen
965 followers · 224 posts · Server sigmoid.social

Few-shot learning almost reaches traditional machine translation

arxiv.org/abs/2302.01398

#enough2skim #nlproc #neuralempty

Last updated 3 years ago

Leshem Choshen · @LChoshen
949 followers · 193 posts · Server sigmoid.social

20 questions can now be played by computers
you probably all know @akinator_team@twitter.com that can guess what you thought about

arxiv.org/pdf/2301.08718.pdf
propose the other role
They pick a character and will answer yes or no
(basically, QA over wiki+ tweaks)

#enough2skim

Last updated 3 years ago