Leshem Choshen · @LChoshen
965 followers · 224 posts · Server sigmoid.social

Few-shot learning almost reaches traditional machine translation

arxiv.org/abs/2302.01398

#enough2skim #nlproc #neuralempty

Last updated 2 years ago

Leshem Choshen · @LChoshen
948 followers · 184 posts · Server sigmoid.social

3 reasons for hallucinations started
only 2 prevailed

Finding how networks behave while hallucinating, they
filter hallucinations (with great success)

arxiv.org/abs/2301.07779

#nlproc #neuralempty #nlp #deepread

Last updated 2 years ago

Jindřich Libovický · @jlibovicky
17 followers · 13 posts · Server sigmoid.social

In my latest blog post jlibovicky.github.io/2023/01/1, I look back at a paper on character-level machine translation aclanthology.org/2022.findings that I finished over a year ago.

#neuralempty #nlproc

Last updated 2 years ago

The Data Therapist · @datatherapist
371 followers · 554 posts · Server mastodon.social

Bold statement (need to think about it more), especially when coming from a machine translation person.

I’d claim MT was no less revolutionary once it became pervasive in industry. But @marian_nmt seems to dismiss it now given ChatGPT

twitter.com/marian_nmt/status/

#nlp #nlproc #neuralempty #nmt

Last updated 2 years ago

Leshem Choshen · @LChoshen
756 followers · 112 posts · Server sigmoid.social

@ talk to me about
ColD Fusion & ibm.github.io/model-recycling/
BabyLM shared task
label-sleuth.org/
Enhancing decoders with syntax

And guided work (talk to them too)
Estimating quality with source only
Controlling structure in - neuron level
Details:

#conll #EMNLP #neuralempty

Last updated 2 years ago

Jindřich Libovický · @jlibovicky
3 followers · 1 posts · Server sigmoid.social

A brief overview 🗞️ of what I found most interesting on arXiv in November is now on my blog jlibovicky.github.io/2022/12/0 (just four papers this month 🤷‍♂️)

#nlproc #neuralempty #mtmlhighlights

Last updated 2 years ago