New Submissions to TMLR · @tmlrsub
198 followers · 689 posts · Server sigmoid.social

One-Round Active Learning through Data Utility Learning and Proxy Models

openreview.net/forum?id=8HQCOM

#labeled #labeling #annotators

Last updated 1 year ago

beSpacific · @bespacific
893 followers · 1703 posts · Server newsie.social

The secret to making sound and less nonsense is to use a technique called reinforcement learning from , which uses input from people to improve the model’s answers. It relies on a small army of who evaluate whether a string of text makes sense and sounds fluent and natural. They decide whether a response should be kept in the AI model’s database or removed. technologyreview.com/2023/06/1

#aichatbots #smart #spew #toxic #humanfeedback #human #data #annotators

Last updated 1 year ago

New Submissions to TMLR · @tmlrsub
161 followers · 376 posts · Server sigmoid.social

Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios

openreview.net/forum?id=dXnccp

#labeling #sampling #annotators

Last updated 2 years ago