Representations and Computations in Transformers that Support Generalization on Structured Tasks
Yuxuan Li, James McClelland
Action editor: Stefan Lee.
#attention #learns #representations
When Less is More: Simplifying Inputs Aids Neural Network Understanding
#simplicity #distractors #learns
FairGrad: Fairness Aware Gradient Descent
Gaurav Maheshwari, Michaël Perrot
Action editor: Novi Quadrianto.
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
#learns #reinforcement #exploration
Simplifying and Understanding State Space Models with Diagonal Linear RNNs
Two-Level Actor-Critic Using Multiple Teachers
#learns #reinforcement #teachers
#AI Is a Lot of Work. As the technology becomes #ubiquitous, a vast #tasker #underclass is emerging and not going anywhere. #AI #learns by finding #patterns in enormous quantities of #data, but first that data has to be sorted, #tagged by people, a vast #workforce mostly hidden behind machines. It’s difficult and #repetitive work. A several-second blip of footage took eight hours to annotate, for which a college graduate in #Nairobi was paid about $10. https://www.theverge.com/features/23764584/ai-artificial-intelligence-data-notation-labor-scale-surge-remotasks-openai-chatbots?tpcc=NL_Marketing
#ai #ubiquitous #tasker #underclass #learns #patterns #data #tagged #workforce #repetitive #nairobi
Reinforcement Teaching
Calarina Muslimani, Alex Lewandowski, Dale Schuurmans, Matthew E. Taylor, Jun Luo
Action editor: Marcello Restelli.
#learns #reinforcement #learnable
DCP: Learning Accelerator Dataflow for Neural Network via Propagation
GraphPNAS: Learning Probabilistic Graph Generators for Neural Architecture Search
Lightweight Learner for Shared Knowledge Lifelong Learning
Yunhao Ge, Yuecheng Li, Di Wu et al.
Action editor: Edward Grefenstette.
Do Vision-Language Pretrained Models Learn Composable Primitive Concepts?
Tian Yun, Usha Bhalla, Ellie Pavlick, Chen Sun
Action editor: Zhe Gan.
#concepts #learns #Recognition
FL Games: A federated learning framework for distribution shifts
Lightweight Learner for Shared Knowledge Lifelong Learning
L-SVRG and L-Katyusha with Adaptive Sampling
Boxin Zhao, Boxiang Lyu, mladen kolar
SLM: End-to-end Feature Selection via Sparse Learnable Masks
Recurrent networks, hidden states and beliefs in partially observable environments
Gaspard Lambrechts, Adrien Bolland, Damien Ernst
#reinforcement #learns #recurrent
Do Vision-Language Pretrained Models Learn Composable Primitive Concepts?
#concepts #learns #Recognition
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Han Wang, Archit Sakhadeo, Adam M White et al.
#hyperparameters #hyperparameter #learns