Benjamin Han · @BenjaminHan
286 followers · 477 posts · Server sigmoid.social

3/

- On average it achieves 64.33% accuracy on reasoning hence is not a reliable reasoner. It performs worse on induction compared to deduction/abduction. But it's good at commonsense, causal and analogical reasoning (screenshot 3).

CAVEAT that I didn't see addressed in the paper (or maybe impossible to address): the possibility that some of these test sets are included in the training set.

#NLG #nlp #nlu #NLGPU #nlproc #deeplearning #ai #paper

Last updated 3 years ago

Benjamin Han · @BenjaminHan
286 followers · 476 posts · Server sigmoid.social

2/

- Compared to many , ChatGPT outperforms almost all on zero-shot tests (except for Open-domain KGD), and a few (3) finetuned ones (screenshot 1).

- Focusing on , evals are broken into logical (deduction, induction and abduction), temporal, spatial, mathematical, , etc (screenshot 2).

#sota #LLMs #reasoning #commonsense #NLG #nlp #nlu #NLGPU #nlproc #deeplearning #ai #paper

Last updated 3 years ago

Benjamin Han · @BenjaminHan
286 followers · 475 posts · Server sigmoid.social

1/ A comprehensive -- , , -- of :

Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, and Pascale Fung. 2023. A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity. arXiv [cs.CL]. arxiv.org/abs/2302.04023

#multitask #multilingual #Multimodal #evaluation #chatgpt #NLG #nlp #nlu #NLGPU #nlproc #deeplearning #ai #paper

Last updated 3 years ago

Benjamin Han · @BenjaminHan
150 followers · 170 posts · Server sigmoid.social

(2/2)

We are interested not just in the construction () and serving aspects of KG, but also in how to apply to any downstream AI tasks to make them better: all stages of , and , and many user-facing features too!

Love to learn more from the community about the latest development in all related areas!

#KGC #kg #NLGPU #inference #reasoning #PaperThread #industry

Last updated 3 years ago

Benjamin Han · @BenjaminHan
150 followers · 170 posts · Server sigmoid.social
Benjamin Han · @BenjaminHan
150 followers · 170 posts · Server sigmoid.social

My : currently working on and ( + + ) at Knowledge Platform. Nice to see you all!

linkedin.com/in/benjaminhan/

#Introduction #knowledge #KnowledgeGraphs #NLGPU #NLG #nlp #nlu #apple

Last updated 3 years ago