Mr.Trunk · @mrtrunk
13 followers · 20860 posts · Server dromedary.seedoubleyou.me
Oliver (Hansi Flick-Ultra) · @oliver
953 followers · 9051 posts · Server die-partei.social

Nanu, KI lernt Fehlerkultur?

#ki #gpt4 #bing

Last updated 1 year ago

Mr.Trunk · @mrtrunk
12 followers · 20330 posts · Server dromedary.seedoubleyou.me
Brittany Trang · @brittanytrang
548 followers · 313 posts · Server newsie.social

Apparently Cedars-Sinai is sending their patients , and it's increasing the # of patients filling out colorectal screenings??

This and more in the new tracker, developed by STAT's experts Casey Ross, Katie Palmer, and @jaspar

statnews.com/2023/09/05/genera

#deepfakes #cancer #chatgpt #artificialintelligence #gpt4 #generativeAI #health #healthcare #healthtech #medicine

Last updated 1 year ago

Brittany Trang · @brittanytrang
548 followers · 313 posts · Server newsie.social

If you, like me, are confused when or announce a new generative partnership with a hospital

(because you're not sure if it's different than the last one they announced)

@STAT has got you!

Introducing the STAT+ Generative AI Tracker:

statnews.com/feature/stat-plus

#epic #microsoft #ai #chatgpt #artificialintelligence #gpt4 #generativeAI #health #healthcare #healthtech #medicine

Last updated 1 year ago

Carl T. Bergstrom · @ct_bergstrom
46764 followers · 2809 posts · Server fediscience.org

People keep telling me that is amazing for proofreading text and improving scientific writing.

I just gave a section of a grant proposal and it made 11 suggestions, none of which were worth keeping (often adding or removing a comma, or repeating a preposition in a list).

More interestedly, a number of its suggestions were identical to my originals.

#generativeAI #llm #gpt4 #chatgpt

Last updated 1 year ago

White House Press Office · @press
77 followers · 378 posts · Server whitehouse.org
5h15h · @shish
104 followers · 687 posts · Server techhub.social

This is a game that tests your ability to predict ("forecast") how well will perform at various types of questions nicholas.carlini.com/writing/l

#gpt4 #chatgpt #gpt #AI #genai #openai

Last updated 1 year ago

Benjamin Han · @BenjaminHan
472 followers · 1294 posts · Server sigmoid.social

7/ Their result shows that even achieves only 23.7% hit@1 on average, even when it scores up to 50% precision@1 using the earlier proposed LAMA benchmark (screenshot). Interestingly, smaller models like BERT can outperform GPT4 on bidirectional, compositional, and ambiguity benchmarks, indicating bigger is not necessarily better.

#gpt4 #KnowledgeGraphs #generativeAI #LLMs #nlp #nlproc #paper

Last updated 1 year ago

5h15h · @shish
104 followers · 682 posts · Server techhub.social
Rob · @Feynman
116 followers · 83 posts · Server mastodon.sdf.org
Benjamin Han · @BenjaminHan
460 followers · 1262 posts · Server sigmoid.social

1/ How robust and reliable is the code generated by , especially for real-world software development? A recent work [2] constructed a new benchmark based on [1] to evaluate if the generated code uses API correctly. Four popular -- .5, , , and -- are tested, and under zero-shot scored 62.09% misuse rate. Even with one-shot relevant examples the misuse rate of is 49.17%.

#LLMs #gpt3 #gpt4 #llama2 #vicuna #generativeAI #papers #nlp #nlproc #softwaredevelopment

Last updated 1 year ago

jfk · @jfkimmes
103 followers · 92 posts · Server social.tinycyber.space

Finetuned code-llama beats GPT-4. They have no moat, indeed.

phind.com/blog/code-llama-beat

#llama #codellama #gpt4 #ai #chatgpt

Last updated 1 year ago

Talya (she/her) · @Yuvalne
331 followers · 8080 posts · Server 433.world

So apparently there's a paper going around claiming is biased to the left.
The paper used politicalcompass.org to gague the model's political standings. You know, that website where you need to practically be a fascist to *not* get a "libleft" result.

If anything, the fact that so many models scored higher than half on the authoritarian score is the thing to be worried about.

(For a good critique of the compass, see youtube.com/watch?v=_oNkJgpkW4 )

#gpt4 #ai #politicalcompass #chatgpt #llm

Last updated 1 year ago

5h15h · @shish
103 followers · 663 posts · Server techhub.social
Bornach · @bornach
153 followers · 1951 posts · Server masto.ai

@freakazoid @Rairii @ifixcoinops

And in spite of all the over passing various exams at 90th percentile, it isn't much better at solving math problems not in its training data
youtu.be/Fi1e-B60cok

#aihype #gpt4

Last updated 1 year ago

Hendrik Haverkamp · @hav_hendrik
966 followers · 784 posts · Server bildung.social

Sehr spannende Untersuchung zur Fähigkeit von , Texte von Studierenden automatisch zu korrigieren. Dürfte auch für Lehrkräfte interessant sein.

arxiv.org/abs/2308.02575

#gpt4 #deepwrite #FediLZ

Last updated 1 year ago

Abhinav Tushar · @lepisma
3 followers · 13 posts · Server mathstodon.xyz
Mihai Lazarescu · @mtl
1 followers · 156 posts · Server techhub.social

really impressed with its ability to grasp , often performing as well as or even better than humans in various scenarios. Early trials of seem to show even more promising outcomes.
marktechpost.com/2023/08/17/th

#gpt3 #abstract #patterns #gpt4

Last updated 1 year ago

Rene Schulte · @rschu
593 followers · 281 posts · Server arvr.social

LLMs simulate human life in a town!

Researchers used GPT-3.5 agents to act like humans, spread info, cooperate & even attend parties.
TrueSkill scores show potential for game dev, social media & more.
Can LLMs truly mimic us? 🤖🤔

Code 👉 github.com/joonspk-research/ge

#ai #llm #gpt3 #gpt4 #agents

Last updated 1 year ago