FedSearch - Federated network search engine

FedSearch

Ric Raftis :obsidian: 🇺🇦 · @ricraftis

152 followers · 337 posts · Server pkm.social

Conclusions are the models are varying significantly over time. Continuous monitory required. Perhaps a website publishing regular benchmarking tests would be useful. #AI #ArtificialIntelligence #AIResearch #BenchmarkingTests https://arxiv.org/pdf/2307.09009.pdf
5/5

#ai #artificialintelligence #airesearch #benchmarkingtests

Last updated 2 years ago

Original post

Ric Raftis :obsidian: 🇺🇦 · @ricraftis

152 followers · 337 posts · Server pkm.social

New research shows accuracy of GPT 3.5 and GPT 4 declining! The paper evaluates performance of both models at March 2023 and June 2023.#AI #ArtificialIntelligence #AIResearch #BenchmarkingTests https://arxiv.org/pdf/2307.09009.pdf
1/5

#ai #artificialintelligence #airesearch #benchmarkingtests

Last updated 2 years ago

Original post