FedSearch - Federated network search engine

Giulio · @giuliohome

19 followers · 345 posts · Server mastodon.world

On top of #tokens and #embeddings, are #transformers and #selfattention the secrets behind #chatgpt? interesting that they are simple low level parallelism primitives independent from compilers and pipeline models

https://twitter.com/ItakGol/status/1650425754449059840

#transformers #selfattention #chatgpt #tokens #embeddings

Last updated 3 years ago

Original post

Lynd Bacon · @lyndbacon

1 followers · 6 posts · Server masto.ai

Maybe "attention" as used in common transformer models isn't all you need, or or need at all. Microsoft researchers describe "focal modulation networks" that aid interpretation of image processing:

https://arxiv.org/abs/2203.11926

#transformer
#computervision
#selfattention

#transformer #computervision #selfattention

Last updated 3 years ago

Original post