Giorgio Sidari · @ideaferace
36 followers · 410 posts · Server mastodon.uno

I made a first test of Llama 2 13B, in a 6-bits quantized version (thanks, )

It's good-to-excellent in various tasks: summarization, translation (I tried EN, IT, FR), NER with semantic filters.

AND it runs on a CPU-only installation on an Intel, at decent speed. 👏

huggingface.co/localmodels/Lla

#ggml #LLM #webui

Last updated 1 year ago

Jar2Eau :cursed_verified: · @jar2eau
2 followers · 18 posts · Server masto.ai

Probably one of the best model I tested so far. Really good for , more logical than -vicuna. Can't wait for a 8K . :cat_typing:

huggingface.co/TheBloke/MythoL

#ai #roleplay #wizard #context #koboldcpp #ggml

Last updated 1 year ago

Dr James Ravenscroft · @jamesravey
324 followers · 934 posts · Server fosstodon.org

Just did a bunch of merges of upstream repo and managed to get the StarCoder and WizardCoder running in - there are definitely some opportunities to accelerate it to make it more useful.

#ggml #turbopilot

Last updated 1 year ago