mnl mnl mnl mnl mnl · @mnl
853 followers · 1016 posts · Server hachyderm.io

Wrote (let gpt write) a tiny tool that scans my repos for prompt generators and allows me to quickly generate custom prompts for new apps. For example “api docs for x, 3 example files, 2 doc pages”. Really useful for 50 lines of code.

#LLMs

Last updated 1 year ago

jdelahanty · @science_is_hard
86 followers · 93 posts · Server social.coop

Oh geeze the are going to be digesting their own text in publications even sooner.

nature.com/articles/d41586-023

#LLMs

Last updated 1 year ago

Tim Kellogg · @kellogh
943 followers · 3608 posts · Server hachyderm.io

well i guess there’s 180b if you have an extra **400 gb** of GPU memory

#falcon #LLMs #ai

Last updated 1 year ago

Giorgio Robino · @solyarisoftware
62 followers · 79 posts · Server sigmoid.social

Question for prompt engineers (anyone):

What are the most important metrics for measuring LLM completion?

I start with some basics "system" variables:

1. Latency (or response time)
measures the elapsed time of a completion

2. Tokens Throughput = tokens/latency

3. Prompt/Completion tokens ratio

4. there are many more cross ratios maybe useful, involving LLM settings (max_tokens, temperature, etc.)

The final goal is to define some list of common vars to evaluate foundational LLMs.

#LLMs

Last updated 1 year ago

Osma Suominen · @osma
300 followers · 971 posts · Server sigmoid.social

Berlin, here I come!

It's now four years since the last in-person conference. may well be the last one, so let's try to cherish the opportunity to meet in one place!

I will be an instructor for the tutorial on Monday, moderate a session & hopefully give a Lightning talk on Tuesday. Looking forward to great talks & ofc dinner and coffee breaks are often the most interesting!

Feel free to stop by if you want to talk about , , , , etc.

#swib #swib23 #annif #skosmos #LLMs #metadata #fennica

Last updated 1 year ago

Ben Waber · @bwaber
704 followers · 2741 posts · Server hci.social

Next was an amazing pair of talks by @armedchile (an ingenious method for combining high and low resource language data to build that significantly improves translation performance) and Orevaoghene Ahia (subword tokenization effects on LLM costs/performance in different languages) at . I'll be thinking about both talks for a long time - they have profound implications for the design of tech and business models. Highly recommend youtube.com/watch?v=EVi9qB_1Cc (3/11)

#LLMs #indaba2023 #generativeAI #ai

Last updated 1 year ago

Tim Kellogg · @kellogh
943 followers · 3597 posts · Server hachyderm.io

Another one — why not just use gzip? This paper uses tiny compression algorithms that run great on even embedded devices, and the performance comes close to where are at. That would be a massive game changer from the current state hendrik-erz.de/post/why-gzip-j

#LLMs

Last updated 1 year ago

Tim Kellogg · @kellogh
943 followers · 3596 posts · Server hachyderm.io

Now that have had repeated big successes over the last 15 years, we are starting to look for better ways to implement them. Some new ones for me:

notes that NNs are bandwidth-bound from memory to GPU. They built a LPU specifically designed for
groq.com/

A wild one — exchange the silicon for moving parts, good old Newtonian physics. Dramatic drop in power utilization and maps to most NN architectures (h/t @FMarquardtGroup)

idw-online.de/de/news820323

#neuralnetworks #groq #LLMs

Last updated 1 year ago

mnl mnl mnl mnl mnl · @mnl
849 followers · 964 posts · Server hachyderm.io

Furthermore, are absolute monsters to slice and refactor legacy code, thus improving the longevity of already long-living software that might otherwise be painfully phased out.

2/

#LLMs

Last updated 1 year ago

mnl mnl mnl mnl mnl · @mnl
849 followers · 963 posts · Server hachyderm.io

summarizing a train of thought from a conversation with @promovicz yesterday, which helped me formulate some of my ideas.

I think allow us to write *less code*. Because they make it easy to generate the boilerplate that ensures longevity and sustainability of software projects: documentation, clean commits, unit tests, tooling, they allow projects to actually live on.

1/

#LLMs

Last updated 1 year ago

mnl mnl mnl mnl mnl · @mnl
848 followers · 938 posts · Server hachyderm.io

I don’t think mathematica is a Great target language, nor the does prompting in the Mathematica chat enabled seem very good, and it churns through tokens like the king of Spain. My third attempt to get something going yesterday was just as much of a mess (trying to get hilbert curves computed and displayed with @defn ) as my first sessions (doing some geospatial computation / dataset computation).

Not fun.

#LLMs

Last updated 1 year ago

Nicole Hennig · @nic221
353 followers · 1792 posts · Server techhub.social

Anthropic \ Introducing Claude Pro anthropic.com/index/claude-pro ($20/mo alternative to ChatGPT Plus)

#AI #LLMs #claude

Last updated 1 year ago

Wendy M. Grossman · @wendyg
1284 followers · 713 posts · Server mastodon.xyz

This week's net.wars, "Small data", summarizes the talk I gave with Jon Crowcroft at this year's , arguing that large language models will ultimately prove to be a distraction: netwars.pelicancrossing.net/20

#gikii #LLMs #ai #netwars

Last updated 1 year ago

Tim Kellogg · @kellogh
942 followers · 3559 posts · Server hachyderm.io

a while back i recall there being some tool for exploring a database of embeddings that lets you visualize and locate duplicates, etc. anyone know what it's called?

#llm #LLMs #ai #llama2

Last updated 1 year ago

mnl mnl mnl mnl mnl · @mnl
847 followers · 901 posts · Server hachyderm.io

I literally have thousands conversations like these now, from designing video game physics to transaction protocols to monad design patterns for music sequencing to numerous DSLs for everything that strikes my fancy (zine layout? comic book storyboard generation? worldbuilding CMS?).

#LLMs

Last updated 1 year ago

mnl mnl mnl mnl mnl · @mnl
848 followers · 880 posts · Server hachyderm.io

chatgpt trick when applied to programming, don't ask "does X do Y" or "is X Y", instead ask for "give me a test program to show that X does Y".

So for example, if you wonder about the size of a struct in memory, ask it to write the program to compute or measure struct sizes.

always think meta.

#LLMs

Last updated 1 year ago

Daniel Hoelzgen · @dhoelzgen
30 followers · 7 posts · Server ruhr.social

For a medical & caretaking project, I experimented with combining symbolic with to mitigate their tendency to nondeterministic behavior and . Still, it leaves a lot of work to be done, but it's a promising approach for situations requiring higher reliability.

medium.com/9elements/using-sym

#logic #LLMs #hallucinations #ai #artificialintelligence #llm #chatgpt

Last updated 1 year ago

Jeroen SZ 🦣 · @JeroenSH
254 followers · 222 posts · Server lingo.lol
dragfyre · @dragfyre
839 followers · 4858 posts · Server mastodon.sandwich.net

I don't know what I expected.

#LLMs #ai #salami #stephenhawking

Last updated 1 year ago

Nicole Hennig · @nic221
352 followers · 1778 posts · Server techhub.social

Timeline History of Large Language Models - Voicebot.ai voicebot.ai/large-language-mod (nice!)

#AI #LLMs #history

Last updated 1 year ago