An Exhaustive Test Program for the Legendary Höhrmann UTF-8 Decoder
“The Höhrmann Decoder, implemented as a deterministic finite state machine, needs only a handful lines of C code and 364 bytes for a combined character class and state transition table.
[...]
hoehrmann-utf8-test.c exhaustively tests all possible inputs to the decoder, that makes 269492416 different byte sequences[^1], out of which 1112063[^2] are accepted.”
https://git.sr.ht/~slink/hoehrmann-utf8
http://bjoern.hoehrmann.de/utf-8/decoder/dfa/
#unicode #utf8
@vermaden IBTD ... #Unicode has been around for long enough and the #UTF8 representation became the de-facto standard everywhere. The #FreeBSD #ports tree is not restricted to #ASCII, but anything outside ASCII needs to be UTF8 to ensure interoperability. What you see here is, well, a bug 🙈
#unicode #utf8 #freebsd #ports #ascii
The Dominant Anti-Pattern in Text UX's Online
new blog post by me
https://news.ycombinator.com/item?id=37101669
please upboat it on HN if you can
THANK YOU!
#Roguelike
#Roguelikes
#RoguelikeGames
#RoguelikeDev
#textgames
#textadventures
#TUI
#text
#plaintext
#ASCII
#Unicode
#UTF8
#UI
#UX
#UIAntiPatterns
#UXAntiPatterns
#GUIAntiPatterns
#WebAntiPatterns
#accessibility
#VisionImpaired
#VisionDisability
#glare
#highcontrast
#resizabletext
#fonts
#roguelike #roguelikes #roguelikegames #roguelikedev #textgames #textadventures #tui #text #plaintext #ascii #unicode #utf8 #ui #ux #uiantipatterns #uxantipatterns #guiantipatterns #webantipatterns #accessibility #visionimpaired #visiondisability #glare #highcontrast #resizabletext #fonts #curses #ncurses #terminal #terminals #shells #cli #clis
is there an ASCII/ANSI/UTF-8 art editor people would recommend?
#roguelike #roguelikes #roguelikedev #asciiart #ansiart #utf8 #tui #cogmind
#unicodesymboloftheday #unicode #unicodelove #utf8
@forumstandaardisatie
Welkom! Jazeker #OpenStandaarden👌 Mede mogelijk gemaakt door #ActivityPub en alle standaarden 🔌 waar het op leunt https://activitypub.rocks/ Maar ook #UTF8 voor emoji's 😉 Ik kijk uit naar jullie updates hier.
#utf8 #activitypub #openstandaarden
Is there a command-line method for converting UTF-8 values into Unicode values? #commandline #unicode #utf8 #formatconversion
#commandline #unicode #utf8 #formatconversion
You might've heard of ASCII or UTF-8. These character encodings built by very smart people.
I just built “UTF-21”, an impractical alternative that only a fool would use. Read about it (with a short Unicode crash course) here: https://evanhahn.com/utf-21/
#unicode #utf8 #ascii #characterencoding #programming
Generar código QR en UTF8 para conectar a una red WiFi usando la consola
#Bash #consola #pass #passphrase #password #PasswordStore #QR #qrencode #terminal #tty #txt #UTF8 #WiFi #wifi2qr
#bash #consola #pass #passphrase #password #passwordstore #qr #qrencode #terminal #tty #txt #utf8 #wifi #wifi2qr
Sollten #UTF8 als Standard machen und meinetwegen wenn denen ne Batterie oder Kondensator zu teuer ist nen Flash-EEPROM draufpacken wie für'n BIOS wo das eingestellt wird...
Notfalls ne Reihe DIPs...
Ich selbst habe bisher einfach Standarddrucker kaufen können!
You know what is so cool about Mastodon+fediverse? There is a setting to feature posts in ᐊᓂᔑᓈᐯᒧᐎᓐ!! Yeah, my native tonge (which I don't know yet). I think we need to set up a FNiverse! #firstnations #ᐊᓂᔑᓈᐯᒧᐎᓐ #IDN #UTF8
#firstnations #ᐊᓂᔑᓈᐯᒧᐎᓐ #idn #utf8
Is this a #custom #emoji or an #UTF8 #character?
Depending on your device / operating system, not all current UTF8 characters may be implemented.
#custom #emoji #utf8 #character
Bé, doncs ja he solucionat (tret d'un petit detall) el tema. Es tractava de:
- instal·lar IBus (# pacman -S ibus) i
- activar IBus ($ ibus start)
Amb l'aplicació Preferències de l'IBus es poden afegir altres configuracions del teclat.
El que no he aconseguit (problemes diversos) és que KDE active IBus abans d'entrar-hi, de manera que em toca fer-ho manualment (problema menor). Després, he de reiniciar Yakuake o no em reconeixerà els accents; i també he de tindre present iniciar Tmux amb l'ordre:
$ tmux -u
per obligar-lo a emprar correctament UTF-8; en cas contrari, també donarà problemes amb els accents (en lloc de «à», escriu «_»).
La solució és parcial, doncs, però ja em permet treballar: sembla que totes les aplicacions reconeixen els accents amb normalitat.
Seguirem investigant.
#endeavouros #plasma #kde #utf8 #accents