Nils Goroll · @slink
69 followers · 574 posts · Server fosstodon.org

An Exhaustive Test Program for the Legendary Höhrmann UTF-8 Decoder

“The Höhrmann Decoder, implemented as a deterministic finite state machine, needs only a handful lines of C code and 364 bytes for a combined character class and state transition table.

[...]

hoehrmann-utf8-test.c exhaustively tests all possible inputs to the decoder, that makes 269492416 different byte sequences[^1], out of which 1112063[^2] are accepted.”

git.sr.ht/~slink/hoehrmann-utf
bjoern.hoehrmann.de/utf-8/deco

#unicode #utf8

Last updated 1 year ago

Felix Palmen 📯 · @zirias
60 followers · 237 posts · Server techhub.social

@vermaden IBTD ... has been around for long enough and the representation became the de-facto standard everywhere. The tree is not restricted to , but anything outside ASCII needs to be UTF8 to ensure interoperability. What you see here is, well, a bug 🙈

#unicode #utf8 #freebsd #ports #ascii

Last updated 1 year ago

Nils Goroll · @slink
69 followers · 573 posts · Server fosstodon.org

Serious question to people: is there a current consensus about how to decode correctly in the various error scenarios?

#unicode #utf8

Last updated 1 year ago

Chuso Pérez · @chuso
15 followers · 316 posts · Server fosstodon.org

Something makes me think this is an automated email.

#unicode #utf8

Last updated 1 year ago

synlogic · @synlogic
208 followers · 2740 posts · Server toot.io
Carolina Koehn · @carolina
245 followers · 733 posts · Server norden.social

-Codierung ist aber auch wirklich nciht einfach.

#utf8

Last updated 1 year ago

synlogic · @synlogic
179 followers · 2498 posts · Server toot.io

is there an ASCII/ANSI/UTF-8 art editor people would recommend?






#roguelike #roguelikes #roguelikedev #asciiart #ansiart #utf8 #tui #cogmind

Last updated 1 year ago

Joachim · @joachim
2243 followers · 20183 posts · Server boitam.eu
Nico Rikken · @nicorikken
244 followers · 593 posts · Server mastodon.nl

@forumstandaardisatie
Welkom! Jazeker 👌 Mede mogelijk gemaakt door en alle standaarden 🔌 waar het op leunt activitypub.rocks/ Maar ook voor emoji's 😉 Ik kijk uit naar jullie updates hier.

#utf8 #activitypub #openstandaarden

Last updated 1 year ago

Alain MICHEL 🤓 · @alainmi11
3915 followers · 3740 posts · Server mamot.fr

Ha ha !
Un bon gros de Pôle Emploi.

Ils ont visiblement quelques problïmes àafficher les caractïres accentués !
Ce serait bien si le site pouvait �tre réparé.
😂

#utf8 #fail

Last updated 1 year ago

Jan ☕🎼🎹☁️ · @jan
396 followers · 3296 posts · Server fedi.kcore.org

The dash does not support properly.

Waze displays text with country flags, the dash just shows two squares with the text.

#skoda #superb #utf8

Last updated 1 year ago

AskUbuntu · @askubuntu
131 followers · 1866 posts · Server ubuntu.social

Is there a command-line method for converting UTF-8 values into Unicode values?

askubuntu.com/q/1473060/612

#commandline #unicode #utf8 #formatconversion

Last updated 1 year ago

Evan Hahn · @EvanHahn
859 followers · 226 posts · Server bigshoulders.city

You might've heard of ASCII or UTF-8. These character encodings built by very smart people.

I just built “UTF-21”, an impractical alternative that only a fool would use. Read about it (with a short Unicode crash course) here: evanhahn.com/utf-21/

#unicode #utf8 #ascii #characterencoding #programming

Last updated 1 year ago

Dana McKiernan · @UnlikelyLass
127 followers · 768 posts · Server dice.camp
OSiUX · @osiux
111 followers · 96 posts · Server rebel.ar
Kevin Karhan :verified: · @kkarhan
1061 followers · 66483 posts · Server mstdn.social

@manawyrm @LaF0rge EXAKT DAS!

Sollten als Standard machen und meinetwegen wenn denen ne Batterie oder Kondensator zu teuer ist nen Flash-EEPROM draufpacken wie für'n BIOS wo das eingestellt wird...

Notfalls ne Reihe DIPs...

Ich selbst habe bisher einfach Standarddrucker kaufen können!

#utf8

Last updated 2 years ago

Shane · @shane
48 followers · 2 posts · Server social.futurnumerique.com

You know what is so cool about Mastodon+fediverse? There is a setting to feature posts in ᐊᓂᔑᓈᐯᒧᐎᓐ!! Yeah, my native tonge (which I don't know yet). I think we need to set up a FNiverse!

#firstnations #ᐊᓂᔑᓈᐯᒧᐎᓐ #idn #utf8

Last updated 2 years ago

Nordnick :verified: · @nick
1141 followers · 9588 posts · Server norden.social

@z428 @dentaku @helpers

Is this a or an ?

Depending on your device / operating system, not all current UTF8 characters may be implemented.

#custom #emoji #utf8 #character

Last updated 2 years ago

L'exiliat professor Grappa · @giorgiograppa
232 followers · 222 posts · Server mastodon.la

Bé, doncs ja he solucionat (tret d'un petit detall) el tema. Es tractava de:

- instal·lar IBus (# pacman -S ibus) i
- activar IBus ($ ibus start)

Amb l'aplicació Preferències de l'IBus es poden afegir altres configuracions del teclat.

El que no he aconseguit (problemes diversos) és que KDE active IBus abans d'entrar-hi, de manera que em toca fer-ho manualment (problema menor). Després, he de reiniciar Yakuake o no em reconeixerà els accents; i també he de tindre present iniciar Tmux amb l'ordre:

$ tmux -u

per obligar-lo a emprar correctament UTF-8; en cas contrari, també donarà problemes amb els accents (en lloc de «à», escriu «_»).

La solució és parcial, doncs, però ja em permet treballar: sembla que totes les aplicacions reconeixen els accents amb normalitat.

Seguirem investigant.




#endeavouros #plasma #kde #utf8 #accents

Last updated 2 years ago

Gemma 👽 · @prettyhuman
176 followers · 1565 posts · Server piipitin.fi

UTF-8 character names that could be band names…
NO-BREAK SPACE
BROKEN BAR
SECTION SIGN
DIAERESIS
FEMININE ORDINAL INDICATOR
SOFT HYPHEN
VULGAR FRACTION ONE HALF
APOSTROPHE
AMPERSAND

#utf8 #unicode

Last updated 2 years ago