#unicode — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #unicode, aggregated by home.social.
-
Il faudrait des symboles standardisés même intégré à #unicode pour :
1. Clique droit ;
2. Clique gauche ;
3. Variante de 1 et 2 avec double clique.
4. Clique central ;
5. Composés des divers possibilités entre 1 et 4 ;
6. Maintiens de bouton de souris enfoncé associé à un déplacement ;
7. Simple survol du curseur (hover).De pareils symboles pourraient avantageusement s’intégrer dans les syntaxes de description des formations de touches. Comme `Ctrl+Alt+X`.
-
I think I understand what's going on concerning the "Small Seal" vs. "Seal" inconsistency issue:
I guess I got confused by the #Unicode blog post mentioning a new "Small Seal" script, which made me believe that data files with "Seal" scripts and "Seal" blocks were inconsistent then, since the Unicode 18.0 *Beta* CodeCharts.pdf and NamesList.txt were also showing a new "Small Seal" block; in Unicode 18.0 *Alpha*, it was still a "Seal" block, so they are the ones to revert to the previous name... -
Finally, an #emoji to represent my permanent state of mind!
Thank you #Unicode Consortium for 1FAEB: https://www.unicode.org/emoji/charts-18.0/emoji-released.html#1faeb
-
Finally, an #emoji to represent my permanent state of mind!
Thank you #Unicode Consortium for 1FAEB: https://www.unicode.org/emoji/charts-18.0/emoji-released.html#1faeb
-
Finally, an #emoji to represent my permanent state of mind!
Thank you #Unicode Consortium for 1FAEB: https://www.unicode.org/emoji/charts-18.0/emoji-released.html#1faeb
-
I just found incidentally this interesting document about Unicode code charts; still a draft, I believe it is new in Unicode 18.0...
🔗 https://www.unicode.org/Public/draft/charts/About.html
And of course, there is chapter 24 of the outstanding "Core Spec": About the Code Charts.
🔗 https://www.unicode.org/versions/Unicode18.0.0/core-spec/chapter-24/
-
媽 CJK unified ideograph for an uma musume
-
Proud to announce another release of my little text editor, kg, now at v1.1.0
Read more about it on my blog https://troglobit.com/post/2026-05-26-long-time-no-blog/
#unicode #microemacs #emacs #terminal #console #embedded #linux
-
Currently, in Unicode 18.0 Beta, it seems that there is some kind of mess around the names of the new "Seal" script and new "Seal" block introduced in this announcement.
The name of the new script is "Small Seal" on the Unicode blog:
https://blog.unicode.org/2026/05/unicode-180-beta-review-opens-for.html
and the name of the new block is also "Small Seal" in its dedicated code chart:
https://www.unicode.org/Public/draft/charts/PDF/U3D000.pdf
(See also https://www.unicode.org/Public/draft/ucd/NamesList.txt) -
Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:
Check these guys on iOS and Linux
⬆⬇
On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.
-
Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:
Check these guys on iOS and Linux
⬆⬇
On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.
-
Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:
Check these guys on iOS and Linux
⬆⬇
On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.
-
Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:
Check these guys on iOS and Linux
⬆⬇
On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.
-
Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:
Check these guys on iOS and Linux
⬆⬇
On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.
-
One of the most complicated questions in modern unicodeology is, what to do with the characters that IBM 437 and similar codepages mapped into 0x00..0x1F, where they'd be avilable for memory-mapped display, but not necesarily for encoding into ordinary text formats.
-
You may have a file with a name like /usr/share/X11/locale/en_US.UTF-8/Compose that contains a *long* list of characters that you can type with compose-key combinations, along with their unicode code points.
References:
https://unicodefyi.com/guide/type-special-chars-linux/
https://symbolfyi.com/glossary/compose-key/
If you use i3, the magic word is Multi_key (this is also the name used in the file I mentioned above):
https://adamsimpson.net/writing/compose-key-and-i3
Here, it's about 11° and raining (see what I did there?)
-
When a top domain only allow a certain set of characters, like #Norid for .no-domain https://www.norid.no/en/om-domenenavn/regelverk-for-no/#3.-General-requirements-for-the-domain-name---what-can-be-applied-for%3F allowing only:
á
à
ä
č
ç
đ
é
è
ê
ï
ŋ
ń
ñ
ó
ò
ô
ö
š
ŧ
ü
ž
æ
ø
åIf there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:
ルートビア.vørterøl.no
?
-
When a top domain only allow a certain set of characters, like #Norid for .no-domain https://www.norid.no/en/om-domenenavn/regelverk-for-no/#3.-General-requirements-for-the-domain-name---what-can-be-applied-for%3F allowing only:
á
à
ä
č
ç
đ
é
è
ê
ï
ŋ
ń
ñ
ó
ò
ô
ö
š
ŧ
ü
ž
æ
ø
åIf there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:
ルートビア.vørterøl.no
?
-
When a top domain only allow a certain set of characters, like #Norid for .no-domain https://www.norid.no/en/om-domenenavn/regelverk-for-no/#3.-General-requirements-for-the-domain-name---what-can-be-applied-for%3F allowing only:
á
à
ä
č
ç
đ
é
è
ê
ï
ŋ
ń
ñ
ó
ò
ô
ö
š
ŧ
ü
ž
æ
ø
åIf there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:
ルートビア.vørterøl.no
?
-
When a top domain only allow a certain set of characters, like #Norid for .no-domain https://www.norid.no/en/om-domenenavn/regelverk-for-no/#3.-General-requirements-for-the-domain-name---what-can-be-applied-for%3F allowing only:
á
à
ä
č
ç
đ
é
è
ê
ï
ŋ
ń
ñ
ó
ò
ô
ö
š
ŧ
ü
ž
æ
ø
åIf there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:
ルートビア.vørterøl.no
?
-
When a top domain only allow a certain set of characters, like #Norid for .no-domain https://www.norid.no/en/om-domenenavn/regelverk-for-no/#3.-General-requirements-for-the-domain-name---what-can-be-applied-for%3F allowing only:
á
à
ä
č
ç
đ
é
è
ê
ï
ŋ
ń
ñ
ó
ò
ô
ö
š
ŧ
ü
ž
æ
ø
åIf there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:
ルートビア.vørterøl.no
?
-
Question for the #unicode nerds: how is 'Ç' sorted in the Alphabet, before/after 'C'? or completely differently? I have an Application here where it is sorted after 'Z'.
-
A friend of mine is creating a unicode table website - like the old unicode-table before it turned slow and useless for programmer lookups: https://symbl.dev
#unicode
It's a very fast virtualized list, more features and permalinks will follow -
Il existe une version en français des "Code Charts Unicode", dont peu de gens soupçonnent même l'existence...
Français: https://www.unicode.org/Public/17.0.0/charts/fr/CodeCharts.pdf
(Anglais: https://www.unicode.org/Public/17.0.0/charts/CodeCharts.pdf)Aujourd'hui, je viens de trouver, un peu par hasard, la version française ListeNoms.txt (apparemment québécoise) du fichier NamesList.txt utilisé justement pour générer les données des "code charts":
Français: https://hapax.qc.ca/ListeNoms-17.0.0.txt
(Anglais: https://www.unicode.org/Public/17.0.0/ucd/NamesList.txt) -
-
Nifty:
“All Of The String Types”, Lemon Donnell (https://lambdalemon.gay/posts/string-types).
Via Lobsters: https://lobste.rs/s/khf0ye/all_string_types
#Programming #PLDI #String #ProgrammingLanguages #Unicode #UTF8 #Characters
-
Nifty:
“All Of The String Types”, Lemon Donnell (https://lambdalemon.gay/posts/string-types).
Via Lobsters: https://lobste.rs/s/khf0ye/all_string_types
#Programming #PLDI #String #ProgrammingLanguages #Unicode #UTF8 #Characters
-
Nifty:
“All Of The String Types”, Lemon Donnell (https://lambdalemon.gay/posts/string-types).
Via Lobsters: https://lobste.rs/s/khf0ye/all_string_types
#Programming #PLDI #String #ProgrammingLanguages #Unicode #UTF8 #Characters
-
Nifty:
“All Of The String Types”, Lemon Donnell (https://lambdalemon.gay/posts/string-types).
Via Lobsters: https://lobste.rs/s/khf0ye/all_string_types
#Programming #PLDI #String #ProgrammingLanguages #Unicode #UTF8 #Characters
-
When you didn’t check whether your sign maker could handle orders with non-ASCII characters. #unicode
-
The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).
🔗 https://codeberg.org/tonton-pixel/unicopedia-symbolica
The linguistic data comes from the Unicode CLDR Project:
And all contributions to it are much welcome!
-
The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).
🔗 https://codeberg.org/tonton-pixel/unicopedia-symbolica
The linguistic data comes from the Unicode CLDR Project:
And all contributions to it are much welcome!
-
The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).
🔗 https://codeberg.org/tonton-pixel/unicopedia-symbolica
The linguistic data comes from the Unicode CLDR Project:
And all contributions to it are much welcome!
-
The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).
🔗 https://codeberg.org/tonton-pixel/unicopedia-symbolica
The linguistic data comes from the Unicode CLDR Project:
And all contributions to it are much welcome!
-
The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).
🔗 https://codeberg.org/tonton-pixel/unicopedia-symbolica
The linguistic data comes from the Unicode CLDR Project:
And all contributions to it are much welcome!
-
UTF-16 reintroduced the old byte split bugs on two byte quantities.
#unicode #utf16
https://george.mand.is/2026/05/my-favorite-bugs-invalid-surrogate-pairs/ -
RE: https://flipboard.com/@courrierinter/asie-lsc1hno6z/-/a-w9ciSPQxSLyq8hQhDuX5XA%3Aa%3A1808141830-%2F0
卢比奥 (lú bǐ ào) ➔ 鲁比奥 (lǔ bǐ ào)
U+5362 卢
U+5362 kDefinition cottage, hut; surname; black
U+5362 kMandarin lú
🔗 https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=5362➔
U+9C81 鲁
U+9C81 kDefinition foolish, stupid, rash; vulgar
U+9C81 kMandarin lǔ
🔗 https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=9C81 -
Eu tenho um adesivo que é uma piada com Unicode. Ele não ia caber na capa do meu celular, então eu o transformei numa piada sobre CSS. #unicode #webdesign #css
-
i could spend hours messing around with this nonsense:
-
ASCII Chessboard, No HTML Required - Sometimes, when I have absolutely nothing to do, I play with ASCII characters in vim. Today I made an ASCII chess board with black and white chess pieces. I'm pretty sure I'm not the first one to make an ascii chessboard and I won't be the last. I thought it looks pretty nice so I wanted to share it on my blog.
Full blog post at https://sava.rocks/blog/ascii-chessboard-no-html-required/
-
Curious what character limit can actually mean. On Twitter it's bytes, so two-byte Unicode characters eat up the allocations fast but on Bluesky it seems to by glyphs so what had to be trimmed for Twitter has lots of legroom on Bluesky.
-
The latest version 2.0.0 of the open-source application "Unicopedia Symbolica" (previously part of the "Unicopedia Plus" application) adds a new "Emoji Taxonomy" utility.
-
@dbattistella
@inthehands
A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019These hard-working birds would be a great addition to the other avian emoji 🦆🐦⬛🦅🦉🪿🐦🐧🐔🐥🐣
https://docs.google.com/document/d/1hU8yWK8U6jcMjjxR8DKYA8VI3F0xsQsCcpNaufnBnh0/ -
@dbattistella
@inthehands
A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019These hard-working birds would be a great addition to the other avian emoji 🦆🐦⬛🦅🦉🪿🐦🐧🐔🐥🐣
https://docs.google.com/document/d/1hU8yWK8U6jcMjjxR8DKYA8VI3F0xsQsCcpNaufnBnh0/ -
@dbattistella
@inthehands
A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019These hard-working birds would be a great addition to the other avian emoji 🦆🐦⬛🦅🦉🪿🐦🐧🐔🐥🐣
https://docs.google.com/document/d/1hU8yWK8U6jcMjjxR8DKYA8VI3F0xsQsCcpNaufnBnh0/ -
@dbattistella
@inthehands
A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019These hard-working birds would be a great addition to the other avian emoji 🦆🐦⬛🦅🦉🪿🐦🐧🐔🐥🐣
https://docs.google.com/document/d/1hU8yWK8U6jcMjjxR8DKYA8VI3F0xsQsCcpNaufnBnh0/