home.social

#unicode — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #unicode, aggregated by home.social.

  1. Il faudrait des symboles standardisés même intégré à #unicode pour :

    1. Clique droit ;
    2. Clique gauche ;
    3. Variante de 1 et 2 avec double clique.
    4. Clique central ;
    5. Composés des divers possibilités entre 1 et 4 ;
    6. Maintiens de bouton de souris enfoncé associé à un déplacement ;
    7. Simple survol du curseur (hover).

    De pareils symboles pourraient avantageusement s’intégrer dans les syntaxes de description des formations de touches. Comme `Ctrl+Alt+X`.

    #it #ergonomie #ui

  2. I think I understand what's going on concerning the "Small Seal" vs. "Seal" inconsistency issue:
    I guess I got confused by the #Unicode blog post mentioning a new "Small Seal" script, which made me believe that data files with "Seal" scripts and "Seal" blocks were inconsistent then, since the Unicode 18.0 *Beta* CodeCharts.pdf and NamesList.txt were also showing a new "Small Seal" block; in Unicode 18.0 *Alpha*, it was still a "Seal" block, so they are the ones to revert to the previous name...

  3. I just found incidentally this interesting document about Unicode code charts; still a draft, I believe it is new in Unicode 18.0...

    🔗 unicode.org/Public/draft/chart

    And of course, there is chapter 24 of the outstanding "Core Spec": About the Code Charts.

    🔗 unicode.org/versions/Unicode18

    #Unicode #CodeCharts #CoreSpec

  4. Proud to announce another release of my little text editor, kg, now at v1.1.0

    Read more about it on my blog troglobit.com/post/2026-05-26-

  5. @hackernews

    Currently, in Unicode 18.0 Beta, it seems that there is some kind of mess around the names of the new "Seal" script and new "Seal" block introduced in this announcement.

    The name of the new script is "Small Seal" on the Unicode blog:
    blog.unicode.org/2026/05/unico
    and the name of the new block is also "Small Seal" in its dedicated code chart:
    unicode.org/Public/draft/chart
    (See also unicode.org/Public/draft/ucd/N)

    @codepoints
    @lianghai

    #Unicode #Seal #SmallSeal

  6. Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:

    Check these guys on iOS and Linux

    ⬆⬇

    On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.

    #unicode #browser #webdev #html

  7. Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:

    Check these guys on iOS and Linux

    ⬆⬇

    On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.

    #unicode #browser #webdev #html

  8. Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:

    Check these guys on iOS and Linux

    ⬆⬇

    On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.

    #unicode #browser #webdev #html

  9. Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:

    Check these guys on iOS and Linux

    ⬆⬇

    On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.

    #unicode #browser #webdev #html

  10. Don't use nice looking Unicode arrows in your web application. They may come out quite differently depending on OS, browser and what not:

    Check these guys on iOS and Linux

    ⬆⬇

    On Linux they currently look like two ordinary arrows. On iOS they are rendered like buttons.

    #unicode #browser #webdev #html

  11. One of the most complicated questions in modern unicodeology is, what to do with the characters that IBM 437 and similar codepages mapped into 0x00..0x1F, where they'd be avilable for memory-mapped display, but not necesarily for encoding into ordinary text formats.

    #retrocomputing #unicode

  12. You may have a file with a name like /usr/share/X11/locale/en_US.UTF-8/Compose that contains a *long* list of characters that you can type with compose-key combinations, along with their unicode code points.

    References:

    unicodefyi.com/guide/type-spec

    symbolfyi.com/glossary/compose

    If you use i3, the magic word is Multi_key (this is also the name used in the file I mentioned above):

    adamsimpson.net/writing/compos

    Here, it's about 11° and raining (see what I did there?)

    #linux #specialcharacters #compose #unicode

  13. When a top domain only allow a certain set of characters, like #Norid for .no-domain norid.no/en/om-domenenavn/rege allowing only:

    á
    à
    ä
    č
    ç
    đ
    é
    è
    ê
    ï
    ŋ
    ń
    ñ
    ó
    ò
    ô
    ö
    š
    ŧ
    ü
    ž
    æ
    ø
    å

    If there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:

    ルートビア.vørterøl.no

    ?

    #IDN #unicode #rfc5890 #rfc5891 #rfc5892 #rfc5893 #rfc5894

  14. When a top domain only allow a certain set of characters, like #Norid for .no-domain norid.no/en/om-domenenavn/rege allowing only:

    á
    à
    ä
    č
    ç
    đ
    é
    è
    ê
    ï
    ŋ
    ń
    ñ
    ó
    ò
    ô
    ö
    š
    ŧ
    ü
    ž
    æ
    ø
    å

    If there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:

    ルートビア.vørterøl.no

    ?

    #IDN #unicode #rfc5890 #rfc5891 #rfc5892 #rfc5893 #rfc5894

  15. When a top domain only allow a certain set of characters, like #Norid for .no-domain norid.no/en/om-domenenavn/rege allowing only:

    á
    à
    ä
    č
    ç
    đ
    é
    è
    ê
    ï
    ŋ
    ń
    ñ
    ó
    ò
    ô
    ö
    š
    ŧ
    ü
    ž
    æ
    ø
    å

    If there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:

    ルートビア.vørterøl.no

    ?

    #IDN #unicode #rfc5890 #rfc5891 #rfc5892 #rfc5893 #rfc5894

  16. When a top domain only allow a certain set of characters, like #Norid for .no-domain norid.no/en/om-domenenavn/rege allowing only:

    á
    à
    ä
    č
    ç
    đ
    é
    è
    ê
    ï
    ŋ
    ń
    ñ
    ó
    ò
    ô
    ö
    š
    ŧ
    ü
    ž
    æ
    ø
    å

    If there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:

    ルートビア.vørterøl.no

    ?

    #IDN #unicode #rfc5890 #rfc5891 #rfc5892 #rfc5893 #rfc5894

  17. When a top domain only allow a certain set of characters, like #Norid for .no-domain norid.no/en/om-domenenavn/rege allowing only:

    á
    à
    ä
    č
    ç
    đ
    é
    è
    ê
    ï
    ŋ
    ń
    ñ
    ó
    ò
    ô
    ö
    š
    ŧ
    ü
    ž
    æ
    ø
    å

    If there exist a domain `vørterøl.no`, would it then be considered rude (or not allowed) to add non supported characters in sub domains like:

    ルートビア.vørterøl.no

    ?

    #IDN #unicode #rfc5890 #rfc5891 #rfc5892 #rfc5893 #rfc5894

  18. Question for the #unicode nerds: how is 'Ç' sorted in the Alphabet, before/after 'C'? or completely differently? I have an Application here where it is sorted after 'Z'.

  19. A friend of mine is creating a unicode table website - like the old unicode-table before it turned slow and useless for programmer lookups: https://symbl.dev
    #unicode

    It's a very fast virtualized list, more features and permalinks will follow

  20. Il existe une version en français des "Code Charts Unicode", dont peu de gens soupçonnent même l'existence...

    Français: unicode.org/Public/17.0.0/char
    (Anglais: unicode.org/Public/17.0.0/char)

    Aujourd'hui, je viens de trouver, un peu par hasard, la version française ListeNoms.txt (apparemment québécoise) du fichier NamesList.txt utilisé justement pour générer les données des "code charts":

    Français: hapax.qc.ca/ListeNoms-17.0.0.t
    (Anglais: unicode.org/Public/17.0.0/ucd/)

    #Unicode #CodeCharts #Français #Québec

  21. When you didn’t check whether your sign maker could handle orders with non-ASCII characters. #unicode

  22. The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).

    🔗 codeberg.org/tonton-pixel/unic

    The linguistic data comes from the Unicode CLDR Project:

    🔗 cldr.unicode.org/

    And all contributions to it are much welcome!

    #Unicopedia #Emoji #Languages #Unicode

  23. The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).

    🔗 codeberg.org/tonton-pixel/unic

    The linguistic data comes from the Unicode CLDR Project:

    🔗 cldr.unicode.org/

    And all contributions to it are much welcome!

    #Unicopedia #Emoji #Languages #Unicode

  24. The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).

    🔗 codeberg.org/tonton-pixel/unic

    The linguistic data comes from the Unicode CLDR Project:

    🔗 cldr.unicode.org/

    And all contributions to it are much welcome!

    #Unicopedia #Emoji #Languages #Unicode

  25. The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).

    🔗 codeberg.org/tonton-pixel/unic

    The linguistic data comes from the Unicode CLDR Project:

    🔗 cldr.unicode.org/

    And all contributions to it are much welcome!

    #Unicopedia #Emoji #Languages #Unicode

  26. The latest version 2.3.0 of the open-source application "Unicopedia Symbolica" introduces a new Language drop-down menu in the "Emoji Data Finder" utility, which lets you display the short name and keywords of all the emoji in 170 languages, including the ones whose direction is Right-To-Left (RTL).

    🔗 codeberg.org/tonton-pixel/unic

    The linguistic data comes from the Unicode CLDR Project:

    🔗 cldr.unicode.org/

    And all contributions to it are much welcome!

    #Unicopedia #Emoji #Languages #Unicode

  27. Eu tenho um adesivo que é uma piada com Unicode. Ele não ia caber na capa do meu celular, então eu o transformei numa piada sobre CSS. #unicode #webdesign #css

  28. What are the major features in this release?

    - We moved all matrix operations into the standard calculation and deprecated `matop`
    - is now aware and handles that by encoding all strings and files using

  29. What are the major features in this release?

    - We moved all matrix operations into the standard calculation and deprecated `matop`
    - #NumeRe is now #Unicode aware and handles that by encoding all strings and files using #UTF8

  30. What are the major features in this release?

    - We moved all matrix operations into the standard calculation and deprecated `matop`
    - #NumeRe is now #Unicode aware and handles that by encoding all strings and files using #UTF8

  31. What are the major features in this release?

    - We moved all matrix operations into the standard calculation and deprecated `matop`
    - #NumeRe is now #Unicode aware and handles that by encoding all strings and files using #UTF8

  32. ASCII Chessboard, No HTML Required - Sometimes, when I have absolutely nothing to do, I play with ASCII characters in vim. Today I made an ASCII chess board with black and white chess pieces. I'm pretty sure I'm not the first one to make an ascii chessboard and I won't be the last. I thought it looks pretty nice so I wanted to share it on my blog.

    Full blog post at sava.rocks/blog/ascii-chessboa

    #ascii #unicode #chess

  33. Curious what character limit can actually mean. On Twitter it's bytes, so two-byte Unicode characters eat up the allocations fast but on Bluesky it seems to by glyphs so what had to be trimmed for Twitter has lots of legroom on Bluesky.

    #Tech #character #Unicode

  34. The latest version 2.0.0 of the open-source application "Unicopedia Symbolica" (previously part of the "Unicopedia Plus" application) adds a new "Emoji Taxonomy" utility.

    🔗 codeberg.org/tonton-pixel/unic

    #Unicopedia #Symbolica #Unicode #Emoji #Taxonomy

  35. @dbattistella
    @inthehands
    A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019

    These hard-working birds would be a great addition to the other avian emoji 🦆🐦‍⬛🦅🦉🪿🐦🐧🐔🐥🐣
    docs.google.com/document/d/1hU

  36. @dbattistella
    @inthehands
    A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019

    These hard-working birds would be a great addition to the other avian emoji 🦆🐦‍⬛🦅🦉🪿🐦🐧🐔🐥🐣
    docs.google.com/document/d/1hU

  37. @dbattistella
    @inthehands
    A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019

    These hard-working birds would be a great addition to the other avian emoji 🦆🐦‍⬛🦅🦉🪿🐦🐧🐔🐥🐣
    docs.google.com/document/d/1hU

  38. @dbattistella
    @inthehands
    A #VultureEmoji has been officially “Under Consideration” by the #Emoji people at #Unicode since 2019

    These hard-working birds would be a great addition to the other avian emoji 🦆🐦‍⬛🦅🦉🪿🐦🐧🐔🐥🐣
    docs.google.com/document/d/1hU