home.social

#unicode — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #unicode, aggregated by home.social.

  1. Eu tenho um adesivo que é uma piada com Unicode. Ele não ia caber na capa do meu celular, então eu o transformei numa piada sobre CSS. #unicode #webdesign #css

  2. What are the major features in this release?

    - We moved all matrix operations into the standard calculation and deprecated `matop`
    - is now aware and handles that by encoding all strings and files using

  3. The latest version 2.0.0 of the open-source application "Unicopedia Symbolica" (previously part of the "Unicopedia Plus" application) adds a new "Emoji Taxonomy" utility.

    🔗 codeberg.org/tonton-pixel/unic

    #Unicopedia #Symbolica #Unicode #Emoji #Taxonomy

  4. Today I learned that the 64 I Ching hexagram symbols are all included in Unicode.

    I'm fairly sure I'll never need to use them so I'll add it to my ever growing list of probably useless bits of knowledge.

    en.wikipedia.org/wiki/List_of_

    #unicode #hexagram #symbols

  5. 🚀 Look out world, Turbo Vision 2.0 is back, now with Unicode! Because clearly, what every 21st-century developer dreams of is a text-based UI from the 90s resurrected ✨ and made cross-platform. Just what we needed—now we can all relive the days of DOS in style! 😂
    github.com/magiblot/tvision #TurboVision2.0 #Unicode #TextBasedUI #RetroGaming #CrossPlatform #DOSNostalgia #HackerNews #ngated

  6. Как (не) перевернуть строку, или Ох, уж этот Unicode

    Это текстовая версия доклада с Java Rock Star Meetup, с которым выступал Александр Ланцов — ведущий разработчик Мир Plat.Form. Если вы больше любите смотреть видео, то смотрите запись доклада на YouTube или VK Видео . Читать далее 🔍

    habr.com/ru/companies/nspk/art

    #java #unicode #utf8 #utf16 #кодировки #обработка_строк #emoji #суррогатные_пары #графемные_кластеры

  7. Full members (voting) of the Unicode Consortium (2026-04-20): Adobe, Airbnb, Apple, Google, Meta, Microsoft, Salesforce, Translated.

    🔗 home.unicode.org/membership/me

    Compared to the full members list dated 2026-04-04, Amazon has disappeared and Google (re-)appeared. Great "substitution" magic trick indeed!

    On a side note, the HTML page source code indicates:

    <!-- List generated: 2026-04-20, 16:07:01 GMT -->

    and tomorrow starts the UTC #187 meeting (2026-04-21 to 2026-04-23)...

    #Unicode #Members

  8. Long before standards bodies agreed on how to encode diverse scripts and symbols, computers used incompatible character sets.

    Understanding how modern systems evolved from ASCII to Unicode—and what UTF-8, UTF-16, and UTF-32 mean—helps any developer, writer, or interested reader grasp the complexity hidden inside what looks like ordinary text.

    tiniacoleyba.com/blog/unicode- #WebStandards #ASCII #Unicode #basics

  9. While implementing a file drag-and-drop feature in one of my Electron-based apps, I fortuitously found an issue in the Electron framework which I believe could be a major security hole... Fortunately, this was not too difficult to fix, but I still don't understand why this has been overlooked so far...

    All applications have been corrected and can be downloaded from my Codeberg repository:
    🔗 codeberg.org/tonton-pixel/

    #OpenSource #DesktopApps #ElectronJS #Unicode #Unicopedia

  10. The latest version v18.1.0 of the open-source application "Unicopedia Sinica" is now available, embedding all data files required to display CJK ideographs as SVG glyphs in the "CJK Sources" and "CJK Variations" utilities...

    🔗 codeberg.org/tonton-pixel/unic

    #Unicopedia #Unicode #Unihan #CJK #CJKV #SVG #Love

  11. The latest version v18.1.0 of the open-source application "Unicopedia Sinica" is now available, embedding all data files required to display CJK ideographs as SVG glyphs in the "CJK Sources" and "CJK Variations" utilities...

    🔗 codeberg.org/tonton-pixel/unic

    #Unicopedia #Unicode #Unihan #CJK #CJKV #SVG #Love

  12. is there a unicode proposal yet for "request left-facing variant" and "request right-facing variant", like how we have the image/text variant selectors for ☃ and ☃︎?

    because i want all my animal emoji to face in the forwards text direction like in sitelen pona!!

    #unicode #emoji #tokipona

  13. Full members (voting) of the Unicode Consortium (2026-04-04): Adobe, Airbnb, Amazon, Apple, Meta, Microsoft, Salesforce, Translated.

    🔗 home.unicode.org/membership/me

    Adobe is back too! Just in time for Easter Day. Maybe a sign from heaven...

    #Unicode #Members

  14. RE: mastodon.social/@mikaeru/11630

    - Technically speaking, Khitan Small Script and Yi script are not included (yet) in the data for non-Han ideographic scripts.

    - The Jurchen and Seal scripts are poised to be officially added to Unicode 18.0 in September 2026...

    - BabelStone (Andrew West) reference links:
    🔗 babelstone.co.uk/Jurchen/
    🔗 babelstone.co.uk/Khitan/
    🔗 babelstone.co.uk/Yi/

    #Unicode #Ideographic #Unihan #CJK #CJKV #Jurchen #Khitan #Nüshu #Seal #Tangut #Yi

  15. RE: mastodon.social/@mikaeru/11630

    - Technically speaking, Khitan Small Script and Yi script are not included (yet) in the data for non-Han ideographic scripts.

    - The Jurchen and Seal scripts are poised to be officially added to Unicode 18.0 in September 2026...

    - BabelStone (Andrew West) reference links:
    🔗 babelstone.co.uk/Jurchen/
    🔗 babelstone.co.uk/Khitan/
    🔗 babelstone.co.uk/Yi/

    #Unicode #Ideographic #Unihan #CJK #CJKV #Jurchen #Khitan #Nüshu #Seal #Tangut #Yi

  16. 🚨 Oh no! The dreaded #Glassworm is back, like a transparent hacker on a mission to confuse developers with invisible #Unicode attacks. With 150 #GitHub repositories compromised, the solution is a dizzying list of acronyms and jargon that promises to protect your code, but only if you squint hard enough to see it! 🐛🔍 #SecurityTheater
    aikido.dev/blog/glassworm-retu #InvisibleAttacks #SecurityThreat #DeveloperConfusion #HackerNews #ngated

  17. Je viens d'apprendre un truc.
    #Typographie #Unicode

    Le petits symboles de drapeaux qu'on trouve sur nos claviers avec tous les autres émojis… eh bien ce ne sont PAS des caractères uniques (comme les autres émojis) mais des combinaisons de 2 caractères pris dans la famille des « Regional Indicator Symbol » (compart.com/fr/unicode/search? ) selon la codification des pays avec 2 caractères de la norme ISO fr.wikipedia.org/wiki/ISO_3166

    1/2

  18. Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories

    Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.

    🧑‍💻 arstechnica.com/security/2026/

    #hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec

  19. Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories

    Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.

    🧑‍💻 arstechnica.com/security/2026/

    #hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec

  20. Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories

    Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.

    🧑‍💻 arstechnica.com/security/2026/

    #hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec

  21. Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories

    Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.

    🧑‍💻 arstechnica.com/security/2026/

    #hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec

  22. When I say "IT mostly just runs in circles" I mean it: arstechnica.com/security/2026/

    This article from 2026 describes something I've been fighting with ~17 years ago. Sure, slightly more clever payload and different delivery method, but in principle nothing new: github.com/MichalBryxi/Apache-

    #Glassworm #Unicode #InvisibleCharacters #Whitespace #Hack

  23. L’année dernière, @MoritzBrouhaha nous a raconté l’histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes. Visionnez ou revisionnez sa conférence en ligne.

    paris-web.fr/2025/conference/a

    #unicode #standards #typographie #internationalisation

  24. Thanks to @jlhwung, the so beautifully crafted 'BabelStone Han' font by Andrew West (魏安), is alive and well!

    The latest version 17.0.0, made of 'BabelStoneHanBasic.ttf' and 'BabelStoneHanExtra.ttf', is available from:

    🔗 github.com/babelstone/babelsto

    #BabelStone #BabelStoneHan #Unicode #Unihan #CJK #CJKV

  25. Thanks to @jlhwung, the so beautifully crafted 'BabelStone Han' font by Andrew West (魏安), is alive and well!

    The latest version 17.0.0, made of 'BabelStoneHanBasic.ttf' and 'BabelStoneHanExtra.ttf', is available from:

    🔗 github.com/babelstone/babelsto

    #BabelStone #BabelStoneHan #Unicode #Unihan #CJK #CJKV

  26. Musqueam language literally uses the North American Phonetic Alphabet.

    Is #Unicode going to add a hən̓q̓əmin̓əm̓ block?

    - Of course not!

    Saanich language uses a modified version of IPA.

    Is #Unicode going to add a SENĆOŦEN block?

    - Of course not! Saanich gets five supplementary characters and they'll be happy about it.

    Would it be possible to represent both of these phonetic alphabets by sharing the same #IPA code points?

    Yes! You would literally just need to change the fonts.

  27. CW: ALL CAPS TO IMPLY YELLING

    NOT EVERY LANGUAGE HAS AN ALPHABET.

    THERE ARE EXISTING LANGUAGES TODAY THAT JUST WRITE DOWN THE SOUNDS IN IPA.

    WHAT ARE THE PEOPLE WHO USE THESE LANGUAGES SUPPOSED TO DO WITHOUT IPA IN UNICODE.

    WE COULD HAVE A BASICALLY UNIVERSAL ALPHABET IN UNICODE.

    YOU COULD CONVERT BETWEEN PHONETIC-BASED SCRIPTS BY CHANGING A FONT.

    #IPA #Unicode

  28. IPA uses a basically random assortment of characters from whatever existing Unicode blocks had similarly-shaped scripts.

    There's no consistent IPA in Unicode. Just a patchwork.

    Why does any of this matter?

    Well, for one is makes linguistics more difficult.

    Unicode is fine with adding a bunch of dead or even undeciphered languages to Unicode to help out academics, but linguists I guess can get fucked.

    But also there's a bigger and more obvious problem.

    #IPA #Unicode

  29. #IPA is the alphabet used to less ambiguously represent sounds.

    Just like Latin, Greek, and Cyrillic, it's an alphabet.

    The IPA "a" doesn't have the same meaning as the Latin "a" or the Cyrillic "а". Instead it represents the "open front unrounded vowel".

    en.wikipedia.org/wiki/Open_fro

    So what #Unicode character are IPA users supposed to use?

    Just the Latin one.

  30. I like #Unicode.

    If you happened to have followed me on Twitter, you'll know that I know way more about how emoji work than most people.

    But holy crap, did Unicode manage to mess up how they handled #IPA.

    For anybody who knows what this means: I think Unicode's handling of IPA is more serious stumble than CJK Unification.

  31. RE: mastodon.social/@mikaeru/11550

    The latest post on the Unicode Consortium blog gives an exhaustive list of all the new Unicode properties in regular expressions (regex), and explains why all the supported properties are so important and can be so useful:

    blog.unicode.org/2026/03/uts-1

    #Unicode #Properties #Regex #RegularExpressions

  32. The unicode standard is over 3000 pages and supports a penguin #emoji but not a wildebeest. #RMS probably does not like that one bit.

    Seriously, how can the #GNU mascot be missing from the #unicode amid all the junk that is in there?

  33. @[email protected]

    Seeing as you're making dice that are fighting the current situation in the USA right now, I think
    this would be up your alley. https://stgiga.github.io/gigaware/TarouijaD120files.zip would be up your alley. It is a 3D model with OpenSCAD for tweaks, of a d120 but instead of the numbers 1-120, it has extended Tarot and extended Ouija as its symbols, via Unicode shenanigans, following this mapping https://www.reddit.com/r/d120Lists/comments/17mr2uv/d120_tarot_and_spirit_board/

    Roll: Result
    1: Ace of Spades

    2: Two of Spades

    3: Three of Spades

    4: Four of Spades

    5: Five of Spades

    6: Six of Spades

    7: Seven of Spades

    8: Eight of Spades

    9: Nine of Spades

    10: Ten of Spades

    11: Jack of Spades

    12: Knight of Spades

    13: Queen of Spades

    14: King of Spades

    15: Ace of Hearts

    16: Two of Hearts

    17: Three of Hearts

    18: Four of Hearts

    19: Five of Hearts

    20: Six of Hearts

    21: Seven of Hearts

    22: Eight of Hearts

    23: Nine of Hearts

    24: Ten of Hearts

    25: Jack of Hearts

    26: Knight of Hearts

    27: Queen of Hearts

    28: King of Hearts

    29: Ace of Diamonds

    30: Two of Diamonds

    31: Three of Diamonds

    32: Four of Diamonds

    33: Five of Diamonds

    34: Six of Diamonds

    35: Seven of Diamonds

    36: Eight of Diamonds

    37: Nine of Diamonds

    38: Ten of Diamonds

    39: Jack of Diamonds

    40: Knight of Diamonds

    41: Queen of Diamonds

    42: King of Diamonds

    43: Black Joker

    44: Ace of Clubs

    45: Two of Clubs

    46: Three of Clubs

    47: Four of Clubs

    48: Five of Clubs

    49: Six of Clubs

    50: Seven of Clubs

    51: Eight of Clubs

    52: Nine of Clubs

    53: Ten of Clubs

    54: Jack of Clubs

    55: Knight of Clubs

    56: Queen of Clubs

    57: King of Clubs

    58: White Joker

    59: Fool

    60: Individual

    61: Childhood

    62: Youth

    63: Maturity

    64: Old Age

    65: Morning

    66: Afternoon

    67: Evening

    68: Night

    69: Earth and Air

    70: Water and Fire

    71: Dance

    72: Shopping

    73: Open Air

    74: Visual Arts

    75: Spring

    76: Summer

    77: Autumn

    78: Winter

    79: The Game

    80: Collective

    81: 0

    82: 1

    83: 2

    84: 3

    85: 4

    86: 5

    87: 6

    88: 7

    89: 8

    90: 9

    91: A

    92: B

    93: C

    94: D

    95: E

    96: F

    97: G

    98: H

    99: I

    100: J

    101: K

    102: L

    103: M

    104: N

    105: O

    106: P

    107: Q

    108: R

    109: S

    110: T

    111: U

    112: V

    113: W

    114: X

    115: Y

    116: Z

    117: Yes

    118: No

    119: Hello

    120: Goodbye

    And in Unicode

    🂡🂢🂣🂤🂥🂦🂧🂨🂩🂪🂫🂬🂭🂮🂱🂲🂳🂴🂵🂶🂷🂸🂹🂺🂻🂼🂽🂾🃁🃂🃃🃄🃅🃆🃇🃈🃉🃊🃋🃌🃍🃎
    🃏🃑🃒🃓🃔🃕🃖🃗🃘🃙🃚🃛🃜🃝🃞🃟🃠🃡🃢🃣🃤🃥🃦🃧🃨🃩🃪🃫🃬🃭🃮🃯🃰🃱🃲🃳🃴🃵𝟶𝟷𝟸𝟹𝟺𝟻𝟼𝟽𝟾𝟿𝙰𝙱𝙲𝙳𝙴𝙵𝙶𝙷𝙸𝙹𝙺𝙻𝙼𝙽𝙾𝙿𝚀𝚁𝚂𝚃𝚄𝚅𝚆𝚇𝚈𝚉👍👎⎆⎋

    The first section of characters is the contents of the Playing Cards block in Unicode, minus Red Joker (white is kept) and Playing Card Back. So that means the 52 cards (jokers included) in an English/American deck of playing cards, plus Tarot's Knight cards, so 56 cards (and these are basically a graphical suit with the value above it, in a 12pt cell), plus the 22 cards in the Major Arcana, with "Fool" as XXII as is done on some decks. That section is rendered as a 12pt card with Roman numerals I through XXII with IX and XI having disambiguation dots. The naming I used for the cards is the
    alias names Unicode gives the cards. So none of the "The Hanged Man" or the generic numbered-only names that Unicode gives as their official codepoint names. After that is Ouija's 0-9 and uppercase A-Z, using Unicode's Mathematical Monospaced characters (Courier) from Mathematical Alphanumeric Symbols block, in order to fit the 1800s playbill font commonly seen on Ouija boards, also 12pt. Now the next ones are the interesting ones. To represent Yes and No, I used the Thumbs-Up and Thumbs-Down emoji respectively, and the real interesting part is what I did for Hello and Goodbye. For those, I used two characters from the Miscellaneous Technical block, namely the Enter Symbol and the Escape Symbol, both seen on old Mac keyboards. The first one is a diamond with an arrow pointing inwards, and the second one is a circle with an arrow pointing outwards. The metaphor here is that "Hello" is entering a conversation, and "Goodbye" is leaving one, obviously with a spirit. And all this fills ALL 120 slots on a d120, with no empty or duplicate entries. A unique glyph for each side. The only fonts usable for this by the way are Unifont Smooth (bundled) or UnifontEX. No other font, even Unifont itself, has all the characters together, due to the fact that Hello and Goodbye symbols are in Plane 0, meanwhile the rest of the characters are in Plane 1 AND even include emoji, never mind that some fonts do not support the Major Arcana part of the Playing Cards block. So basically, you're stuck with these two forks of GNU Unifont, but UnifontEX is pixel and so is not exactly a fitting theme unless you're a hacker like I am. Plus, by a bout of sheer chance, ALL the characters after vectorization turned out fine (though White Joker's J is too skeletal in the loop), something that related characters (some of the other stuff in the same block as the thumbs up and thumbs down emoji didn't vectorize well) have trouble with. I was very pleasantly surprised that the emoji and the Roman numerals turned out fine. But ultimately this was a feat of engineering I did when I was bored from 2023 to nowadays.

    Anyways, what makes this a compelling protest product is that it combines several things that fundamentalist Christians are very prone to hating. It takes Tarot cards and Ouija boards and shoves them onto dice that are literally divisible into an entire set of common and rare TTRPG dice, on top of the shape being a D&D d20 but divided into 6 triangles (putting a d4 on each face and then dividing by 2), a D&D d12 but divided into 10 triangles for each pentagon, as well as being a derivative shape of the d30 and d60. So basically, this "Tarouija" d120 combines multiple things that fundamentalist Christians consider "demonic" into one divination ritual item and thus is a great form of protest against the religious right. For the record I live in California. Hopefully this is interesting. Oh the OpenSCAD file needs the nightly build of OpenSCAD.
    #dicemaking #dicemaker #dice #d120 #unicode #unifontex #tarotcard #tarotdecks #tarotcards #tarotcardsreading #ouijaboard #ouija #3d #3dp #3dprinting #3dprinter #spiritboard #majorarcana #fuckice #protest #unifont #openscad #scad #3dart #art #tech #technology #code #font #fontdev #fonts #3dmodel #3dmodeling #3dmodels #3dmodeled #computerscience #compsci #boredom #activism #ice

  34. The icon of the new #Unicopedia #Sigilla application shows the provisional #Seal character U+3FBB5 𿮵 whose equivalent #CJK #ideograph is U+5B57 字, meaning "letter, character, word".

    #Unicode

  35. The icon of the new #Unicopedia #Sigilla application shows the provisional #Seal character U+3FBB5 𿮵 whose equivalent #CJK #ideograph is U+5B57 字, meaning "letter, character, word".

    #Unicode

  36. Very interesting insights into currency symbols in Unicode and how their implementation involves decisions and actions on so many different levels:

    🔗 blog.unicode.org/2026/02/from-

    #Unicode #Currency #Symbols #Roadmap

  37. I found this reply that I made in 1984 to Dennis Ritchie in the net.followup newsgroup. I was at the time lobbying Sun to add 8-bit character set support to the firmware, but they wanted to hold out for a 16-bit system, like the as yet unnamed Unicode. There was eventually an interim solution but my memory of that is a bit foggy.

    #Usenet #DennisRitchie #C #Pascal #emacs #VT100 #charactersets #ISO8859 #languages #Swedish #programming #unicode #SunMicrosystems #Värmland

  38. Un outil pour styliser un texte personnalisé à l'aide de caractères unicodes.

    🔗 textstyler.com/

    #unicode #utf8 #texte #style

  39. Taught Invisible XML today to my undergrads for the first time (last year had a guest speaker do it). This is lots of fun, to write a deliberate grammar instead of grabbing at patterns haphazardly with ad hoc regular expressions. Both are fun, but ixml kinda makes you feel like a serious wielder of Unicode lore. Or a Cheshire Cat.

    invisiblexml.org
    #ixml #digitalhumanities #xml #unicode

  40. All documents published by the Ideographic Research Group (IRG) are now available on the Unicode web site, and can be easily and efficiently found through the new search bar provided on the IRG homepage.

    🔗 unicode.org/irg/

    This long-awaited search feature is very convenient, and so useful to find what you're interested in, and even more (ah, the wonderful power of serendipity!)...

    #Unicode #IRG #IdeographicResearchGroup #CJK #Ideographs #Unihan