#unicode — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #unicode, aggregated by home.social.
-
RE: https://flipboard.com/@courrierinter/asie-lsc1hno6z/-/a-w9ciSPQxSLyq8hQhDuX5XA%3Aa%3A1808141830-%2F0
卢比奥 (lú bǐ ào) ➔ 鲁比奥 (lǔ bǐ ào)
U+5362 卢
U+5362 kDefinition cottage, hut; surname; black
U+5362 kMandarin lú
🔗 https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=5362➔
U+9C81 鲁
U+9C81 kDefinition foolish, stupid, rash; vulgar
U+9C81 kMandarin lǔ
🔗 https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=9C81 -
Eu tenho um adesivo que é uma piada com Unicode. Ele não ia caber na capa do meu celular, então eu o transformei numa piada sobre CSS. #unicode #webdesign #css
-
The latest version 2.0.0 of the open-source application "Unicopedia Symbolica" (previously part of the "Unicopedia Plus" application) adds a new "Emoji Taxonomy" utility.
-
Today I learned that the 64 I Ching hexagram symbols are all included in Unicode.
I'm fairly sure I'll never need to use them so I'll add it to my ever growing list of probably useless bits of knowledge.
https://en.wikipedia.org/wiki/List_of_hexagrams_of_the_I_Ching
-
🚀 Look out world, Turbo Vision 2.0 is back, now with Unicode! Because clearly, what every 21st-century developer dreams of is a text-based UI from the 90s resurrected ✨ and made cross-platform. Just what we needed—now we can all relive the days of DOS in style! 😂
https://github.com/magiblot/tvision #TurboVision2.0 #Unicode #TextBasedUI #RetroGaming #CrossPlatform #DOSNostalgia #HackerNews #ngated -
Как (не) перевернуть строку, или Ох, уж этот Unicode
Это текстовая версия доклада с Java Rock Star Meetup, с которым выступал Александр Ланцов — ведущий разработчик Мир Plat.Form. Если вы больше любите смотреть видео, то смотрите запись доклада на YouTube или VK Видео . Читать далее 🔍
https://habr.com/ru/companies/nspk/articles/1024668/
#java #unicode #utf8 #utf16 #кодировки #обработка_строк #emoji #суррогатные_пары #графемные_кластеры
-
Full members (voting) of the Unicode Consortium (2026-04-20): Adobe, Airbnb, Apple, Google, Meta, Microsoft, Salesforce, Translated.
🔗 https://home.unicode.org/membership/members/
Compared to the full members list dated 2026-04-04, Amazon has disappeared and Google (re-)appeared. Great "substitution" magic trick indeed!
On a side note, the HTML page source code indicates:
<!-- List generated: 2026-04-20, 16:07:01 GMT -->
and tomorrow starts the UTC #187 meeting (2026-04-21 to 2026-04-23)...
-
Long before standards bodies agreed on how to encode diverse scripts and symbols, computers used incompatible character sets.
Understanding how modern systems evolved from ASCII to Unicode—and what UTF-8, UTF-16, and UTF-32 mean—helps any developer, writer, or interested reader grasp the complexity hidden inside what looks like ordinary text.
https://tiniacoleyba.com/blog/unicode-utf8-ascii-encodings/ #WebStandards #ASCII #Unicode #basics
-
Unicode Variation Selectors, by @CSSence:
-
While implementing a file drag-and-drop feature in one of my Electron-based apps, I fortuitously found an issue in the Electron framework which I believe could be a major security hole... Fortunately, this was not too difficult to fix, but I still don't understand why this has been overlooked so far...
All applications have been corrected and can be downloaded from my Codeberg repository:
🔗 https://codeberg.org/tonton-pixel/ -
A cool little guide to Unicode characters that I found (now I want to make a font that has them all):
https://antofthy.gitlab.io/info/data/utf8_demo.txt
#typography #unicode #characters #glyphs #guide #unicodeBlocks
-
The latest version v18.1.0 of the open-source application "Unicopedia Sinica" is now available, embedding all data files required to display CJK ideographs as SVG glyphs in the "CJK Sources" and "CJK Variations" utilities...
-
The latest version v18.1.0 of the open-source application "Unicopedia Sinica" is now available, embedding all data files required to display CJK ideographs as SVG glyphs in the "CJK Sources" and "CJK Variations" utilities...
-
Charcuterie: A visual Unicode picker
https://charcuterie.elastiq.ch/#A66E
#via:kottke #unicode #design #emoji #+ -
Full members (voting) of the Unicode Consortium (2026-04-04): Adobe, Airbnb, Amazon, Apple, Meta, Microsoft, Salesforce, Translated.
🔗 https://home.unicode.org/membership/members/
Adobe is back too! Just in time for Easter Day. Maybe a sign from heaven...
-
An Introduction to Writing Systems and Unicode
https://r12a.github.io/scripts/tutorial/part2
#HackerNews #WritingSystems #Unicode #Tutorial #Linguistics #TechEducation
-
L’art du rendu de texte, Le jeudi 2 avril 2026 de 20h00 à 21h00. https://www.agendadulibre.org/events/34895 #typographie #glyphe #police #unicode #gutenberg #latex #tex #libreEnFete2026 #video
-
L’art du rendu de texte, Le jeudi 2 avril 2026 de 20h00 à 21h00. https://www.agendadulibre.org/events/34895 #typographie #glyphe #police #unicode #gutenberg #latex #tex #libreEnFete2026 #video
-
L’art du rendu de texte, Le jeudi 2 avril 2026 de 20h00 à 21h00. https://www.agendadulibre.org/events/34895 #typographie #glyphe #police #unicode #gutenberg #latex #tex #libreEnFete2026 #video
-
RE: https://mastodon.social/@mikaeru/116300905939195821
- Technically speaking, Khitan Small Script and Yi script are not included (yet) in the data for non-Han ideographic scripts.
- The Jurchen and Seal scripts are poised to be officially added to Unicode 18.0 in September 2026...
- BabelStone (Andrew West) reference links:
🔗 https://www.babelstone.co.uk/Jurchen/
🔗 https://www.babelstone.co.uk/Khitan/
🔗 https://www.babelstone.co.uk/Yi/#Unicode #Ideographic #Unihan #CJK #CJKV #Jurchen #Khitan #Nüshu #Seal #Tangut #Yi
-
RE: https://mastodon.social/@mikaeru/116300905939195821
- Technically speaking, Khitan Small Script and Yi script are not included (yet) in the data for non-Han ideographic scripts.
- The Jurchen and Seal scripts are poised to be officially added to Unicode 18.0 in September 2026...
- BabelStone (Andrew West) reference links:
🔗 https://www.babelstone.co.uk/Jurchen/
🔗 https://www.babelstone.co.uk/Khitan/
🔗 https://www.babelstone.co.uk/Yi/#Unicode #Ideographic #Unihan #CJK #CJKV #Jurchen #Khitan #Nüshu #Seal #Tangut #Yi
-
🚨 Oh no! The dreaded #Glassworm is back, like a transparent hacker on a mission to confuse developers with invisible #Unicode attacks. With 150 #GitHub repositories compromised, the solution is a dizzying list of acronyms and jargon that promises to protect your code, but only if you squint hard enough to see it! 🐛🔍 #SecurityTheater
https://www.aikido.dev/blog/glassworm-returns-unicode-attack-github-npm-vscode #InvisibleAttacks #SecurityThreat #DeveloperConfusion #HackerNews #ngated -
Glassworm Is Back: A New Wave of Invisible Unicode Attacks Hits Repositories
https://www.aikido.dev/blog/glassworm-returns-unicode-attack-github-npm-vscode
#HackerNews #Glassworm #Invisible #Unicode #Attacks #Cybersecurity #GitHub #Repositories
-
Je viens d'apprendre un truc.
#Typographie #UnicodeLe petits symboles de drapeaux qu'on trouve sur nos claviers avec tous les autres émojis… eh bien ce ne sont PAS des caractères uniques (comme les autres émojis) mais des combinaisons de 2 caractères pris dans la famille des « Regional Indicator Symbol » (https://www.compart.com/fr/unicode/search?q=regional%20indicator#characters ) selon la codification des pays avec 2 caractères de la norme ISO https://fr.wikipedia.org/wiki/ISO_3166-1
1/2
-
Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories
Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.
#hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec
-
Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories
Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.
#hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec
-
Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories
Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.
#hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec
-
Beware of blank lines and white spaces — Supply-chain attack using invisible code hits GitHub and other repositories
Unicode that’s invisible to the human eye was largely abandoned - until attackers took notice.
#hacking #blankline #whitespace #github #supplychain #unicode #hack #git #code #coding #invisible #gitrepo #itsecurity #it #itsec
-
When I say "IT mostly just runs in circles" I mean it: https://arstechnica.com/security/2026/03/supply-chain-attack-using-invisible-code-hits-github-and-other-repositories/
This article from 2026 describes something I've been fighting with ~17 years ago. Sure, slightly more clever payload and different delivery method, but in principle nothing new: https://github.com/MichalBryxi/Apache-fork-hack-finder-cleaner/tree/master
-
L’année dernière, @MoritzBrouhaha nous a raconté l’histoire du standard informatique Unicode, utilisé par tout le monde à travers le globe dans nos communications quotidiennes. Visionnez ou revisionnez sa conférence en ligne.
https://www.paris-web.fr/2025/conference/a-la-decouverte-du-monde-au-travers-de-lunicode
-
Thanks to @jlhwung, the so beautifully crafted 'BabelStone Han' font by Andrew West (魏安), is alive and well!
The latest version 17.0.0, made of 'BabelStoneHanBasic.ttf' and 'BabelStoneHanExtra.ttf', is available from:
🔗 https://github.com/babelstone/babelstonehan-ufo/releases/latest
-
Thanks to @jlhwung, the so beautifully crafted 'BabelStone Han' font by Andrew West (魏安), is alive and well!
The latest version 17.0.0, made of 'BabelStoneHanBasic.ttf' and 'BabelStoneHanExtra.ttf', is available from:
🔗 https://github.com/babelstone/babelstonehan-ufo/releases/latest
-
Musqueam language literally uses the North American Phonetic Alphabet.
Is #Unicode going to add a hən̓q̓əmin̓əm̓ block?
- Of course not!
Saanich language uses a modified version of IPA.
Is #Unicode going to add a SENĆOŦEN block?
- Of course not! Saanich gets five supplementary characters and they'll be happy about it.
Would it be possible to represent both of these phonetic alphabets by sharing the same #IPA code points?
Yes! You would literally just need to change the fonts.
-
CW: ALL CAPS TO IMPLY YELLING
NOT EVERY LANGUAGE HAS AN ALPHABET.
THERE ARE EXISTING LANGUAGES TODAY THAT JUST WRITE DOWN THE SOUNDS IN IPA.
WHAT ARE THE PEOPLE WHO USE THESE LANGUAGES SUPPOSED TO DO WITHOUT IPA IN UNICODE.
WE COULD HAVE A BASICALLY UNIVERSAL ALPHABET IN UNICODE.
YOU COULD CONVERT BETWEEN PHONETIC-BASED SCRIPTS BY CHANGING A FONT.
-
IPA uses a basically random assortment of characters from whatever existing Unicode blocks had similarly-shaped scripts.
There's no consistent IPA in Unicode. Just a patchwork.
Why does any of this matter?
Well, for one is makes linguistics more difficult.
Unicode is fine with adding a bunch of dead or even undeciphered languages to Unicode to help out academics, but linguists I guess can get fucked.
But also there's a bigger and more obvious problem.
-
#IPA is the alphabet used to less ambiguously represent sounds.
Just like Latin, Greek, and Cyrillic, it's an alphabet.
The IPA "a" doesn't have the same meaning as the Latin "a" or the Cyrillic "а". Instead it represents the "open front unrounded vowel".
https://en.wikipedia.org/wiki/Open_front_unrounded_vowel
So what #Unicode character are IPA users supposed to use?
Just the Latin one.
-
I like #Unicode.
If you happened to have followed me on Twitter, you'll know that I know way more about how emoji work than most people.
But holy crap, did Unicode manage to mess up how they handled #IPA.
For anybody who knows what this means: I think Unicode's handling of IPA is more serious stumble than CJK Unification.
-
RE: https://mastodon.social/@mikaeru/115502251163025356
The latest post on the Unicode Consortium blog gives an exhaustive list of all the new Unicode properties in regular expressions (regex), and explains why all the supported properties are so important and can be so useful:
https://blog.unicode.org/2026/03/uts-18-more-unicode-properties-in.html
-
@[email protected]
Seeing as you're making dice that are fighting the current situation in the USA right now, I think this would be up your alley. https://stgiga.github.io/gigaware/TarouijaD120files.zip would be up your alley. It is a 3D model with OpenSCAD for tweaks, of a d120 but instead of the numbers 1-120, it has extended Tarot and extended Ouija as its symbols, via Unicode shenanigans, following this mapping https://www.reddit.com/r/d120Lists/comments/17mr2uv/d120_tarot_and_spirit_board/
Roll: Result
1: Ace of Spades
2: Two of Spades
3: Three of Spades
4: Four of Spades
5: Five of Spades
6: Six of Spades
7: Seven of Spades
8: Eight of Spades
9: Nine of Spades
10: Ten of Spades
11: Jack of Spades
12: Knight of Spades
13: Queen of Spades
14: King of Spades
15: Ace of Hearts
16: Two of Hearts
17: Three of Hearts
18: Four of Hearts
19: Five of Hearts
20: Six of Hearts
21: Seven of Hearts
22: Eight of Hearts
23: Nine of Hearts
24: Ten of Hearts
25: Jack of Hearts
26: Knight of Hearts
27: Queen of Hearts
28: King of Hearts
29: Ace of Diamonds
30: Two of Diamonds
31: Three of Diamonds
32: Four of Diamonds
33: Five of Diamonds
34: Six of Diamonds
35: Seven of Diamonds
36: Eight of Diamonds
37: Nine of Diamonds
38: Ten of Diamonds
39: Jack of Diamonds
40: Knight of Diamonds
41: Queen of Diamonds
42: King of Diamonds
43: Black Joker
44: Ace of Clubs
45: Two of Clubs
46: Three of Clubs
47: Four of Clubs
48: Five of Clubs
49: Six of Clubs
50: Seven of Clubs
51: Eight of Clubs
52: Nine of Clubs
53: Ten of Clubs
54: Jack of Clubs
55: Knight of Clubs
56: Queen of Clubs
57: King of Clubs
58: White Joker
59: Fool
60: Individual
61: Childhood
62: Youth
63: Maturity
64: Old Age
65: Morning
66: Afternoon
67: Evening
68: Night
69: Earth and Air
70: Water and Fire
71: Dance
72: Shopping
73: Open Air
74: Visual Arts
75: Spring
76: Summer
77: Autumn
78: Winter
79: The Game
80: Collective
81: 0
82: 1
83: 2
84: 3
85: 4
86: 5
87: 6
88: 7
89: 8
90: 9
91: A
92: B
93: C
94: D
95: E
96: F
97: G
98: H
99: I
100: J
101: K
102: L
103: M
104: N
105: O
106: P
107: Q
108: R
109: S
110: T
111: U
112: V
113: W
114: X
115: Y
116: Z
117: Yes
118: No
119: Hello
120: Goodbye
And in Unicode
🂡🂢🂣🂤🂥🂦🂧🂨🂩🂪🂫🂬🂭🂮🂱🂲🂳🂴🂵🂶🂷🂸🂹🂺🂻🂼🂽🂾🃁🃂🃃🃄🃅🃆🃇🃈🃉🃊🃋🃌🃍🃎🃏🃑🃒🃓🃔🃕🃖🃗🃘🃙🃚🃛🃜🃝🃞🃟🃠🃡🃢🃣🃤🃥🃦🃧🃨🃩🃪🃫🃬🃭🃮🃯🃰🃱🃲🃳🃴🃵𝟶𝟷𝟸𝟹𝟺𝟻𝟼𝟽𝟾𝟿𝙰𝙱𝙲𝙳𝙴𝙵𝙶𝙷𝙸𝙹𝙺𝙻𝙼𝙽𝙾𝙿𝚀𝚁𝚂𝚃𝚄𝚅𝚆𝚇𝚈𝚉👍👎⎆⎋
The first section of characters is the contents of the Playing Cards block in Unicode, minus Red Joker (white is kept) and Playing Card Back. So that means the 52 cards (jokers included) in an English/American deck of playing cards, plus Tarot's Knight cards, so 56 cards (and these are basically a graphical suit with the value above it, in a 12pt cell), plus the 22 cards in the Major Arcana, with "Fool" as XXII as is done on some decks. That section is rendered as a 12pt card with Roman numerals I through XXII with IX and XI having disambiguation dots. The naming I used for the cards is the alias names Unicode gives the cards. So none of the "The Hanged Man" or the generic numbered-only names that Unicode gives as their official codepoint names. After that is Ouija's 0-9 and uppercase A-Z, using Unicode's Mathematical Monospaced characters (Courier) from Mathematical Alphanumeric Symbols block, in order to fit the 1800s playbill font commonly seen on Ouija boards, also 12pt. Now the next ones are the interesting ones. To represent Yes and No, I used the Thumbs-Up and Thumbs-Down emoji respectively, and the real interesting part is what I did for Hello and Goodbye. For those, I used two characters from the Miscellaneous Technical block, namely the Enter Symbol and the Escape Symbol, both seen on old Mac keyboards. The first one is a diamond with an arrow pointing inwards, and the second one is a circle with an arrow pointing outwards. The metaphor here is that "Hello" is entering a conversation, and "Goodbye" is leaving one, obviously with a spirit. And all this fills ALL 120 slots on a d120, with no empty or duplicate entries. A unique glyph for each side. The only fonts usable for this by the way are Unifont Smooth (bundled) or UnifontEX. No other font, even Unifont itself, has all the characters together, due to the fact that Hello and Goodbye symbols are in Plane 0, meanwhile the rest of the characters are in Plane 1 AND even include emoji, never mind that some fonts do not support the Major Arcana part of the Playing Cards block. So basically, you're stuck with these two forks of GNU Unifont, but UnifontEX is pixel and so is not exactly a fitting theme unless you're a hacker like I am. Plus, by a bout of sheer chance, ALL the characters after vectorization turned out fine (though White Joker's J is too skeletal in the loop), something that related characters (some of the other stuff in the same block as the thumbs up and thumbs down emoji didn't vectorize well) have trouble with. I was very pleasantly surprised that the emoji and the Roman numerals turned out fine. But ultimately this was a feat of engineering I did when I was bored from 2023 to nowadays.
Anyways, what makes this a compelling protest product is that it combines several things that fundamentalist Christians are very prone to hating. It takes Tarot cards and Ouija boards and shoves them onto dice that are literally divisible into an entire set of common and rare TTRPG dice, on top of the shape being a D&D d20 but divided into 6 triangles (putting a d4 on each face and then dividing by 2), a D&D d12 but divided into 10 triangles for each pentagon, as well as being a derivative shape of the d30 and d60. So basically, this "Tarouija" d120 combines multiple things that fundamentalist Christians consider "demonic" into one divination ritual item and thus is a great form of protest against the religious right. For the record I live in California. Hopefully this is interesting. Oh the OpenSCAD file needs the nightly build of OpenSCAD. #dicemaking #dicemaker #dice #d120 #unicode #unifontex #tarotcard #tarotdecks #tarotcards #tarotcardsreading #ouijaboard #ouija #3d #3dp #3dprinting #3dprinter #spiritboard #majorarcana #fuckice #protest #unifont #openscad #scad #3dart #art #tech #technology #code #font #fontdev #fonts #3dmodel #3dmodeling #3dmodels #3dmodeled #computerscience #compsci #boredom #activism #ice -
The icon of the new #Unicopedia #Sigilla application shows the provisional #Seal character U+3FBB5 whose equivalent #CJK #ideograph is U+5B57 字, meaning "letter, character, word".
-
The icon of the new #Unicopedia #Sigilla application shows the provisional #Seal character U+3FBB5 whose equivalent #CJK #ideograph is U+5B57 字, meaning "letter, character, word".
-
Very interesting insights into currency symbols in Unicode and how their implementation involves decisions and actions on so many different levels:
🔗 https://blog.unicode.org/2026/02/from-central-bank-to-code-point-roadmap.html
-
Unicopedia Sigilla is a developer-oriented set of #Unicode utilities related to Seal characters, wrapped into one single app, built with #Electron.
Repository: 🔗 https://codeberg.org/tonton-pixel/unicopedia-sigilla
#Unicopedia #Seal #Characters #JavaScript #CodePoints #Glyphs #OpenSource #DesktopApplication
-
I found this reply that I made in 1984 to Dennis Ritchie in the net.followup newsgroup. I was at the time lobbying Sun to add 8-bit character set support to the firmware, but they wanted to hold out for a 16-bit system, like the as yet unnamed Unicode. There was eventually an interim solution but my memory of that is a bit foggy.
#Usenet #DennisRitchie #C #Pascal #emacs #VT100 #charactersets #ISO8859 #languages #Swedish #programming #unicode #SunMicrosystems #Värmland
-
Taught Invisible XML today to my undergrads for the first time (last year had a guest speaker do it). This is lots of fun, to write a deliberate grammar instead of grabbing at patterns haphazardly with ad hoc regular expressions. Both are fun, but ixml kinda makes you feel like a serious wielder of Unicode lore. Or a Cheshire Cat.
https://invisiblexml.org
#ixml #digitalhumanities #xml #unicode -
All documents published by the Ideographic Research Group (IRG) are now available on the Unicode web site, and can be easily and efficiently found through the new search bar provided on the IRG homepage.
🔗 https://www.unicode.org/irg/
This long-awaited search feature is very convenient, and so useful to find what you're interested in, and even more (ah, the wonderful power of serendipity!)...
#Unicode #IRG #IdeographicResearchGroup #CJK #Ideographs #Unihan