home.social

#rfc9839 — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #rfc9839, aggregated by home.social.

  1. Ah, RFC 9839: The quest for the "less bad" #Unicode #subset 🤦‍♂️. Because, clearly, specifying which Unicode characters to avoid is humanity's greatest achievement since sliced bread. Tim Bray and Paul Hoffman bravely venture into the Unicode abyss, emerging with a groundbreaking revelation: not all characters are created equal. Who knew? 🙄
    tbray.org/ongoing/When/202x/20 #RFC9839 #LessBad #TimBray #PaulHoffman #UnicodeAbyss #HackerNews #ngated

  2. «Unicode is good. If you’re designing a data structure or protocol that has text fields, they should contain #Unicode characters encoded in #UTF8. There’s another question, though: “Which Unicode characters?” The answer is “Not all of them, please exclude some.”

    This issue keeps coming up, so [ @paulehoffman and @timbray ] put together an individual-submission draft to the IETF and now (where by “now” I mean “two years later”) it’s been published as #RFC9839. It explains which characters are bad, and why, then offers three plausible less-bad subsets that you might want to use.»

    tbray.org/ongoing/When/202x/20 by @timbray

    #programming #CharacterEncoding #LML