home.social

#ngrams — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #ngrams, aggregated by home.social.

  1. Oh, the irony! 🤖 An article that promises to demystify #Transformers with N-grams but really just masquerades as a job listing for a #DevOps Engineer at #arXiv. 📜 Because nothing says "deep understanding" like *skipping to the main content* for career opportunities. 🌟
    arxiv.org/abs/2407.12034 #Ngrams #irony #careeropportunities #HackerNews #ngated

  2. I was even thinking to use data from marcoxbresciani.codeberg.page/ but even if I have those numbers, I have no idea on how to use them to create a better -based layout.

    Also, is it worth it?

    Hints? Ideas? Help!

    @mechanicalkeyboards

  3. I was even thinking to use #ngrams data from marcoxbresciani.codeberg.page/ but even if I have those numbers, I have no idea on how to use them to create a better #Italian-based #ColemakDH layout.

    Also, is it worth it?

    Hints? Ideas? Help!

    @mechanicalkeyboards

  4. I was even thinking to use #ngrams data from marcoxbresciani.codeberg.page/ but even if I have those numbers, I have no idea on how to use them to create a better #Italian-based #ColemakDH layout.

    Also, is it worth it?

    Hints? Ideas? Help!

    @mechanicalkeyboards

  5. I was even thinking to use #ngrams data from marcoxbresciani.codeberg.page/ but even if I have those numbers, I have no idea on how to use them to create a better #Italian-based #ColemakDH layout.

    Also, is it worth it?

    Hints? Ideas? Help!

    @mechanicalkeyboards

  6. @joeyh I've read elsewhere that Google's Ngrams (and book scanning) is heavily skewed toward academic publishing from roughly 1920 -- 1990 or so. It's a combination of books falling into the copyright hole, emphasis on academic corpora (e.g., University of Michigan) as major contributors to the scanning project, and the emergence of digital book formats in the very late 20th century.

    Douglas Harper of The Online Etymological Dictionary addresses this in a blog post:

    etymonline.com/columns/post/wh

    #Etymology #ngrams #GoogleNgramViewer

  7. @johnwehrle I'm defending the notion of effective and fact-based criticism here, not longtermism ...

    ... but note that the term "existential risk" LONG predates the emergence of "longtermism", and through 2000 is also far more prevalent. See screenshot, and note that "longtermism" is multiplied 3x to scale equivalently to "existential risk".

    I've strong concerns with any argument which leans heavily on such readily-refuted claims. The viewpoint may well be justified, but a bit less hyperventilating hyperbole and poor scholarship would greatly help the case.

    The notion of "existential risk" was originally applied in a religious context (by Paul Tillich) and to nuclear weapons.

    See:

    #longtermism #ExistentialRisk #GoogleNgramViewer #Ngrams #WeakArguments #EmilePTorres

  8. Pondering the Big Questions:

    When did "meet-cute" become A Thing?

    Ngram Viewer says ... mostly post-2010:

    books.google.com/ngrams/graph?

    It seems recent to me.

    (Both "meet cute" and "meet-cute" plotted. I suspect the unhyphenated version will have numerous false positives as in "meet cute (girl(s)|guy(s))".)

    #ngrams #NgramViewer

  9. Pondering the Big Questions:

    When did "meet-cute" become A Thing?

    Ngram Viewer says ... mostly post-2010:

    books.google.com/ngrams/graph?

    It seems recent to me.

    (Both "meet cute" and "meet-cute" plotted. I suspect the unhyphenated version will have numerous false positives as in "meet cute (girl(s)|guy(s))".)

    #ngrams #NgramViewer

  10. Pondering the Big Questions:

    When did "meet-cute" become A Thing?

    Ngram Viewer says ... mostly post-2010:

    books.google.com/ngrams/graph?

    It seems recent to me.

    (Both "meet cute" and "meet-cute" plotted. I suspect the unhyphenated version will have numerous false positives as in "meet cute (girl(s)|guy(s))".)

    #ngrams #NgramViewer

  11. Pondering the Big Questions:

    When did "meet-cute" become A Thing?

    Ngram Viewer says ... mostly post-2010:

    books.google.com/ngrams/graph?

    It seems recent to me.

    (Both "meet cute" and "meet-cute" plotted. I suspect the unhyphenated version will have numerous false positives as in "meet cute (girl(s)|guy(s))".)

    #ngrams #NgramViewer

  12. Pondering the Big Questions:

    When did "meet-cute" become A Thing?

    Ngram Viewer says ... mostly post-2010:

    books.google.com/ngrams/graph?

    It seems recent to me.

    (Both "meet cute" and "meet-cute" plotted. I suspect the unhyphenated version will have numerous false positives as in "meet cute (girl(s)|guy(s))".)

    #ngrams #NgramViewer

  13. On the changing of language usage patterns over time, homelessness is an interesting case.

    I'd discovered some time back, that term broke into usage suddenly in 1980. It wasn't entirely unknown before, but the concept often appeared as a compound verb, "made homeless", rather than as a noun, "homeless (man|woman|person)", and almost always as an immediate consequence of some disaster, such as a structural fire, hurricane, flood, or earthquake. Earlier terms that had been used to describe long-term lack of reliable housing include vagrant, itinerant, and the like (I'd need to look these up again).

    Part of this seems to be due to changes in how housing was approached in the US, and especially the elimination of alternatives to single-family dwellings (e.g., rooming houses, residence hotels) in many areas. But some also seems to be a linguistic, social, and political change in usage.

    Ngram: "homelessness": books.google.com/ngrams/graph?

    Ngram: "homeless, vagrant, itinerant": books.google.com/ngrams/graph?

    The message is that ngrams and the Google corpus are useful but also require interpretation.

    #ngrams #NgramViewer #homelessness

  14. On the changing of language usage patterns over time, homelessness is an interesting case.

    I'd discovered some time back, that term broke into usage suddenly in 1980. It wasn't entirely unknown before, but the concept often appeared as a compound verb, "made homeless", rather than as a noun, "homeless (man|woman|person)", and almost always as an immediate consequence of some disaster, such as a structural fire, hurricane, flood, or earthquake. Earlier terms that had been used to describe long-term lack of reliable housing include vagrant, itinerant, and the like (I'd need to look these up again).

    Part of this seems to be due to changes in how housing was approached in the US, and especially the elimination of alternatives to single-family dwellings (e.g., rooming houses, residence hotels) in many areas. But some also seems to be a linguistic, social, and political change in usage.

    Ngram: "homelessness": books.google.com/ngrams/graph?

    Ngram: "homeless, vagrant, itinerant": books.google.com/ngrams/graph?

    The message is that ngrams and the Google corpus are useful but also require interpretation.

    #ngrams #NgramViewer #homelessness

  15. On the changing of language usage patterns over time, homelessness is an interesting case.

    I'd discovered some time back, that term broke into usage suddenly in 1980. It wasn't entirely unknown before, but the concept often appeared as a compound verb, "made homeless", rather than as a noun, "homeless (man|woman|person)", and almost always as an immediate consequence of some disaster, such as a structural fire, hurricane, flood, or earthquake. Earlier terms that had been used to describe long-term lack of reliable housing include vagrant, itinerant, and the like (I'd need to look these up again).

    Part of this seems to be due to changes in how housing was approached in the US, and especially the elimination of alternatives to single-family dwellings (e.g., rooming houses, residence hotels) in many areas. But some also seems to be a linguistic, social, and political change in usage.

    Ngram: "homelessness": books.google.com/ngrams/graph?

    Ngram: "homeless, vagrant, itinerant": books.google.com/ngrams/graph?

    The message is that ngrams and the Google corpus are useful but also require interpretation.

    #ngrams #NgramViewer #homelessness

  16. On the changing of language usage patterns over time, homelessness is an interesting case.

    I'd discovered some time back, that term broke into usage suddenly in 1980. It wasn't entirely unknown before, but the concept often appeared as a compound verb, "made homeless", rather than as a noun, "homeless (man|woman|person)", and almost always as an immediate consequence of some disaster, such as a structural fire, hurricane, flood, or earthquake. Earlier terms that had been used to describe long-term lack of reliable housing include vagrant, itinerant, and the like (I'd need to look these up again).

    Part of this seems to be due to changes in how housing was approached in the US, and especially the elimination of alternatives to single-family dwellings (e.g., rooming houses, residence hotels) in many areas. But some also seems to be a linguistic, social, and political change in usage.

    Ngram: "homelessness": books.google.com/ngrams/graph?

    Ngram: "homeless, vagrant, itinerant": books.google.com/ngrams/graph?

    The message is that ngrams and the Google corpus are useful but also require interpretation.

    #ngrams #NgramViewer #homelessness

  17. On the changing of language usage patterns over time, homelessness is an interesting case.

    I'd discovered some time back, that term broke into usage suddenly in 1980. It wasn't entirely unknown before, but the concept often appeared as a compound verb, "made homeless", rather than as a noun, "homeless (man|woman|person)", and almost always as an immediate consequence of some disaster, such as a structural fire, hurricane, flood, or earthquake. Earlier terms that had been used to describe long-term lack of reliable housing include vagrant, itinerant, and the like (I'd need to look these up again).

    Part of this seems to be due to changes in how housing was approached in the US, and especially the elimination of alternatives to single-family dwellings (e.g., rooming houses, residence hotels) in many areas. But some also seems to be a linguistic, social, and political change in usage.

    Ngram: "homelessness": books.google.com/ngrams/graph?

    Ngram: "homeless, vagrant, itinerant": books.google.com/ngrams/graph?

    The message is that ngrams and the Google corpus are useful but also require interpretation.

    #ngrams #NgramViewer #homelessness

  18. Google Ngrams: "white nationalist"

    Apropos some recent discussions, I've been looking into a number of aspects of this term and aspects related to it.

    Google Ngram Viewer is a powerful, if occasionally problematic, tool for exploring language and terms used within it.

    An ngram of the headline phrase of this toot ... shows an immense rise in prevalence of the term through 2019 (the most recent data in the corpus), roughly 10 times the 2010 level.

    What's driving that isn't necessarily clear --- language and usage reflects both the reflected real-world phenomena described by terms, and preferences for certain terms over others.

    But it's attention-grabbing all the same. And a bit sobering.

    books.google.com/ngrams/graph?

    #ngrams #NgramViewer #racism

  19. Google Ngrams: "white nationalist"

    Apropos some recent discussions, I've been looking into a number of aspects of this term and aspects related to it.

    Google Ngram Viewer is a powerful, if occasionally problematic, tool for exploring language and terms used within it.

    An ngram of the headline phrase of this toot ... shows an immense rise in prevalence of the term through 2019 (the most recent data in the corpus), roughly 10 times the 2010 level.

    What's driving that isn't necessarily clear --- language and usage reflects both the reflected real-world phenomena described by terms, and preferences for certain terms over others.

    But it's attention-grabbing all the same. And a bit sobering.

    books.google.com/ngrams/graph?

    #ngrams #NgramViewer #racism

  20. Google Ngrams: "white nationalist"

    Apropos some recent discussions, I've been looking into a number of aspects of this term and aspects related to it.

    Google Ngram Viewer is a powerful, if occasionally problematic, tool for exploring language and terms used within it.

    An ngram of the headline phrase of this toot ... shows an immense rise in prevalence of the term through 2019 (the most recent data in the corpus), roughly 10 times the 2010 level.

    What's driving that isn't necessarily clear --- language and usage reflects both the reflected real-world phenomena described by terms, and preferences for certain terms over others.

    But it's attention-grabbing all the same. And a bit sobering.

    books.google.com/ngrams/graph?

    #ngrams #NgramViewer #racism

  21. Google Ngrams: "white nationalist"

    Apropos some recent discussions, I've been looking into a number of aspects of this term and aspects related to it.

    Google Ngram Viewer is a powerful, if occasionally problematic, tool for exploring language and terms used within it.

    An ngram of the headline phrase of this toot ... shows an immense rise in prevalence of the term through 2019 (the most recent data in the corpus), roughly 10 times the 2010 level.

    What's driving that isn't necessarily clear --- language and usage reflects both the reflected real-world phenomena described by terms, and preferences for certain terms over others.

    But it's attention-grabbing all the same. And a bit sobering.

    books.google.com/ngrams/graph?

    #ngrams #NgramViewer #racism

  22. Google Ngrams: "white nationalist"

    Apropos some recent discussions, I've been looking into a number of aspects of this term and aspects related to it.

    Google Ngram Viewer is a powerful, if occasionally problematic, tool for exploring language and terms used within it.

    An ngram of the headline phrase of this toot ... shows an immense rise in prevalence of the term through 2019 (the most recent data in the corpus), roughly 10 times the 2010 level.

    What's driving that isn't necessarily clear --- language and usage reflects both the reflected real-world phenomena described by terms, and preferences for certain terms over others.

    But it's attention-grabbing all the same. And a bit sobering.

    books.google.com/ngrams/graph?

    #ngrams #NgramViewer #racism

  23. In case you were wondering, Christmas is in fact doing just fine

    If there ever was in fact a war against it, that ran from 1950--1980.

    Via Googe Ngram Viewer US English Corpus

    books.google.com/ngrams/graph?

    #Christmas #Ngrams #Language

  24. College campuses, academics, and professors have been radical liberal elites ... mostly since the 1960s.

    Ngram Viewer

    #ngrams