home.social

#stylometry — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #stylometry, aggregated by home.social.

  1. #Romanistiktag #Konstanz heute morgen meine Lieblingsthemen in unserer Sektion: romanistiktag.de/xxxix-romanis : unsere Keynote von la famosa Laura Hernández Lopez: zur morphosyntaktischen, stilistisch und stilometrischen Analyse der Poesie von Autorinnen des Siglo de Oro. #stylometry #cls #dh

  2. #Romanistiktag #Konstanz heute morgen meine Lieblingsthemen in unserer Sektion: romanistiktag.de/xxxix-romanis : unsere Keynote von la famosa Laura Hernández Lopez: zur morphosyntaktischen, stilistisch und stilometrischen Analyse der Poesie von Autorinnen des Siglo de Oro. #stylometry #cls #dh

  3. #Romanistiktag #Konstanz heute morgen meine Lieblingsthemen in unserer Sektion: romanistiktag.de/xxxix-romanis : unsere Keynote von la famosa Laura Hernández Lopez: zur morphosyntaktischen, stilistisch und stilometrischen Analyse der Poesie von Autorinnen des Siglo de Oro. #stylometry #cls #dh

  4. #Romanistiktag #Konstanz heute morgen meine Lieblingsthemen in unserer Sektion: romanistiktag.de/xxxix-romanis : unsere Keynote von la famosa Laura Hernández Lopez: zur morphosyntaktischen, stilistisch und stilometrischen Analyse der Poesie von Autorinnen des Siglo de Oro. #stylometry #cls #dh

  5. #Romanistiktag #Konstanz heute morgen meine Lieblingsthemen in unserer Sektion: romanistiktag.de/xxxix-romanis : unsere Keynote von la famosa Laura Hernández Lopez: zur morphosyntaktischen, stilistisch und stilometrischen Analyse der Poesie von Autorinnen des Siglo de Oro. #stylometry #cls #dh

  6. My own talk, which is coming up / just happened, is on #Multilingual #Stylometry.

    Based on our work for #CHR2024, we've moved on from the influence of language and translation on stylometric attribution accuracy to the influence of corpus composition. I'll be presenting both parts, but for the corpus composition issue, we are still at the stage of preliminary results.

    With a special thanks to @artjomshl.bsky.social!

    Slides: dhtrier.quarto.pub/icla/

    @rebsim #ICLA2025

  7. My own talk, which is coming up / just happened, is on #Multilingual #Stylometry.

    Based on our work for #CHR2024, we've moved on from the influence of language and translation on stylometric attribution accuracy to the influence of corpus composition. I'll be presenting both parts, but for the corpus composition issue, we are still at the stage of preliminary results.

    With a special thanks to @artjomshl.bsky.social!

    Slides: dhtrier.quarto.pub/icla/

    @rebsim #ICLA2025

  8. My own talk, which is coming up / just happened, is on #Multilingual #Stylometry.

    Based on our work for #CHR2024, we've moved on from the influence of language and translation on stylometric attribution accuracy to the influence of corpus composition. I'll be presenting both parts, but for the corpus composition issue, we are still at the stage of preliminary results.

    With a special thanks to @artjomshl.bsky.social!

    Slides: dhtrier.quarto.pub/icla/

    @rebsim #ICLA2025

  9. My own talk, which is coming up / just happened, is on #Multilingual #Stylometry.

    Based on our work for #CHR2024, we've moved on from the influence of language and translation on stylometric attribution accuracy to the influence of corpus composition. I'll be presenting both parts, but for the corpus composition issue, we are still at the stage of preliminary results.

    With a special thanks to @artjomshl.bsky.social!

    Slides: dhtrier.quarto.pub/icla/

    @rebsim #ICLA2025

  10. My own talk, which is coming up / just happened, is on #Multilingual #Stylometry.

    Based on our work for #CHR2024, we've moved on from the influence of language and translation on stylometric attribution accuracy to the influence of corpus composition. I'll be presenting both parts, but for the corpus composition issue, we are still at the stage of preliminary results.

    With a special thanks to @artjomshl.bsky.social!

    Slides: dhtrier.quarto.pub/icla/

    @rebsim #ICLA2025

  11. Now we're kicking off our "Digital Comparative Literature" track at #ICLA2025 with the first session. Three talks on social reading / Goodreads, on #multilingual #stylometry, and on visualisation of visual data.

    See the session programme here: conftool.pro/icla2025/index.ph

    @rebsim

  12. Now we're kicking off our "Digital Comparative Literature" track at #ICLA2025 with the first session. Three talks on social reading / Goodreads, on #multilingual #stylometry, and on visualisation of visual data.

    See the session programme here: conftool.pro/icla2025/index.ph

    @rebsim

  13. Now we're kicking off our "Digital Comparative Literature" track at #ICLA2025 with the first session. Three talks on social reading / Goodreads, on #multilingual #stylometry, and on visualisation of visual data.

    See the session programme here: conftool.pro/icla2025/index.ph

    @rebsim

  14. Now we're kicking off our "Digital Comparative Literature" track at #ICLA2025 with the first session. Three talks on social reading / Goodreads, on #multilingual #stylometry, and on visualisation of visual data.

    See the session programme here: conftool.pro/icla2025/index.ph

    @rebsim

  15. Now we're kicking off our "Digital Comparative Literature" track at #ICLA2025 with the first session. Three talks on social reading / Goodreads, on #multilingual #stylometry, and on visualisation of visual data.

    See the session programme here: conftool.pro/icla2025/index.ph

    @rebsim

  16. Great to be at the "Comparative Literature Goes Digital" session at #DH2025!

    Session info here: conftool.pro/dh2025/index.php?

    Full programme here: dls.hypotheses.org/1952

    Including a talk by Evgeniia Filveva, with Julia Havrylash, myself, Artjoms Šeļa on "#Multilingual #Stylometry: The influence of corpus composition and language on the performance of authorship attribution using corpora from the European Literary Text Collection (#ELTeC)".

    #ICLA #ADHO #SIG_DLS #CLS @tcdh

  17. Reminder for those who may not realize this, but #Stylometry is kind of an insane field of study, and you can be uniquely identified based on your writing style alone.

    This has, in the past, been applied to open source developers and programming code too, and it was found that using stylometry techniques you can identify the author of a
    compiled binary based on their open source code style ~78% of the time

    https://arxiv.org/pdf/1512.08546v1

    There are some techniques to avoid this luckily, which involve fairly basic changes to your writing style and structure that can very effectively anonymize things again:

    https://en.wikipedia.org/wiki/Adversarial_stylometry

  18. In unserem #StabiLab gibt es Digital Humanities zum Ausprobieren! Am Dienstag, den 21. Januar, lernt ihr bei uns, wie ihr mit dem Tool #Stylo Literatur erforschen könnt 👉 sbb.berlin/59m32

    #StabiBerlin #Stylometry #Stylometrie #Digitalisierung #Forschung #Workshop

  19. In unserem #StabiLab gibt es Digital Humanities zum Ausprobieren! Am Dienstag, den 21. Januar, lernt ihr bei uns, wie ihr mit dem Tool #Stylo Literatur erforschen könnt 👉 sbb.berlin/59m32

    #StabiBerlin #Stylometry #Stylometrie #Digitalisierung #Forschung #Workshop

  20. In unserem #StabiLab gibt es Digital Humanities zum Ausprobieren! Am Dienstag, den 21. Januar, lernt ihr bei uns, wie ihr mit dem Tool #Stylo Literatur erforschen könnt 👉 sbb.berlin/59m32

    #StabiBerlin #Stylometry #Stylometrie #Digitalisierung #Forschung #Workshop

  21. In unserem #StabiLab gibt es Digital Humanities zum Ausprobieren! Am Dienstag, den 21. Januar, lernt ihr bei uns, wie ihr mit dem Tool #Stylo Literatur erforschen könnt 👉 sbb.berlin/59m32

    #StabiBerlin #Stylometry #Stylometrie #Digitalisierung #Forschung #Workshop

  22. In unserem #StabiLab gibt es Digital Humanities zum Ausprobieren! Am Dienstag, den 21. Januar, lernt ihr bei uns, wie ihr mit dem Tool #Stylo Literatur erforschen könnt 👉 sbb.berlin/59m32

    #StabiBerlin #Stylometry #Stylometrie #Digitalisierung #Forschung #Workshop

  23. Later today at #CHR2024, we are going to present our work on #Multilingual #Stylometry!

    We isolated the influence of #language on #authorship #attribution #accuracy by translating multiple #corpora into each others' languages while keeping #corpus composition stable.

    Interactive showcase: showcases.clsinfra.io/stylomet

    Full paper: ceur-ws.org/Vol-3834/paper9.pd

    This work was developed within the @CLSinfra project in #Trier, #Krakow and #Prague with Artjoms Šeļa, Evgeniia Fileva and Julia Dudar.

  24. Agapitos and van Cranenburgh use computational #stylometry to show that while 'Octavia' and 'Hercules Oetaeus' were largely written by #Seneca, a closer analysis of the text segments reveals signs of mixed #authorship. doi.org/10.48694/jcls.3919 #CLS #CCLS24 #Classics #AuthorshipVerification

  25. Look what landed on my doorstep 😍 The book is also available #OpenAccess online at #heiUP: heiup.uni-heidelberg.de/catalo and I would like to thank the very patient editors who had to deal with switching the publisher and coming up with ways to improve the quality of my illustrations in my article about #stylometry in #French and #Spanish for #Picasso 's writings: @christof @josecalvo @u_henny and Robert Hesselbach, Daniel Schlör

  26. New #paper out: « Code #stylometry vs formatting and minification » peerj.com/articles/cs-2142/ , where we show how much current code stylometry techniques (i.e., how to automatically detect the author of a source code snippet) are resistent to automatic code formatting and minification. (Spoiler: quite a bit, authors can still be identified after those source-to-source transformations.) Available #openaccess on #PeerJ CS.

  27. Interesting! Dominika Weronska on "A Stylometric Glance at Basque Novels" at #DH2024. #stylometry

    The author did stylometric analyses on 57 Basque novels, a first!

  28. Now up at #DH2024, Maciej Eder, developer of #stylo and co-organizer of #DH2016 in #Krakow, on various distance measures for #Stylometry: "Manhattan, Euclidean and their Siblings. Exploring Exotic Measures of Text Similarities...".

    Key idea: Manhattan distance is L1-norm based, Euclidean is L2. But we can vary this parameter for a wide range of values, from 0.1 to 10. Then evaluate accuracy for authorship attribution.

    Result: For longer vectors, it pays off to use a value of less than 1!

  29. Now up at #DH2024, Maciej Eder, developer of #stylo and co-organizer of #DH2016 in #Krakow, on various distance measures for #Stylometry: "Manhattan, Euclidean and their Siblings. Exploring Exotic Measures of Text Similarities...".

    Key idea: Manhattan distance is L1-norm based, Euclidean is L2. But we can vary this parameter for a wide range of values, from 0.1 to 10. Then evaluate accuracy for authorship attribution.

    Result: For longer vectors, it pays off to use a value of less than 1!

  30. Now up at #DH2024, Maciej Eder, developer of #stylo and co-organizer of #DH2016 in #Krakow, on various distance measures for #Stylometry: "Manhattan, Euclidean and their Siblings. Exploring Exotic Measures of Text Similarities...".

    Key idea: Manhattan distance is L1-norm based, Euclidean is L2. But we can vary this parameter for a wide range of values, from 0.1 to 10. Then evaluate accuracy for authorship attribution.

    Result: For longer vectors, it pays off to use a value of less than 1!

  31. Now up at #DH2024, Maciej Eder, developer of #stylo and co-organizer of #DH2016 in #Krakow, on various distance measures for #Stylometry: "Manhattan, Euclidean and their Siblings. Exploring Exotic Measures of Text Similarities...".

    Key idea: Manhattan distance is L1-norm based, Euclidean is L2. But we can vary this parameter for a wide range of values, from 0.1 to 10. Then evaluate accuracy for authorship attribution.

    Result: For longer vectors, it pays off to use a value of less than 1!

  32. Now up at #DH2024, Maciej Eder, developer of #stylo and co-organizer of #DH2016 in #Krakow, on various distance measures for #Stylometry: "Manhattan, Euclidean and their Siblings. Exploring Exotic Measures of Text Similarities...".

    Key idea: Manhattan distance is L1-norm based, Euclidean is L2. But we can vary this parameter for a wide range of values, from 0.1 to 10. Then evaluate accuracy for authorship attribution.

    Result: For longer vectors, it pays off to use a value of less than 1!

  33. Kurz mal getestet, stylo() kann die verschiedenen Versformen bei Goethe ziemlich sicher auseinanderhalten: Dramen in Alexandrinern, Knitteln, Blankversen, gemischten Versen sowie die beiden hexametrischen Epen.

    (Volltexte via #DraCor bzw. @gutenberg_org.)

    #DigitalHumanities #Stylometry

  34. @dvergano … until you start using techniques to defend against #stylometry

    whonix.org/wiki/Stylometry

    (One of the many reasons I love and support the #whonix project)

  35. @jcls Another paper we would like to highlight, again for the lovers of #novels

    Dorothy Henriette Modrall Sperling, Mike Kestemont & Vincent Neyt (2023), “The Authorship of Stephen King’s Books Written Under the Pseudonym “Richard #Bachman”: A Stylometric Analysis”, Journal of Computational Literary Studies 2(1), 1–35. doi: doi.org/10.48694/jcls.3594

    Keywords: #Stephen_King, #stylometry, #pop_culture, #authorship verification, contemporary English-language #fiction

  36. This next paper is about #stylometry in a #translation setting involving novels in #Swedish and #Danish:

    Martje Wijers (2023), “Why the Daisy sisters are different. A stylometric study on the oeuvre of Swedish author Henning #Mankell and the Dutch translations of his work”, Journal of Computational Literary Studies 2 (1), 1–27. doi: doi.org/10.48694/jcls.3585

    Keywords: #stylometry, #cluster analysis, #PCA, #delta, #zeta, #translation

  37. AI Forensic Linguistic Circuit Boards

    So how’s that AI analysis coming along on the Zodiac Killer cards and letters?

    Don Seawater posts to OPORDAnalytical.com, Nov. 12, 2013, explaining there are as many as “40-50 letters” from Arthur Leigh Allen to Phyllis Seawater.
    (Click image to enlarge in separate browsing tab.)

    Well, come on, Law Enforcement and Netflix, what is taking so long?!

    — Please remind me again on how the Unabomber was eventually identified. —

    The community Forensic Linguists are ready . . .

    Data! Data! We cannot make bricks without clay (or straw).

    BRING ON THE LETTERS!

    Here’s some Ackerman Industry analysis for you. Check out these forensic linguistic circuit boards.

    Any questions?

    People,

    (“If it . . . quacks like a duck . . .”)

    The Zodiac killer was Arthur Leigh Allen.

    Word Choice Not By Chance! Phraseology Not Random! Non-Contextual AND Contextual Idiolect Matches Not Accidental!

    Aston, Berkeley, Cambridge, Cardiff, Carnegie Mellon, Chicago, Fresno State, Glasgow, Harvard, Hofstra, ILE, MIT, Princeton, Stanford, UMass, Yale, York . . . ?

    What say ye? . . .

    Put this in your stylometric chatbots and vape it.

    [Performs mic drop with disembodied robot arm in place of mic]

    Arthur Leigh Allen’s First letter (p.1) from Atascadero State Hospital
    to Phyllis Hensley Seawater (1976)
    R.P. Ackerman performs Diction and Phraseology Parallel Analysis on scans from originals
    uploaded to YouTube by the Seawater family, Oct. 26, 2021 Arthur Leigh Allen’s First letter (p.2) from Atascadero State Hospital
    to Phyllis Hensley Seawater (1976)
    R.P. Ackerman performs Diction and Phraseology Parallel Analysis on scans from originals
    uploaded to YouTube by the Seawater family, Oct. 26, 2021 [office src="onedrive.live.com/embed?resid=" width="1200" height="600"]

    Arthur Leigh Allen’s Second letter (six pages) from Atascadero State Hospital
    to Phyllis Hensley Seawater (Dec. 7, 1976)
    R.P. Ackerman performs Diction and Phraseology Parallel Analysis on scans from originals
    uploaded to YouTube by the Seawater family, Oct. 31, 2021
    [If PDF scrolling embed does not open on screen, click link to open in new browser tab.]

    Arthur Leigh Allen’s letter to Phyllis Hensley Seawater re: Phone Tapping and Melvin Belli
    (June 13, 1992 – 2 ½ mos. before Allen’s death)
    R.P. Ackerman performs Diction and Phraseology Parallel Analysis on scans from originals
    uploaded to YouTube by the Seawater family, Nov. 8, 2021

    © 2024-2026 Robert Peter Ackerman
    zodiacconfessed.wordpress.com

    Related Posts

    Other Blog Topics

    Do Androids Dream of Electric Sheep?

    Arthur Leigh Allen’s Lake Berryessa Map Sketch

    2024: The Curious Case of the Year in The Zodiac Killer Research

    #AI #allen #arthur #articulation #ArtificialIntelligence #author #benicia #bot #cards #chatbot #collocation #corpus #envelopes #evidence #ForensicCriminology #ForensicLinguistics #gaviota #idiolect #lee #leigh #letters #linguistic #machineLearning #mailed #murder #Napa #neuralNetwork #phoneme #phonic #psycholinguistics #Riverside #SanFrancisco #scrape #scraping #sent #serial #solution #stylometry #Vallejo #WebDataExtraction #WebHarvesting #WebScraping #writing #ZodiacKiller
  38. Very happy to participate in today's workshop on "Potentials and Limits of #Stylometry for Early Modern Text in #Romance Languages". It's co-organized by the "Pamphlets and Patrons" #PAPA project in Early Modern French History and the Trier Center for Digital Humanities @tcdh today.

    The programme is here: tcdh.uni-trier.de/en/event/hyb

    #CLS #Romanistik #Trier