home.social

Search

581 results for “shawnzone”

  1. Wolfgang Kircheis is presenting "Mining the History Sections of Wikipedia Articles" at the #JCDL2023 AI / ML / Entity Extraction session

  2. @ibnesayeed is kicking off the final #JCDL2023 paper session: "AI / ML / Entity Extraction"

  3. Now we have the #JCDL2023 panel “Who can submit an excellent review for this manuscript in the next 30 days? — Peer Reviewing in the age of overload” with Hamed Alhoori, Edward A. Fox, Ingo Frommholz, Haiming Liu, Corinna Coupette, Bastian A. Rieck, Tirthankar Ghosal, and Jian Wu

  4. At #JCDL2023 Triet Ho Anh Doan presented "MINE - A Text Analysis Service for Digital Humanities Scientists", closing out the Digital Humanities and Teaching session

    Demo: mine-graph.de/
    Python package: pypi.org/project/minetext/

  5. Pavlos Fafalios is presenting "FastCat Catalogues: Interactive Entity-based Exploratory Analysis of Archival Documents" #jcdl2023

    Preprint: arxiv.org/abs/2302.02635
    Demo: catalogues.sealitproject.eu/
    Code: github.com/isl/FastCat-Catalog

  6. Do you want to discuss and explore a new or emerging issue? Bring together communities of interest at #JCDL2023 for a workshop or tutorial. Our Call for Workshops and Tutorials is live! Submission deadline is February 12, 2023.

    #RethinkingDigitalRecords

    2023.jcdl.org/calls/workshops-

  7. This #sunday, I signed up to test the @ivory #macOS #Mastodon Desktop client (Ivory for Mac) through #Apple's #TestFlight.

    I'm enjoying the experience. It’s like the experience I have on #iOS and #iPadOS, but has support for windowing, keyboard sequences, and more.

    It is quite polished! Kudos to @tapbots! I look forward to its release!

    #LeaveTwitterDay #Fediverse #TwitterMigration #IdesOfMusk #MastodonMigration #JoinMastodon #JohnMastodon #LeaveTwitter #TwitterExodus

    tapbots.com/ivory/mac/

  8. Having another post-holiday, pre-caffeine morning over here, as I stand at the kitchen counter, tea bag in cup, hot water unpoured, breakfast ingredients out and half-assembled. Forgot what I was doing while scrolling Mastodon.

    Welcome to Tuesday, I guess.

    #DayInTheLife #WhatAmIDoingHere #CaffeineAddiction

  9. At #TPDL2023 right now, @martinklein is presenting “It's Not Just GitHub: Identifying Data and Software Sources Included in Publications”

    The authors trained a classifier to classify open-access data and software (OADS) URLs from research papers as dataset or code. Archivists can then take these URLs and preserve the referenced datasets and code for reproducibility.

    Paper: doi.org/10.1007/978-3-031-4384
    Preprint: arxiv.org/abs/2307.14469

  10. Beatrice Alex is giving the second #TPDL2023 keynote “AI language technologies and digital collections: the need for interdisciplinary communication and co-design and training”

    * How can we invite #AI into the #archive?
    * AI can provide a lot of positive opportunities.
    * To improve its application, we need #interdisciplinary collaborations going forward.
    * AI #literacy needs to be taught early in education.

    Ref:
    * ed.ac.uk/profile/dr-beatrice-a
    * ltg.ed.ac.uk
    * ed.ac.uk/usher/clinical-natura

  11. Yesterday at #TPDL2023 Gianmaria Silvello presented“How to Cite a Web Ranking and Make it #FAIR

    Researchers often need to cite #SearchEngine results. Unfortunately, search engines change their algorithms and their index all the time. Alessandro Lotta and Gianmaria Silvello presented a #prototype that captures this ranking in a human- and machine-readable #format and posts it to #Zenodo for citing with a #DOI.

    My suggestion: include #webarchiving and #webarchives

    Ref: doi.org/10.1007/978-3-031-4384

  12. #TPDL2023 @hkroll and Mirjam Cuper from the Institute for Information Systems presented “Aspect-Driven Structuring of Historical Dutch Newspaper Archives”

    The authors discussed the challenges of automatically organizing and structuring content in a corpus when the #OCR is unreliable, the #metadata might be inconsistent, and the #licensing restrictions dictate who can see the content.

    Ref: doi.org/10.1007/978-3-031-4384

  13. Laura Hollink is giving the first #TPDL2023 keynote “Responsible AI & GLAM: challenges and opportunities” :
    * defining “diversity” and “fairness” for #GLAM
    * producer fairness vs. user-fairness
    * treatment equality vs. counterfactional fairness
    * popularity bias in recommender systems
    * publication country bias in datasets
    * identifying contentious terminology

    More information:
    * cwi.nl/en/groups/human-centere
    * cultural-ai.nl
    * aim4dem.nl

  14. Zvjezdan Penezić, Marijana Tomić, and Gianmaria Silvello are kicking off #TPDL2023!

    64 submissions
    * 39 full papers
    * 25 short papers

    Acceptance:
    * 33% of full papers were accepted for oral presentation
    * 18% of full papers were accepted as short papers
    * 10 short papers were accepted for oral presentation (40%)

    Authors from 3 best papers from the International Journal of Digital Libraries were also invited to present.

    Proceedings are available now: doi.org/10.1007/978-3-031-4384

  15. Zadar, Croatia is pretty. The #TPDL2023 conference starts later today. I cannot wait.

    (I also have jet lag, but I’m fighting through it.)

    Conference website: tpdl2023.dei.unipd.it/index.ht

  16. @kentborg @Dianora (I’m on a plane & the WiFi is spotty.)

    I’m going to #TPDL2023 and will not live-tweet it like past conferences. I’ll post to #Mastodon and #Bluesky, but not #Twitter.

    But I ask myself, why didn’t I quit when #Musk:
    * made using the word #cisgender grounds for suspension?
    * was promoting #antivaxx, #antisemitism, #racism, #sexism & hurting #BlackTwitter?
    * suspended #journalists?

    I stayed for the people who stayed. I stayed to help promote & support them.

    #TwitterMigration

  17. The new copies of Rights of Use are in the DragonCon vendor hall, booth 1320!
    #AmReadingScifi

  18. I was listening to the Don't Panic podcast yesterday about why Diane Morrison loves military science fiction: pca.st/40qo8w98
    #MilitarySF #AmReadingScifi #IfThisGoesOnDontPanic

  19. Having a great time at the In Your Write Mind Conference so far!

    It's not too late to join for the full day tomorrow if you're near Greensburg, PA.

    #IYWM #WritingConference #MariaSnyder #SallyBosco #CarrieGessner #BrianaSmith #KevanPeterson

  20. CW: Sexual assault, Govt repression, fascists

    Just finished watching #argentina1985 about the trial of the fascist military junta that had recently been thrown out of power and had committed countless attrocities against leftists and centrist democratic folk (and I'll add where funded by the CIA via operation condor, the violent US and south american Federation of fascists.). Its a courtroom drama, This film is gut wrenching, it features (possibly dramatized) testimonies from victims, so content warning;- If you have trauma around sexual abuse, or government repression I'd advise evaluating if this is the right film for you.

    But for everyone else, its a much watch.

    Oh and the english dubs well done, so its quite watchable if you dont speak espanol.

  21. This #Caturday Malala continued her exploration of the wall dividing our stairwell from the loft on the second floor. I keep telling myself that she should survive a two story drop, but she still makes me nervous whenever she does this.

    #Cat #CatsOfMastodon #Catstodon #GreyCatsRule #Cats #CatsOfTheFedi #GreyCat

  22. What about that metadata that is present? Grusky et al. (doi.org/10.18653/v1/N18-1065 ) realized that, because page authors create that metadata, it can serve as ground truth to evaluate #Automatic #Summarization.

    We analyzed pages from #WebArchiving and saw how this metadata evolved. By 2010 we saw a metadata explosion with the use of #Twitter Cards, Open Graph Protocol, #Facebook Tracking, and more. Things like Twitter cards created a metadata renaissance for HTML.

    Ref: doi.org/10.1109/JCDL52503.2021

  23. Social cards are generated based on #metadata present in web pages. If the author does not create the metadata, the service will not create the card. What do we do for web pages that predate this metadata?

    A lot of #Automatic #Summarization techniques can help us create the description part of social cards, but what about the image? In 2021, we found that Random Forest #MachineLearning can help choose the correct image using easy-to-calculate features.

    Ref: doi.org/10.1145/3447535.346250

  24. In 2020, we developed a special tool, MementoEmbed, for generating/extracting metadata from archived web pages. We presented this tool at the Web Archiving and Digital Libraries Workshop (WADL2020).

    We found out that #Twitter, #Facebook, #Tumblr, and others could not reliably create cards for archived web pages. We use MementoEmbed’s cards in #Storytelling with our tool Raintale to create a #Visualization of this #Summarization.

    #WebArchiving #DigitalPreservation

    Ref: arxiv.org/abs/2008.00137

  25. Elon is planning to effectively kill social cards on #Twitter. Social cards were a big part of my dissertation work. I published a few papers about generating them via #ComputerVision, #NLP, and #MachineLearning because they make for nice bits of document #Summarization and #Storytelling. Now Musk wants them gone to force journalists to write articles directly on Twitter.

    Ref (paywall): fortune.com/2023/08/21/elon-mu
    Ref (article about paywalled article): 9to5mac.com/2023/08/21/twitter

    #TwitterMigration