home.social

#digitalarchiving — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #digitalarchiving, aggregated by home.social.

  1. Maintenance begins at creation, so why are we not creating better?

    by @beet_keeper

    The beats are the same. You work for government, or academia (lets face it, that’s probably where 90% of the work is) you have a deliverable; you save it; you print to PDF; you store it on an institutional repository with some metadata (or Zenodo, OSF or equivalent) and its done.

    There’s a small chance that it’s FAIR (Findable, Accessible, Interoperable, Reusable) right? It has metadata that can be discovered by an audience looking for it and can be indexed by search engines. The data is potentially accessible if published correctly. They’re not particularly interoperable or easily converted, and PDFs aren’t really designed for reuse, even if tools like Apache Tika help ease the burden of extracting artifacts. It’s just a PDF, why are we even talking about FAIR? There begins a story…

    The beats are the same, yet, we work in digital preservation, our backgrounds are in GLAM or software, why do we want to shoot ourselves in the foot? Why are we not using our skills to create better?


    #Archives #BetterPoster #ContinuumModel #createToMaintain #digipres #DigitalArchiving #DigitalContunuity #digitalLiteracy #DigitalPreservation #FAIR #FileFormats #GLAM #informationRecordsMangagement #NationalDigitalStewardshipAlliance #NDSA #OpenAccess #OpenData #PDF #RDM #ResearchDataLifecycle #RIM
  2. Maintenance begins at creation, so why are we not creating better?

    by @beet_keeper

    The beats are the same. You work for government, or academia (lets face it, that’s probably where 90% of the work is) you have a deliverable; you save it; you print to PDF; you store it on an institutional repository with some metadata (or Zenodo, OSF or equivalent) and its done.

    There’s a small chance that it’s FAIR (Findable, Accessible, Interoperable, Reusable) right? It has metadata that can be discovered by an audience looking for it and can be indexed by search engines. The data is potentially accessible if published correctly. They’re not particularly interoperable or easily converted, and PDFs aren’t really designed for reuse, even if tools like Apache Tika help ease the burden of extracting artifacts. It’s just a PDF, why are we even talking about FAIR? There begins a story…

    The beats are the same, yet, we work in digital preservation, our backgrounds are in GLAM or software, why do we want to shoot ourselves in the foot? Why are we not using our skills to create better?

    Continue reading “Maintenance begins at creation, so why are we not creating better?”


    #Archives #BetterPoster #ContinuumModel #createToMaintain #digipres #DigitalArchiving #DigitalContunuity #digitalLiteracy #DigitalPreservation #FAIR #FileFormats #GLAM #informationRecordsMangagement #NationalDigitalStewardshipAlliance #NDSA #OpenAccess #OpenData #PDF #RDM #ResearchDataLifecycle #RIM
  3. Maintenance begins at creation, so why are we not creating better?

    by @beet_keeper

    The beats are the same. You work for government, or academia (lets face it, that’s probably where 90% of the work is) you have a deliverable; you save it; you print to PDF; you store it on an institutional repository with some metadata (or Zenodo, OSF or equivalent) and its done.

    There’s a small chance that it’s FAIR (Findable, Accessible, Interoperable, Reusable) right? It has metadata that can be discovered by an audience looking for it and can be indexed by search engines. The data is potentially accessible if published correctly. They’re not particularly interoperable or easily converted, and PDFs aren’t really designed for reuse, even if tools like Apache Tika help ease the burden of extracting artifacts. It’s just a PDF, why are we even talking about FAIR? There begins a story…

    The beats are the same, yet, we work in digital preservation, our backgrounds are in GLAM or software, why do we want to shoot ourselves in the foot? Why are we not using our skills to create better?


    #Archives #ContinuumModel #createToMaintain #digipres #DigitalArchiving #DigitalContunuity #digitalLiteracy #DigitalPreservation #FAIR #FileFormats #GLAM #informationRecordsMangagement #NationalDigitalStewardshipAlliance #NDSA #OpenData #PDF #RDM #ResearchDataLifecycle #RIM
  4. Maintenance begins at creation, so why are we not creating better?

    by @beet_keeper

    The beats are the same. You work for government, or academia (lets face it, that’s probably where 90% of the work is) you have a deliverable; you save it; you print to PDF; you store it on an institutional repository with some metadata (or Zenodo, OSF or equivalent) and its done.

    There’s a small chance that it’s FAIR (Findable, Accessible, Interoperable, Reusable) right? It has metadata that can be discovered by an audience looking for it and can be indexed by search engines. The data is potentially accessible if published correctly. They’re not particularly interoperable or easily converted, and PDFs aren’t really designed for reuse, even if tools like Apache Tika help ease the burden of extracting artifacts. It’s just a PDF, why are we even talking about FAIR? There begins a story…

    The beats are the same, yet, we work in digital preservation, our backgrounds are in GLAM or software, why do we want to shoot ourselves in the foot? Why are we not using our skills to create better?


    #Archives #ContinuumModel #createToMaintain #digipres #DigitalArchiving #DigitalContunuity #digitalLiteracy #DigitalPreservation #FAIR #FileFormats #GLAM #informationRecordsMangagement #NationalDigitalStewardshipAlliance #NDSA #OpenData #PDF #RDM #ResearchDataLifecycle #RIM
  5. Maintenance begins at creation, so why are we not creating better?

    by @beet_keeper

    The beats are the same. You work for government, or academia (lets face it, that’s probably where 90% of the work is) you have a deliverable; you save it; you print to PDF; you store it on an institutional repository with some metadata (or Zenodo, OSF or equivalent) and its done.

    There’s a small chance that it’s FAIR (Findable, Accessible, Interoperable, Reusable) right? It has metadata that can be discovered by an audience looking for it and can be indexed by search engines. The data is potentially accessible if published correctly. They’re not particularly interoperable or easily converted, and PDFs aren’t really designed for reuse, even if tools like Apache Tika help ease the burden of extracting artifacts. It’s just a PDF, why are we even talking about FAIR? There begins a story…

    The beats are the same, yet, we work in digital preservation, our backgrounds are in GLAM or software, why do we want to shoot ourselves in the foot? Why are we not using our skills to create better?


    #Archives #BetterPoster #ContinuumModel #createToMaintain #digipres #DigitalArchiving #DigitalContunuity #digitalLiteracy #DigitalPreservation #FAIR #FileFormats #GLAM #informationRecordsMangagement #NationalDigitalStewardshipAlliance #NDSA #OpenAccess #OpenData #PDF #RDM #ResearchDataLifecycle #RIM
  6. Digital dark times: Salaries in digital preservation

    by @beet_keeper

    The Serpentine is one of the world’s most renowned art galleries. Their exhibitions as varied as Gerhard Richter, Damien Hirst, and Marina Abramović. They don’t hold a permanent collection, instead, they provide a space for temporary collections and an annual pavilion, the pavilion designed by luminaries such as Zaha Hadid, Frank Gehry, and Ai Weiwei.

    Given a recent job posting it looks like they are looking at maintaining their memory better and branching out into digital preservation.

    Here’s the kicker — its salary band is GBP 35,000 to GBP 38,000. So it must be an entry level position, especially in London, right?

    Well, let’s see what they want you to do for that price tag…


    #Archives #Art #artGalleries #artHistory #ArtsCouncilEngland #career #digipres #DigitalArchiving #DigitalHumanities #DigitalPreservation #digitalPreservationIsPeople #dpip #earlyCareer #employment #GLAM #jobPostings #london #outreach #salaries #Serpentine #SerpentineGalleries #SerpentineGallery
  7. Digital dark times: Salaries in digital preservation

    by @beet_keeper

    The Serpentine is one of the world’s most renowned art galleries. Their exhibitions as varied as Gerhard Richter, Damien Hirst, and Marina Abramović. They don’t hold a permanent collection, instead, they provide a space for temporary collections and an annual pavilion, the pavilion designed by luminaries such as Zaha Hadid, Frank Gehry, and Ai Weiwei.

    Given a recent job posting it looks like they are looking at maintaining their memory better and branching out into digital preservation.

    Here’s the kicker — its salary band is GBP 35,000 to GBP 38,000. So it must be an entry level position, especially in London, right?

    Well, let’s see what they want you to do for that price tag…


    #Archives #Art #artGalleries #artHistory #ArtsCouncilEngland #career #digipres #DigitalArchiving #DigitalHumanities #DigitalPreservation #digitalPreservationIsPeople #dpip #earlyCareer #employment #GLAM #jobPostings #london #outreach #salaries #Serpentine #SerpentineGalleries #SerpentineGallery
  8. Digital dark times: Salaries in digital preservation

    by @beet_keeper

    The Serpentine is one of the world’s most renowned art galleries. Their exhibitions as varied as Gerhard Richter, Damien Hirst, and Marina Abramović. They don’t hold a permanent collection, instead, they provide a space for temporary collections and an annual pavilion, the pavilion designed by luminaries such as Zaha Hadid, Frank Gehry, and Ai Weiwei.

    Given a recent job posting it looks like they are looking at maintaining their memory better and branching out into digital preservation.

    Here’s the kicker — its salary band is GBP 35,000 to GBP 38,000. So it must be an entry level position, especially in London, right?

    Well, let’s see what they want you to do for that price tag…


    #Archives #Art #artGalleries #artHistory #ArtsCouncilEngland #career #digipres #DigitalArchiving #DigitalHumanities #DigitalPreservation #digitalPreservationIsPeople #dpip #earlyCareer #employment #GLAM #jobPostings #london #outreach #salaries #Serpentine #SerpentineGalleries #SerpentineGallery
  9. Densho: Building the Densho Digital Repository: Three Decades of Digital Preservation. “How did Densho’s digital archives begin, and how have they evolved over nearly three decades? Densho Archives Director Caitlin Oiye Coon traces the journey from the creation of Densho’s first ‘Digital Archive’ in 1998 to today’s ‘Densho Digital Repository,’ highlighting the people, technologies, and […]

    https://rbfirehose.com/2026/03/11/building-the-densho-digital-repository-three-decades-of-digital-preservation-densho/
  10. Internet Archive Blog: Recording Now Available from “Protect Our Future Memory” Webinar. “Held on January 27, the event brought together legal experts, library leaders, and advocates to talk about Our Future Memory and the global coalition working to secure the protections that memory institutions need in our increasingly digital and networked world.”

    https://rbfirehose.com/2026/02/08/internet-archive-blog-recording-now-available-from-protect-our-future-memory-webinar/
  11. Evolving Web: Designing a digital archive in partnership with an Indigenous community. “At Evolving Web, we recently collaborated with the University of Denver on the Our Stories, Our Medicine Archive (OSOMA), a community-owned digital archive that centres traditional Indigenous knowledge related to health, wellness, culture, and identity. Built in close collaboration with community partners, […]

    https://rbfirehose.com/2026/02/01/evolving-web-designing-a-digital-archive-in-partnership-with-an-indigenous-community/
  12. "The FBI is attempting to unmask the owner behind archive.today, a popular archiving site that is also regularly used to bypass paywalls on the internet and to avoid sending traffic to the original publishers of web content, according to a subpoena posted by the website. The FBI subpoena says it is part of a criminal investigation, though it does not provide any details about what alleged crime is being investigated. Archive.today is also popularly known by several of its mirrors, including archive.is and archive.ph.

    The subpoena, which was posted on X by archive.today on October 30, was sent by the FBI to Tucows, a popular Canadian domain registrar. It demands that Tucows give the FBI the “customer or subscriber name, address of service, and billing address” and other information about the “customer behind archive.today.”"

    404media.co/fbi-tries-to-unmas

    #DigitalArchiving #Archiving #Paywalls #FBI

  13. Indicator: The Indicator Guide to tools for capturing webpages and social media content. “We tested 11 tools ranging from full-featured continuous capture apps to one-off screenshot extensions for grabbing long webpages.”

    https://rbfirehose.com/2025/09/23/indicator-the-indicator-guide-to-tools-for-capturing-webpages-and-social-media-content/

  14. ⚡️Linkwarden: The Self-Hosted Bookmark Manager That Solved a Problem I Didn’t Know I Had

    Thank you, Linux Unplugged and Jupiter Broadcasting @ironicbadger, for introducing me to Linkwarden—a FOSS gem that will change how I save, share, and preserve the web.

    Like many of you, I’ve been using browser bookmarks for years. I’d save articles, tutorials, and interesting links, only to find them gone when I finally got around to reading them. Link rot is real, and it’s frustrating. But until I heard about Linkwarden linkwarden.app/ on Linux Unplugged jupiterbroadcasting.com/, I didn’t realize how much I needed a better solution.

    I used to think, “Browser bookmarks are fine,” and honestly, backing them up manually from time to time isn’t a real trouble—just a slight inconvenience. My problem is that I experience massive link rot when looking into two-year-old links, often with interesting subjects on small sites—they are often just gone when I want to recall them. The problem is that saving the link isn’t saving any of the information.

    But Linkwarden @linkwarden isn’t just another bookmark manager—it’s a preservation powerhouse, a collaborative hub, and a self-hosted dream. And thanks to the folks at Jupiter Broadcasting, I now understand why it’s a game-changer.

    I haven’t started hosting it yet, but I definitely will, and I hope some of you out there will find it useful too.
    Thanks to @daniel31x13 for making a awesome tool :heart_cyber: ⚡️.
    ---
    • Linkwarden github.com/linkwarden/linkwarden —  Self-hosted collaborative bookmark manager to collect, read, annotate, and fully preserve what matters, all in one place.
    • Announcing Linkwarden 2.11 blog.linkwarden.app/releases/2.11
    • Linkwarden Browser Extension github.com/linkwarden/browser-extension

    @[email protected] @selfhosting @[email protected] @selfhost #OpenSourceSoftware #TechForGood #Linkwarden #SelfHosted #FOSS #OpenSource #WebPreservation #Fediverse #LinuxUnplugged #SaveTheWeb #NoMore404 #TechCommunity #DigitalArchiving #LinkRot #PrivacyFirst #BookmarkManager #Bookmark

  15. "Disproportionately, the datasets that are no longer accessible through the portal come from the Department of Energy, the National Oceanic and Atmospheric Administration, the Department of the Interior, NASA, and the Environmental Protection Agency. But determining what is actually gone and what has simply moved or is backed up elsewhere by the government is a manual task, and it's too early to say for sure what is gone and what may have been renamed or updated with a newer version.

    This is because data.gov doesn’t always host the data that it is indexing. Sometimes the data is hosted directly on data.gov, but other times it links to an individual agency’s website, where the data is actually hosted. This means archiving and analyzing data.gov is not straightforward.

    “Some of [the entries link to] actual data,” Cushman told 404 Media. “And some of them link to a landing page [where the data is hosted]. And the question is—when things are disappearing, is it the data it points to that is gone? Or is it just the index to it that’s gone?”"

    404media.co/archivists-work-to

    #USA #Trump #DigitalArchiving #DataGov #OpenData #OpenGovernment

  16. What will you bitflip today?


    by @beet_keeper

    I want to let you into a secret: I enjoy corruption. Corrupting digital objects leads to undefined behavior (C++’s definition is fun). And flipping bits in objects can tell us something both about the fragility, and robustness of our digital files and the applications that work with them.

    I had a pull-request for bitflip accepted the other day. Bitflip is by Antoine Grondin and is a simple utility for flipping bits in digital files. I wrote in my COPTR entry for it that it reminds me of shotGun by Manfred Thaller. The utility is exceptionally easy to use (and of course update and maintain written in Golang) and has some nice features for flipping individual bits or a uniform percentage of bits across a digital file.

    My pull-request was a simple one updating Goreleaser and its GitHub workflow to provide binaries for Windows and FreeBSD. I only needed to use Windows for a short amount of time thankfully, but it’s an environment I believe is prevalent for a lot of digital preservationists in corporate IT environments.

    Bitflip is a useful utility to improve your testing of digital preservation systems, or simply for outreach, but let’s have a quick look at it in action.

    #Archive30 #Archives #Art #Binary #bitflip #bitrot #Code #Coding #digipres #digital #DigitalArchiving #digitalLiteracy #DigitalPreservation #FileFormats #GenerativeArt #GlitchArt #outreach #Ravensburger #SomethingFun #Vagabond

  17. Client-side file format identification and reporting pipeline with Siegfried and Demystify Lite


    by @beet_keeper

    With thanks to the sponsorship of Archives New Zealand and Richard Lehane for his great coding expertise and his collaboration; Demystify Lite has a new feature — Siegfried!!

    Richard recently posted about this work on LinkedIn but lets look at this effort in more detail below.

    #Archives #Coding #digipres #DigitalArchiving #DigitalPreservation #DROID #FileFormat #Golang #siegfried #SoftwareDevelopment