#wtfpdf — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #wtfpdf, aggregated by home.social.
-
#archivtagAT #archivtag2025 Andreas Rauber zeigt ein Beispiel von einem PDF, auch HTML hat oder als Virtual Machine gespeichert werden kann, die dann erweiterte Funktionen haben. Was passiert, wenn man so ein PDF normalisiert oder migriert wird? #wtfPDF
-
CW: Uspol Repost
This is in French, but the link is to a 404 media story. Every now and then it’s #wtfpdf FTW. #digitalforensics #digitaldiplomatics
From: @BertrandCaron
https://digipres.club/@BertrandCaron/113923145282409719 -
ICYMI - are "octal escape sequences" in #PDF strings really a preservation risk, as claimed by the authors of the recent "The Phantom 👻 of a PDF File" blog post?
Some quick tests I did with eight different PDF processing tools suggest they're not, and #JHOVE's inability to handle them really seems to be the exception here #wtfPDF #fileformatfriday
https://www.bitsgalore.org/2024/11/14/escape-from-the-phantom-of-the-pdf
-
The authors of the recent "The Phantom of a PDF File" blog post argue that "octal escape sequences" in #PDF strings are a potential preservation risk.
But some quick tests with 8 different PDF tools suggest that #JHOVE is really the only tool that can't handle them!
Details in my new blog post "Escape from the phantom of the PDF" #wtfPDF 👻 :
https://www.bitsgalore.org/2024/11/14/escape-from-the-phantom-of-the-pdf
-
Update on the "Phantom of the #PDF" blog of a few weeks ago (link: https://digitalpreservation.fi/en/2024-phantom-pdf-file).
I did a little test of authors' claim that "#JHOVE probably is not the only software that will get confused" by octal escape sequences* in metadata strings
So I read the file with 8 different PDF tools/libraries:
https://github.com/openpreserve/jhove/issues/927#issuecomment-2465947326
Turns out JHOVE actually *is* the only software that gets confused by this #wtfPDF!
*) The authors describe this as "dual encodings", but see Peter Wyatt's comment!
-
Here's a sneak peek at a #PDF Quality Assessment tool I'm working on for digitisation batches , mostly based on #PyMuPDF, #pillow and #Schematron:
https://github.com/KBNLresearch/pdfquad
(Wouldn't recommend this for production yet, as it's not completely finished, and I'm still changing some things around.)
-
OMG, I played this game https://www.dpconline.org/blog/wdpd/blog-fff-game-wdpd2024 and my file format fling was PDF!?! #WTFPDF @wtfpdf #digipres #wdpd2024