home.social

#pymupdf — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #pymupdf, aggregated by home.social.

  1. “Annotation censoring everywhere”

    #Censor, the #PDF document #redaction tool for the #GNOME desktop, got a minor update to v0.7.1. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    Censor’s #Flatpak package comes with the latest version of the upstream library #PyMuPDF (1.27.2.3). This solves an issue with annotations that could not be redacted in some files. Thanks to Julian Smith from #Artifex for fixing it upstream. In case you do not use the Flatpak package, you may need to wait for your distribution maintainers to update PyMuPDF’s version.

    Hindi has been added to the languages you may now speak with Censor. Thanks to the translators!

    #censorship #Codeberg #Flathub #GNOME #linux #python #MuPDF

  2. “Annotation censoring everywhere”

    #Censor, the #PDF document #redaction tool for the #GNOME desktop, got a minor update to v0.7.1. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    Censor’s #Flatpak package comes with the latest version of the upstream library #PyMuPDF (1.27.2.3). This solves an issue with annotations that could not be redacted in some files. Thanks to Julian Smith from #Artifex for fixing it upstream. In case you do not use the Flatpak package, you may need to wait for your distribution maintainers to update PyMuPDF’s version.

    Hindi has been added to the languages you may now speak with Censor. Thanks to the translators!

    #censorship #Codeberg #Flathub #GNOME #linux #python #MuPDF

  3. “Annotation censoring everywhere”

    #Censor, the #PDF document #redaction tool for the #GNOME desktop, got a minor update to v0.7.1. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    Censor’s #Flatpak package comes with the latest version of the upstream library #PyMuPDF (1.27.2.3). This solves an issue with annotations that could not be redacted in some files. Thanks to Julian Smith from #Artifex for fixing it upstream. In case you do not use the Flatpak package, you may need to wait for your distribution maintainers to update PyMuPDF’s version.

    Hindi has been added to the languages you may now speak with Censor. Thanks to the translators!

    #censorship #Codeberg #Flathub #GNOME #linux #python #MuPDF

  4. “Annotation censoring everywhere”

    #Censor, the #PDF document #redaction tool for the #GNOME desktop, got a minor update to v0.7.1. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    Censor’s #Flatpak package comes with the latest version of the upstream library #PyMuPDF (1.27.2.3). This solves an issue with annotations that could not be redacted in some files. Thanks to Julian Smith from #Artifex for fixing it upstream. In case you do not use the Flatpak package, you may need to wait for your distribution maintainers to update PyMuPDF’s version.

    Hindi has been added to the languages you may now speak with Censor. Thanks to the translators!

    #censorship #Codeberg #Flathub #GNOME #linux #python #MuPDF

  5. “Annotation censoring everywhere”

    #Censor, the #PDF document #redaction tool for the #GNOME desktop, got a minor update to v0.7.1. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    Censor’s #Flatpak package comes with the latest version of the upstream library #PyMuPDF (1.27.2.3). This solves an issue with annotations that could not be redacted in some files. Thanks to Julian Smith from #Artifex for fixing it upstream. In case you do not use the Flatpak package, you may need to wait for your distribution maintainers to update PyMuPDF’s version.

    Hindi has been added to the languages you may now speak with Censor. Thanks to the translators!

    #censorship #Codeberg #Flathub #GNOME #linux #python #MuPDF

  6. “Zoom in, zoom out, redact your points!”

    #Censor v0.7.0 ships improved zooming with pinch-to-zoom gesture on touchpads and touchscreens and various security bug fixes. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    You may now redact safely also #PDF documents with cropped, scaled, or rotated pages. The upstream #PyMuPDF library has some issues treating these documents. Thus, I implemented manual transformation and drawing of the redaction rectangles, and verified the proper redaction with a specially created sample document, submitted to the collection by #pypdf: github.com/py-pdf/sample-files

    Additionally, point- or line-like elements are now properly redacted. Sanitization during post-processing keeps now entries in form fields (note: this changes previous behavior).

    Thanks to the translators, Censor is now available in 13 languages including Croatian and Russian.

    #censorship #redaction #Codeberg #Flatpak #Flathub #GNOME #linux #python #MuPDF

  7. “Zoom in, zoom out, redact your points!”

    #Censor v0.7.0 ships improved zooming with pinch-to-zoom gesture on touchpads and touchscreens and various security bug fixes. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    You may now redact safely also #PDF documents with cropped, scaled, or rotated pages. The upstream #PyMuPDF library has some issues treating these documents. Thus, I implemented manual transformation and drawing of the redaction rectangles, and verified the proper redaction with a specially created sample document, submitted to the collection by #pypdf: github.com/py-pdf/sample-files

    Additionally, point- or line-like elements are now properly redacted. Sanitization during post-processing keeps now entries in form fields (note: this changes previous behavior).

    Thanks to the translators, Censor is now available in 13 languages including Croatian and Russian.

    #censorship #redaction #Codeberg #Flatpak #Flathub #GNOME #linux #python #MuPDF

  8. “Zoom in, zoom out, redact your points!”

    #Censor v0.7.0 ships improved zooming with pinch-to-zoom gesture on touchpads and touchscreens and various security bug fixes. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    You may now redact safely also #PDF documents with cropped, scaled, or rotated pages. The upstream #PyMuPDF library has some issues treating these documents. Thus, I implemented manual transformation and drawing of the redaction rectangles, and verified the proper redaction with a specially created sample document, submitted to the collection by #pypdf: github.com/py-pdf/sample-files

    Additionally, point- or line-like elements are now properly redacted. Sanitization during post-processing keeps now entries in form fields (note: this changes previous behavior).

    Thanks to the translators, Censor is now available in 13 languages including Croatian and Russian.

    #censorship #redaction #Codeberg #Flatpak #Flathub #GNOME #linux #python #MuPDF

  9. “Zoom in, zoom out, redact your points!”

    #Censor v0.7.0 ships improved zooming with pinch-to-zoom gesture on touchpads and touchscreens and various security bug fixes. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    You may now redact safely also #PDF documents with cropped, scaled, or rotated pages. The upstream #PyMuPDF library has some issues treating these documents. Thus, I implemented manual transformation and drawing of the redaction rectangles, and verified the proper redaction with a specially created sample document, submitted to the collection by #pypdf: github.com/py-pdf/sample-files

    Additionally, point- or line-like elements are now properly redacted. Sanitization during post-processing keeps now entries in form fields (note: this changes previous behavior).

    Thanks to the translators, Censor is now available in 13 languages including Croatian and Russian.

    #censorship #redaction #Codeberg #Flatpak #Flathub #GNOME #linux #python #MuPDF

  10. “Zoom in, zoom out, redact your points!”

    #Censor v0.7.0 ships improved zooming with pinch-to-zoom gesture on touchpads and touchscreens and various security bug fixes. Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    You may now redact safely also #PDF documents with cropped, scaled, or rotated pages. The upstream #PyMuPDF library has some issues treating these documents. Thus, I implemented manual transformation and drawing of the redaction rectangles, and verified the proper redaction with a specially created sample document, submitted to the collection by #pypdf: github.com/py-pdf/sample-files

    Additionally, point- or line-like elements are now properly redacted. Sanitization during post-processing keeps now entries in form fields (note: this changes previous behavior).

    Thanks to the translators, Censor is now available in 13 languages including Croatian and Russian.

    #censorship #redaction #Codeberg #Flatpak #Flathub #GNOME #linux #python #MuPDF

  11. “Secure #redaction by design and through extensive #testing

    #Censor 0.6.0 comes with many more #security improvements, motivated by extensive testing on more then 1,000 #PDF document samples. You may now redact securely also links, form fields and widgets. In rare cases, when partial image redaction fails, the more secure full image removal is used instead.

    But even more important, Censor now warns you, when unsuccessful redaction is detected during postprocessing. This reduces the impact of known issues of unsecure redaction.

    Polish is the 11th language you may speak with Censor. Thanks to its translators (among them, @mondstern)!

    Thanks a lot also to #pypdf, #qpdf, #pikepdf, #Ghostscript, #MuPDF, #PyMuPDF, and #poppler contributors for the great resource of PDF document samples!

    Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    #Censorship #Codeberg #Flathub #GNOME #Linux #Python

  12. “Secure #redaction by design and through extensive #testing

    #Censor 0.6.0 comes with many more #security improvements, motivated by extensive testing on more then 1,000 #PDF document samples. You may now redact securely also links, form fields and widgets. In rare cases, when partial image redaction fails, the more secure full image removal is used instead.

    But even more important, Censor now warns you, when unsuccessful redaction is detected during postprocessing. This reduces the impact of known issues of unsecure redaction.

    Polish is the 11th language you may speak with Censor. Thanks to its translators (among them, @mondstern)!

    Thanks a lot also to #pypdf, #qpdf, #pikepdf, #Ghostscript, #MuPDF, #PyMuPDF, and #poppler contributors for the great resource of PDF document samples!

    Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    #Censorship #Codeberg #Flathub #GNOME #Linux #Python

  13. “Secure #redaction by design and through extensive #testing

    #Censor 0.6.0 comes with many more #security improvements, motivated by extensive testing on more then 1,000 #PDF document samples. You may now redact securely also links, form fields and widgets. In rare cases, when partial image redaction fails, the more secure full image removal is used instead.

    But even more important, Censor now warns you, when unsuccessful redaction is detected during postprocessing. This reduces the impact of known issues of unsecure redaction.

    Polish is the 11th language you may speak with Censor. Thanks to its translators (among them, @mondstern)!

    Thanks a lot also to #pypdf, #qpdf, #pikepdf, #Ghostscript, #MuPDF, #PyMuPDF, and #poppler contributors for the great resource of PDF document samples!

    Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    #Censorship #Codeberg #Flathub #GNOME #Linux #Python

  14. “Secure #redaction by design and through extensive #testing

    #Censor 0.6.0 comes with many more #security improvements, motivated by extensive testing on more then 1,000 #PDF document samples. You may now redact securely also links, form fields and widgets. In rare cases, when partial image redaction fails, the more secure full image removal is used instead.

    But even more important, Censor now warns you, when unsuccessful redaction is detected during postprocessing. This reduces the impact of known issues of unsecure redaction.

    Polish is the 11th language you may speak with Censor. Thanks to its translators (among them, @mondstern)!

    Thanks a lot also to #pypdf, #qpdf, #pikepdf, #Ghostscript, #MuPDF, #PyMuPDF, and #poppler contributors for the great resource of PDF document samples!

    Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    #Censorship #Codeberg #Flathub #GNOME #Linux #Python

  15. “Secure #redaction by design and through extensive #testing

    #Censor 0.6.0 comes with many more #security improvements, motivated by extensive testing on more then 1,000 #PDF document samples. You may now redact securely also links, form fields and widgets. In rare cases, when partial image redaction fails, the more secure full image removal is used instead.

    But even more important, Censor now warns you, when unsuccessful redaction is detected during postprocessing. This reduces the impact of known issues of unsecure redaction.

    Polish is the 11th language you may speak with Censor. Thanks to its translators (among them, @mondstern)!

    Thanks a lot also to #pypdf, #qpdf, #pikepdf, #Ghostscript, #MuPDF, #PyMuPDF, and #poppler contributors for the great resource of PDF document samples!

    Find it at @flathub: flathub.org/apps/page.codeberg and @Codeberg: codeberg.org/censor/Censor

    #Censorship #Codeberg #Flathub #GNOME #Linux #Python

  16. От вет-ИИ для коров до имперского глянца: хардкорный MLOps на бесплатных GPU

    В начале 2026 года ленты новостей принесли тревожные сообщения из Сибири: массовые вспышки опасных заболеваний у КРС (крупного рогатого скота) привели к необходимости вынужденного забоя тысяч голов. Для многих фермеров это означало потерю бизнеса и средств к существованию. Мы задались вопросом: может ли доступный Computer Vision стать первой линией обороны? Инструментом, который позволит фермеру в отдаленном районе провести первичный скрининг (триаж) животного с помощью обычного смартфона и вовремя вызвать ветеринара, не дожидаясь начала эпидемии. Так родился проект AI-Vet-Scanner ( наше пространство на Hugging Face ), определяющий признаки заболеваний по фотографии.

    habr.com/ru/articles/1013214/

    #MLOps #Kaggle #Computer_Vision #OpenCV #PyMuPDF #Hugging_Face #датасет #парсинг #оптимизация_памяти #SDXL_LoRA

  17. От вет-ИИ для коров до имперского глянца: хардкорный MLOps на бесплатных GPU

    В начале 2026 года ленты новостей принесли тревожные сообщения из Сибири: массовые вспышки опасных заболеваний у КРС (крупного рогатого скота) привели к необходимости вынужденного забоя тысяч голов. Для многих фермеров это означало потерю бизнеса и средств к существованию. Мы задались вопросом: может ли доступный Computer Vision стать первой линией обороны? Инструментом, который позволит фермеру в отдаленном районе провести первичный скрининг (триаж) животного с помощью обычного смартфона и вовремя вызвать ветеринара, не дожидаясь начала эпидемии. Так родился проект AI-Vet-Scanner ( наше пространство на Hugging Face ), определяющий признаки заболеваний по фотографии.

    habr.com/ru/articles/1013214/

    #MLOps #Kaggle #Computer_Vision #OpenCV #PyMuPDF #Hugging_Face #датасет #парсинг #оптимизация_памяти #SDXL_LoRA

  18. От вет-ИИ для коров до имперского глянца: хардкорный MLOps на бесплатных GPU

    В начале 2026 года ленты новостей принесли тревожные сообщения из Сибири: массовые вспышки опасных заболеваний у КРС (крупного рогатого скота) привели к необходимости вынужденного забоя тысяч голов. Для многих фермеров это означало потерю бизнеса и средств к существованию. Мы задались вопросом: может ли доступный Computer Vision стать первой линией обороны? Инструментом, который позволит фермеру в отдаленном районе провести первичный скрининг (триаж) животного с помощью обычного смартфона и вовремя вызвать ветеринара, не дожидаясь начала эпидемии. Так родился проект AI-Vet-Scanner ( наше пространство на Hugging Face ), определяющий признаки заболеваний по фотографии.

    habr.com/ru/articles/1013214/

    #MLOps #Kaggle #Computer_Vision #OpenCV #PyMuPDF #Hugging_Face #датасет #парсинг #оптимизация_памяти #SDXL_LoRA

  19. От вет-ИИ для коров до имперского глянца: хардкорный MLOps на бесплатных GPU

    В начале 2026 года ленты новостей принесли тревожные сообщения из Сибири: массовые вспышки опасных заболеваний у КРС (крупного рогатого скота) привели к необходимости вынужденного забоя тысяч голов. Для многих фермеров это означало потерю бизнеса и средств к существованию. Мы задались вопросом: может ли доступный Computer Vision стать первой линией обороны? Инструментом, который позволит фермеру в отдаленном районе провести первичный скрининг (триаж) животного с помощью обычного смартфона и вовремя вызвать ветеринара, не дожидаясь начала эпидемии. Так родился проект AI-Vet-Scanner ( наше пространство на Hugging Face ), определяющий признаки заболеваний по фотографии.

    habr.com/ru/articles/1013214/

    #MLOps #Kaggle #Computer_Vision #OpenCV #PyMuPDF #Hugging_Face #датасет #парсинг #оптимизация_памяти #SDXL_LoRA

  20. That said and celebrated ;), there are things that #Censor is not yet well redacting.

    The upstream library #MuPDF (with its #Python bindings in #PyMuPDF) supports by default only redaction of text, vector graphics and images. Testing on a variety of PDF files (thanks to #pypdf, #qpdf, #ghostscript, and their issue reporters, as well as @pdfarranger for their hint) let me discover that some vector graphics are not properly redacted and an upstream issue has been reported for that.

    Also, form fields (widgets), signatures and links may be incompletely redacted.

    You can find an updated list of “What is redacted? What not?” here: codeberg.org/censor/Censor/iss

    #pdf #redaction #security

  21. That said and celebrated ;), there are things that #Censor is not yet well redacting.

    The upstream library #MuPDF (with its #Python bindings in #PyMuPDF) supports by default only redaction of text, vector graphics and images. Testing on a variety of PDF files (thanks to #pypdf, #qpdf, #ghostscript, and their issue reporters, as well as @pdfarranger for their hint) let me discover that some vector graphics are not properly redacted and an upstream issue has been reported for that.

    Also, form fields (widgets), signatures and links may be incompletely redacted.

    You can find an updated list of “What is redacted? What not?” here: codeberg.org/censor/Censor/iss

    #pdf #redaction #security

  22. That said and celebrated ;), there are things that #Censor is not yet well redacting.

    The upstream library #MuPDF (with its #Python bindings in #PyMuPDF) supports by default only redaction of text, vector graphics and images. Testing on a variety of PDF files (thanks to #pypdf, #qpdf, #ghostscript, and their issue reporters, as well as @pdfarranger for their hint) let me discover that some vector graphics are not properly redacted and an upstream issue has been reported for that.

    Also, form fields (widgets), signatures and links may be incompletely redacted.

    You can find an updated list of “What is redacted? What not?” here: codeberg.org/censor/Censor/iss

    #pdf #redaction #security

  23. That said and celebrated ;), there are things that #Censor is not yet well redacting.

    The upstream library #MuPDF (with its #Python bindings in #PyMuPDF) supports by default only redaction of text, vector graphics and images. Testing on a variety of PDF files (thanks to #pypdf, #qpdf, #ghostscript, and their issue reporters, as well as @pdfarranger for their hint) let me discover that some vector graphics are not properly redacted and an upstream issue has been reported for that.

    Also, form fields (widgets), signatures and links may be incompletely redacted.

    You can find an updated list of “What is redacted? What not?” here: codeberg.org/censor/Censor/iss

    #pdf #redaction #security

  24. That said and celebrated ;), there are things that #Censor is not yet well redacting.

    The upstream library #MuPDF (with its #Python bindings in #PyMuPDF) supports by default only redaction of text, vector graphics and images. Testing on a variety of PDF files (thanks to #pypdf, #qpdf, #ghostscript, and their issue reporters, as well as @pdfarranger for their hint) let me discover that some vector graphics are not properly redacted and an upstream issue has been reported for that.

    Also, form fields (widgets), signatures and links may be incompletely redacted.

    You can find an updated list of “What is redacted? What not?” here: codeberg.org/censor/Censor/iss

    #pdf #redaction #security

  25. “Better safe than sorry”

    For release 0.5.0 of #Censor, a lot of work went into improving the security of PDF redaction.

    PDF documents are tricky, and irrevocably removing elements from them is even more. With this release, before saving a redacted document, garbage is now properly collected and documents are sanitized, which means that metadata, page thumbnails, etc. are removed.

    Also, vector graphics are now removed with a more strict option when they overlap with redaction rectangles. On top of that, I added redaction of PDF annotations.

    The user interface was refreshed: with undo and redo buttons in the toolbar and improved document saving experience. Drawing rectangles is now indicated by a crosshair cursor.

    Thanks to the translators, you may now talk also Czech with Censor!

    Get it from @flathub: flathub.org/apps/page.codeberg, or contribute on @Codeberg: codeberg.org/censor/Censor

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #linux

  26. “Better safe than sorry”

    For release 0.5.0 of #Censor, a lot of work went into improving the security of PDF redaction.

    PDF documents are tricky, and irrevocably removing elements from them is even more. With this release, before saving a redacted document, garbage is now properly collected and documents are sanitized, which means that metadata, page thumbnails, etc. are removed.

    Also, vector graphics are now removed with a more strict option when they overlap with redaction rectangles. On top of that, I added redaction of PDF annotations.

    The user interface was refreshed: with undo and redo buttons in the toolbar and improved document saving experience. Drawing rectangles is now indicated by a crosshair cursor.

    Thanks to the translators, you may now talk also Czech with Censor!

    Get it from @flathub: flathub.org/apps/page.codeberg, or contribute on @Codeberg: codeberg.org/censor/Censor

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #linux

  27. “Better safe than sorry”

    For release 0.5.0 of #Censor, a lot of work went into improving the security of PDF redaction.

    PDF documents are tricky, and irrevocably removing elements from them is even more. With this release, before saving a redacted document, garbage is now properly collected and documents are sanitized, which means that metadata, page thumbnails, etc. are removed.

    Also, vector graphics are now removed with a more strict option when they overlap with redaction rectangles. On top of that, I added redaction of PDF annotations.

    The user interface was refreshed: with undo and redo buttons in the toolbar and improved document saving experience. Drawing rectangles is now indicated by a crosshair cursor.

    Thanks to the translators, you may now talk also Czech with Censor!

    Get it from @flathub: flathub.org/apps/page.codeberg, or contribute on @Codeberg: codeberg.org/censor/Censor

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #linux

  28. “Better safe than sorry”

    For release 0.5.0 of #Censor, a lot of work went into improving the security of PDF redaction.

    PDF documents are tricky, and irrevocably removing elements from them is even more. With this release, before saving a redacted document, garbage is now properly collected and documents are sanitized, which means that metadata, page thumbnails, etc. are removed.

    Also, vector graphics are now removed with a more strict option when they overlap with redaction rectangles. On top of that, I added redaction of PDF annotations.

    The user interface was refreshed: with undo and redo buttons in the toolbar and improved document saving experience. Drawing rectangles is now indicated by a crosshair cursor.

    Thanks to the translators, you may now talk also Czech with Censor!

    Get it from @flathub: flathub.org/apps/page.codeberg, or contribute on @Codeberg: codeberg.org/censor/Censor

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #linux

  29. “Better safe than sorry”

    For release 0.5.0 of #Censor, a lot of work went into improving the security of PDF redaction.

    PDF documents are tricky, and irrevocably removing elements from them is even more. With this release, before saving a redacted document, garbage is now properly collected and documents are sanitized, which means that metadata, page thumbnails, etc. are removed.

    Also, vector graphics are now removed with a more strict option when they overlap with redaction rectangles. On top of that, I added redaction of PDF annotations.

    The user interface was refreshed: with undo and redo buttons in the toolbar and improved document saving experience. Drawing rectangles is now indicated by a crosshair cursor.

    Thanks to the translators, you may now talk also Czech with Censor!

    Get it from @flathub: flathub.org/apps/page.codeberg, or contribute on @Codeberg: codeberg.org/censor/Censor

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #linux

  30. “A historic moment for Censor”

    #Censor – the PDF redaction tool for the @gnome desktop – comes now with a new edit history. It allows you to undo and redo redactions using the right-click context menu or keyboard shortcuts. Also, a bug that prevented repeated saving to the same file path was fixed.

    Get the new version from @flathub: flathub.org/apps/page.codeberg, and find it on @Codeberg: codeberg.org/censor/Censor/rel

    You may now talk Chinese, Dutch, English, Estonian, Finnish, French, German, Italian, and Vietnamese with Censor (thanks a lot to the translators!). If your language is missing from this list I invite you to contribute at Codeberg Translate: translate.codeberg.org/engage/

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #Linux

  31. “A historic moment for Censor”

    #Censor – the PDF redaction tool for the @gnome desktop – comes now with a new edit history. It allows you to undo and redo redactions using the right-click context menu or keyboard shortcuts. Also, a bug that prevented repeated saving to the same file path was fixed.

    Get the new version from @flathub: flathub.org/apps/page.codeberg, and find it on @Codeberg: codeberg.org/censor/Censor/rel

    You may now talk Chinese, Dutch, English, Estonian, Finnish, French, German, Italian, and Vietnamese with Censor (thanks a lot to the translators!). If your language is missing from this list I invite you to contribute at Codeberg Translate: translate.codeberg.org/engage/

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #Linux

  32. “A historic moment for Censor”

    #Censor – the PDF redaction tool for the @gnome desktop – comes now with a new edit history. It allows you to undo and redo redactions using the right-click context menu or keyboard shortcuts. Also, a bug that prevented repeated saving to the same file path was fixed.

    Get the new version from @flathub: flathub.org/apps/page.codeberg, and find it on @Codeberg: codeberg.org/censor/Censor/rel

    You may now talk Chinese, Dutch, English, Estonian, Finnish, French, German, Italian, and Vietnamese with Censor (thanks a lot to the translators!). If your language is missing from this list I invite you to contribute at Codeberg Translate: translate.codeberg.org/engage/

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #Linux

  33. “A historic moment for Censor”

    #Censor – the PDF redaction tool for the @gnome desktop – comes now with a new edit history. It allows you to undo and redo redactions using the right-click context menu or keyboard shortcuts. Also, a bug that prevented repeated saving to the same file path was fixed.

    Get the new version from @flathub: flathub.org/apps/page.codeberg, and find it on @Codeberg: codeberg.org/censor/Censor/rel

    You may now talk Chinese, Dutch, English, Estonian, Finnish, French, German, Italian, and Vietnamese with Censor (thanks a lot to the translators!). If your language is missing from this list I invite you to contribute at Codeberg Translate: translate.codeberg.org/engage/

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #Linux

  34. “A historic moment for Censor”

    #Censor – the PDF redaction tool for the @gnome desktop – comes now with a new edit history. It allows you to undo and redo redactions using the right-click context menu or keyboard shortcuts. Also, a bug that prevented repeated saving to the same file path was fixed.

    Get the new version from @flathub: flathub.org/apps/page.codeberg, and find it on @Codeberg: codeberg.org/censor/Censor/rel

    You may now talk Chinese, Dutch, English, Estonian, Finnish, French, German, Italian, and Vietnamese with Censor (thanks a lot to the translators!). If your language is missing from this list I invite you to contribute at Codeberg Translate: translate.codeberg.org/engage/

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub #GNOME #python #MuPDF #PyMuPDF #Linux

  35. Censor, a new document redaction tool, is there!

    It allows to draw black rectangles on PDF documents and to permanently remove the text and images below. Find it on @Codeberg: codeberg.org/censor/Censor, get it from @flathub: flathub.org/apps/page.codeberg, or translate it on Codeberg Translate: translate.codeberg.org/engage/!

    It is a free and open-source graphical user interface (GUI) for #Linux and the #GNOME desktop, and uses the #MuPDF library with its #python bindings from the #PyMuPDF module.

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub

  36. Censor, a new document redaction tool, is there!

    It allows to draw black rectangles on PDF documents and to permanently remove the text and images below. Find it on @Codeberg: codeberg.org/censor/Censor, get it from @flathub: flathub.org/apps/page.codeberg, or translate it on Codeberg Translate: translate.codeberg.org/engage/!

    It is a free and open-source graphical user interface (GUI) for #Linux and the #GNOME desktop, and uses the #MuPDF library with its #python bindings from the #PyMuPDF module.

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub

  37. Censor, a new document redaction tool, is there!

    It allows to draw black rectangles on PDF documents and to permanently remove the text and images below. Find it on @Codeberg: codeberg.org/censor/Censor, get it from @flathub: flathub.org/apps/page.codeberg, or translate it on Codeberg Translate: translate.codeberg.org/engage/!

    It is a free and open-source graphical user interface (GUI) for #Linux and the #GNOME desktop, and uses the #MuPDF library with its #python bindings from the #PyMuPDF module.

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub

  38. Censor, a new document redaction tool, is there!

    It allows to draw black rectangles on PDF documents and to permanently remove the text and images below. Find it on @Codeberg: codeberg.org/censor/Censor, get it from @flathub: flathub.org/apps/page.codeberg, or translate it on Codeberg Translate: translate.codeberg.org/engage/!

    It is a free and open-source graphical user interface (GUI) for #Linux and the #GNOME desktop, and uses the #MuPDF library with its #python bindings from the #PyMuPDF module.

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub

  39. Censor, a new document redaction tool, is there!

    It allows to draw black rectangles on PDF documents and to permanently remove the text and images below. Find it on @Codeberg: codeberg.org/censor/Censor, get it from @flathub: flathub.org/apps/page.codeberg, or translate it on Codeberg Translate: translate.codeberg.org/engage/!

    It is a free and open-source graphical user interface (GUI) for #Linux and the #GNOME desktop, and uses the #MuPDF library with its #python bindings from the #PyMuPDF module.

    #censorship #redaction #PDF #Codeberg #Flatpak #Flathub

  40. After struggling to get #python #PyMuPDF to work and being close the deadline, I shifted to using a combination of other commands.

    First using the #linux #pdftohtml command, which is so much faster than PyMuPDF and packages the result similar to saving a website.

    Next with #NeoVim and #RegEx format the #HTML file to be able to be quickly processed with #NodeJs #cheerio and eventually through #json to be saved in #sqlite.

    Is it elegant and automatic? No, though it works!

    #JavaScript

  41. After struggling to get #python #PyMuPDF to work and being close the deadline, I shifted to using a combination of other commands.

    First using the #linux #pdftohtml command, which is so much faster than PyMuPDF and packages the result similar to saving a website.

    Next with #NeoVim and #RegEx format the #HTML file to be able to be quickly processed with #NodeJs #cheerio and eventually through #json to be saved in #sqlite.

    Is it elegant and automatic? No, though it works!

    #JavaScript

  42. After struggling to get #python #PyMuPDF to work and being close the deadline, I shifted to using a combination of other commands.

    First using the #linux #pdftohtml command, which is so much faster than PyMuPDF and packages the result similar to saving a website.

    Next with #NeoVim and #RegEx format the #HTML file to be able to be quickly processed with #NodeJs #cheerio and eventually through #json to be saved in #sqlite.

    Is it elegant and automatic? No, though it works!

    #JavaScript

  43. After struggling to get #python #PyMuPDF to work and being close the deadline, I shifted to using a combination of other commands.

    First using the #linux #pdftohtml command, which is so much faster than PyMuPDF and packages the result similar to saving a website.

    Next with #NeoVim and #RegEx format the #HTML file to be able to be quickly processed with #NodeJs #cheerio and eventually through #json to be saved in #sqlite.

    Is it elegant and automatic? No, though it works!

    #JavaScript

  44. Further while trying to extract and format data from PDFs using #python #PyMuPDF.

    I was trying to create a perfect chain of functions that would format all the edge cases into the final desired #HTML format. This is where I quickly realized running every tweaked version of the functions on the 100 page PDF is quite time consuming.

    Instead I can run it once and save the results in a #sqlite database. Then create #sql queries to do post processing on the edge cases while having a good enough way to observe the contents of each page over the pervious method of posting the output into the #terminal and scrolling to the desired page. And in the end, I am one step closer of having the data in a #csv file, which is easily exported with #Dbeaver.

  45. Further while trying to extract and format data from PDFs using #python #PyMuPDF.

    I was trying to create a perfect chain of functions that would format all the edge cases into the final desired #HTML format. This is where I quickly realized running every tweaked version of the functions on the 100 page PDF is quite time consuming.

    Instead I can run it once and save the results in a #sqlite database. Then create #sql queries to do post processing on the edge cases while having a good enough way to observe the contents of each page over the pervious method of posting the output into the #terminal and scrolling to the desired page. And in the end, I am one step closer of having the data in a #csv file, which is easily exported with #Dbeaver.

  46. Further while trying to extract and format data from PDFs using #python #PyMuPDF.

    I was trying to create a perfect chain of functions that would format all the edge cases into the final desired #HTML format. This is where I quickly realized running every tweaked version of the functions on the 100 page PDF is quite time consuming.

    Instead I can run it once and save the results in a #sqlite database. Then create #sql queries to do post processing on the edge cases while having a good enough way to observe the contents of each page over the pervious method of posting the output into the #terminal and scrolling to the desired page. And in the end, I am one step closer of having the data in a #csv file, which is easily exported with #Dbeaver.

  47. Further while trying to extract and format data from PDFs using #python #PyMuPDF.

    I was trying to create a perfect chain of functions that would format all the edge cases into the final desired #HTML format. This is where I quickly realized running every tweaked version of the functions on the 100 page PDF is quite time consuming.

    Instead I can run it once and save the results in a #sqlite database. Then create #sql queries to do post processing on the edge cases while having a good enough way to observe the contents of each page over the pervious method of posting the output into the #terminal and scrolling to the desired page. And in the end, I am one step closer of having the data in a #csv file, which is easily exported with #Dbeaver.

  48. Currently trying to extract and format data from PDFs using #python #PyMuPDF.

    Initially used the `get_text(value)` method with the `"text"` value, only to learn that I could have potentially saved time directly using the `"html"` value, since I have been creating pattern matchers to format the text into #HTML.

    After investigation, although the html option exists, the post processing is more strenuous than the initial approach.

    My fascination with the `get_text(value)` method is that each value packages the data differently. Where as `"html"` puts the text in `<p><span>text</span></p>`, `"xhtml"` puts it instead in `<h1>text</h1>`.

  49. Currently trying to extract and format data from PDFs using #python #PyMuPDF.

    Initially used the `get_text(value)` method with the `"text"` value, only to learn that I could have potentially saved time directly using the `"html"` value, since I have been creating pattern matchers to format the text into #HTML.

    After investigation, although the html option exists, the post processing is more strenuous than the initial approach.

    My fascination with the `get_text(value)` method is that each value packages the data differently. Where as `"html"` puts the text in `<p><span>text</span></p>`, `"xhtml"` puts it instead in `<h1>text</h1>`.

  50. Currently trying to extract and format data from PDFs using #python #PyMuPDF.

    Initially used the `get_text(value)` method with the `"text"` value, only to learn that I could have potentially saved time directly using the `"html"` value, since I have been creating pattern matchers to format the text into #HTML.

    After investigation, although the html option exists, the post processing is more strenuous than the initial approach.

    My fascination with the `get_text(value)` method is that each value packages the data differently. Where as `"html"` puts the text in `<p><span>text</span></p>`, `"xhtml"` puts it instead in `<h1>text</h1>`.