home.social

#ebooklib — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #ebooklib, aggregated by home.social.

  1. New blog post, in which I review and test some options for extracting unformatted text from #EPUB files in Python, using #Apache #Tika (via #Tika-python), #Textract and #EbookLib.

    Includes link to Git repo with demo scripts.

    bitsgalore.org/2023/03/09/extr