home.social

#contentanalysis — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #contentanalysis, aggregated by home.social.

  1. Porting SafeText and analyzing digital content with Apache Tika

    by @beet_keeper

    Last year I wrote about pitfalls in modern journalism, especially with regards to receiving documents and information from whistleblowers without offering them adequate protection.

    The tl;dr is that you, as a whistleblower, need to protect yourself; and you, as an editor or journalist, need to protect your whistleblowers.

    Steganographic fingerprints might be one method adopted to detect someone leaking information. Steganographic characters replace common textual characters with unusual but hard to detect variants, e.g. they look the same to the human eye, or are actually invisible. Using a tool called SafeText by David Jacobson we can identify these hidden fingerprints in the content that you share.

    I firmly believe we can find clues about what is important to preserve, or learn to preserve, when we analyse the content of the digital record and not just the (file) format of the digital record.

    A file can contain many different features and these are all challenges to their future interpretation, and thus preservation.

    I wanted to use SafeText in some of my other non-Python tooling and so I decided to port the code to Golang as a composable module and binary.

    By coincidence at the time I started writing this I had also just written about revisiting tikalinkextract and so I thought I would write this small explanation about how you might combine Tika and SafeText to perform some content analysis of your own.

    Who knows, maybe we will find a conspiracy. Maybe we’ll find secret codes in our own digital records. Maybe we’ll learn something new about our records…

    Lets have a look at putting Tika and SafeText together and see where it goes.


    #ApacheTika #authenticity #Code #Coding #ContentAnalysis #Data #DigitalHumanities #digitalLiteracy #DigitalPreservation #Golang #integrity #Metadata #Paradata #SafeText #steganography
  2. Porting SafeText and analyzing digital content with Apache Tika

    by @beet_keeper

    Last year I wrote about pitfalls in modern journalism, especially with regards to receiving documents and information from whistleblowers without offering them adequate protection.

    The tl;dr is that you, as a whistleblower, need to protect yourself; and you, as an editor or journalist, need to protect your whistleblowers.

    Steganographic fingerprints might be one method adopted to detect someone leaking information. Steganographic characters replace common textual characters with unusual but hard to detect variants, e.g. they look the same to the human eye, or are actually invisible. Using a tool called SafeText by David Jacobson we can identify these hidden fingerprints in the content that you share.

    I firmly believe we can find clues about what is important to preserve, or learn to preserve, when we analyse the content of the digital record and not just the (file) format of the digital record.

    A file can contain many different features and these are all challenges to their future interpretation, and thus preservation.

    I wanted to use SafeText in some of my other non-Python tooling and so I decided to port the code to Golang as a composable module and binary.

    By coincidence at the time I started writing this I had also just written about revisiting tikalinkextract and so I thought I would write this small explanation about how you might combine Tika and SafeText to perform some content analysis of your own.

    Who knows, maybe we will find a conspiracy. Maybe we’ll find secret codes in our own digital records. Maybe we’ll learn something new about our records…

    Lets have a look at putting Tika and SafeText together and see where it goes.


    #ApacheTika #authenticity #Code #Coding #ContentAnalysis #Data #DigitalHumanities #digitalLiteracy #DigitalPreservation #Golang #integrity #Metadata #Paradata #SafeText #steganography
  3. Porting SafeText and analyzing digital content with Apache Tika

    by @beet_keeper

    Last year I wrote about pitfalls in modern journalism, especially with regards to receiving documents and information from whistleblowers without offering them adequate protection.

    The tl;dr is that you, as a whistleblower, need to protect yourself; and you, as an editor or journalist, need to protect your whistleblowers.

    Steganographic fingerprints might be one method adopted to detect someone leaking information. Steganographic characters replace common textual characters with unusual but hard to detect variants, e.g. they look the same to the human eye, or are actually invisible. Using a tool called SafeText by David Jacobson we can identify these hidden fingerprints in the content that you share.

    I firmly believe we can find clues about what is important to preserve, or learn to preserve, when we analyse the content of the digital record and not just the (file) format of the digital record.

    A file can contain many different features and these are all challenges to their future interpretation, and thus preservation.

    I wanted to use SafeText in some of my other non-Python tooling and so I decided to port the code to Golang as a composable module and binary.

    By coincidence at the time I started writing this I had also just written about revisiting tikalinkextract and so I thought I would write this small explanation about how you might combine Tika and SafeText to perform some content analysis of your own.

    Who knows, maybe we will find a conspiracy. Maybe we’ll find secret codes in our own digital records. Maybe we’ll learn something new about our records…

    Lets have a look at putting Tika and SafeText together and see where it goes.


    #ApacheTika #authenticity #Code #Coding #ContentAnalysis #Data #DigitalHumanities #digitalLiteracy #DigitalPreservation #Golang #integrity #Journalism #Metadata #Paradata #SafeText #steganography #Whistleblow #Whistleblower
  4. Porting SafeText and analyzing digital content with Apache Tika

    by @beet_keeper

    Last year I wrote about pitfalls in modern journalism, especially with regards to receiving documents and information from whistleblowers without offering them adequate protection.

    The tl;dr is that you, as a whistleblower, need to protect yourself; and you, as an editor or journalist, need to protect your whistleblowers.

    Steganographic fingerprints might be one method adopted to detect someone leaking information. Steganographic characters replace common textual characters with unusual but hard to detect variants, e.g. they look the same to the human eye, or are actually invisible. Using a tool called SafeText by David Jacobson we can identify these hidden fingerprints in the content that you share.

    I firmly believe we can find clues about what is important to preserve, or learn to preserve, when we analyse the content of the digital record and not just the (file) format of the digital record.

    A file can contain many different features and these are all challenges to their future interpretation, and thus preservation.

    I wanted to use SafeText in some of my other non-Python tooling and so I decided to port the code to Golang as a composable module and binary.

    By coincidence at the time I started writing this I had also just written about revisiting tikalinkextract and so I thought I would write this small explanation about how you might combine Tika and SafeText to perform some content analysis of your own.

    Who knows, maybe we will find a conspiracy. Maybe we’ll find secret codes in our own digital records. Maybe we’ll learn something new about our records…

    Lets have a look at putting Tika and SafeText together and see where it goes.

    Continue reading “Porting SafeText and analyzing digital content with Apache Tika”


    #ApacheTika #authenticity #Code #Coding #ContentAnalysis #Data #DigitalHumanities #digitalLiteracy #DigitalPreservation #Golang #integrity #Journalism #Metadata #Paradata #SafeText #steganography #Whistleblow #Whistleblower
  5. Nghĩa vụ dự án đầu tay: So sánh quảng cáo sản phẩm làm sạch ở các nước để phân tích hành vi tiêu dùng & văn hóa. Tác giả cần phản hồi về phương pháp phân tích nội dung + mã hóa dữ liệu từ ChatGPT. Dự án được dùng xây dựng portfolio ứng tuyển ngành marketing. Cần cộng đồng hỗ trợ tránh "tê liệt phân tích"! #Marketing #NghiênCứu #DựÁnĐầuTay #ConsumerBehavior #ContentAnalysis #MarketingResearch #VietnamBusiness #ThịTrườngTiêuDùng #StartupProject

    reddit.com/r/SideProject/comme

  6. New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
    We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
    the-main-thread.com/p/quarkus-

    #Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

  7. New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
    We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
    the-main-thread.com/p/quarkus-

    #Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

  8. New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
    We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
    the-main-thread.com/p/quarkus-

    #Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

  9. New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
    We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
    the-main-thread.com/p/quarkus-

    #Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

  10. New tutorial: Discover hidden themes in your writing with Quarkus + DeepLearning4j.
    We scrape, embed, and cluster Substack articles. Showing how Java can power AI content analysis.
    the-main-thread.com/p/quarkus-

    #Java #Quarkus #AI #DeepLearning4j #ContentAnalysis

  11. I’m truly grateful my article "Too Cute to Be a Crime? AI-Generated Lolita Aesthetics and the Legal Limits of Synthetic Girlhood on TikTok" has been published in the International Journal for Crime, Law and AI.

    I hope that this will be a small step towards understanding how technology reshapes law and culture.
    academia.edu/130116050/Too_Cut

    #AIResearch #DigitalCulture #TikTok #LolitaAesthetics #SyntheticMedia #LawAndTech #MediaStudies #AIandLaw #ContentAnalysis #Lolita #AI #MediaScholar #SocialMedia

  12. I am looking for fellow social science scholars on wind power.

    I am preparing an article about public participation in environmental assessment in Flanders and the Netherlands.

    The dataset of publicly available comments is rich enough for additional publications, though that requires teamwork.

    #energyjustice #risk #ambiguity #publicparticipation #environmentaljustice #uncertainty #sociology #contentanalysis #groundedtheory #acceptance #windpower #windturbine #renewableenergytechnology

  13. I am looking for fellow social science scholars on wind power.

    I am preparing an article about public participation in environmental assessment in Flanders and the Netherlands.

    The dataset of publicly available comments is rich enough for additional publications, though that requires teamwork.

    #energyjustice #risk #ambiguity #publicparticipation #environmentaljustice #uncertainty #sociology #contentanalysis #groundedtheory #acceptance #windpower #windturbine #renewableenergytechnology

  14. I am looking for fellow social science scholars on wind power.

    I am preparing an article about public participation in environmental assessment in Flanders and the Netherlands.

    The dataset of publicly available comments is rich enough for additional publications, though that requires teamwork.

    #energyjustice #risk #ambiguity #publicparticipation #environmentaljustice #uncertainty #sociology #contentanalysis #groundedtheory #acceptance #windpower #windturbine #renewableenergytechnology

  15. Which tools do you use for quantitative content analysis or annotation tasks which is simpler than MAXQDA or Inception?

    I am looking for a tool which is able to display long text like a newspaper article in a nice, readable way, some metadata about the text and a very basic scheme of categories.

    #commscholars #annotation #contentanalysis

  16. • Perfect for quick #ContentAnalysis
    • Streamlined #MLOps pipeline

    - Nova Pro:
    • Advanced #DeepLearning capabilities
    #LargeLanguageModel with 300,000 token support
    • Excels in #FinTech document analysis
    • Optimal #AI performance balance

  17. Pleased to say one of our most popular workshops is running again in Oct '24.
    buff.ly/3xPQnxO
    Join us for a day discussing the principles of #qualitative data analysis and experimenting with some of the practical tasks involved. This one usually fills up quickly so get in quick. Small groups and two tutors make this a focused, friendly and thought-provoking day. #QualitativeDataAnalysis #QDA #QualitativeResearch #ThematicAnalysis #ContentAnalysis #DiscourseAnalysis #GroundedTheory

  18. Pleased to say one of our most popular workshops is running again in Oct '24.
    buff.ly/3xPQnxO
    Join us for a day discussing the principles of #qualitative data analysis and experimenting with some of the practical tasks involved. This one usually fills up quickly so get in quick. Small groups and two tutors make this a focused, friendly and thought-provoking day. #QualitativeDataAnalysis #QDA #QualitativeResearch #ThematicAnalysis #ContentAnalysis #DiscourseAnalysis #GroundedTheory

  19. Pleased to say one of our most popular workshops is running again in Oct '24.
    buff.ly/3xPQnxO
    Join us for a day discussing the principles of #qualitative data analysis and experimenting with some of the practical tasks involved. This one usually fills up quickly so get in quick. Small groups and two tutors make this a focused, friendly and thought-provoking day. #QualitativeDataAnalysis #QDA #QualitativeResearch #ThematicAnalysis #ContentAnalysis #DiscourseAnalysis #GroundedTheory

  20. Pleased to say one of our most popular workshops is running again in Oct '24.
    buff.ly/3xPQnxO
    Join us for a day discussing the principles of #qualitative data analysis and experimenting with some of the practical tasks involved. This one usually fills up quickly so get in quick. Small groups and two tutors make this a focused, friendly and thought-provoking day. #QualitativeDataAnalysis #QDA #QualitativeResearch #ThematicAnalysis #ContentAnalysis #DiscourseAnalysis #GroundedTheory

  21. Pleased to say one of our most popular workshops is running again in Oct '24.
    buff.ly/3xPQnxO
    Join us for a day discussing the principles of #qualitative data analysis and experimenting with some of the practical tasks involved. This one usually fills up quickly so get in quick. Small groups and two tutors make this a focused, friendly and thought-provoking day. #QualitativeDataAnalysis #QDA #QualitativeResearch #ThematicAnalysis #ContentAnalysis #DiscourseAnalysis #GroundedTheory

  22. The topic for today's lecture is content analysis. To give students a taste of what that work is like, I print out a collection of pro- and anti-gun-control memes for them to code according to criteria they develop.

    #AcademicMastodon #ContentAnalysis

  23. To follow up on the #Adobe #ContentAnalysis #AI boosts, if anyone wants to turn off the setting:

    1. Log in to Adobe.com
    2. Click your photo in the top right, select View Account
    3. Select Account and Security > Privacy and personal data in the top menu
    4. Toggle the switch to off under Content analysis (2nd option)

    Hope this helps!

  24. For those trying to figure out how to turn off Adobe "content analysis":

    Log in at account.adobe.com/privacy. As of this writing, Content Analysis is the second section on the page. #adobe #privacy #MachineLearning #ContentAnalysis

  25. Who loves content analysis? Got any tips?

    "Content analysis is a research technique for making replicable and valid inferences from texts to the contexts of their use." (p. 24)

    #ContentAnalysis #Reseach #media #commodon @communicationscholars #MediaContent #fujifilm Xe3 27mm 2.8

  26. Anyone good at #contentanalysis in my networks, ideally in games psychology or hci.

  27. #introduction Hello! I am an Associate Professor in Management and Project Management. I am interested in #complexsystems #systemsthinking #complexadaptivesystems #sensemaking #criticalrealism #qualitativeresearch #methodology and #management #projectmanagement #entrepreneurship #noise #decisionmaking #conflictresolution #negotiation I have two book chapters with #springer numerous others with #igi a tutorial on #netbeans #java and a tutorial on #contentanalysis in a research methodology book. I teach Principles of Management, Project Management, Contemporary Issues in Management, Managing Complex Projects, Applied Research Methods, Qualitative Techniques, and Critical Review of Literature.

  28. #introduction Hello! I am an Associate Professor in Management and Project Management. I am interested in #complexsystems #systemsthinking #complexadaptivesystems #sensemaking #criticalrealism #qualitativeresearch #methodology and #management #projectmanagement #entrepreneurship #noise #decisionmaking #conflictresolution #negotiation I have two book chapters with #springer numerous others with #igi a tutorial on #netbeans #java and a tutorial on #contentanalysis in a research methodology book. I teach Principles of Management, Project Management, Contemporary Issues in Management, Managing Complex Projects, Applied Research Methods, Qualitative Techniques, and Critical Review of Literature.

  29. #introduction Hello! I am an Associate Professor in Management and Project Management. I am interested in #complexsystems #systemsthinking #complexadaptivesystems #sensemaking #criticalrealism #qualitativeresearch #methodology and #management #projectmanagement #entrepreneurship #noise #decisionmaking #conflictresolution #negotiation I have two book chapters with #springer numerous others with #igi a tutorial on #netbeans #java and a tutorial on #contentanalysis in a research methodology book. I teach Principles of Management, Project Management, Contemporary Issues in Management, Managing Complex Projects, Applied Research Methods, Qualitative Techniques, and Critical Review of Literature.