Sven Lieber
-
Hey #library folks 👋 ,
do you want to cluster your book editions with the well-known Work-set algorithm from #OCLC, but you don't find a suitable reusable tool?
I recently faced this issue while working on the #BELTRANS project at KBR (Royal Library of Belgium). All I found were many research papers describing the clustering and a few implementations that required me to install 2010-style Java software stacks.
So I decided to write an easily reusable small #Python script that follows the ideas of the Work-set algorithm: clustering based on descriptive keys. Nothing more, nothing less.
Check my blog post for more information and have a look at the script.
➡️ blog post: https://doi.org/10.59350/4hd4r-1tk44
➡️ script: https://doi.org/10.5281/zenodo.10011416
-
Last week I have been to the 1st Conference on Research Data Infrastructure #CoRDI2023 organized by @NFDI
My main takeaways
👨💻 It’s mostly about human problems that need to be fixed (connecting communities, data governance, etc)
🖥 Knowledge Graphs and especially #Wikibase gain attention in the context of research (data) infrastructures
👩⚕️ There is an increasing need for sustainably funded data management careers
Check out my full trip report at https://sven-lieber.org/en/2023/09/17/cordi-2023/
Our contribution about the MetaBelgica platform:
📄 Abstract: https://doi.org/10.52825/cordi.v1i.381
👨🏫 Presentation: https://doi.org/10.5281/zenodo.8337301
🌐 Website: https://www.kbr.be/en/projects/metabelgica/
-
Great initiative about research data management #RDM in Germany presented at #CoRDI2023. Reminds me of the #UGent #datastewards, but of course much larger: instead of one data steward per faculty, one steward (organisation) for each federal state (yes that's partial Germany on the map , more German states will join)
-
Curious how the free publishing service #CEUR-WS is making its data #FAIR? Or how the #Wikimedia foundation is planning to democratize functions?
Check out the previous summer-break editions of the FAIR Data Digest newsletter in the archive while I am preparing next week's edition.
-
Are you working with research data? I think you should learn about ERICs and how these infrastructures promote #FAIR data!
I covered them in last week's edition of the #FAIRDataDigest
➡️ https://fair-data-digest.org/archive/6
⏳ tomorrow's edition will cover #CLARINERIC #CLARIN (10am in your inbox if you are a free subscriber) -
New newsletter edition is out
➡️https://fair-data-digest.org/archive/3 !
Many #FAIR topics:
💻 filling data gaps via #ISNI in #BELTRANS,
📅 Ethical, Legal and Social Aspects (#ELSA) of #DataScience at a workshop from the FAIR Data Spaces project https://www.nfdi.de/fair-data-spaces/?lang=en and
📹 #LawAsCode from Interoperable Europe
-
The new FAIR Data Digest newsletter edition is out!
🖥️ Handling author pseudonyms in #BELTRANS,
🎓 listening to fundamental #KnowledgeGraph research at the Data Science Institute of the University of Hasselt and
📚 learning about the history of Wikidata
➡️ https://fair-data-digest.org -
I'm excited to present our librarian-in-the-loop workflow to increase #dataQuality in roughly one hour at #swib22 #RDF #KBR #LinkedData Join the live stream at https://swib.org/swib22/ Slides available at https://doi.org/10.5281/zenodo.7372985