home.social

#cheminformatics — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #cheminformatics, aggregated by home.social.

  1. new blog post: "One Million IUPAC names #5: a new approach and 400k names" doi.org/10.59350/gqtbx-jta57 #openaccess #iupac #cheminformatics #blueObelisk

    "Europe PMC, however, also provides the JATS XML files as download on their FTP site. Already that august 2025 I had a prototype and knew it would change the game. These gzipped XML files are about 150 to 250 MB. Unzipped, about 1 GB each."

    Replies to this post show up in the blog.

  2. Did you know that the Beilstein-Institut has always been a digital innovator in chemistry - from the Beilstein Database in the ‘80s to the digital infrastructure and standards in chemistry projects supported in the current Beilstein ChemInfo Labs 🧪💻?

    ➡️ beilstein-institut.de/en/proje

    Join us in making chemical data accessible, findable, and reusable for everyone.

    #BeilsteinChemInfoLabs #OpenScience #Cheminformatics #FAIRData #STRENDA #MIRAGE #Beilstein75

  3. a macro to generate other macros was the best idea ever: now confirmed

    (with the last version of emacs-indigo for pretty-much-memory-safe molecule manipulation inside emacs!)

    github.com/gicrisf/emacs-indigo

  4. a macro to generate other macros was the best idea ever: now confirmed

    (with the last version of emacs-indigo for pretty-much-memory-safe molecule manipulation inside emacs!)

    github.com/gicrisf/emacs-indigo

    #emacs #lisp #chemistry #cheminformatics #metaprogramming #programming #code #coding #fp

  5. a macro to generate other macros was the best idea ever: now confirmed

    (with the last version of emacs-indigo for pretty-much-memory-safe molecule manipulation inside emacs!)

    github.com/gicrisf/emacs-indigo

    #emacs #lisp #chemistry #cheminformatics #metaprogramming #programming #code #coding #fp

  6. a macro to generate other macros was the best idea ever: now confirmed

    (with the last version of emacs-indigo for pretty-much-memory-safe molecule manipulation inside emacs!)

    github.com/gicrisf/emacs-indigo

    #emacs #lisp #chemistry #cheminformatics #metaprogramming #programming #code #coding #fp

  7. a macro to generate other macros was the best idea ever: now confirmed

    (with the last version of emacs-indigo for pretty-much-memory-safe molecule manipulation inside emacs!)

    github.com/gicrisf/emacs-indigo

    #emacs #lisp #chemistry #cheminformatics #metaprogramming #programming #code #coding #fp

  8. 🧩 Could RNA and proteins share the same “shape language” for binding drugs?

    🔗 Eigenvalue Ratios Reveal Shared Binding Pocket Shapes in RNA and Protein Structures. Computational and Structural Biotechnology Journal (CSBJ). DOI: doi.org/10.34133/csbj.0022

    📚 CSBJ - A Science Partner Journal: spj.science.org/journal/csbj

    #DrugDiscovery #StructuralBiology #ComputationalBiology #RNAResearch #ProteinScience #Bioinformatics #MolecularModeling #Genomics #Proteomics #Cheminformatics #LigandBinding

  9. The Coordinated Action “Diagnostic, Therapeutic and Vaccine Viral Targets” of #ANRS MIE is organising a #webinar on #AI for molecular discovery. This webinar will explore how AI and #computational #modelling are advancing #drugdesign and protein research.
    Speakers will present approaches for molecular design, prediction of protein variant effects and dynamics, and structural modelling of protein–protein interactions. The session will also take a critical perspective, addressing current limitations in #cheminformatics and practical considerations for researchers.

    🚨 TOMORROW🚨
    ⚠️ AI for molecular discovery: From drug design to protein dynamics⚠️
    April 15th 2026
    12:30 - 14:00
    Wednesday April 15th 2026, from 12:30 to 14:00 — online (Zoom)

    Programme :

    1️⃣ "AI for drug design" — Dragos Horvath, Strasbourg University
    2️⃣ "Computational approaches for protein variant effect and motion prediction" — Elodie Laine, Sorbonne University
    3️⃣ "Structural modelling and binding affinity prediction of the Human PDZ-PBM interactome" — Victor Reys, Utrecht University

    ➡️ Registration : services.hosting.augure.com/Re

  10. #SWAT4HCLS is over.

    Today, I am at the #KNCV CTC meeting, where we are launching the #Cheminformatics and #DrugDiscovery working group, as part of the CTC Section

  11. OPSIN 2.9.0 has been released: chembl.blogspot.com/2026/03/op

    OPSIN is an IUPAC name parser that returns SMILES

    "The release notes describe a mixture of minor bug fixes and improvements:"

  12. The Coordinated Action “Diagnostic, Therapeutic and Vaccine Viral Targets” of #ANRS MIE is organising a #webinar on #AI for molecular discovery. This webinar will explore how AI and #computational #modelling are advancing #drugdesign and protein research.
    Speakers will present approaches for molecular design, prediction of protein variant effects and dynamics, and structural modelling of protein–protein interactions. The session will also take a critical perspective, addressing current limitations in #cheminformatics and practical considerations for researchers.

    🚨 MARK YOUR CALENDAR 🚨
    ⚠️ AI for molecular discovery: From drug design to protein dynamics⚠️
    April 15th 2026
    12:30 - 14:00
    Wednesday April 15th 2026, from 12:30 to 14:00 — online (Zoom)

    Programme :

    1️⃣ "AI for drug design" — Dragos Horvath, Strasbourg University
    2️⃣ "Computational approaches for protein variant effect and motion prediction" — Elodie Laine, Sorbonne University
    3️⃣ "Structural modelling and binding affinity prediction of the Human PDZ-PBM interactome" — Victor Reys, Utrecht University

    ➡️ Registration : services.hosting.augure.com/Re

  13. #OpenBabel is dead, long live #RDKit!

    github.com/RMeli/spyrmsd/issue

    On a more serius note, it would be cool to have a cheminformatics library that actually works. Don't get me wrong, RDKit is very cool - but you can feel all the underlying problems it has when using it.

    #Cheminformatics

  14. 🤖 Can AI help us predict and prevent off-target effects in PROTAC drug design?

    🔗 Predicting PROTAC off-target effects via warhead involvement levels in drug–target interactions using graph attention neural networks. Computational and Structural Biotechnology Journal, DOI: doi.org/10.1016/j.csbj.2025.10

    📚 CSBJ: csbj.org/

    #DrugDevelopment #StructuralBiology #ChemicalBiology #PROTAC #DrugDiscovery #Cheminformatics #Bioinformatics #PrecisionMedicine #TargetedProteinDegradation

  15. ✨ Is it time to rethink allosteric site prediction using internal protein nanoenvironments?

    🔗 Protein allosteric site identification using machine learning and per amino acid residue reported internal protein nanoenvironment descriptors. Computational and Structural Biotechnology Journal, DOI: doi.org/10.1016/j.csbj.2024.10

    📚 CSBJ: csbj.org/

    #DrugDiscovery #ComputationalBiology #StructuralBiology #ProteinEngineering #Biophysics #Cheminformatics

  16. Fwd: "Unleashing 4 million IUPAC names into the wild" chembl.blogspot.com/2025/08/un

    "In support of Egon Willighagen's 'One Million IUPAC Names' project, we have just released more than 4 million IUPAC names text-mined from patents. Here are the details as listed on Zenodo:"

    #cheminformatics #chemistry #iupac #openscience

  17. ha, I managed to convert a CXSMILES in @wikidata via the MDL V2000 molfile and the #inchi webtool into an #InChI (!B) and InChIKey :) wikidata.org/wiki/Q66421202#P1

    #cheminformatics #polymers

  18. The outcomes of this workshop will directly shape the next phase of development, guiding the creation of a scientifically rigorous and user-focused search tool and Magres database (version 2), while also strengthening collaboration between CCP-NC, CCDC, and PSDI.

    #PSDI #CCPNC #CCDC #Workshop #Database #Metadata #cheminformatics #Data #MaterialScience

  19. The outcomes of this workshop will directly shape the next phase of development, guiding the creation of a scientifically rigorous and user-focused search tool and Magres database (version 2), while also strengthening collaboration between CCP-NC, CCDC, and PSDI.

    #PSDI #CCPNC #CCDC #Workshop #Database #Metadata #cheminformatics #Data #MaterialScience

  20. The outcomes of this workshop will directly shape the next phase of development, guiding the creation of a scientifically rigorous and user-focused search tool and Magres database (version 2), while also strengthening collaboration between CCP-NC, CCDC, and PSDI.

    #PSDI #CCPNC #CCDC #Workshop #Database #Metadata #cheminformatics #Data #MaterialScience

  21. interesting progress: "STOUT V2.0: SMILES to IUPAC name conversion using transformer models" doi.org/10.1186/s13321-024-009

    "The model was trained on a dataset of nearly 1 billion SMILES and corresponding IUPAC names, generated using OpenEye's Lexichem software. This new version achieves very high accuracy on test and benchmark datasets, demonstrating improved capability in producing longer IUPAC names with fewer errors overall."

    #chemistry #cheminformatics #iupac

  22. A really interesting preprint caught my attention from Connor Coley’s group at MIT. ShEPhERD diffusing shape, electrostatics, and pharmacophores for bioisosteric drug design macinchem.org/2024/11/23/sheph #cheminformatics #RDKit

  23. my lightning talk from the #RDKitUGM2023 is now on YouTube - all about making your work that uses datasets derived from ChEMBL more reproducible

    📺 Video: youtu.be/PY-xaoRoSOY?list=PLug

    📜 Slides: bit.ly/cth-rdkit-ugm-2023

    🤖 Code/Docs: github.com/cthoyt/chembl-downl

    Get started with: pip install chembl-downloader

    #cheminformatics #chemoinformatics #chembl #pubchem

  24. I'm making a tiny change to the Balsa grammar to reject strings with terminating branches (e.g., "CCC(C)" and "CCC(C).O"). They should never be needed and just complicate canonicalization.

    Working paper at: chemrxiv.org/engage/chemrxiv/a