#snakemake — Public Fediverse posts on home.social

Christian Meesters @[email protected] · 2026-05-14 · 10:13 UTC

RE: https://fediscience.org/@snakemake/116571962095785816

This little bit "performance improvements" lowered the number of file system access events for considerably! #Snakemake trigger many such events for keeping track of metadata. Which is important, but may cause some delays due to file system overhead - particularly on parallel and/or network file systems. The feature to outsource parts of this to sqlite was implemented during the #SnakemakeHackathon2026 . I hope, I can test the improvements next Monday!

#HPC #ReproducibleComputing

#snakemake #snakemakehackathon2026 #hpc #reproduciblecomputing

Christian Meesters @[email protected] · 2026-05-14 · 10:13 UTC

RE: https://fediscience.org/@snakemake/116571962095785816

This little bit "performance improvements" lowered the number of file system access events for considerably! #Snakemake trigger many such events for keeping track of metadata. Which is important, but may cause some delays due to file system overhead - particularly on parallel and/or network file systems. The feature to outsource parts of this to sqlite was implemented during the #SnakemakeHackathon2026 . I hope, I can test the improvements next Monday!

#HPC #ReproducibleComputing

#snakemake #snakemakehackathon2026 #hpc #reproduciblecomputing

Christian Meesters @[email protected] · 2026-05-14 · 10:13 UTC

RE: https://fediscience.org/@snakemake/116571962095785816

This little bit "performance improvements" lowered the number of file system access events for considerably! #Snakemake trigger many such events for keeping track of metadata. Which is important, but may cause some delays due to file system overhead - particularly on parallel and/or network file systems. The feature to outsource parts of this to sqlite was implemented during the #SnakemakeHackathon2026 . I hope, I can test the improvements next Monday!

#HPC #ReproducibleComputing

#snakemake #snakemakehackathon2026 #hpc #reproduciblecomputing

Christian Meesters @[email protected] · 2026-05-14 · 10:13 UTC

RE: https://fediscience.org/@snakemake/116571962095785816

This little bit "performance improvements" lowered the number of file system access events for considerably! #Snakemake trigger many such events for keeping track of metadata. Which is important, but may cause some delays due to file system overhead - particularly on parallel and/or network file systems. The feature to outsource parts of this to sqlite was implemented during the #SnakemakeHackathon2026 . I hope, I can test the improvements next Monday!

#HPC #ReproducibleComputing

#reproduciblecomputing #hpc #snakemakehackathon2026 #snakemake

Christian Meesters @[email protected] · 2026-05-14 · 10:13 UTC

RE: https://fediscience.org/@snakemake/116571962095785816

This little bit "performance improvements" lowered the number of file system access events for considerably! #Snakemake trigger many such events for keeping track of metadata. Which is important, but may cause some delays due to file system overhead - particularly on parallel and/or network file systems. The feature to outsource parts of this to sqlite was implemented during the #SnakemakeHackathon2026 . I hope, I can test the improvements next Monday!

#HPC #ReproducibleComputing

#snakemake #snakemakehackathon2026 #hpc #reproduciblecomputing

Christian Meesters @[email protected] · 2026-05-05 · 19:11 UTC

RE: https://fediscience.org/@snakemake/116523521544819091

Software provenance with #Snakemake: Using the reporter plugin for nanopublications, we can now get slightly improved nanopublications like this one: https://w3id.org/np/RAmgzfta63xx0wWc_zzQVm7kwOc4tsEOA0JJJCfsiLL1g (press on the little blue arrow on the right to see the full details). Automatically captured for this workflow: https://w3id.org/np/RAjHDlPDghZzc9ZvQ3uJQNJ9Jd_KAYzZt7dk5PXKgjRyE - again expressed a nanopub declaration. 😉

It now supports to capture the "classic" software support for #Conda and Snakemake wrappers.

There is more work to do. Let's see when and if I get to it.

#reproducibleComputing #softwareprovenance #nanopub

#snakemake #conda #reproduciblecomputing #softwareprovenance #nanopub

Christian Meesters @[email protected] · 2026-03-30 · 14:17 UTC

The #SnakemakeHackathon2026 has ended, we are still preparing our preprint release. But, our host has prepared a note on their homepage: https://go.tum.de/946236 🥳

#Snakemake #ReproducibleComputing

#snakemakehackathon2026 #snakemake #reproduciblecomputing

Christian Meesters @[email protected] · 2026-03-26 · 13:04 UTC

RE: https://fediscience.org/@snakemake/116295568336688286

This is a big step forward: The SLURM plugin for Snakemake now supports so-called job arrays. These are cluster jobs, with ~ equal resource requirements in terms of memory and compute resources.

The change in itself was big: The purpose of a workflow system is to make use of the vast resources of an HPC cluster. Hence, jobs are submitted to run concurrently. However, for a job array, we have to "wait" for all eligible jobs to be ready. And then we submit.

To preserve concurrent execution of other jobs which are ready to be executed, a thread pool has been introduced. In itself, I do not see job arrays as such a big feature: The LSF system profited much more from arrays than the rather lean SLURM implementation does.

BUT: the new code base will ease further development to pooling many shared memory tasks (applications which support no parallel execution or are confined to one computer by "only" supporting threading). Until then, there is more work to do.

#HPC #SLURM #Snakemake #SnakemakeHackathon2026 #ReproducibleComputing #OpenScience

#hpc #slurm #snakemake #snakemakehackathon2026 #reproduciblecomputing #openscience

Christian Meesters @[email protected] · 2026-03-26 · 11:36 UTC

Did you know? We started to gather cluster profiles for Snakemake users in a little repo: https://github.com/snakemake/snakemake-cluster-profiles

They should serve as a template for others.

#HPC #Snakemake #ReproducibleComputing #SnakemakeHackathon2026

#hpc #snakemake #reproduciblecomputing #snakemakehackathon2026

Christian Meesters @[email protected] · 2026-03-13 · 17:28 UTC

RE: https://fediscience.org/@snakemake/116222696140712833

What a week at the #SnakemakeHackathon2026 !

What a wonderful week with wonderful people!

We were pretty productive and this #Snakemake release is just the peak of it. The list of features, bug fixes, performance improvement and additional documentation is so long — our little announcement robot cannot display it all. Even here on FediiScience with its 1500-character limit!

#ReproducibleComputing #OpenScience

#snakemakehackathon2026 #snakemake #reproduciblecomputing #openscience

Christian Meesters @[email protected] · 2026-03-06 · 18:25 UTC

What do you see here? This is an example knowledge graph describing a #Snakemake analysis workflow. You see the workflow description, a linked data set and a linked report.

All work done to boost #HPC user support for those conducting their workflows on HPC systems (you can run Snakemake on other platforms, too).

My to-do list:
- an assertion template for workflows: ✅
- another for reports: ✅ (simple datasets are already in the #nanopub verse)
- a plugin to gather software metadata and publish as a nanopub ❌ (half done: #SnakemakeHackathon2026 )

Kudos to @nanopub / @tkuhn and @johanneskoester - without them this pursuit would (have been) futile! And my feeling is that @fbartusch will play an important role in any further development ...

#OpenScience #ReproducibleComputing

#snakemake #hpc #snakemakehackathon2026 #openscience #reproduciblecomputing #nanopub