www-from-model/content/publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048/_index.md
2026-04-22 12:30:13 +00:00

3.3 KiB

title persons topics params
A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps
orcid:0000-0003-3456-2493
simon-eickhoff
michael-hanke
research-software-engineering
neuroimaging
research-data-management
distributed-systems
high-throughput-computing
graphRootNodePID pid doi date title description kind author topic
xyzrins:publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048 xyzrins:publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048 10.1162/imag_a_00074 2024-01 A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps Neuroimaging research faces a crisis of reproducibility. With massive sample sizes and greater data complexity, this problem becomes more acute. Software that operates on imaging data defined using the Brain Imaging Data Structure (BIDS)—the BIDS App—has provided a substantial advance. However, even using BIDS Apps, a full audit trail of data processing is a necessary prerequisite for fully reproducible research. Obtaining a faithful record of the audit trail is challenging—especially for large datasets. Recently, the FAIRly big framework was introduced as a way to facilitate reproducible processing of large-scale data by leveraging DataLad—a version control system for data management. However, the current implementation of this framework was more of a proof of concept, and could not be immediately reused by other investigators for different use cases. Here, we introduce the BIDS App Bootstrap (BABS), a user-friendly and generalizable Python package for reproducible image processing at scale. BABS facilitates the reproducible application of BIDS Apps to large-scale datasets. Leveraging DataLad and the FAIRly big framework, BABS tracks the full audit trail of data processing in a scalable way by automatically preparing all scripts necessary for data processing and version tracking on high performance computing (HPC) systems. Currently, BABS supports jobs submissions and audits on Sun Grid Engine (SGE) and Slurm HPCs with a parsimonious set of programs. To demonstrate its scalability, we applied BABS to data from the Healthy Brain Network (HBN; n = 2,565). Taken together, BABS allows reproducible and scalable image processing and is broadly extensible via an open-source development model. bibo:AcademicArticle
pid given_name family_name
orcid:0000-0003-3456-2493 Yaroslav Halchenko
pid given_name family_name
xyzrins:persons/simon-eickhoff Simon Eickhoff
pid given_name family_name
xyzrins:persons/michael-hanke Michael Hanke
pid display_label
xyzrins:topics/research-software-engineering Research software engineering (RSE)
pid display_label
xyzrins:topics/neuroimaging Neuroimaging
pid display_label
xyzrins:topics/research-data-management Research data management (RDM)
pid display_label
xyzrins:topics/distributed-systems Distributed systems
pid display_label
xyzrins:topics/high-throughput-computing High-throughput computing