www-from-model/content/publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048/_index.md
2026-04-22 12:30:13 +00:00

65 lines
No EOL
3.3 KiB
Markdown

---
title: A reproducible and generalizable software workflow for analysis of large-scale neuroimaging
data collections using BIDS Apps
persons:
- orcid:0000-0003-3456-2493
- simon-eickhoff
- michael-hanke
topics:
- research-software-engineering
- neuroimaging
- research-data-management
- distributed-systems
- high-throughput-computing
params:
graphRootNodePID: xyzrins:publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048
pid: xyzrins:publications/a2c0e912-ae8e-48b4-aac5-d023afb4a048
doi: 10.1162/imag_a_00074
date: 2024-01
title: A reproducible and generalizable software workflow for analysis of large-scale
neuroimaging data collections using BIDS Apps
description: "Neuroimaging research faces a crisis of reproducibility. With massive\
\ sample sizes and greater data complexity, this problem becomes more acute. Software\
\ that operates on imaging data defined using the Brain Imaging Data Structure (BIDS)\u2014\
the BIDS App\u2014has provided a substantial advance. However, even using BIDS Apps,\
\ a full audit trail of data processing is a necessary prerequisite for fully reproducible\
\ research. Obtaining a faithful record of the audit trail is challenging\u2014\
especially for large datasets. Recently, the FAIRly big framework was introduced\
\ as a way to facilitate reproducible processing of large-scale data by leveraging\
\ DataLad\u2014a version control system for data management. However, the current\
\ implementation of this framework was more of a proof of concept, and could not\
\ be immediately reused by other investigators for different use cases. Here, we\
\ introduce the BIDS App Bootstrap (BABS), a user-friendly and generalizable Python\
\ package for reproducible image processing at scale. BABS facilitates the reproducible\
\ application of BIDS Apps to large-scale datasets. Leveraging DataLad and the FAIRly\
\ big framework, BABS tracks the full audit trail of data processing in a scalable\
\ way by automatically preparing all scripts necessary for data processing and version\
\ tracking on high performance computing (HPC) systems. Currently, BABS supports\
\ jobs submissions and audits on Sun Grid Engine (SGE) and Slurm HPCs with a parsimonious\
\ set of programs. To demonstrate its scalability, we applied BABS to data from\
\ the Healthy Brain Network (HBN; n = 2,565). Taken together, BABS allows reproducible\
\ and scalable image processing and is broadly extensible via an open-source development\
\ model."
kind: bibo:AcademicArticle
author:
- pid: orcid:0000-0003-3456-2493
given_name: Yaroslav
family_name: Halchenko
- pid: xyzrins:persons/simon-eickhoff
given_name: Simon
family_name: Eickhoff
- pid: xyzrins:persons/michael-hanke
given_name: Michael
family_name: Hanke
topic:
- pid: xyzrins:topics/research-software-engineering
display_label: Research software engineering (RSE)
- pid: xyzrins:topics/neuroimaging
display_label: Neuroimaging
- pid: xyzrins:topics/research-data-management
display_label: Research data management (RDM)
- pid: xyzrins:topics/distributed-systems
display_label: Distributed systems
- pid: xyzrins:topics/high-throughput-computing
display_label: High-throughput computing
---