Convert BIDS datasets to flat-data #22
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The conversion of the penguins dataset led to the script and helper metadata here: https://hub.datalad.org/edu/penguins/src/branch/main/code
This could be generalized more, to allow the same script to be used for BIDS datasets.
Looking at the datasets that we have internally, there are a handful that have all three of these files:
dataset_description.jsonparticipants.tsvparticipants.jsonIt makes sense to me to start here since they would have the best way to verify the column headings in participants.tsv file. Of these datasets, here are some common column headers of the
participants.tsvfiles:(Age is sometimes reported in months, sometimes in years.)
Looking at
dataset_description.jsonfiles, these are common dataset-level properties:I
Here's the structure that I can determine so far:
From #17 (comment):
I think the following properties from the
dataset_description.jsonfiles would be good to include in this first data annotation round:Acknowledgements: could go into the existingcommentsfieldAuthors: a new multivalued slot onDatasetwith rangePerson?DatasetDOI: a new slot onDatasetwith rangeIssuedIdentifier? orstringas perdoi:description: >-Associated Digital Object Identifier (DOI; ISO 26324; seehttps://doi.org). The value must be just the DOI without the URLproject. So just `10.1038/s41597-022-01163-2` and not`https://doi.org/10.1038/s41597-022-01163-2`.range: stringFunding: maybe a new funding field. Could be range string to simplify it, or same as:funding:description: >-Grant that provides resources for a project.range: Grantmultivalued: trueHowToAcknowledge: could go into the existingcommentsfieldLicense: my feeling is this would be a property of a distribution, so not immediately applicable if we don't annotate that yetName: maps directly to existing name field onDatasetReferencesAndLinks: perhaps it would be useful to have a genericsee_alsomultivalued field to house these items? These could technically also be included as curation comments, although i feel seeAlso fits particularly well. Maybe the difference is the type, i.e. string vs uri.Internal discussions say: