The BRAIN Initiative data-sharing ecosystem: Characteristics, challenges, benefits, and opportunities

  1. Sudhanvan Iyer
  2. Kathryn Maxson Jones  Is a corresponding author
  3. Jill O Robinson
  4. Nicole R Provenza
  5. Dominique Duncan
  6. Gabriel Lázaro-Muñoz
  7. Amy L McGuire
  8. Sameer A Sheth
  9. Mary A Majumder
  1. Center for Medical Ethics and Health Policy, Baylor College of Medicine, United States
  2. Department of History, Purdue University, United States
  3. Department of Neurosurgery, Baylor College of Medicine, United States
  4. Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, United States
  5. Center for Bioethics, Harvard Medical School, United States
  6. Department of Psychiatry, Massachusetts General Hospital, United States
1 figure and 1 table

Figures

BRAIN Initiative data archive ecosystem connections.

Summary of the main BRAIN Initiative data archive ecosystem connections currently enabling data upload, access (ability to download), analysis (which includes short-term data storage), and mirroring (identical copies of data across archives) among entities. Potential paths of data flow are represented by arrows, with the directions of flow (upload, download, or both) indicated by the directions of the arrows. All seven archives (light blue circles) are intended for long-term data storage. For the archives marked with an asterisk (*), data generated by the BRAIN Initiative Cell Census Network (BICCN) also are indexed by the Brain Cell Data Center (BCDC) at the Allen Institute in Seattle, WA. Clockwise from top: (i) Brain Image Library (BIL): From the data producer’s Home institution (hereafter, ‘Home’) (dark blue rectangle), data can be uploaded to or downloaded from BIL. BIL data can be uploaded to, analyzed, and downloaded from the Bridges-2 supercomputer (orange circle). (ii) Data Archive for the BRAIN Initiative (DABI): From Home, data can be uploaded to or downloaded from DABI. (iii) NeuroElectroMagnetic data Archive and tools Resource (NEMAR): From Home, data can be downloaded from NEMAR. From NEMAR or Home, data also may be uploaded to the Neuroscience Gateway (NSG), although analyzed data may only be downloaded back to Home (and not back to NEMAR). All NEMAR data are mirrored (double solid line with no arrows) in OpenNeuro, and data also can only be uploaded to NEMAR through OpenNeuro. (iv) OpenNeuro: Data can be uploaded to or downloaded from OpenNeuro from Home. From OpenNeuro or Home, users also may upload and download data to BrainLife, a cloud-computing analysis environment. Through the data mirror, all OpenNeuro datasets are accessible on NEMAR. (v) Brain Observatory Storage Service and Database (BossDB): From Home, data can be uploaded to or downloaded from BossDB. (vi) Distributed Archives for Neurophysiology Data Integration (DANDI): From Home, data can be uploaded to and downloaded from DANDI. From Home, data analyzed in DANDI Hub also may be downloaded directly, but uploads to this analysis environment must always take place through DANDI. (vii) Neuroscience Multi-Omic Data Archive (NeMO): This archive is the primary repository for genomic, transcriptomic, and epigenetic data generated by the BICCN. From Home, data can be uploaded to or downloaded from NeMO. NeMO also streamlines user access to Terra and gEAR. By way of the BCDC, users also may link directly to Terra workspaces associated with NeMO projects. All descriptions current as of June 5, 2024.

Tables

Table 1
BRAIN Initiative data archive ecosystem key statistics and related information.

Sources: For Host Institution, Initial Funding Date, Total NIH Support, Administering ICO, and Funding ICO, information was taken from the NIH RePORTER entry for the most recent award for each archive, as of June 6, 2024 (https://reporter.nih.gov/) Total NIH Support for each archive was taken from the History section of RePORTER. A separate line of NIH funding, totaling $1,707,134, with NIMH listed as the awardee, has supported the development of OpenNeuroPET. Other information in the table was derived from the International Neuroscience Coordinating Facility (INCF) Infrastructure Portfolio (https://www.incf.org/infrastructure-portfolio) and archive websites. Where an archive landing page or its data portal did not offer a public ‘datasets‘ (or ‘dandisets’, in the case of DANDI) count, we used an available alternative such as the number of ‘projects‘ for which data are available. *Data current as of June 6, 2024. ICO = NIH Institutes, Centers, and Offices.

ArchiveHost InstitutionInitial Funding DateTotal NIH Support (USD)*Administering ICOFunding ICOData Types HostedData Formats SupportedPublicly Available Datasets*
Brain Image Library (BIL)Carnegie-Mellon University2017$6,767,625NIMHNIMHConfocal microscopyDICOM or NIfTI; for derived data, any format as long as primary data are in the required format8418
Brain Observatory Storage Service and Database (BossDB)Johns Hopkins University2018$5,133,219NIMHNIMHElectron microscopy; x-ray microtomographyPNG; APNG; JPG; BMP; GIF; PSD50
Data Archive for the BRAIN Initiative (DABI)University of Southern California2018$6,160,745NIMHNINDSFocus on invasive neurophysiology; houses all imaging and brain signal dataEDF, Biosemi (.bdf), BrainVision (.eeg,.vhdr,.vmrk), EEGLAB (.set,.fdt), Blackrock NeuroPort (.nev,.nsX), Intan (.rhd,.rhs), MATLAB files (.mat,.m); iEEG-BIDS, NWB, or DABI110
Distributed Archives for Neurophysiology Data Integration (DANDI)Massachusetts Institute of Technology2019$8,309,010NIMHNIMHCellular neurophysiology, neuroimaging, and microscopyBIDS; NWB640
NeuroElectroMagnetic data Archive and tools Resource (NEMAR)University of California, San Diego2019$4,464,874NIMHNIMHEEG, MEGBIDS297
Neuroscience Multi-Omic Data Archive (NeMO)University of Maryland, Baltimore2017$9,347,683NIMHNIMHMulti-omicsFASTQ; TSV; BAM; BIGWIG; MEX; LIST; BED; QBED; CSV; H5; BIGBED; LOOM; MTX; ASPX49
OpenNeuroStanford University2018$7,254,848NIMHNIMHMRI, PET, MEG, EEG, iEEGBIDS1076

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Sudhanvan Iyer
  2. Kathryn Maxson Jones
  3. Jill O Robinson
  4. Nicole R Provenza
  5. Dominique Duncan
  6. Gabriel Lázaro-Muñoz
  7. Amy L McGuire
  8. Sameer A Sheth
  9. Mary A Majumder
(2024)
The BRAIN Initiative data-sharing ecosystem: Characteristics, challenges, benefits, and opportunities
eLife 13:e94000.
https://doi.org/10.7554/eLife.94000