Start a new topic

What is Raw Data?

Raw data is defined as the data that would typically be provided to investigators from a core facility. Thus, it may not be the initial data produced by the instrumentation but may have had additional processing (e.g., .fastq files rather than .bcl files produced from next generation sequencing assays).

Below are a few quick tips about raw data:

  • Raw data should include appropriate sample labels. 
  • Investigators are responsible for determine which raw data files will have the most utility for secondary analysis, including running additional workflows and analyses. 
  • When human subjects are involved, investigators should ensure that all raw data selected for sharing are in alignment with the obtained participant consent.


To maximize interoperability, raw data will need to be transformed into harmonized data. More information on raw vs harmonized data and the data harmonization process can be found within the Data Harmonization section of the Data Management Plan

Login or Signup to post a comment