The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data among researchers, launched recently with a visit from Vice President Joe Biden to the system’s operations center at the University of Chicago.

An initiative of the National Cancer Institute (NCI), the GDC will be a core component of the National Cancer Moonshot and President Obama’s Precision Medicine Initiative (PMI). (For more information on cancer moonshot initiatives, visit “Clinical Labs and Diagnostics in Cancer Moonshot Initiatives“ and “Inside Track: Moonshot Diagnostics.”) The data system is funded by $70 million allocated to NCI to lead efforts in cancer genomics as part of PMI for oncology. The GDC will centralize, standardize, and make accessible data from such large-scale NCI programs such as the Cancer Genome Atlas (TCGA) and its pediatric equivalent, TherapeuticallyApplicableResearchtoGenerateEffectiveTreatments(TARGET).

Together, TCGA and TARGET represent some of the largest and most comprehensive cancer genomics datasets in the world, currently comprising more than two petabytes of data. The GDC will accept submissions of cancer genomic and clinical data from researchers around the world who wish to share their data broadly. Contributing researchers will be able to use the state-of-the-art analytic methods of the GDC, allowing them to compare their findings with other data in the GDC.

Data in the GDC, representing thousands of cancer patients and tumors, will be harmonized using standardized software algorithms so that they are accessible and broadly useful to any cancer researcher. The storage of raw genomic data in the GDC will also allow it to be reanalyzed as computational methods and genome annotations improve.

In an era of heightened concern about data security and authorized access, the GDC has important safeguards in place to ensure secure data storage and downloading.

Douglas Lowy, MD, National Cancer Institute.

Douglas Lowy, MD, National Cancer Institute.

“With the GDC, NCI has made a major commitment to maintaining long-term storage of cancer genomic data and providing researchers with free access to these data,” says Douglas Lowy, MD, acting director at NCI. “Importantly, the explanatory power of data in the GDC will grow over time as data from more patients are included, and ultimately the GDC will accelerate our efforts in precision medicine.”

The GDC is being built and managed by the University of Chicago Center for Data Intensive Science in collaboration with the Ontario Institute for Cancer Research, all under an NCI contract with Leidos Biomedical Research, Frederick, Md.

“Of particular significance, the GDC will also house data from a number of newer NCI programs that will sequence the DNA of patients enrolled in NCI clinical trials,” says Louis M. Staudt, MD, PhD, director of NCI’s Center for Cancer Genomics. “These datasets will lead to a much deeper understanding of which therapies are most effective for individual cancer patients. With each new addition, the GDC will evolve into a smarter, more comprehensive knowledge system that will foster important discoveries in cancer research and increase the success of cancer treatment for patients.”

The hope is that the GDC will form the basis for a comprehensive knowledge system for cancer. GDC researchers will be able to integrate genetic and clinical data, such as cancer imaging and histological data, with information on the molecular profiles of tumors as well as treatment response. From this perspective, the GDC would become an important resource for generating potentially actionable and life-changing information that ultimately could be used by doctors and their patients.

NCI is part of the National Institutes of Health. For more information, visit NCI.