Project GENIE Releases Aggregated Cancer Genomic Data Set | Healthcare Informatics Magazine | Health IT | Information Technology Skip to content Skip to navigation

Project GENIE Releases Aggregated Cancer Genomic Data Set

January 5, 2017
by David Raths
| Reprints
Project includes nearly 19,000 de-identified genomic records
Click To View Gallery

In November 2015 the American Association for Cancer Research (AACR) launched a registry initiative to aggregate participants’ clinical-grade genomic sequencing data to improve patient treatment decisions and catalyze clinical and translational research. Now AACR has announced the first public release of cancer genomic data aggregated through the initiative.

The data set for AACR’s Project Genomics Evidence Neoplasia Information Exchange (GENIE) includes nearly 19,000 de-identified genomic records collected from patients who were treated at eight international institutions, making it among the largest fully public cancer genomic data sets released to date.

The release includes data for 59 major cancer types, including data on nearly 3,000 patients with lung cancer, more than 2,000 patients with breast cancer, and more than 2,000 patients with colorectal cancer. The genomic data and a limited amount of linked clinical data for each patient can be accessed via the AACR website or downloaded directly from Sage Bionetworks.

“We are excited to make publicly available this very large set of clinical-grade, next-generation sequencing data obtained during routine patient care,” said Charles Sawyers, M.D., Project GENIE Steering Committee chairperson, in a prepared statement.

AACR said there are many ways in which the data can be exploited to benefit patients in the future, including through the validation of gene signatures of drug response or prognosis; the ability to identify new patient populations for drugs previously approved by the U.S. Food and Drug Administration; the expansion of patient populations that will benefit from existing drugs; and the identification of new drug targets and biomarkers.

Sawyers, who is also chairperson of the Human Oncology and Pathogenesis Program at Memorial Sloan Kettering Cancer Center in New York and a Howard Hughes Medical Institute investigator, added, “These data were generated as part of routine patient care and without AACR Project GENIE they would likely never have been shared with the global cancer research community. We are committed to sharing not only the real-world data within the AACR Project GENIE registry but also our best practices, from tips about assembling an international consortium to the best variant analysis pipeline, because only by working together will information flow freely and patients benefit rapidly.”

AACR stressed that the newly released data are fully de-identified in compliance with the Health Insurance Portability and Accountability Act (HIPAA). They are derived from patients whose tumors were genetically sequenced as part of their care at one of the eight international institutions that participated in the first phase of AACR Project GENIE.

The eight institutions participating in AACR Project GENIE phase 1 are:

• Dana-Farber Cancer Institute, Boston;

• Gustave Roussy Cancer Campus, Paris-Villejuif-France;

• The Netherlands Cancer Institute, Amsterdam, on behalf of the Center for Personalized Cancer Treatment, Utrecht, The Netherlands;

• Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore;

•  Memorial Sloan Kettering Cancer Center, New York;

• Princess Margaret Cancer Centre, Toronto;

•  University of Texas MD Anderson Cancer Center, Houston; and,

•  Vanderbilt-Ingram Cancer Center, Nashville, Tennessee.

 To expand the AACR Project GENIE registry, the consortium is accepting applications for new participating centers starting Jan. 5, which is a year sooner than originally anticipated. Any nonprofit institution that meets certain criteria can submit an application to become a project participant.

Get the latest information on Health IT and attend other valuable sessions at this two-day Summit providing healthcare leaders with educational content, insightful debate and dialogue on the future of healthcare and technology.

Learn More



Arizona ACO Pilots Blockchain Platform to Improve Clinical Outcomes, Reduce Costs

Arizona Care Network, a Phoenix-based accountable care organization (ACO), plans to pilot a blockchain technology platform developed by Solve.Care with the aim of improving clinical outcomes, relieving healthcare’s administrative burdens, and reducing waste within the system.

Protenus February Breach Report: Number of Incidents Remain Steady

The number of healthcare data breach incidents continues to remain steady, and in February’s Breach Barometer report from Protenus, it was revealed that last month, a ransomware attack was responsible for the largest single incident.

Prior Authorization Burdens Hindering Patient Care, AMA Survey Finds

Approximately 64 percent of physicians in a recent American Medical Association (AMA) survey said they wait at least one business day before getting a response from a health plan regarding a prior authorization (PA) decision.

Intermountain Healthcare to Build Global DNA Registry with AncestryDNA, 23andMe Data

Intermountain Healthcare is building a new global DNA registry based on medical histories from people around the world, using existing genetic test results and electronic health histories.

NH-ISAC Accelerates Cyber Threat Sharing for Healthcare Industry

The National Health Information Sharing and Analysis Center (NH-ISAC) is partnering with Anomali, a provider of threat management solutions, to enable seamless, secure threat sharing within the healthcare community.

Enterprise Telemedicine Strategies Gaining Steam, Survey Finds

Healthcare providers are increasingly leveraging a centrally-managed (enterprise) approach to telemedicine, according to the results of the REACH Health 2018 U.S. Telemedicine Industry Benchmark survey.