Skip to main content


BD2K funded 13 Centers of Excellence, which are large-scale projects developing new approaches, methods, software tools, and related resources, and are also providing training to advance Big Data science in the context of their biomedical area of focus. The Centers are located all across the United States and function with the other BD2K grantees as a consortium and collaborate with one another for the purpose of furthering every aspect of the field of biomedical data science research.

Description of individual centers

Big Data for Discovery Science (BDDS)
Researchers at the BDDS focus on proteomics, genomics, and images of cells and brains collected from patients and subjects across the globe. They enable detection of patterns, trends, and relationships among these data for the efficient large-scale analysis of biomedical data.

BD2K-LINCS Data Coordination and Integration Center (BD2K-LINCS DCIC)
The BD2K-LINCS DCIC conducts data science research focused on perturbation-response data obtained from experiments with human cells and tissues, and provides access to and analysis of this data by the broader biomedical research community.

Center for Big Data in Translational Genomics (BDTG)
The BDTG creates data models and analysis tools to analyze massive datasets of genomic information to uncover the contribution of gene variants to disease with an initial focus on cancer.

Center for Causal Modeling and Discovery of Biomedical Knowledge from Big Data (CCD)
Center for Causal Modeling & Discovery of Biomedical Knowledge from Big Data (CCD) develops computational methods known as causal discovery algorithms that can be used to discover causal relationships from a combination of observational data, experimental data, and prior knowledge.

Center for Expanded Data Annotation and Retrieval (CEDAR)
Center for Expanded Data Annotation and Retrieval (CEDAR) is building new web-based technology to make it easier for biomedical scientists to author detailed metadata that describe their experiments completely, adhere to appropriate community-based standards, and incorporate controlled terms that facilitate interoperability with other online data sets.

Center for Mobility Data Integration to Insight (The Mobilize Center)
The Mobilize Center is analyzing movement data from over 6 million individuals using a smartphone app, revealing new insights about physical activity levels around the world and the factors predictive of these activity levels.

BD2K Centers Resources

The BD2K Centers have developed a wide array of tools and resources. A list of these resources is maintained by the BD2K Centers Coordination Center (BD2KCCC). The BD2KCCC helps to promote collaboration among the Centers and across the BD2K program, and coordinates BD2K Centers Consortium activities.

Resources from each BD2K Center were highlighted in a special issue of the Summer 2017 Biomedical Computational review. Learn more about some of the BD2K Centers exciting accomplishments.

Tools and resources are also available on the individual Centers resource pages:

  • BDDS
  • CCD
  • The Mobilize Center
  • CPCP
  • MD2K
  • HeartBD2K
  • KnowEnG

This page last reviewed on December 9, 2019