BD2K Centers | NIH Common Fund

Overview

BD2K funded 13 Centers of Excellence, which are large-scale projects developing new approaches, methods, software tools, and related resources, and are also providing training to advance Big Data science in the context of their biomedical area of focus. The Centers are located all across the United States and function with the other BD2K grantees as a consortium and collaborate with one another for the purpose of furthering every aspect of the field of biomedical data science research.

Description of individual centers

Big Data for Discovery Science (BDDS)
Researchers at the BDDS focus on proteomics, genomics, and images of cells and brains collected from patients and subjects across the globe. They enable detection of patterns, trends, and relationships among these data for the efficient large-scale analysis of biomedical data.

BD2K-LINCS Data Coordination and Integration Center (BD2K-LINCS DCIC)
The BD2K-LINCS DCIC conducts data science research focused on perturbation-response data obtained from experiments with human cells and tissues, and provides access to and analysis of this data by the broader biomedical research community.

Center for Big Data in Translational Genomics (BDTG)
The BDTG creates data models and analysis tools to analyze massive datasets of genomic information to uncover the contribution of gene variants to disease with an initial focus on cancer.

Center for Causal Modeling and Discovery of Biomedical Knowledge from Big Data (CCD)
Center for Causal Modeling & Discovery of Biomedical Knowledge from Big Data (CCD) develops computational methods known as causal discovery algorithms that can be used to discover causal relationships from a combination of observational data, experimental data, and prior knowledge.

Center for Expanded Data Annotation and Retrieval (CEDAR)
Center for Expanded Data Annotation and Retrieval (CEDAR) is building new web-based technology to make it easier for biomedical scientists to author detailed metadata that describe their experiments completely, adhere to appropriate community-based standards, and incorporate controlled terms that facilitate interoperability with other online data sets.

Center for Mobility Data Integration to Insight (The Mobilize Center)
The Mobilize Center is analyzing movement data from over 6 million individuals using a smartphone app, revealing new insights about physical activity levels around the world and the factors predictive of these activity levels.

Center for Predictive Computational Phenotyping (CPCP)
The CPCP aims to accelerate the impact of predictive modeling on clinical practice by developing computational and statistical methods and software for a range of computational phenotyping tasks, including extracting relevant phenotypes from complex data sources and predicting clinically important phenotypes before they are exhibited.

Center of Excellence for Mobile Sensor Data-to-Knowledge (MD2K)
  Researchers at MD2K develop tools to make it easier to gather, analyze, and interpret data from mobile and wearable sensors to reliably quantify physical, biological, behavioral, social, and environmental factors that contribute to health and disease risk.

ENIGMA Center for Worldwide Medicine, Imaging, and Genomics (ENIGMA)
The ENIGMA Center develops computational methods for integration, clustering, and learning from complex biodata types to help identify factors that either resist or promote brain disease, or assist in the diagnosis and prognosis, as well as new mechanisms and drug targets for mental health care.

Heart BD2K, a Community Effort to Translate Protein Data to Knowledge: An Integrated Platform (Heart BD2K)
The goal of the Heart BD2K Center is to democratize data research to include non-computational scientists and individuals and to apply innovative global community-driven data integration and modeling methods to address challenges involved in the study of protein structure, function, and networks with a focus on cardiovascular research.

KnowEng, a Scalable Knowledge Engine for Large-Scale Genomic Data (KnowEng)
The KnowEng Center built a computational Knowledge Engine that uses data mining and machine learning techniques to obtain and combine gene function and gene interaction information from disparate genomic data sources.

Patient-Centered Information Commons: Standardized Unification of Research Elements (PIC-SURE)
Investigators at the PIC-SURE Center develop systems to combine genetic, environmental, imaging, behavioral, and clinical data on individual patients from multiple sources into integrated sets to enable more accurate classification of individual disease or disease risk;and facilitate greater precision in patient disease prevention and treatment strategies.

BD2K Centers Resources

The BD2K Centers have developed a wide array of tools and resources. A list of these resources is maintained by the BD2K Centers Coordination Center (BD2KCCC). The BD2KCCC helps to promote collaboration among the Centers and across the BD2K program, and coordinates BD2K Centers Consortium activities.

Resources from each BD2K Center were highlighted in a special issue of the Summer 2017 Biomedical Computational review. Learn more about some of the BD2K Centers exciting accomplishments.

Tools and resources are also available on the individual Centers resource pages:

BDDS
The Center for Big Data for Discovery Science offers a variety of toolsfor data management and processing, genetic association studies, statistical analysis, and utilities for existing frameworks.
BD2K-LINCS DCIC
The BD2K-LINCS DCIC develops web-based tools and data standards for integrative data access, visualization, and analysis across the distributed LINCS and BD2K sites and other relevant data sources. The DCIC engages in training and outreach activities by delivering educational materials to the research community through MOOCs, mentoring, seminars, and symposiums.
CCD
The Center for Causal Discovery provides in-person and on-line educational resources, and offers a software suite for causal discovery from large and complex biomedical data sets.
CEDAR
The CEDAR provides a combination of tools and training to aid the community in producing optimal metadata.
The Mobilize Center
The Mobilize Center engages the community in mobility big data efforts and promotes the use of big data analytics in biomedical computational research through a number of resources including software, training opportunities, data sources, and publications.
CPCP
The Center for Predictive Computational Phenotyping provides a variety of resourcesresources including computational and statistical methods and software for computational phenotyping, and training in biomedical big data analysis to scientists and clinicians.
MD2K
The Mobile Sensor Data-to-Knowledge Center provides software and training resources through the mHealthHub, a virtual forum where the research community interacts, learns, shares, and innovates on mobile health tools.
ENIGMA
The Enigma Center offers EnigmaVis, a software that provides online interactive visualization of data sets from ENIGMA. The ENIGMA center also provides training and tutorials for the software.
HeartBD2K
The HeartBD2K Center offers a collection of tools and platforms to enable community-driven data integration and modeling methods in the study of protein structure, function, and networks with a focus on cardiovascular research.
KnowEnG
The KnowEnG Center provides a cloud-based cyberinfrastructure for genomic analysis, and offers an education and training component that fosters learning through on-site programs and online interactive educational tools and resources. KnowEnG engages in community outreach by providing under-represented minority undergraduate students with training in Bioinformatics and Big Data through the Fisk-UIUC KnowEnG R25 program.
PIC-SURE
The PIC-SURE Center offers a variety of products that enable integration of patient data sets, and provides training to develop the next generation of big data scientists.