Skip to main content

Data Commons Pilot Phase Consortium

As part of the DCPPC, the awardees will design innovative solutions that meet the needs of the computational, data, and scientific key capabilities of the NIH Data Commons Pilot Phase. These key capabilities include:

  1. Guidelines and metrics for making data Findable, Accessible, Interoperable, and Reusable (FAIR)
  2. An approach to Global Unique Identifiers (GUIDs)
  3. Application Program Interfaces (APIs) based on open standards
  4. Architecture independent of a specific cloud platform or provider
  5. Workspaces to find and interact with data and associated tools
  6. Research ethics, privacy, and security (including authentication and authorization)
  7. Indexing and search functionality
  8. Use cases that demonstrate how the NIH Data Commons Pilot could advance biomedical research
  9. Coordination, training, and outreach

Along with NIH staff, contracting staff, and external program consultants, the following awardees will form the NIH Data Commons Pilot Phase Consortium (DCPPC).

Awards Made Under Research Opportunity AnData Stewards nouncement (ROA) RM-17-026

The following awards were made using a special type of funding mechanism called Other Transactions (OT). The OT mechanism gives NIH considerable flexibility in making and managing awards. This will be particularly important for the DCPPC to stay nimble as it approaches the complex task involved in the development of the NIH Data Commons Pilot Phase under the ever-changing conditions of data science and biomedical science.
Project TitleAwardeesInstitution
University of Maryland NIH Data Commons Facilitation CenterCrabtree, JonathanUniversity of Maryland
Felix, VictorUniversity of Maryland
Giglio, Michelle GwinnUniversity of Maryland
O'Connor, TimothyUniversity of Maryland
Sansone, Susanna-AssuntaUniversity of Oxford, Oxford e-Research Centre
Schriml, LynnUniversity of Maryland
White, OwenUniversity of Maryland
Development and Implementation Plan for Community Supported FAIR Guidelines and MetricsDumontier, MichelMaastricht University
Ma'ayan, AviIcahn School of Medicine at Mount Sinai
Klenk, JuergenDeloitte Consulting LLP
Sansone, Susanna-AssuntaUniversity of Oxford, Oxford e-Research Centre
Schurer, StephanUniversity of Miami
Patient-centric information commons under FAIR principals (PIC-FAIR)Avillach, PaulHarvard Medical School
Kohane, IsaacHarvard Medical School
Kreda, DavidHarvard Medical School
Manrai, Arjun K.Harvard Medical School
Patel, Chirag J.Harvard Medical School
Susan, ChurchillHarvard Medical School
Versmee, LauraHarvard Medical School
The Commons Alliance: A partnership to catalyze the creation of an NIH Data CommonsGrossman, Robert L.University of Chicago
Haussler, DavidUniversity of California, Santa Cruz
O’Connor, BrianUniversity of California, Santa Cruz
Paten, BenedictUniversity of California, Santa Cruz
Philippakis, AnthonyBroad Institute
Tang, Yajing (Phillis)University of Chicago
Yung, ChristinaUniversity of Chicago
A Commons Platform for Promoting Continuous FAIRnessChard, KyleUniversity of Chicago: Globus
Foster, IanUniversity of Chicago: Globus
Kesselman, CarlUniversity of Southern California
Madduri, RaviUniversity of Chicago: Globus
Tools and workflows for mining genomic data on many cloudsBrown, C. TitusUniversity of California, Davis
Zaranek, Alexander WaitCuroverse Innovations, Inc.
A collaboration for the NIH Data CommonsAhalt, StanleyRenaissance Computing Institute: RENCI
Boyles, RebeccaRTI International
Castillo, ClarisRenaissance Computing Institute: RENCI
Conway, MichaelRenaissance Computing Institute: RENCI
Cox, StevenRenaissance Computing Institute: RENCI
Dumontier, MichelMaastricht University
Haendel, MelissaOregon Health Science University
Mungall, ChristopherLawrence Berkeley National Laboratory
Oprea, TudorUniversity of New Mexico Health Sciences Center
Peter, RobinsonThe Jackson Laboratory
FAIR data to drive CURESDavis-Dusenbery, BrandiSeven Bridges Genomics Inc
Nielsen, FionaRepositive
Pyarajan, SaijuUS Department of Veterans Affairs
de Waard, AnitaElsevier
CALIFORNIA: Cloud-agnostic Architecture to Locate Indexed FAIR objects and safely reuse them in new integrated analysesOhno-Machado, LucilaUniversity of California, San Diego
Jiang, XiaoqianUniversity of California, San Diego
Sansone, Susanna-AssuntaUniversity of Oxford, Oxford e-Research Centre
Xu, HuaUniversity of Texas Health Science Center at Houston
Towards a FAIR Digital Ecosystem in the CloudClark, TimothyHarvard University
Crosas, MercèHarvard University
Fenner, MartinDataCite


Data Stewards

The stewards for each of the following data sets received supplemental funding to existing grants to participate as members of the DCPPC. These data sets will serve as test cases to develop the capabilities of the NIH Data Commons Pilot Phase.
Data SetPrincipal Investigator(s)Institution(s)
Genotype Tissue Expression (GTEx)Ardlie, KristinBroad Institute Inc.
Getz, GadBroad Institute Inc.
Alliance of Genome Resources (AGR)Judith BlakeThe Jackson Library
Carol BultThe Jackson Library
J. Michael CherryStanford University
Suzanna LewisLawrence Berkeley National Laboratory
Mary ShimoyamaMedical College of Wisconsin
Paul ThomasUniversity of Southern California
Trans-Omics for Precision Medicine (TOPMed)Abecasis, GoncaloUniversity of Michigan
Blackwell, TomUniversity of Michigan
Borecki, IngridUniversity of Washington
Laurie, CatherineUniversity of Washington
Psaty, BruceUniversity of Washington
Weir, BruceUniversity of Washington

Learn more about each of the Test Case Data Sets

Cloud Service Providers

Researchers and staff from the NIH Data Commons Pilot Phase and the STRIDES initiative will work with cloud service providers (CSPs) to learn how to provide a sustainable cloud infrastructure to support NIH-derived data sets including the Data Commons Pilot Phase test case datasets.

Assessment and Logistical Support

The MITRE Corporation, operator of the HHS federally funded research and development center, will provide a broad range of support services for the NIH Data Commons Pilot Phase including the following:

  • Assessment services to analyze usage, costs, business models, and long-term data viability
  • Communications, coordination, training, and other logistical support

This contract is administered by the Center for Information Technology, and is a trans-NIH initiative funded by multiple Institutes, Centers, and Offices.

This page last reviewed on February 1, 2024