The Matchmaker Exchange
Transcription
The Matchmaker Exchange
The Matchmaker Exchange: Connecting Matchmakers to Accelerate Gene Discovery Heidi Rehm, PhD, FACMG Associate Professor of Pathology, Brigham & Women’s Hospital and Harvard Medical School Director, Partners Laboratory for Molecular Medicine Clinical Director, Broad Institute Clinical Research Sequencing Platform On behalf of the MME Working Group Goal: To identify causal genes for rare disease by matching phenotypes and genotypes across a federated network of genomic databases. Patient #1 Clinical Geneticist #1 Patient #2 Clinical Geneticist #2 Notification of Match Phenotypic Data Feature 1 Feature 2 Feature 3 Feature 4 Feature 5 Genotypic Data Gene A Gene B Gene C Gene D Gene E Gene F Genomic Matchmaker Genotypic Data Gene D Gene G Gene H Phenotypic Data Feature 1 Feature 3 Feature 4 Feature 5 Feature 6 Courtesy of Joel Krier The state of affairs as of the initial meeting in October 2013, ASHG. Gene Matcher PEER DECIPHER Genome Connect Monarch LOVD Multiple Matchmaker disconnected Exchange projects Café Variome Undiag. Diseases Program ClinGen Gene GEM.app Phenome Central Yenta Making a Match Linking multiple databases is not without its challenges… The Real Story Gene Matcher Phenome Central Exchanging Data API v 1.0 0.1 DECIPHER Courtesy Ben Hutton, DECIPHER Gene Gene Matcher Diseases PEER Gene and Phenotype (HPO) DECIPHER Variants Disease and Variants Genome Connect Model Organisms Monarch LOVD Multiple Matchmaker disconnected Exchange projects Café Variome Gene Undiag. and Phenotype (HPO) Variant and Disease ClinGen Disease and VCFs Variants and Phenotype Diseases Program Gene GEM.app Phenome Central Yenta Gene and Phenotype (HPO) Steps to Build the MME • Defined MME API – aided by GA4GH Data WG • Created consent policy – aided by GA4GH REWG (see Appendix) • Determined requirements to be a MME service (see Appendix) • Developed user agreement to use the MME (see Appendix) • Security requirements - work in progress with GA4GH Security WG PhenomeCentral www.phenomecentral.org Dr. Mike Brudno U of T Step 1: Add Patient via PhenoTips Courtesy of Mike Brudno API: Query by Example Request: a (real) patient profile, with phenotype and/or gene/genotype data Response: a list of “similar” patient profiles Note: Matching algorithms under refinement Step 2: See Patients Similar to Yours Courtesy of Mike Brudno Step 3: Contact Submitter of Other Dataset Courtesy of Mike Brudno MME Pilot GeneMatcher • Designed to connect clinicians/researchers with the same candidate gene (e.g. from a patient, animal models, etc.) • Only a human gene symbol or ID is required. • Can also match on OMIM phenotype numbers. • Matching on phenotypic features is in development. As of 6/1/2015: 2178 genes submitted by 486 submitters from 38 countries 307 Matches! Courtesy Ada Hamosh, François Schiettecatte, Nara Sobreira Growth in Number of Genes & Matches in GeneMatcher 2500 Number 2000 1500 1000 500 0 Courtesy Ada Hamosh, François Schiettecatte, Nara Sobreira Genes Matches 16 MME Pilot • Test set of 50 solved cases into both databases – All successfully identified • Tested using 45 unsolved cases with flagged candidates in PhenomeCentral – 10 gene matches • 6 false positives (discordant phenotypes) • 2 unresolved • 2 potentially significant Currently Connected MME Services Special Issue of Human Mutation o Guest Editors: Ada Hamosh, Kym Boycott, Heidi Rehm o Published October (timed with MME session at ASHG) o 16 papers • MME Overview • MME API • Mathematics of Matching Algorithms • Papers by MME services (PhenomeCentral, DECIPHER, GeneMatcher, Café Variome, Monarch, UDN, GENESIS/GEM.app, GenomeConnect, Patient Matching) • Cases solved through matching (3 papers) How to Participate Users can either connect a database to the MME via the API or Choose an existing MME service Guiding Use by the Community These tables help define the database location, content and approaches for each MME service to help guide the user in choosing a MME service for case deposition. Phenotypes Matchmaker Exchange Site Genotype Chromosome Structured Non-Human Variants VCF Files Gene Name coordinates Phenotypes Models Diagnosis term Diagnosis code PhenomeCentral (Canada) √ √ √ GeneMatcher (USA) √ √ √ DECIPHER (United Kingdom) √ Matchmaker Exchange Site √ √ Parameters Used for Matching Gene Diagnosis PhenomeCentral √ √ GeneMatcher √ √ DECIPHER √ Candidates Phenotype Match Score Features √ √ √ √ √ √ √ √ √ √ √ √ Evidence Flagged for Gene Gene Candidate Candidates s √ √ √ √ www.matchmakerexchange.org Future Directions • Support for hypothesis-free matching in the absence of an identified candidate gene • Designate more case data as freely accessible for any type of analysis (defining phenotypes, meta-data analysis, algorithm training, etc) • Patient-initiated matching Acknowledgements S Balasubramanian Mike Bamshad Sergio Beltran Agullo Jonathan Berg Kym Boycott Anthony Brookes Catherine Brownstein Michael Brudno Han Brunner Orion Buske Deanna Church Raymond Dalgliesh Victor de la Torre Andrew Devereau Johan den Dunnen Sergiu Dumitriu Helen Firth Paul Flicek Jan Friedman Richard Gibbs Marta Girdea Robert Green Ada Hamosh Melissa Haendel Ingrid Holm Matt Hurles Ben Hutton Ekta Khurana Sebastian Kohler Joel Krier Owen Lancaster Melissa Landrum Farrah Ladha Paul Lasko Rick Lifton Daniel MacArthur Alex MacKenzie Danielle Azzariti Aleksander Milosavljevic Chris Mungall Debbie Nickerson Woong-Yan Park Justin Paschall Anthony Philippakis Heidi Rehm Peter Robinson Gary Saunders Francois Schiettecatte Rolf Sijmons Nara Sobreira Jawahar Swaminathan Morris Swertz Peter Taschner Sharon Terry Rachel Thompson Stephan Zuchner Appendix Launching the MME 1. Goals of MME Defined – At the 2013 ASHG Conference, representatives of genomic discovery platforms met and agreed on a common goal to develop a federated network of databases to support novel gene discovery and inform the clinical interpretation of genomic variation. 2. Version 0.1 MME API Developed – In 2014, members of PhenomeCentral and GeneMatcher collaborated to develop a common API. The group engaged technical input from the wider MME services and the Data Working Group of the GA4GH to refine the API and conform to evolving genomic standards. The API was posted on GitHub for community exchange of ideas. 3. MME Core Policies Developed – Over the span of one year, core policies to support the MME were developed and finalized in January 2015 at the Miami MME meeting. These policies are described above and include a tiered consent framework, a memorandum of understanding for MME services, a user agreement and a steering committee to oversee the program. 4. Matchmaker Exchange Website Launched – a website was launched to begin engagement of the outside community. The website can be found at www.matchmakerexchange.org. 5. Matching Algorithm Principles Defined – It was agreed that each matchmaker service would support its own matching algorithms, in part to deal with the practical aspects of different data schemas, database content and notification management on behalf of existing users. However, matchmaker services continue to collaborate and share practices using GitHub as a forum for exchange of ideas and algorithms. 6. API Test Phase – Beginning in 2014, three matchmaker services (PhenomeCentral, GeneMatcher and DECIPHER) implemented the API in three point-to-point launches in which the entire content of each database was queried against the content of the other databases to evaluate the code-base for the API and matching algorithms. API keys are exchanged using a standard Apache-based protocol that promotes data security. Problems were documented on a GitHub issues tracker and resolved. 7. MME Test Dataset Developed – To improve the ease with which new matchmaker services can be incorporated into the MME, a test dataset has been developed and resides on GitHub to allow evaluation of API and matching algorithm implementations. 8. User Interface to Support Queries – User interfaces at individual MME services are being built to support user-initiated queries of content across the MME. The status of user interfaces for each MME service can be found on the MME website. API v1 Data Fields ID (Mandatory) - The internal identifier (obfuscated or not) that can be used by the originating system to reference the patient data. Label (Optional) - A name/identifier assigned by the user which can be used to reference the patient in a recognizable manner (in an email for example); it should not contain any personally identifiable information. Query type (Optional) – Accepted values: once or periodic Submitter (Mandatory) - Consists of contact information of the person that submitted the search Gender (Optional) Age of onset (Optional) Mode of inheritance (Optional) Disorders (Optional) - A list of OMIM (MIM:######) or OrphaNet (ORPHA#####) identifiers, can be empty Features – It is mandatory to have at least clinical features or gene(s ); having both is preferred – HPO terms for clinical features – gene name(s) MME Informed Consent Policy The MME worked closely with the GA4GH's Regulatory and Ethics Working Group and Consent Task Team on developing a proposal for informed consent for data sharing in the context of genomic matchmaking within MME. We have distinguished two levels of matchmaking and different consent requirements based on the data shared and the probability of re-identifying the patient: Level 1 - No additional consent required - This level of matchmaking involves a data requester querying on a broad phenotype description or disease name using standardized terms or codes (Human Phenotype Ontology (HPO), OMIM, Orphanet) and/or candidate gene names +/- variant type. This level of sharing is consistent with current clinical practice with low risk of possible reidentification and therefore specific patient consent for this activity is not required. Level 2 - Consent required - This level of matchmaking involves a data requester querying on a unique or sensitive phenotype description and/or sequence level and related information, such as defined variants and/or genomic datasets. This level of sharing requires consent from the patient. If the patient had previously consented to data being shared in an open or registered access database whose declared purpose involves data sharing for purposes consistent with those of this matchmaking, no additional consent is required. The MME service in which data is deposited is responsible for ensuring patient data used in matchmaking is consented appropriately. MME Service Requirements To become a MME service, each new site must achieve the following: 1. Require users to deposit case data to undertake a federated query across the MME service providers 2. Establish a minimum of two point-to-point API connections to other MME services 3. Contain content that is considered by the MME steering committee to be useful for matching, including the flagging of, or ability to prioritize, candidate genes 4. Successfully implement matching algorithms using test data 5. During user queries, enable dual notification of data requester (i.e. the querier) and prior data depositor (i.e. the queried) including sharing the identities and contact information for each 6. For each database to which a MME service is connected by an API, the connected database’s disclaimers should be posted on the MME service’s website and displayed with query results. Disclaimers can be found on GitHub (https://github.com/MatchmakerExchange/mmeapis/tree/master/disclaimers) 7. Store queries sent and received between MME sites only for the purpose of auditing, defining query statistics, and following up queries to understand rates of validated gene discovery 8. Attest to database security requirements as defined by the GA4GH Security WG (forthcoming) 9. Advance the goals of the MME project through active participation in meetings and conference calls including defining a representative for the MME steering committee MME End User Agreement To use the MME, each data querier agrees to the following: 1. To make no attempt to identify individual patients in any MME database 2. To enable all cases submitted for querying to be stored in the query-initiating database for future matching 3. To obtain permission from the source of the matching data before publishing or presenting the results of queries 4. To acknowledge the MME, and the specific MME service that supported any discoveries in publications, as appropriate