Chromium System Brochure
Transcription
Chromium System Brochure
Introducing the Chromium™ System One system, one workflow, powerful new sequencing applications 10xGenomics.com 10x Genomics Changing the Definition of Sequencing™ The Chromium™ System One system, one workflow, powerful new sequencing applications GemCode™ technology powers an innovative system that transforms the capability of existing short-read sequencers. With millions of uniquely addressable partitions, the Chromium System unlocks critical genomic information. Genome Sequencing Targeted Sequencing The Chromium Genome provides long range information on a genome-wide scale, including variant calling, phasing and extensive characterization of genomic structure. Unlock critical genetic information for variants in heritable disorders, and discover key alterations in cancer. The Chromium Exome provides long range information, enabling phasing, structural variant detection and copy number determination. Low complexity and repetitive regions previously missed with short-read sequencing are now accessible. Single Cell Sequencing De Novo Assembly With Chromium Single Cell 3’, perform deep profiling of complex cell populations with high-throughput digital gene expression on a cell-by-cell basis. Trace expression profiles to individual cells to ensure biologically relevant signals are not masked by bulk average measurements. Discover the true genome with the Supernova™ Assembler and open the door to low-cost, everyday diploid assemblies. Unlock samplespecific sequence, probe diploid genome structure, and remove the need for a reference sequence of any kind. The Chromium™ System Unlock critical long range genomic and cell-by-cell gene expression information. 1 2 10x Genomics Changing the Definition of Sequencing™ Chromium™ Genome The most comprehensive genome Long range genomes at population scale. Call and phase the full spectrum of variants and unlock previously inaccessible regions from a single library. Resolve the genome into multi-megabase phase blocks While human genomes are diploid, analysis based on short reads collapses variants into a single haploid call set, masking critical genetic relationships. Chromium Genome phases the full spectrum of variants (SNVs, indels, and large-scale structural rearrangements) into ultra long multi-megabase phase blocks, enabling a full understanding of diploid genome sequence. 50 Mb Phase Block Chr 6 Massive partitioning and complete genomic information from a single library with 1ng input The Chromium Genome performs massive partitioning and barcoding of 1ng input DNA. The system produces uniform, high-complexity libraries, compatible with Illumina’s HiSeq X Ten systems, enabling high quality variant calling from minimal sequencing depth. Hap 1 STANDARD BARCODING Chromium Genome Partitions 384 >1,000,000 Barcode Pool 384 4,000,000 100ng+ 1ng Input DNA ™ Hap 2 70 kb Deletion Uncover previously inaccessible parts of the genome and hidden variants in critical genes Multi-megabase diploid assemblies eliminate the need for a reference Short reads cannot be mapped into repetitive regions, causing regions with no coverage (Child: BWA Aligner). The Lariat™ Aligner uses Linked-Read information to map reads that could not normally be mapped (Child Lariat Aligner), enabling variant calling and phasing in regions normally inaccessible to short reads. Father: NA12891 Child: NA12878 Mother: NA12892. The Supernova™ Assembler utilizes Linked-Reads to de novo construct multi-megabase diploid assemblies, preserving phasing information for small variants, structural rearrangements and novel sequence. Chr15 RPS17 Exon 3 Father Lariat Aligner Child Lariat Aligner Mother Lariat Aligner Child BWA Aligner G G G A G Child G A A G G C Mother A A G G C G Father 3 MAPQ>30 A Detection of a SNP in RPS17. The Lariat Aligner uses barcode information to correctly map reads and call variants in a highly repetitive region that standard aligners miss due to low mapping quality. A region in chromosome 3 is represented in the diploid assembly of NA12878 by two >10Mb phased scaffolds within a larger 17Mb overall scaffold segment. A zoom into the figure shows that variants between the haplotypes are conserved in the assembly such as the 8kb deletion indicated, where traditional assemblies compress the two haplotypes into a single path. 4 10x Genomics Changing the Definition of Sequencing™ Chromium™ Exome In collaboration with Reach beyond the exome The Chromium Exome with optimized baits powered by Agilent’s marketleading SureSelect® technology enables access to low complexity and duplicated regions, megabase-scale phasing, structural and copy number variation while improving the quality of SNP calls. Resolve compound heterozygosity and copy number change Establish cis or trans relationships between variants without trio sequencing. TRPV1 SHPK CTNS P2RX5-TAX1BP3 Coverage G>A Hap1 57kb Deletion Enable phasing of thousands of genes and detection of structural and copy number variation The optimized SureSelect baits are designed specifically for the Chromium Exome to improve gene phasing by closing gaps, and recovering hard-to-map loci in the genome. Hap2 Exome Baits Linked-Reads from Exome Baits Identify structural variants Detect gene fusions, duplications, deletions and translocations from a single exome run. Optimized SureSelect Baits 13 Mb EML4 Linked-Reads from Chromium Exome ALK Coverage BC Overlap Map previously inaccessible regions and improve variant calling performance The barcode-aware Lariat™ Aligner maps reads in previously inaccessible regions, identifies new variants and eliminates false positives due to incorrect mapping. Linked Reads 7,693 bp RPS17 Optimized SureSelect Baits NA12878 Lariat Aligner NA12878 BWA Aligner 1 MAPQ>30 5 6 20 29 EML4-ALK 6 10x Genomics Changing the Definition of Sequencing™ Chromium™ Single Cell 3’ High-throughput single cell RNA sequencing Profiling 1,000s to 10,000s of cells per experiment increases sensitivity and accuracy for the detection of rare cell types. Single cell resolution is the key to uncovering the true complexity of heterogeneous populations. Chromium Single Cell 3’ provides scalable transcriptional profiling of 1,000s to 10,000s of individual cells. Efficient biochemistry captures thousands of genes per cell Differential expression identifies cell cycle phase Number of distinct genes detected per cell as a function of sequencing coverage for HEK293T cells. Graph shows mean (black line) and range (dark gray) over 16 independent experiments. Proliferating HEK293T cells were profiled and scored for expression of markers associated with each major cell cycle phase. Cells from all phases were identified at expected proportions. Single cell partitioning with low multiplet rates Mouse transcript counts 60,000 Human:Mouse Human only Mouse only 0 Human transcript counts 60,000 Approximately 1,400 human (HEK293T) and mouse (NIH3T3) cells were mixed at a 1:1 ratio prior to profiling. 99.4% of the resulting cell-containing GEMs gave rise to sequencing reads mapping to only one species. This implies a total doublet rate of 1.0%. 7 High-throughput Up to 8 channels processed in parallel 1,000 to 6,000+ cells per channel 10 minute run time per chip No cell size restrictions ~50% cell processing efficiency t-SNE projection of gene expression profiles from 68,000 unsorted human peripheral blood mononuclear cells (PBMCs). Colors indicate the closest match in a reference panel of sorted cell types. Multiple key applications Immunology Neurobiology Cancer Biology Stem Cell Research Embryology 8 10x Genomics Changing the Definition of Sequencing™ Make the Discovery with GemCode™ Technology GemCode Technology with millions of uniquely addressable partitions enables single molecule resolution and single cell sequencing GemCode™ Technology Process ™ 1 2 3 Gel bead Disposable microfluidics Definitive results Mega diversity reagent libraries. Highly automated reaction assembly. Single cell analysis on a massive scale. Up to 100 million barcoded oligos per bead. Millions of “effective” pipette steps per minute. Creating the GEMs Functionalized with defined DNA barcodes. Deformable polymeric beads dissolve on demand. Droplet Monodisperse water-in-oil droplets. Implementing the GEMs Discovering with GEMs Single molecule resolution. On a single instrument Touch screen operation. ~10 minute run time. Massively scalable to millions of partitions. Together make a GEM Gelbead in Emulsion. Uniquely addressable reactions for molecular barcoding on a massive scale. >90% of all droplet reactions receive a single barcoded gel bead. Gel bead in emulsion (GEM) Barcode and reagent delivery with sample partitioning on a massive scale. 9 10 10x Genomics Changing the Definition of Sequencing™ Partitioning & Molecular Barcoding at Scale Partition sample across 100,000s to 1,000,000s of partitions, each containing a unique barcode DNA Linked-Reads Barcoded Fragment Embedded Barcode Genome is replicated and barcoded via a low-level enzymatic replication. Enzyme GEMs Collect Pool Lines represent Linked-Reads. Dots represent reads. Color indicates barcode. Reads with same barcode originated from molecules encapsulated in the same partition. Barcoded Primer Library DNA or Cells Gel beads loaded with barcoded oligonucleotides are first mixed with sample and enzyme mixture, then combined with an oilsurfactant solution at a microfluidic ‘double-cross’ junction. Oil Gel bead-containing droplets (GEMs) flow to a reservoir and are harvested. Gel beads are dissolved and molecular barcoding of the sample is initiated. The barcoded products are pooled from each droplet. Standard library construction adds the remaining sequencing adapters and sample indices. Single Cell Digital Gene Expression Barcoded cDNA Cell Cell Embedded Barcode Cell Cell mRNA is transcribed by a reverse transcriptase that creates barcoded cDNA. Cell 1... Cell 5,000 Gene 1 Gene 2... Gene 2,000 Gene 1 Gene 2... Gene 2,000 A barcode identifies transcripts originating from a single cell, which are then counted. 11 12 10x Genomics Changing the Definition of Sequencing™ The Chromium™ Software Suite Linking data, developers and discovery Long Range Genomes and Exomes Single Cell Transcriptomics Long Ranger Analysis Pipelines Cell Ranger™ Analysis Pipelines Leverage molecular barcoding to generate a powerful data type called Linked-Reads. Novel algorithms in the Long Ranger pipelines yield several classes of biological insights from Linked-Reads: Turn-key analysis for single cell gene expression sequencing data. Combining robust transcriptome alignment with large scale cellular barcoding, the Cell Ranger pipelines rapidly generate expression profiles to shed light on: ™ •Barcode-aware alignment •Megabase-scale haplotype phasing •Structural variant detection and phasing •Greater accuracy in SNP and indel calling •Complex primary cell populations •Cellular heterogeneity in cancer tumors •Cell cycle behavior Building upon widely accepted aligners and variant callers such as BWA, GATK, and Freebayes, the Long Ranger pipelines leverage the Chromium System’s molecular barcoding to enable phasing and structural variant calling. De Novo Assembly Open Source Informatics Supernova Assembler Martian™ Technology The Chromium System opens the door to low-cost, everyday diploid genome assemblies. The Supernova Assembler leverages the unique properties of Linked-Read data to reconstruct the genome under study, without the need for a reference genome. Chromium Software is built using a powerful, open source informatics platform called Martian. The Martian language features an elegant syntax for defining complex pipelines. A lightweight runtime featuring a micro-container design is performanceoptimized for high-throughput NGS analysis, and includes powerful data management features. ™ The Supernova Assembler exploits Linked-Read data to reconstruct a diploid genome, unlocking sample-specific sequence and removing the need for a reference sequence of any kind. 13 Starting with pre-built or user-supplied transcriptomes, the Cell Ranger pipelines can process cellular barcodes to produce gene expression profiles across tens of thousands of cells at once. The Martian language is complemented by a type-checking compiler, visual IDE, and debugging and profiling tools. The Martian runtime provides built-in support for pipeline composability, parallelization, and parameter sweeping. 14 10x Genomics Changing the Definition of Sequencing™ The Loupe haplotype view displays multi-megabase phase blocks and structural variant calls. Above, an example of compound heterozygosity is shown where the variants involved are fully contained within large, contiguous phase blocks. The Loupe™ family of visualization applications brings clarity to the novel biological insights made possible by the Chromium System. The Loupe structural variant view lists the calls and candidates produced by the Long Ranger pipelines, and uses changes and gaps in color intensity to represent barcode-based evidence of structural rearrangements. For genomes and exomes: fully haplotype-enabled genome browsing and structural variant visualization. For single cell transcriptomics: dimensionality reduction, clustering, and isolation of cell types and phases. Loupe applications feature fluid, modern user interfaces, run on Windows and Mac, and work with files produced by the Long Ranger and Cell Ranger pipelines. The Loupe single cell analysis view features an array of techniques for dimensionality reduction and clustering which can be applied to gain insight into a variety of single cell experiment types. 15 16 10x Genomics 10x Genomics, Inc. About us 10x Genomics meets the critical need for long range, structural and cellular information, with an innovative system that transforms the capability of existing short-read sequencers. Our Chromium™ System supports comprehensive genomics and high-throughput single cell transcriptomics. It enables researchers to discover previously inaccessible genomic information at unprecedented scale, including phased structural variants, phased single nucleotide variants, and dynamic gene expression of individual cells—while leveraging their existing sequencing systems and workflows. 10xGenomics.com Contact Us Pleasanton, CA 10x Genomics, Inc. 7068 Koll Center Parkway, Suite 401 Pleasanton, CA 94566 17 +1 925 401 7300 +1 800 709 1208 [email protected] San Francisco, CA 10x Genomics, Inc. 160 Spear Street, Suite 1130 San Francisco, CA 94105 +1 925 401 7300 +1 800 709 1208 [email protected] © 2016 10x Genomics, Inc. All rights reserved. Duplication and/or reproduction of all or any portion of this document without the express written consent of 10x Genomics, Inc., is strictly forbidden. Nothing contained herein shall constitute any warranty, express or implied, as to the performance of any products described herein. Any and all warranties applicable to any products are set forth in the applicable terms and conditions of sale accompanying the purchase of such product. 10x Genomics provides no warranty and hereby disclaims any and all warranties as to the use of any third party products or protocols described herein. The use of products described herein is subject to certain restrictions as set forth in the applicable terms and conditions of sale accompanying the purchase of such product. “10x”, “10x Genomics”, “Changing the Definition of Sequencing”, “Chromium”, “GemCode”, “Loupe”, “Long Ranger”, “Cell Ranger”, “Lariat” and “Supernova” are trademarks of 10x Genomics, Inc. All other trademarks are the property of their respective owners. All products and services described herein are intended FOR RESEARCH USE ONLY and NOT FOR USE IN DIAGNOSTIC PROCEDURES. Changing the Definition of Sequencing™ 10xGenomics.com