Rosetta software package
Transcription
Rosetta software package
4/10/2011 Rosetta .. • Breif introduction to Rosetta Ph.D. Thomas M. Frimurer • Rosetta docking example Novo Nordisk Foundation Center for Potein Reseach Center for Basic Metabilic Research Rosetta software package • Rosetta consists of multiple modules: – protein folding, comparative modeling, ligand docking, protein design, antibody/antigen interactions, etc. • Rosetta is developed in a consortium of twelve laboratories by • Breif introduction to Rosetta • Rosetta docking example around 50 developers • Rosetta is free for academic – user guide and tutorials are available • PyRosetta is a python interface – allows integration with Pymol • FoldIt is the better video game for you and your kids • Rosetta@home uses your computer for our RosettaCon 2009, Leavenworth, WA, USA research www.rosettacommons.org Conformational sampling and scoring of models in Rosetta 1. Rosetta Combines Conformational Sampling ~12,000,000 sequences >Q8TDV5 GP119_HUMAN Glucose-dependent.... MESSFSFGVILAVLASLIIATNTLVAVAVLLLIHKNDGVSL CFTLNLAVADTLIGVAISGLLTDQLSSPSRPTQKTLCSLRM AFVTSSAAASVLTVMLITFDRYLAIKQPFRYLKIMSGFVAG ACIAGLWLVSYLIGFLPLGIPMFQQTAYKGQCSFFAVFHPH FVLTLSCVGFFPAMLLFVFFYCDMLKIASMHSQQIRKMEHA GAMAGGYRSPRTPSDFKALRTVSVLIGSFALSWTPFLITGI ..... 1. Scoring Function for Structure Determination ~70,000 structures ? 1 4/10/2011 Conformational sampling and scoring of models in Rosetta 1. Rosetta Combines Conformational Sampling 1. 2. Exchanging the backbone conformation of 9 and 3 amino acids peptide fragments, collected from the pdb of homologous strethes. Metropolis Monte Carlo Enew < Eold accept Enew ≥ Eold accept with probability e-[ (Enew - Eold)/T ] Sampling strategies for backbone degrees of freedom >Q8TDV5 GP119_HUMAN Homo sapiens MESSFSFGVILAVLASLIIATNTLVAVAVLLLIHKNDGVSLCFTLNLAVADTLIGVAISGLLTDQLSSPSR LAVLASLIIA PTQKTLCSLRMAFVTSSAAASVLTVMLITFDRYLAIKQPFRYLKIMSGFVAGACIAGLWLVSYLIGFLPLG VLTLSCVGF IPMFQQTAYKGQCSFFAVFHPHFVLTLSCVGFFPAMLLFVFFYCDMLKIASMHSQQIRKMEHAGAMAGGYR SPRTPSDFKALRTVSVLIGSFALSWTPFLITGI..... Approximate local interactions using the distribution of conformations seen for similar sequences in known protein structures .. 2. Scoring Function for Structure Determination 1. 2. For each sequence window, select fragments that represent the conformations sampled during folding Low resolution: Reduced atom representation (centroid) Simple energy function which Aggressively search conformational space High resolution: Full atom more sophisticated energy function. “Local” search of conformational (and sequence) space Majority of conformational sampling protocols in Rosetta use Metropolis Monte Carlo follow by gradient based minmization • Monte Carlo simulated annealing assembly of fragments • Simplified protein representation • One centroid per amino acid side chain identify the ”best” structure 1) Remove very low contact order structures 2) Select broadest minima using cluster analysis 3) Select lowest energy structures with full atom potential The free energy minimum corresponds (usually) to the native protein fold • High resolution potential energy function • full atom representation Figure form Kistian W. Kaufmann et. al. 2010 Figure form Kistian W. Kaufmann et. al. 2010 Computational strategy .. ab initio protein folding Approximate local interactions using the distribution of conformations seen for similar sequences in known protein structures .. Filter conformational ensample: Cluster models that maintain the same overall fold e.g. Cα rmsd < 5 Å • Statistically-derived potential function • Steric overlap (vdw interactions) • Residue environment (solvation) • Pairwise interactions (electrostatics) • Strand pairing (hydrogen bonding) • Compactness (solvation) Folding Units .. While not every protein fold is present in the protein databank, all possible conformations of small peptides are .. Its depth is obscured because of the simplified energy approximation Rosetta begins with an extended peptide chain. Insertions of backbone ”fragments” rapidly folds the protein Statistically-derived potential function Cluster analysis .. Select broadest minima Sample conformational space using Monte Carlo simulations Figure form Kistian W. Kaufmann et. al. 2010 Refine models .. Simons K. T.; Kooperberg C.; Huang E.; Baker D. (1997) J. Mol. Biol. 268, 209–225 Rohl C. A.; Strauss C. E.; Misura K. M.; Baker D. (2004) Methods Enzymol. 383, 66–93. Simons K. T.; Ruczinski I.; Kooperberg C.; Fox B. A.; Bystroff C.; Baker D. (1999) Proteins 34, 82–95. Bradley P.; Misura K. M.; Baker D. (2005) Science 309, 1868–1871. Full atomic potential energy function .. Figure form Kistian W. Kaufmann et. al. 2010 2 4/10/2011 The resulting model undergo atomic-detail refinement • http://rosettadesigngroup.com • http://robetta.bakerlab.org Figure form Kistian W. Kaufmann et. al. 2010 http://rosettadesigngroup.com http://robetta.bakerlab.org GPR119 AR-231453 interaction ? • Breif introduction to Rosetta AR-231453 • Rosetta docking example Mutational mapping of the AR-231453 ligand binding site in the GPR119 receptor 3 4/10/2011 Rosetta docking protocol Modelling the GPR119 receptor Table 1: Sequence alignment of β2-adrenergic GPR with GPR119 ID=22% pP=7.2 2rh1_a Q8TDV5_GP119_HUMAN 2rh1_a 2rh1_a Q8TDV5_GP119_HUMAN 2rh1_a 2rh1_a Q8TDV5_GP119_HUMAN 2rh1_a 1 1 .E..#....I##.L#.L#I#...####.###.#.+...V..#F...LA#AD.##G#A#..#....#..........#C.#...#....#.AS#.T###I.#DRY#AI..PF+Y.. DEVWVVGMGIVMSLIVLAIVFGNVLVITAIAKFERLQTVTNYFITSLACADLVMGLAVVPFGAAHILMKMWTFGNFWCEFWTSIDVLCVTASIETLCVIAVDRYFAITSPFKYQS MESSFSFGVILAVLASLIIATNTLVAVAVLLLIHKNDGVSLCFTLNLAVADTLIGVAISGLLTDQLSSPSRPTQKTLCSLRMAFVTSSAAASVLTVMLITFDRYLAIKQPFRYLK 1. 2. 1. Resulting models sorted with respect to energy 2. Best scoring model had 2.7 Å to TM domain of b2ar template ##....A.##I#.#W#VS.L..FLP#.###........#...A....C.FF....#####...#..##P########...#..#...#......................#.-#K 116 LLTKNKARVIILMVWIVSGLTSFLPIQMHWYRATHQEAINCYAEETCCDFFT-NQAYAIASSIVSFYVPLVIMVFVYSRVFQEAKRQLNIFE----------------FCLKEHK 116 IMSGFVAGACIAGLWLVSYLIGFLPLGIPM--------FQQTAYKGQCSFFAVFHPHFVLTLSCVGFFPAMLLFVFFYCDMLKIASMHSQQIRKMEHAGAMAGGYRSPRTPSDFK AL+T#.###G.F.L.W.PF#I..IV.V#....##...#..#L.##G#.NS.#NPLIY#....-#R#.#..###........................................ 214 ALKTLGIIMGTFTLCWLPFFIVNIVHVIQDN-LIRKEVYILLNWIGYVNSGFNPLIYC-RSPDFRIAFQELLCL--------------------------------------223 ALRTVSVLIGSFALSWTPFLITGIVQVACQECHLYLVLERYLWLLGVGNSLLNPLIYAYWQKEVRLQLYHMALGVKKVLTSFLLFLSARNCGPERPRESSCHIVTISSSEFDG 3. 4. 5. 2. Lack of sequnece similarity in loop regions .. 6. Rosetta docking protocol Generate initial homology model Produce 1000 models using Rosetta relaxed 1000 diffeent loop conformations were generated on best model 1. Kinematic closure algorithm 2. Disulphide bridge constrain between Cys in EXL2B and Cys III:01 1. Week sequence similarity to templates .. 3. Quality of homology model is questionable .. Generate GPR119 Homology model (b2ar template) 1000 GPR119 models were produced using Rosetta relax 7. 14 low energy ligand (AR-231453) conformations were generated A total of 2000 docking trajectories of randomly picked ligand conformations were performed on each of the 1000 loop models Of the 2.000.000 combinatorial solutions – top 5 % were selected based on total energy and receptor-ligand interaction energy 5 best docking poses were relaxed 5000 times to optimize loop structure and receptor ligand packing ”Low resolution” docking mode Refine best docking poses Generate ensamle of low energy conformations of AR-231453 In between helices lack agreement with experimental mapping 2000 docking trajectories performed for each of the 1000 loop models 2.000.000 decoys Cluster, filter and extract top10 best models lack agreement with experimental mapping Pose 1 Pose 2 Pose 3 Pose 5 Pose 4 A total og 5000 docking trajectories were performed for each of the 5 best docking poses to relax and refine the receptor and ligand packing Build and refine 100 loop conformations for each of the 10 best models A total of 1000 models Docking pose I: Refinement Docking pose I: Refinement Pose 1 4 4/10/2011 Docking pose II: Refinement Docking pose II: Refinement Pose 2 Docking pose III: Refinement Docking pose III: Refinement Pose 3 Docking pose III: Refinement Docking pose IV: Refinement Pose 4 5 4/10/2011 Docking pose V: Refinement Docking pose V: Refinement Pose 5 Refined GPR119 docking decoys Docking pose III Docking pose IV Refined GPR119 docking decoys Docking pose III Docking pose IV Proposed binding pose of AR231453 agonist to the GPR119 receptor • http://www.rosettacommons.org • http://robetta.bakerlab.org • http://rosettadesigngroup.com Binding pocket of AR231453 in GPR119. Amino acids investigation by mutagenesis is coloured according to potency shift. 6
Similar documents
Intel International Science and Engineering Fair 2012
Engineering Fair 2012 in Pittsburgh. This city is well known for its national sports teams and the steel industry of the mid 1900s, but over the years the primary industries have shifted to high te...
More information