A rational approach to a HIV-1 vaccine design - CBS
Transcription
A rational approach to a HIV-1 vaccine design - CBS
A rational approach to a HIV-1 vaccine design Course 27685 Immunological Bioinformatics Gustav Ahlberg, Joakim Ro-Poulsen and Snezana Djurisic Center For Biological Sequence Analysis CBS , Technical University of Denmark DTU Aim: To do a rational selection of peptides with a high potential as vaccine candidates against HIV-1 in Guinea-Bissau focusing on HLA-E. Methodology: The selection of peptides is tailored towards the prevalent HLADRB1 and HLA-E molecules present in the population in Guinea-Bissau using bioinformatic tools. The process involved the following: Step 1: Selecting MHC Class I/II alleles from the population. Step 2: Predicting and selecting the peptides in the HIV 1 gag-protein binding to the chosen alleles. Step 3: Evaluating peptides by visualizing and reviewing mutation frequency. Background: The human immunodeficiency virus (HIV) has been shown to abrogate and/or down-regulate most MHC class I molecules as an evasion strategy that targets the antiviral activity of the immune system[9,10]. Interestingly, the non-classical HLA molecule HLA-E seems not to be affected by the viral infection[10]. HLA-E is a conserved class Ib molecule characterized by a limited polymorphism (2 coding variants are identified to date). Its primary purpose is to bind small leader peptides of most HLA class I molecules after which the HLA-E molecule is stabilized and migrates to the cell surface where it interacts with receptors from natural killer (NK) cells. Recent evidence reveals that several peptides other than MHC class I molecules encode peptides that can bind to HLA-E including peptides derived from the HIV gag protein[8,10]. In addition, HLAE restricted CD8+ T cells have been reported against various bacterial and viral infections[11,12]. HLA-E restricted T cell responses may be of interest for vaccine development since HLA-E displays very limited polymorphism, and is not down-regulated by HIV-infection. Persistent HLA-E restricted CD8+ T cell responses require assistance from CD4 + T cells and HLA class II epitopes should also be evaluated in the context of HIV-1 vaccine design. Step 1b: HLA Sequence Logos. HLADRB1*1001 HLADRB1*1304 HLADRB1*0901 HLA-E*0101 and E*0103 No information available No information available No information available Figure 1b. Three selected logos predicted via MHC Motif Viewer (upper figures) showing the bias for peptides binding to the HLA-DRB1 molecule. Position 1, 4, 6 and 9 (the abscissa) have a high degree of information content[1,4], the most pronounced degree found at position 1 for all three alleles. These are anchor positions that to a high extend determines the binding of a peptide. In the depicted logo plots, the amino acids are colored according to their properties: Acidic = red (DE); Basic = blue (HKR); Hydrophobic = black (ACFILMPVW); Neutral = green (GNQSTY). No information could be obtained on HLA-E with either the MHC Motif Viewer or Syfpeithi MHC database. Step 2a: Predicting the binding peptides. Step 1a: Identification of the prevalent HLA-DRB1 alleles in a Guinea-Bissau population. Figure 2a. NetMHCpanI In-House server, predicting epitopes for HLA- Figure 1a. The HLA-DRB1 locus has one of the highest sequence variation within the human genome. Allelic frequencies in a Guinea Bissau population were identified using the Allele Frequency Database[3]. Prevalent alleles were chosen to complement eachother at anchor positions (se figure 1b), so that the alleles did not have the same binding preferences. The alleles chosen were HLA-DRB1*09:01, HLA-DRB1*13:04, HLA-DRB1*10:01 (shown in colour). Among these the unusual HLA-DRB1*1304 allele showed to be highly abundant covering 19.6% of the analyzed group. All together the phenotypic population coverage was calculated to be 61%. Allel frequences for HLA-E could not be identified using the Allele frequency Database. Step 3a: Visualizing the protein with epitopes and amino properties. (a) Cartoon (b) Surface E0101[6]. NetMHCpan identified two potential epitopes in the gag protein as binders to HLA-E*01:01, one as a weak binder (WB) and one as strong binder (SB). The two epitopes (with sequence positions) were: Weak binder: 163 FSPEVIPMF Strong binder: 274 RMYSPTSIL Step 2b: Epitope selection of peptides tailored to bind to HLA-E*0101 and HLA-DRB1*1001, DRB1*0901, DRB1*1304. Figure 2b. Seven 15mer peptides selected for further analysis. Four of these were also predicted as binders for the nanomers found with MHCpanI (163, 272 and 273) [7]. X represents binder and 0 represents a non binder. Step 3b: Selected peptides are cross-reactive towards other subtypes. (c) Hydrophobicity (d) Electrostatic potential Aligning multiple gag protein sequences with blastp (AAD39400, BAF32552, CAJ01233, ADN38206 amongst 50 others) revealed a few insimilarities in the binding peptides[5]. These differences were shown to have little to no difference in the predictions of netMHCpan and netMHCIIpan. Studying the prediction revealed that most substitions (but one, which were T-V) were of similar amino acid types and only one insertion had been made throughout the sequence. Results Figure 3a. Structure of the HIV-1 gag protein and the predicted epitopes 268 GLNKIVRMYSPTSIL (blue), 131 YPIVQNIQGQMVHQA (purple), 164 SPEVIPMFSALSEGA (orange) and 56 CRQILGQLQPSLQTG (pink). PyMol models are based on the CPHmodel predicted from AAC82593[2,5]. The CPHmodel were predicted with 95.7% identity. There were no clear link between the epitopes, hydrophobicity and electrostatic potential. The resulting analysis revealed two potential 9mer epitopes found in the HIV protein known as Gag. A BLASTp of 40 HIV 1 gag protein sequences, that were arbitrary picked from NCBI database, also indicated that the binding properties of the epitopes were conserved. We found the two HLA-E binding epitopes to be located closely in the protein structure, while the class II were more spread. There were no clear correlation between hydrophobity or electrostatic potential and the epitopes. The gag protein can be considered conserved compared to other proteins in the HIV virus, a virus that displays a high genetic variability. The protein is believed to play a crucial role in the HIV assembly process and is therefore an attractive target in the development of antiviral drugs. References: [1]http:// www.syfpeithi.de/Scripts/MHCServer.dll/FindYourMotif.htm [8] Nattermann J. et al, HIV-1 infection leads to increased HLA-E expression resulting in impaired function of natural killer cells, Antivir Ther, 2005 [2]http://www.cbs.dtu.dk/services/CPHmodels-2.0/ [9] Schaefer, M.R., et al., HIV-1 Nef targets MHC-I and CD4 for degradation via a final common beta-COP-dependent pathway in T cells. PLoS Pathog, 2008. 4(8): p. e1000131 [3]http://www.allelefrequencies.net/ [10] Pietra, G., et al., The emerging role of HLA-E-restricted CD8+ T lymphocytes in the adaptive immune response to pathogens and tumors. J Biomed Biotechnol. 2010: p. 907092. [4]http://www.cbs.dtu.dk/biotools/MHCMotifViewer/MHC_Fight.html [11] Salerno-Goncalves, R., et al., Identification of a human HLA-E-restricted CD8+ T cell subset in volunteers immunized with Salmonella enterica serovar Typhi strain Ty21a typhoid vaccine. J Immunol, 2004. 173(9): p. 5852-62. [5]http://www.ncbi.nlm.nih.gov/ [12] Romagnani, C., et al., HLA-E-restricted recognition of human cytomegalovirus by a subset of cytolytic T lymphocytes. Hum Immunol, 2004. 65(5): p. 437-45. [6]http://www.cbs.dtu.dk/services/NetMHCpan/ [7]http://www.cbs.dtu.dk/services/NetMHCIIpan/