Biochemisches Praktikum II - Institute of Biochemistry

Transcription

Biochemisches Praktikum II
3-wöchiges Praktikum
im Hauptstudium der Biochemie
WS 2009/10
Institut für Biochemie
Biozentrum
Max-von-Laue-Straße 9
60438 Frankfurt am Main
Tel.: 069-798-29475
Inhaltsverzeichnis
1
Allgemeines------------------------------------------------------------------------------- 2
2
Zeitplan ----------------------------------------------------------------------------------- 4
3
Hilfe zur Erstellung eines Praktikumprotokolls ---------------------------------- 5
4
Bioinformatik ---------------------------------------------------------------------------- 9
5
Anleitung zum Ansetzen von Puffern ----------------------------------------------40
6
Der Peptidtransporter TAP ----------------------------------------------------------48
7
6.1
Membranpräparation aus Sf9-Insektenzellen (Versuch 1 A) ------------------------------------- 51
6.2
Markierung von Peptiden mit Iodoacetamidofluorescein (Versuch 1B) ------------------------ 58
6.3
Bestimmung der Bindungsaffinität (KD-Wert) und der Bindungskinetik eines Peptids
zu TAP (Versuch 1C) -------------------------------------------------------------------------------------- 61
6.4
Untersuchung der Nukleotidbindung von TAP (Versuch 1D) ------------------------------------ 65
6.5
Peptidtransportassay (Versuch 1E)--------------------------------------------------------------------- 70
Funktionelle Charakterisierung der Nukleotidbindedomäne von MDL1 --70
7.1
Überexpression der MDL1-NBD in E. coli ------------------------------------------------------------ 74
7.2
Aufreinigung der MDL1-NBD bzw. MDL1(E599Q)-NBD von E. coli--------------------------- 77
7.3
ATPase Aktivitätstest -------------------------------------------------------------------------------------- 83
7.4
Fluoreszenzlöschung zur Bestimmung der Geschwindigkeitskonstante der
MDL1(E599Q)-NBD Dimerisierung -------------------------------------------------------------------- 86
Allgemeines
2
1 Allgemeines
Das Biochemische Praktikum im Rahmen des BC-II dauert 3 Wochen und läuft für
alle Gruppen jeweils von Montag bis Donnerstag ab 9.00 Uhr s.t. Freitag ist jeweils
„Riedberg-Tag“ und ist für Vorlesungen reserviert.
Für die Gruppen A1 – A6 findet das Praktikum vom 11.01. bis 28.01.10 und für die
Gruppen B1 – B6 vom 1.02. bis 18.02.10 im Institut für Biochemie statt.
Vorbereitungs- und Kontrollseminare finden nach Bedarf statt und werden zeitlich
günstig in das Praktikum eingepasst.
Von den Praktikumsteilnehmern wird erwartet, dass sie sich an hand der Skripte auf
der Biochemie homepage gut auf das Praktikum vorbereiten.
Nach dem Ende des Praktikums liefert jede Gruppe (A1,.....B1,....) ein ausgedrucktes
Gruppenprotokoll getrennt nach Teilversuchen bei den Versuchsbetreuern (TAP 1A–
1E und MDL1 2A-2D) ab (Deadline: 15.03.10). Jedes Protokoll ist mit einem
Deckblatt zu versehen, auf dem der Teilversuch, die Gruppe und die Teilnehmer
vermerkt sind. Die korrigierten Protokolle werden am 26.03.10 persönlich von den
Studenten abgeholt. Endgültige Abgabe der verbesserten Endversion beim Betreuer:
19.04.10. Bei ungenügender Ausarbeitung muss der Praktikumsteil wiederholt
werden. Erst die verbesserte Endversion wird vom Betreuer abgezeichnet und dann
bei Dr. Abele archiviert.
Der Protokollaufbau richtet sich nach dem einer Publikation: a. Einleitung, b. Material
und Methoden, c. Ergebnisse, d. Diskussion. Der Teil Material und Methoden darf
sehr kurz sein oder weggelassen werden, muss aber im Falle von Abweichungen
vom Skript dringend detailliert geschrieben werden.
Am letzten Praktikumstag findet eine Präsentation der Ergebnisse statt. Die
Powerpoint-Präsentation umfasst eine Einleitung, in der der Versuch dargestellt wird.
Anschließend werden die Ergebnisse vorgestellt und kritisch diskutiert. Bei
Versuchen, bei denen unterschiedliche Eigenschaften der Proteine untersucht
wurden, werden die Ergebnisse aller Gruppen dargestellt.
Allgemeines
3
Die Klausur des gesamten BC-II Praktikums (PEI, GSH, BC) findet am Freitag, dem
26.02.10 um 9:30 h im Biozentrum Gebäude N100 Raum 0.15 statt. Die Ergebnisse
werden spätestens am 8.03.10 im Foyer des Instituts für Biochemie ausgehängt und
auf unserer homepage veröffentlicht. Zum Bestehen der Klausur sind 50% der
Punkte notwendig.
Praktikumsleitung
Prof. Robert Tampé
PD. Dr. Rupert Abele
Betreuer
Christoph Baldauf
Dr. Katharina Ceh
Irina Bankert
Nina Kreißig
Agnes Molnar
Gerhard Spatz-Kümbel
Franz Tumulka
Zeitplan
4
2 Zeitplan
Das Praktikum besteht aus zwei großen Blöcken. Zuerst findet am Montag eine
allgemeine Einleitung statt. Des weiteren wird das Internet als Werkzeug für
biochemische Studien vorgestellt. Am Dienstag werden grundlegende Methoden im
Laboralltag
durchgeführt.
Daran
schließen
sich
Experimente
mit
dem
Peptidtransporter TAP an (Versuch 1). In der zweiten Hälfte des Praktikums werden
Sie biochemische Studien mit der löslichen Domäne des ABC-Transporters MDL1
durchführen (Versuch 2). Am letzten Tag findet ein Seminar statt, in dem jede
Gruppe ein Versuch aus dem Praktikum darstellt und noch offene Fragen diskutiert
werden.
Montag
9 h Einführung
Dienstag
9 h Gr. 1–6
Mittwoch
9 h Gr. 1-6
8:30 h Gr. 5/6 V.1E
V. 1A
9:00 h Gr. 1/2 V.1C
Bioinformatik
1. Woche
Donnerstag
Gr. 3/4 V.1D*
9:30 h Gr. 1–6
Bioinformatik
13 h Gr. 1-3
13 h Gr. 1-3
Medium ansetzen
Puffer ansetzen
Gele gießen
Gr. 4-6
Medium ansetzen
Gr. 4-6
Puffer ansetzen
Gele gießen
2. Woche
8:30 h Gr. 3/4 V.1B
8:30 h Gr. 5/6 V.1B
8:30 h Gr. 1/2 V.1B
9:00 h Gr. 1/2 V.1E
9:00 h Gr. 1/2 V.1D
9:00 h Gr. 3/4 V.1E
Gr. 5/6 V.1D*
Gr. 3/4 V.1C
Gr. 5/6 V.1C
9 h Gr. 1-6
V. 2A-2
17:00 h Gr. 1-6
V. 2A-1
9 h Gr. 1-6
V. 2B
9 h Gr. 1-6
V. 2B
9h
13 h
Gr. 1/2 V. 2D
Abschlussseminar**
Gr. 3/4 V. 2C
3. Woche
Gr. 1 V. 2A/B
Gr. 2 V. 2C
13 h
Gr. 1/2 V. 2C
13 h
Gr. 3 V. 2D
Gr. 3/4 V. 2D
Gr. 5/6 V. 2D
Gr. 4 V. 1A/B
Gr. 5/6 V. 2C
Gr. 5 V. 1C/D
Gr. 6 V. 1E
* Vers. 1D trifft sich im Praktikum
** Beim Abschlussseminar werden die Ergebnisse aller Gruppen vorgestellt, falls in den
Versuchen unterschiedliche Fragestellungen untersucht wurden (betrifft Versuch 1C-D)
Hilfe zur Erstellung eines Praktikumprotokolls
5
3 Hilfe zur Erstellung eines Praktikumprotokolls
Das Abfassen eines Praktikumprotokolls beginnt mit dem Aufschreiben aller Versuchsdetails wie Versuchsdaten und Versuchsbeobachtungen in einem Laborjournal.
Protokollieren Sie alles sehr sorgfältig, vertrauen Sie nicht auf Ihr Gedächtnis! Da
das Laborjounal ein Dokument ist, ist es empfehlenswert, ein fest gebundenes Heft
zu benutzen statt eines Ringbuchs oder einer losen Blattsammlung und die Seiten
alle durchzunummerieren. So können Sie es auf Verlangen vorzeigen oder abgeben.
Alle Eintragungen in das Laborjournal können stichpunktartig knapp gehalten sein,
sofern sie nur eindeutig und vollständig sind. Im Laborjournal – aber auch nur hier
und sonst nirgends – ist auch Laborjargon (LJ) zulässig. Das krasse Gegenteil gilt für
das später niederzuschreibende, eigentliche Protokoll!
Bei der Erstellung des endgültigen Protokolls sind bestimmte formale und inhaltliche
Standards einzuhalten.
Form und Inhalt des Protokolls
Allgemeines
Das Protokoll sollte mit einem Deckblatt beginnen, das Ihnen zur Verfügung gestellt
wird. Es trägt den Namen des Praktikums und der Universität, sowie den des(r) Verfassers(in) und einige zeitliche Angaben.
Strukturelle Übersichtlichkeit und sprachlicher Stil helfen dem Leser des Protokolls,
den Gang ihrer Experimente und Gedankengänge nachzuvollziehen. Unübersichtlichkeit und unnütze kleine Fehler lenken vom Inhalt ab. Deshalb fügen Sie Leerzeilen und Abstände ein, wo es sinnvoll ist. Schreiben Sie Überschriften fett, gegebenenfalls in verschiedenen Schriftgrößen.
Formulieren Sie wissenschaftlich kurz und prägnant und vermeiden Sie umständliche
Darstellungen in Wort und Länge der Sätze.
Gliederung
Jedes Protokoll soll aus den folgenden Abschnitten bestehen:
▪
Einleitung
Hier findet sich die Antwort auf die Frage: Welches Problem
wurde untersucht, welche Frage studiert?
▪
Material, Methoden, Versuchsdurchführung
6
Hier findet sich die Antwort auf
die Frage: Wie wurde die Frage, das Problem bearbeitet?
▪
Ergebnisse Hier wird die Frage beantwortet: Welche Resultate wurden
erzielt?
▪
Diskussion
Hier findet sich die Antwort auf die Frage: Was bedeuten diese
Resultate?
A.
▪
Literaturverzeichnis
▪
Anhang
(wenn nötig)
Einleitung
Halten Sie die Einleitung so kurz wie möglich, aber so informativ wie notwendig. Zur Einleitung gehören auch eventuelle Formelschemata, also z.B. wie
das zu isolierende Enzym ein Edukt in das Produkt umwandelt. Die Einleitung
endet immer mit der kurzen Beschreibung der Aufgabenstellung!
B.
Material, Methoden, Versuchsdurchführung
Unter Material sind aufzählungsartig unter Angabe des vollen Namens, einer
eventuellen Abkürzung, der Bezugsfirma und des Reinheitsgrades die verwendeten Chemikalien, Biochemikalien und andere Hilfsmittel wie z.B. Platten für
die Dünnschichtchromatographie anzugeben.
Der Teil Methoden wird als einziger im Präsens geschrieben. Methoden müssen nachvollziehbar und vollständig (reproduzierbar) erläutert werden. Änderungen oder Varianten müssen klar herausgestellt werden.
In dieses Kapitel gehören auch statistische Auswertungs- und Fit-Methoden und die
verwendete Software.
Die Versuchsdurchführung wird in der Vergangenheit geschrieben und beschreibt im Detail den tatsächlichen Ablauf der Experimente. Halten Sie sich
eng an Ihr Laborjournal und beschreiben Sie, was Sie tatsächlich gemacht
haben (nicht etwa, was man machen könnte). Denken Sie auch im Sinne Ihrer
Nachfolger an wichtige Details wie: Welchen Zentrifugentyp habe ich über welche Zeit mit welchem Rotor bei welcher Drehzahl benutzt. Sehr hilfreich ist
hier auch die Angabe der g-Zahl!
C.
7
Ergebnisse
Der Ergebnisteil wird ebenfalls in der Vergangenheit geschrieben. Stellen Sie
all Ihre Ergebnisse in Text und, soweit möglich und notwendig, auch in Abbildungen und Tabellen dar. Der Leser muss mühelos Text und Inhalt nachvollziehen können. Untergliedern Sie gegebenenfalls den Ergebnisteil. Klarheit
der Darstellung ist ungemein wichtig. Durch unklare Aussagen überlassen Sie
die Interpretation dem Leser! Stellen Sie im Text einen Bezug her zu Ihren
Abbildungen und Tabellen: Dabei wechseln Sie beim Schreiben in die Gegenwart!
Abbildungen und Tabellen werden grundsätzlich nummeriert und die Legende,
die selbsterklärend sein soll, mit einer Überschrift versehen.
Grundsätzlich müssen die Achsen von Diagrammen vernünftig skaliert, beschriftet und mit einer Einheit versehen werden. Unterschiedliche Symbole
müssen erklärt werden.
D.
Diskussion
Die Diskussion kann, wenn Sie es für sinnvoll halten, mit dem Ergebnisteil
zusammengefasst werden.
Interpretieren und diskutieren Sie Ihre Ergebnisse jeweils unter Verweis auf
die entsprechende Textstelle (Abb., Tab.). Lassen Sie sich von der Frage
leiten: Wo können aufgrund der gewählten Methodik Fehler in der Durchführung auftreten? Dabei ist Denkarbeit erforderlich! Nicht immer können „komische Werte“ einfach auf fehlerhafte Geräte, fehlerhaftes Material oder
mögliche Fehler beim Pipettieren geschoben werden!
E.
Literaturverzeichnis
Sobald Sie im Text Informationen aus der Literatur verwenden, müssen Sie
die entsprechende Literatur zitieren. Im Text nennen Sie dabei nur die Autoren
und die Jahreszahl (z.B. Tampé 2005). Bei mehr als zwei Autoren schreiben
Sie „et al.“ hinter den Erstautor (z.B. Tampe et al. 2006).
Die verwendeten Literaturzitate müssen unter Angabe der erforderlichen Details im Literaturverzeichnis aufgeführt werden.
Beispiel:
8
BEISMANN-DRIEMEYER, S. und TAMPE, R. (2004): Funktion
der Antigen-Transportmaschinerie TAP im zellulären Immunsystem – Angew.
Chemie 116, 4104-4122.
Das Literaturverzeichnis wird alphabetisch nach dem Erstautor geordnet!
F.
Anhang
In manchen Fällen folgt noch ein Anhang, der meistens aus längeren Tabellen
mit Messdaten oder größeren Abbildungen besteht.
Schluss
Versuchen Sie beim Schreiben Ihres Protokolls vollständige und klare Sätze zu formulieren. Lesen Sie den Text nach Fertigstellung auf Plausibilität, Aufbau und Formulierung sowie auf Rechtschreibfehler nochmals durch und/oder bitten Sie jemand
um Hilfe!
Bemerkung
Wenn es sinnvoll erscheint, sind Abweichungen von diesen Richtlinien
erwünscht, Kreativität wird nicht bestraft!
Bioinformatik
9
4 Bioinformatik
Die Bioinformatik ist mittlerweile ein wichtiger Teil der biochemischen Forschung. Mit Hilfe
der Bioinformatik kann gezielt Literatur gesucht werden und DNA- und Proteinsequenzdaten
analysiert und verglichen werden.
In diesem Teil des Praktikums sollen Sie eine Übersicht über die wichtigsten Server und
Programme bekommen, die für die tägliche Arbeit hilfreich sind.
Wichtige Internetadressen für Biochemiker:
www.ncbi.nlm.nih.gov (National Center for Biotechnology informatiom)
www.rcsb.org/pdb (Brookhaven Protein data bank)
www.ebi.ac.uk/services (European Bioinformatics Institute)
www.embl-heidelberg.de/predictprotein/predictprotein.html (The Predict Protein Server)
www.expasy.org (ExPASy Proteomics server)
http://portal15.isiknowledge.com (web of science)
http://tools.neb.com/NEBcutter2/index.php (webcutter)
http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi (Primer Design)
http://www.genebee.msu.ru/clustal/ (ClustalW/phylogenetischer Baum)
Aufgabe
Literatur:
1. Suchen Sie die Veröffentlichung über TAP von P. Cresswell, die in Nature
veröffentlicht wurde.
2. Wieviele Veröffentlichungen über TAP erschienen in „Journal of Biologcal
Chemistry“?
3. Wie
oft
wurde
folgende
Veröffentlichung
zitiert:
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE
UNITED STATES OF AMERICA 98 (13): 7241-7246 JUN 19 2001
4. Suchen Sie verwandte Publikationen.
DNA-Sequenz:
1. Suchen Sie die DNA-Sequenz von humanem TAP1.
2. Bestimmen Sie die Schnittstelle mit dem Restriktionsenzym DraIII.
3. Translatieren Sie die Sequenz und schneiden Sie die Proteinsequenz zurecht, so dass
ein Protein mit 748 Aminosäuren entsteht (N-Terminus: MASSRCPAPR; CTerminus: QAPADAPE)
Protein-Sequenz:
1. Bestimmen Sie den pI, Molekulargewicht, Extinktionskoeffizienten und die
Membrantopologie von TAP1.
2. Finden Sie in ABC-Transporter konservierte Sequenzen.
3. Finden Sie zu TAP1 homologe Proteine aus Säugetieren, Fischen und Vögel.
4. Erstellen Sie einen Sequenzvergleich.
5. Erstellen Sie einen phylogenetischen Baum.
Bioinformatik
10
Anhang
BIOINFORMATICS
As computational techniques become more and more important in all areas of biology, a
knowledge of what is called "bioinformatics" is becoming essential for biochemists and
molecular biologists.
In these practicals you will get a very general introduction to a small area of bioinformatics the use of literature, DNA and protein databases. Many scientists work with gene sequences at
some stages in their research, and the database of DNA sequences is growing rapidly.
The entire genomes of more than 100 bacteria and archea have been completely sequenced in
the past five years, including Haemophilis influenza (1.5 million bases). The first eukaryotic
genome (Saccharomyces cerevisiae: 12 million bases) was completed in 1996, followed by
Caenorhabditis elegans and Drososophila melanogaster. The complete sequence of human
chromosome 21 was reported in May 2000, and a rough draft of the entire genome was
released in 2002.
The vast amount of data generated in these projects are collected in databases - the most
commonly used DNA databases are those maintained by GenBank (USA) and the EMBL
(European Molecular Biology Laboratories), and most proteins sequences are found in Swissprot. These databases are updated daily with new sequences, and the DNA databases contain
more than 500 million bases.
Each sequence is assigned a name a unique accession number which allows the sequence file
to be retrieved, and analysed. Upon sequencing a new gene, the scientist usually first
compares it to all sequences in the database - a "hit" or a similarity with a gene of known
function can provide some clues as to the biological function of the new gene.
One can also search for protein patterns - for example the presence of phosphorylation sites,
or protein folding motifs. In these practicals, you will learn how to retrieve and manipulate
specific DNA sequences, including determining the restriction enzyme digestion pattern, and
also how to compare protein sequences.
Bioinformatik
11
LITERATURE AND DNA DATABASES
In today's practical, you will look at literature databases, which include titles and abstracts for
most scientific papers published, and you will also investigate methods of retrieving DNA and
protein sequences.
This page contains links to most of the sites you need, and also some sequences you will be
required to analyse. You may find it useful to bookmark some of the sites you visit, to make
it easier to return.
Many of the links in this page are to sites maintained at universities, or at research
institutions. They allow access to databases, and numerous software. You will begin by
looking at the site at the PubMed site at the National Institute for Health (NIH) at Bethesda
(US). PubMed allows online access to the Medline database, which contains over 12 million
citations including all the scientiific papers published in 4,600 biomedical journals. This is an
amazing research resource, and allows you to access publications in any area of interest. The
information provided includes the papers authors, the journal citation, and an abstract of the
work. As online publication of journals continues, it is now sometimes possible to access the
entire text of the paper, with associated figures. PubMed will be invaluable when you are
writing essays and carrying out your project research work, particularly in 4th year.
Start by visiting the PubMed site. Remember, the links in this manual are designed to open in
a new Netscape window so you can read the manual and follow the links at the same time. To
see both windows, you will need to change the size of the window containing the manual. Do
this by clicking the mouse on the box structure at the bottom right hand side of the window,
and moving it up and to the left. When you have the size you like, you can move the entire
window by clicking and holding on the top bar. You can move between windows at any time
by clicking on the one you wish to have active (in front). Now open the PubMed site at
http://www.ncbi.nih.gov/entrez/query.fcgi
You can now carry out a simple search - for example, find all the papers published by Paul C.
Engel. The syntax used for author searches is first type the surname, followed by a space and
initials (Engel PC).
Each blue highlight is a link to the abstract information for the papers - follow a few. Note
that some of these (for example for papers in Biochem J) have a link to the journal. Try
following these. Where UCD library has a subscription to the journal, these links will bring
you directly to the published paper. This service is improving every year, and you may soon
never have to visit a library again! (The papers are also available in "PDF" format, which
looks just like document you would photocopy from the journal). Another useful link from
the abstract is to Books. This will highlight all the words in the abstract that are defined in a
linked source text book (usually Molecular Biology of the Cell). This can be very useful if
you don't understand some of the terms.
Try the same search for Stephen G. Mayhew.
Bioinformatik
12
Note that this a useful search for authors with unusual surnames, or who use middle initials. A
search for G. Butler yields lots of papers I didn't write! Also, you have retrieved more than
100 papers for both Prof Engel and Mayhew - how do you find a specific paper? Return to the
PubMed site http://www.ncbi.nih.gov/entrez/query.fcgi and try entering more keywords on
the same line - eg to find papers by PC Engel on glutamate dehydrogenase, enter "Engel
glutamate dehydrogenase". The programme will search for the presence of all three words.
Still a lot of papers!
To tighten your search a little, use the search fields command. This will restrict the search to
specific terms, for example authors, or the name of a journal. You can find a list of all the
fields and their codes used in PubMed in the Entrez help manual here. The search fields
command is placed inside square brackets following the search term. Some of the most useful
are author [AU] and title of journal [TA]. You can link search terms together using "AND" or
"NOT". For example, to find papers by Paul C Engel on glutamate dehydrogenases published
in the Journal of Biological Chemistry (abbreviated as J Biol Chem), enter Engel PC [AU]
AND glutamate dehydrogenase AND J Biol Chem [TA].
Try this search, and then also find all the papers by Paul Engel that are NOT about glutamate
dehydrogenase!
You can also use the search fields to specify a range, using a colon (:). For example 1995
[PDAT]: 1999 [PDAT] will search for papers published between 1995 and 1999.
In your essay and research report next year, you will probably be searching for information on
a specific topic, rather than an author. It is important to choose your search terms carefully, or
you will find too many papers. For example, look for information about "DNA replication".
You will see there are more than 30,000 papers.
If you know what you're looking for, you can restrict these to specific authors or years using
the search fields described in the lecture. You can however also use the Preview/Index menu
just under the search field. This allows you to enter a search topic, preview the number of hits,
enter an additional phrase, preview the number of hits and so on.
Carry out your search again for "DNA replication" and use the Preview/Index menu to add
more terms, such as restricting the publication type.or the journal. The Index button (in the
Preview/Index menu) tells you what you can enter as additional search terms. For example
suppose you want to search for papers in the Journal of Biological Chemistry, but you are not
sure what the journal abbreviation is. Choose Journal name from the Add terms menu, type
"j biol" and hit the Index button. You will be shown a list of possible journals.
You can also restrict your search using the Limits menu. This should be fairly obvious. The
History menu is also clear. You also have access to a Clipboard menu. This allows you to
store the results of a search while you are working. The Details menu will show you how your
search is conducted using the correct search terms.
The results of your search will also have links on the right hand side to Realated articles and
Links. The programme searches for rare words that are common to the paper and a series of
"related" papers. This can be very useful for finding work for other groups working in your
research area. For example, find papers by SG Mayhew on the organism Megasphaera
elsdenii and then see who else is working in this area. Links will connect you to online
molecular biology books with information relevant to the research papers.
Bioinformatik
13
When you are comfortable using PubMed, use it to answer the questions below.
1. How many molecular biology papers have been published by individuals with the
same surname and first initial as you?
2. Use the "search field" syntax to build a search for papers about cloned animals
published in the journal Science in 2000.
3. Use the "search field" syntax to expand the last search for review articles about
cloning animals published in the past five years.
4. Use the Preview/Index or Limits menu to search for papers about gene therapy in
cystic fibrosis in 1995.
5. In 1994, the Danish resercher Jan Fallingborg was searching the Internet for papers
relating to his own research interest, selenium. He was surprised to find along with
one of his own papers from 1989 an apparent plagiarism by a Polish scientist
published in 1992. Find both papers. (Hint: Use the See related articles link).
6. This wasn't the first attempt by the Polish first author - see if you can find another one
about lymphocytes and uterine cancer.
CITATION INDEX
Another method for carrying out literature searches is to use the Science Citation Index,
which maintains a database of all citations to published papers in the scientific lierature. This
is extremely useful if you are interested in work following from a particular publication. You
simply locate the paper you are interested in the citation database, and determine where and
when it has been cited. You will use the Science Citation Index available at the Web of
Science. The site can be slow at times - have patience.
Example
The human genome sequence was reported in two publications in early 2001 - one from the
publicly funded consortium (in Nature) and one from the mainly privately funded consortium
(in Science).
•
How often have these papers been cited in the intervening time?
To use the citation index, you need to know the name of the first author on the paper, and
preferably the journal title and year published also. Sometimes other authors will work also,
but it is more accurate if you use the first one. The papers we are interested in are:
Lander ES et al (2001) Nature 409:860-931
Venter JC et al (2001) Science 291:1304
•
•
•
Go to the Web of Science site, and use the full search option.
Carry out a cited ref search. Enter the cited author (the first author of the paper begin with Lander), the cited work (the journal name - there is a link here showing
you all the generally used abbreviations for authors) and the cited year (the year the
original paer was published).
Press lookup to begin your search.
Bioinformatik
14
You will see that Eric Lander published several papers in Nature in 2001. How many times
was each one cited? Follow the link to the paper we are intered in, and see the list of
publications that cited it (this should be pretty obvious). What is the most recent paper that
cited this work?
•
Repeat the search for the Venter paper.
Exercises
1. Dr D. Margaret Worrall (Worrall DM) published a review in Biochem J in 1994. How
many times has it been cited?
2. How many papers did ES Lander publish in 2000? What is the title of the one that was
most cited?
3. How many papers in the Science Citation Index deal with both "pizza" and "fish"?
(Use the General Search option rather than the Cited Ref Search). Can you find one
of these working on fish fingers? Has it been cited? Are you surprised?
SEQUENCE RETRIEVAL
As you may noticed, the search engine you have been using has a pulldown menu that allows
you to search several databases. These include:
Entrez
This allows you to search many databases simultaneously - for example PubMed and the
biological databases listed below.
Protein Database
The Protein database contains sequence data from the translated coding regions from DNA
sequences in GenBank, EMBL and DDBJ as well as protein sequences submitted to PIR,
SWISSPROT, PRF, Protein Data Bank (PDB) (sequences from solved structures).
Nucleotide Database
The Nucleotide database contains sequence data from GenBank, EMBL, and DDBJ, the
members of the tripartite, international collaboration of sequence databases. EMBL is the
European Molecular Biology Laboratory (EMBL) at Hinxton Hall, UK, DDBJ is the DNA
Database of Japan (DDBJ) in Mishima, Japan. Sequence data is also incorporated from the
Genome Sequence Data Base (GSDB), Santa Fe, NM. Patent sequences are incorporated
through arrangements with the U.S. Patent and Trademark Office (US PTO), and via the
collaborating international databases from other international patent offices.
Structure Database
The Structure database or Molecular Modeling Database (MMDB) contains experimental data
from crystallographic and NMR structure determinations. The data for MMDB are obtained
from the Protein Data Bank (PDB). The NCBI has cross-linked structural data to
bibliographic information, to the sequence databases, and to the NCBI taxonomy.You can use
the NCBI 3D structure viewer, Cn3D, for easy interactive visualization of molecular
structures from Entrez though we will not look at molecular structures during this course.
Bioinformatik
15
Genome Database
The Genomes database provides views for a variety of genomes, complete chromosomes,
contiged sequence maps, and integrated genetic and physical maps.
There are also a number of more minor databases, with a bit more restricted appeal. You may
find some of these useful for specific research problems, or for help with your studies. I've
briefly described some below.
Books
This contains links to several online Biomedical books, which are useful for explaining terms
or concepts. A lot of Genetics textbooks are now available.
3D Domains
A collection of protein domain structures.
GEO
A collection of gene expression data from microarray experiments.
OMIM (Online Mendelian Inheritance in Man).
This database is a catalog of human genes and genetic disorders collected at Johns Hopkins
and elsewhere, and developed for the World Wide Web by NCBI.
Taxonomy
The NCBI Taxonomy database contains the names for the organisms whose sequences have
been made public by the collaborating sequence database EMBL, DDBJ, and NCBI/GenBank
or by one of the other public databases that are indexed in Entrez. Currently sequence data are
available for only about 50,000 of the about 2-10,000 million species supposed to exist on
earth.
In these practicals you will predominately access the first two listed, but you should
investigate some of the others. Searching these databases is very similar to the literature
searches you have already performed, and in fact one of the greatest strengths of the Entrez
search engine is that most records are linked to other records, even between databases.
Assume you are working on a gene called ACE2 from Saccharomyces cerevisiae , and wish to
retrieve the DNA and protein sequence. Go to the general NCBI at
http://www.ncbi.nlm.nih.gov/. For now, let's concentrate on finding the DNA sequence of
ACE2 so choose the nucleotide database from the pulldown menu. This search will find
more than 50 documents unless you lare careful about choosing your search criteria.
You will notice that the S. cerevisiae ACE2 gene was sequenced several times, and so has a
lot of entries in the database. This is not unusual - many genes are isolated several times in
several labs, and as the entire yeast genome has been sequenced, each gene is present at least
once! This page shows you that each entry has a specific accession number - Z73303,
U53881, X91258, M55619 and so on. There is a small amount of information provided some of these entries have chunks of sequence for chromosome XII, including ACE2. For
now, display the M55619 sequence in the default format. You should see a page that looks
soemthing like the one shown below, and you should bookmark this page.
Bioinformatik
16
This page gives you a lot of information about ACE2, including listing the protein sequence
(in the one letter code) and the DNA sequence.
•
•
•
What can you tell about the function of ACE2?
What organism was it cloned from?
Where was the research paper published? Note that there is a link to Medline, which
allows you to see the abstract of the article.
Bioinformatik
17
Sequence features include regions like "cds" (coding DNA sequence). This tells you the
coding region of DNA - the region which was translated to give the protein sequence.
•
At what position does the coding sequence begin and end?
Some of the sequence entries contained more than one gene. Have a look at accession number
X91258. (You can carry out the search again, or use your browser back button to find it).
Display this sequence in Graphics format (from a link near the top of the page). You should
see a page like the one below.
Explore the links on this page to see the other information available about this region.
The Limits and Preview/Index menus (and the Search Fields commands) are much more
useful when searching the nucleotide and protein data bases than with the literature databases.
You can find a list of all the search fields and their codes used in the sequence databases in
the Entrez help manual here.
Example: Use Limits to find all the human sequences (excluding ESTs and STSs) added to
the database since 1998:
Bioinformatik
•
•
•
•
•
•
•
18
Select the nucleotide database from the pull down menu.
Type "human" into the query box.
Select Limits
Choose "Organism" from the Limited to... pull down menu.
Check the "Exclude ESTs" and "Exclude STSs" check boxes.
Enter the date range in the modification in the Modification date box. (Format:
YYYY/MM/DD)
Hit Go
Now try to find only mitochondrial sequences from Saccharomyces cerevisiae that were
published in the past 2 years.
The Preview/Index menu is even more powerful for complex searches. This method provides
a menu driven approach that is identical to using Search fields commands. You can watch
your search being built in the query field.
Example: Search the protein database to find all the flavoproteins from bacteria reported in
the Journal of Biochemistry.
•
•
•
•
•
•
•
•
•
•
•
•
Select the protein database from the pull down menu.
Select Preview/Index
Select "Organism" in the pull down menu.
Type "bacter" (no quotes) in the query box, and hit the Index button.
Choose "bacteria" and hit the Add button. (Note the query field now says "bacteria
[Organism]")
Select "Text word" in the pull down menu.
Type "flavoprot" (no quotes) in the query box, and hit the Index button.
Choose the relevant key word (flavoprotein) and click Add. (Note the structure of
your query).
Select "Journal" in the pull down menu.
Type "J Biol" (no quotes) and hit the Index button.
Choose the correct journal and click Add.
Click Go.
Your query should look like: "bacteria"[Organism] AND "flavoprotein"[Text Word] AND "j
biol chem"[Journal Name]"
Now use the Preview/Index menu or enter a search using Search fields terms to find in the
protein database, an electron-tranferring flavoprotein from Megasphaera elsdenii isolated by
Prof Stephen G. Mayhew.
•
How many subunits does the protein have?
Exercises
1. How many alcohol dehydrogenase are there in the PROTEIN database?
2. Find the accession number(s) for the pyruvate decarboxylase gene from
Kluyveromyces marxianus.
3. Use the "search field" syntax or the Preview/Index menu to find all the nucleotide
sequences from chicken entered or modified between 1/1/98 and 31/12/98 with a
length between 1000 and 2000 bp. (Remember to use a colon to specify a range).
Bioinformatik
19
4. How many nucleotide sequences (excluding ESTs) from the chloroplasts of tobacco
are in the databases?
5. Apparently 'the King" branched out from rock-and-roll to DNA sequence analysis
long after his reported death. Find the accession numbers of DNA sequences reported
by Elvis Aaron Presley.
Required reading: the Entrez Entrez help manual
http://www.ncbi.nlm.nih.gov/entrez/query/static/help/helpdoc.html.
Proceed to the next section.
Bioinformatik
20
SEQUENCE ANALYSIS
In the last practical you mastered retrieval of sequences - now it is time for some simple
analysis.
For many cloning purposes, you need a restriction map of your DNA sequence. You can
generate one using the publicly available programme Webcutter. First, you need to select the
DNA sequence - we will use the ACE2 gene you are already familiar with. Return to a page
containing the ACE2 sequence (you may need to search for it again, if you didn't bookmark it
- PubMed is at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed).
Highlight the DNA sequence only. To do this, point the arrow to the end of the DNA
sequence, depress the mouse button, and keep it depressed while you highlight the sequence.
Let go of the mouse - the selected region should be obvious. Copy this region using the Copy
command in the Edit menu. Now, go to NEBCutter (at
http://tools.neb.com/NEBcutter2/index.php). Click the mouse at the position indicated for
pasting your sequence, and choose the Paste command from the File menu. Remember that
once you copy something it will remain on the clipboard (in the computer's memory) until you
copy something new. You can therefore paste the same thing several times. Choose "submit".
Explore the menus that allow you enter specific restriction enzymes, or view all sites. You
can see that this site also translates Open reading Frames, and shows you the position of the
restriction site relative to the ORFs.
•
•
How many times does BglII cut the ACE2 sequence?
How many times does HincII cut? What are the sizes of the resulting fragments?
Exercises
•
•
You wish to monitor the expression of theACE2 gene at the mRNA level. You need to
isolate a DNA probe from within the coding region. Find a restriction enzyme with a
recognition site of at least 6 bases which cuts only twice, releasing a fragment of at
least 1 kilobase.
You need to clone a DNA fragment into the cloning vector pUC18. Find the sequence,
and determine which restriction enzymes cut this plasmid only ONCE. Use this
information to identify the multiple cloning site.
TRANSLATION
The sequences you have examined so far have been annotated in the database - the coding
region has been determined, and the gene identified. Assuming you have cloned and
sequenced a new gene of interest, what can you determine about its function using computer
analysis? The first step is to determine if it is a coding sequence, and what protein it encodes.
This is relatively simple (what would you look for?), but tedious with long sequences. Several
computer programmes are available to carry out "conceptual translation" to search for
potential coding sequences (or Open Reading Frames [ORFs]). Copy the DNA sequence that
is present at the Sequence 1 link on the first page of the Bioinformatics course. One of the
most useful sites is OrfFinder at NCBI. Paste the sequence at the appropriate point. Press
OrfFind when you are ready. Remember that each DNA sequence has six possible reading
frames - three on the top strand, and three on the bottom. Only one strand will be transcribed,
Bioinformatik
21
but when sequencing a gene you rarely know which it is! Reading frames are usually
numbered 1, 2, and 3 for the top strand, and -1, -2, -3 for the bottom strand. There are several
programmes available on the Web for translation of simple sequences, though most are
designed to tranlate one frame at a time. Alternatives are available here , here and here.
Examine some of the outputs.
•
•
•
•
•
How would you define an ORF?
How many ORFs are there in this sequence, and in what reading frames?
Which is the longest ORF?
Could all the ORFs be translated at the same time?
Print out sequence 1, and mark the position of the start and stop sites for the ORFs in
frame +3 and frame -3 (as found by ORFinder).
In the next practical you will some more tests which could give you information about the
gene you have cloned. Please note that these translation programmes can only be used with
sequences that contain no introns, such as bacterial genes, genes from lower eukaryotes or
cDNAs. Do you know why?
POLYMERASE CHAIN REACTION
In your practicals last year you discussed methods for designing oligonucleotides for PCR
amplification. There are numerous programmes available to aid in primer design, and one of
the best is Primer3, and can be found at http://www.broad.mit.edu/cgibin/primer/primer3_www.cgi.
Exercises
•
•
Use the NCBI nucleotide search engine to find a sequence encoding the chimpanzee
alpha-like zeta-1-globin gene. Find a restriction enzyme that will cut outside the
regions flanking the first exon, and determine the size of this fragment.
Design a pair of PCR primers that will amplify the entire exon 1 region yet lie within
this restriction fragment.
Continue to the next section.
Bioinformatik
22
SEQUENCE COMPARISON
In today's practical you will get an introduction to comparing DNA and protein sequences.
Detection of a similarity between sequences can give some information about function of an
unknown protein - for example if it shares similarities with alcohol dehydrogenases from
other species, then it may be another alcohol dehydrogenase. Several proteins may also share
specific patterns relating to function - for example DNA binding regions, or similar catalytic
sites. Comparing several proteins from the same family (e.g. alcohol dehdrogenases) can help
identify which amino acids in the protein are important for function - those which are
conserved (i.e. present in all, or most members of the family) are likely to be most important.
DATABASE SEARCHING
One of the most powerful tools in bioinformatics is the ability to search databases of all
known sequences with an input, or query, sequence. This allows you to take a newly derived
sequence from your gene of interest, and search for any other similar sequences - a "match" or
a "hit" may give you valuable information about the biological function of your gene.
There are various publicly maintained databases, such as DNA sequences, protein sequences,
3-D structure, and protein motifs and patterns. If you are looking for something which
resembles a newly isolated clone, it is usually easier to search with the protein sequence
(why?).
There are many sites and programmes for carrying out database searches. We will use the one
at NCBI. First go to the National Centre for Biotechnology Information page. This time
follow the Blast button. There are a number of programmes you can use.
Nucleotide
BLAST
Input a nucleotide query sequence and search a
nucleotide sequence database. The most commonly
used program is blastn (standard BLAST). Other
options are MEGABLAST,which is a rapid metod
developed for comparing groups of similar sequences
and a version of blastn that is optimised for searching
for short conserved sequences. Discontiguous
megablast is used for sequences that have a very low
level of identity.
Protein
BLAST
Input an amino acid query sequence and search a
protein sequence database.The most commonly used
program is blastp. Other options are PSI- and PHIBLAST (discussed later), and searching for short
conserved sequences.
blastx: Converts a nucleotide query sequence into
Translating protein sequences in all 6 reading frames. The
translated protein products are then compared against
BLAST
the NCBI protein databases. tblastn: Takes a protein
Bioinformatik
23
query sequence and compares it against an NCBI
nucleotide database which has been translated in all six
reading frames. tblastx: Converts a nucleotide query
sequence into protein sequences in all 6 reading frames
and then compares this to an NCBI nucleotide database
which has been translated in all six reading frames.
Follow the Stand nucleotide-nucleotide BLAST (blastn) for the moment. At the top of the
page is the box for entering your sequence. From the pull down menu you can select one of
several NCBI databases to compare your query sequences against. The databases available
will depend on which programme you are using - for example with blastn you can only
search a nucleotide database. Some of those available are:
Nucleotides
Database
Description
nr
All
GenBank+EMBL+DDBJ+PDB
sequences (but no EST, STS, GSS,
or HTGS sequences). Removes
most repetition but not all. This is
the most commonly used database.
refseq
curated references for transcripts,
proteins and genomic regions, plus
computationally derived
nucleotide sequences and proteins
est
Database of
GenBank+EMBL+DDBJ EST
(expressed sequence tag)
Divisions. These can be further
limited to est-human, est-mouse
or est-others.
gss
Genome Survey Sequence,
includes single-pass genomic data,
exon-trapped sequences, and Alu
PCR sequences.
htgs
High Throughput Genomic
Sequences.
pat
Nucleotide sequences derived
from the Patent division of
GenBank.
pdb
Sequences derived from the 3dimensional structure of proteins.
month
All new or revised
GenBank+EMBL+DDBJ+PDB
sequences released in the last 30
days.
Bioinformatik
24
alu
Select Alu repeats from
REPBASE, suitable for masking
Alu repeats from query sequences.
It is available at
ftp://ncbi.nlm.nih.gov/pub/jmc/alu.
See "Alu alert" by Claverie and
Makalowski, Nature vol. 371,
page 752 (1994).
Searches Complete Genomes,
Complete Chromosome, or contigs
chromosome
form the NCBI Reference
Sequence project.
Proteins
Database Description
nr
refseq
month
All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Curated references for transcripts,
proteins and genomic regions, plus
computationally derived nucleotide
sequences and proteins
All new or revised GenBank CDS
translation+PDB+SwissProt+PIR
released in the last 30 days.
The last major release of the SWISSPROT protein sequence database (no
swissprot updates). These are uploaded to the
NCBI system when they are received
from EMBL.
patents
Protein sequences derived from the
Patent division of GenBank.
pdb
Sequences derived from the 3
dimensional structure Brookhaven
Protein Data Bank.
env_nr
Sequences from environmental samples
Genomes: You can restrict your searches to specific genomes (eg chicken, cow), or sets of
genomes (eg fungi) by choosing the special Blast pages available under the "Genomes" link.
The BLAST web pages accept input sequences in three formats; FASTA sequence format,
NCBI Accession numbers, or GIs. The most commonly used format is FASTA, and all the
sequences provided for these practicals are in this format. It is easy to generate the correct
format using any text editor. A sequence in FASTA format begins with a single-line
description, followed by lines of sequence data. The description line is distinguished from the
sequence data by a greater than (">") symbol in the first column. It is recommended that all
lines of text be shorter than 80 characters in length. An example sequence in FASTA format
is:
Bioinformatik
25
>envelope protein
ELRLRYCAPAGFALLKCNDADYDGFKTNCSNVSVVHCTNLMNTTVTTGLLLNGSYS
ENRT
QIWQKHRTSNDSALILLNKHYNLTVTCKRPGNKTVLPVTIMAGLVFHSQKYNLRLRQ
AWC
HFPSNWKGAWKEVKEEIVNLPKERYRGTNDPKRIFFQRQWGDPETANLWFNCHGEF
FYCK
MDWFLNYLNNLTVDADHNECKNTSGTKSGNKRAPGPCVQRTYVACHIRSVIIWLETI
SKK
TYAPPREGHLECTSTVTGMTVELNYIPKNRTNVTLSPQIESIWAAELDRYKLVEITPIG
F
SIMILARITY SCORING
A very detailed tutorial and course on database searching can be found at
http://www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-1.html and
http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/information3.html You should
make sure you read and follow these (outside the time set aside for paracticals). Some of the
text below is taken from these documents.
The score assigned to a database search match depends on the similarity between the query
sequence and those in the database, as discussed in the lectures. Identical amino acids are easy
to detect, but similar substitutions are more difficult. Several methods of evaluating similar
amino acids have been developed, based on evolutionary models, structural properties,
chemical properties such as charge, polarity, and shape, and combinations.
The PAM or Dayhoff Family of Matrices
Margaret Dayhoff pioneered the evolutionary approach in the 1970's. She made an extensive
study of the frequencies in which amino acids substituted for each other during evolution. The
studies involved carefully aligning all of the proteins in several families of proteins and
constructing phylogenetic trees for each family. She then inferred the ancestral sequence, and
counted all the replacements on every leg of the tree. This information was used to construct
the PAM ("point accepted mutation") matrix, where 1 PAM corresponds to an average change
in 1% of all amino acid positions. The most widely used PAM matrix is the PAM 250, which
is is appropriate for searching for sequence that have diverged by 250 PAMs or 250 mutations
per 100 amino acids of sequence. Because of back mutations and silent mutations this
corresponds to sequences that are about 20 percent identical.
The BLOSUM Family of Matrices
The BLOSUM matrices were developed by Henikoff and Henikoff. In the PAM method,
entire sequences are compared, including closely related and distantly related regions. To
derive BLOSUM, conserved regions only (blocks) of distantly related sequences were
compared. No gaps were allowed, and no extrapolations were performed. The BLOSUM
matrices are differentiated by number (like BLOSUM 62). The number refers to the minimum
percent identity of the blocks used to construct the matrix; greater numbers are lesser
distances.
Bioinformatik
26
Which matrix to use?
The Blosum and PAM matrices are the most widely used amino acids similarity matrices for
database searching and sequence alignment. Both perform well, but the BLOSUM usually
perform better. This likely reflects the fact that the Blosum matrices are based on the
replacement patterns found in more highly conserved regions of the sequences. This appears
to be an advantage because these more highly conserved regions are those discovered in
database searches and they serve as anchor points in alignments involving complete
sequences. It is reasonable to expect that the replacements that occur in highly conserved
regions will be more restricted than those that occur in highly variable regions of the
sequence. This is supported by the different pattern of positive and negative scores in the two
families of matrices. Some of the difference is also likely to be because the Blosum matrices
are based on much more data than the PAM matrices.
The PAM matrices still perform relatively well despite the small amount of data underlying
them. The most likely reasons for this are the care used in constructing the alignments and
phylogenetic trees used in counting replacements and the fact that they are explicitly based on
a simple model of evolution.
In database searching using a PAM or Blosum table corresponding to an inappropriate degree
of divergence can cause you to fail to discover homologous sequences that are present in the
database. Therefore, a thorough database search will involve using at least 2 and most likely 3
different matrices. Using different matrices usually better than using different programs and
search algorithms.
EXAMPLES
Example 1
First you will attempt to identify the protein sequence present in Sequence 3 using a blastp
search of all protein sequences available.
Use the cut and paste facilities to place the sequence in the correct place, choose the
appropriate search (blastp) and database (nr).
Hit the search button. Your search will be queued for a few minutes, though a link to a map
indicating conserved "domains" or "motifs" present will be shown almost immediately.
Follow the instructions on the screen to format the results of the database search. The time
taken to complete this will depend on how busy the site is (and is usually much faster in the
morning).
Note the size of the database searched! Look at the rest of the output carefully. The results are
first presented as an "image map" - a diagrammatic representation of the matches, with the
colour of each "hit" indicating the level of similarity. Each line represents a "hit", and if you
move the mouse over the line you can see the name of the protein. This is followed by a list of
the proteins hit in order of similarity, with a score and probability that the sequence
similarities are significant. This is followed by a series of alignments between the query
proteins and the hits - the alignment is in the centre, where a + indicates a similar, but not
identical amino acid. Note that some of the amino acids in the query sequence have been
replaced with "XXXXXXX". This is "filtered" - strings of repeated amino acids are removed
to remove false hits - for example a serine rich protein might align with other serine rich
Bioinformatik
27
proteins, but they are not necessarily functionally related. Use your lecture notes to interpret
the significance of the scores.
Some of the sequences have extra links attached, such as the letter"L" in a blue box. This is a
connection to "LocusLink", a database with information on human and mouse loci.
•
What conclusions can you draw about the potential identity and function of sequence
3?
Example 2
Retrieve the protein sequence of Ace2 you used in the last practical class, and search for
homologues in the nr database.
•
•
What sequence is most closely related to Ace2? (Excluding itself!)
Can you see regions which might indicate domains?
Example 3
The examples above are relatively straightforward, and it is easy to identify similar
sequences. If you are searching with a query from Homo sapiens or another higher eukaryote
you may have more difficulty interpreting the output. For example, try searching for
sequences with similarities to the human erythropoietin gene. A portion of this sequence is
available here. Examine this file carefully - it is a partial genomic sequence, encompassing
exons 4 and 5 of the gene. Use the nucleotide sequence to search the nr database.
Examine the output carefully. The top few hits will be almost identical to the query (starting
at base 1). Following the links to see where they originated from. One of these hits is derived
from a large scale sequencing project - of chromosome 7. Follow the link to this site, and see
how much more information is available. It may help to display the results in graphic format.
If you look at the image map, you will see that some of the alignments have a gap. These
sequences are usually drived from mRNA, so they do not contain the introns. Follow the links
to check this out. Are the intron positions conserved in non-human sequences?
Now try searching the est database. This can be very useful for identifying potential
homologues - for example members of a protein family. Until the human genome sequence is
complete it is a useful source for human genes, and remember the database also contains
sequences from other organisms. Can you detect any potential homologues of erythropoietin?
Example 4
The sequence available at Sequence 4 was obtained by sequencing fragments from the
genome of a pathogenic yeast. Try to identify open reading frames that are similar to other
known genes using a blastn search.
As you can see it is difficult to identify, mainly because the sequence supplied is short, and
there is no absolute match in the database. Now try translating the sequence in all six reading
frames (blastx), and searching the nr database. (Remember to check the list of programmes
for the most appropriate one!)
Bioinformatik
•
28
What is the most similar sequence in the database?
Example 5
What would the output be like if your test sequence contained more than one Open Reading
Frame? Retrieve the ucleotide sequence with the accession number AF167163. Use the DNA
sequence in a BlastX search.
•
•
How many open reading frames are there, and what is the most similar sequence ("top
hit") for each?
Which ORF belongs to a family with representatives in many species?
Options for Advanced Blast
For most uses, the default parameters in the Blast searches are sufficient. However,
sometimes you may fail to find a significant match although one exists. There are various
parameters you can change in the two panels below where you paste your sequence. We have
already described the effect of filtering out low complexity sequences. You can also however
filter out highly repetitive sequences from human queries, such as the SINE and LINE
elements. If present in your sequence they were greatly interfere with the output. Other filter
options include mask lower case - if a sequence is entered in upper case and lower case
letters, only the upper case sequence is included in the search. You can use the Limit by
Entrez query option to limit your search to a subset of the database, using a standard Entrez
search command. For example "dehydrogenase NOT Saccharomyces [Organism]" would
limit the search to dehyrogenases in all organisms exept Saccharomyces. You can change the
stringency of the search, by altering the Expect value. This has a default value of 10 -that is,
10 matches are expected to be found merely by chance. Lowering this value will increase the
stringency leading to fewer chance matches reported. You can change the word size, and
when searching the protein database you can change the matrix used (for Example PAM or
BLOSUM). The Format options allow you to control what your output looks like.
EXERCISES
1. A student in your lab (!) is attempting to clone a human gene encoding alcohol
dehydrogenase. She has obtained the following DNA sequence from the clone:
CCATTATCTT CTCCCCGCAC CCGCGTGCAA. Has she cloned the correct gene?
2. The figure supplied in the lab shows a sequence entry from the book "The Lost
World" by Michael Crichton. From what kind of organism is he claiming to have
isolated the sequence? What do you think the sequence is actually from? (hint: search
with the protein sequence).
3. The sequence provided at sequence 5 is from an unusual allele formed by the
integration of a transposon within a yeast gene. Identify the original gene, and the
transposon. Can you identify were in the gene the insertion event occurred?
Other suggested reading: Blast tutorial
Continue to Sequence Alignment pages.
Bioinformatik
29
PAIRWISE ALIGNMENTS
Sometimes it is more useful to compare two sequences directly, rather than searching the
entire database. This method can be used to compare very closely related sequences and
identify divergent regions, or to compare more distantly related sequences to identify strongly
conserved regions.
Example 1
Gene duplication (and enome duplication) is a comon evolutionary mechanism. It has been
postulated that the yeast genome duplicated about 100 million years ago. Although most of
the duplicates have been lost, some remain and have acquired different functions. The Ace2
protein you studied in the last practical is a member of one of these pairs.
•
Find the S. cerevisiae protein that is most closely related to Ace2p using a database
search (you already did this in the last practical).
Copy the Ace2 protein sequence and the sequence of the homologue into a text editor.
(Discuss with your demonstrators which is the most suitable to use). Make sure the sequence
is in Fasta format and use a suitable header (see last practical).
There are many Web sites that provide alignment programmes. Try the version of Blast
designed for comparing two sequences at the NCBI site (Blast 2 sequences). Other pairwise
alignment programs are available here here, and here. You will need to paste each sequence
separately from the text editor into the appropriate box.
•
Choose the correct program for comapring protein sequences. Turn the "filter" option
off, to avoid substitution of repeated sequences with "X". In proteins that are less
closely related you would leave the filter on.
The results will show you a graphical indication of the similarity between the two sequences,
followed by an alignment. In the graph the first sequence is plotted on the X axis and the
second on the Y - a diagonal line indicates similarity. In the diagram beneath, the red regions
are the positions of gaps introduced to improve the alignment. In the alignment, the sequence
in the centre indicates an exact match between amino-acids, and a "+" indicaties a conserved
substitution. Note that the scores are similiar to those used in the database searches. You can
change the matrix used for comparison (as discussed in the lectures).
•
•
How similar are the two proteins?
Are some regions more similar than others?
Example 2
The sequence available at the mutant link contains a single base change relative to the wild
type sequence. Is this likely to make any difference to the function of the protein?
There are a number of ways you can address this question.
•
Find the wild type sequence (see Blast searches from the last practical). Can you see
where the base pair change is? Is it within the coding sequence?
Bioinformatik
•
•
30
Compare the mutant DNA sequence to the wild type protein sequence.
o Copy the mutant DNA sequence and the wild type protein sequence to a text
editor, in Fasta format. Make sure you copy the relevant protein sequence.
(Check your Blast results if you're unsure).
o Go to the alignment pages at Baylor College of Medicine and choose the
LAP2 programme (compares DNA to protein).
o Can you see the result of the mutation? Will it change the protein sequence?
Check the Genetic Code if you're unsure.
You could also translate the mutant as described in a previous practical and then
compare it to the wild type sequence.
MULTIPLE ALIGNMENT
The pairwise alignment shown above can be very useful, but if you are attempting to
determine the important amino-acids (for an example in an active site), the proteins shown
above are too similar. It is often more help to compare several proteins from the same family,
including those from distantly related organisms, such as humans and yeast. One would
expect very little conservation between distantly related species, so any identical or conserved
amino- acids could indicate functional importance. The similaities between proteins can also
indicate the relation between organism, and the results of a multiple alignment analysis are
often used to draw phylogenetic trees.The most commonly used multiple sequence alignment
programme is ClustalW. Some hints for using the Web version of this programme are
available at http://www2.ebi.ac.uk/clustalw/help.html. Make sure you read this - in
particular, it explains the options you have to change the default parameters.
Example 1
Four protein sequences are available at the Tyrosine kinase site. These are from very
different organisms, including yeast and man. Copy these sequences, and go to the Multiple
Alignment page (alternative options here and here). The default settings are fine. You can
change "color" to "yes" if you wish. Paste the sequences in the box provided, and click "Run
CLUSTALW". Print out one example from the class.
•
Can you identify any regions that may be important for kinase function?
Example 2
The sequence of seven members of the IF-1 protein family discussed in the last practical are
available here. The first five sequences are from bacteria or chloroplasts. Carry out a multiple
alignment to see the degree of similarity between these proteins. The last three sequences are
from nuclear eukaryotic genes. Include these in a new alignment.
•
If you were to carry out a mutagenesis experiment to assess the importance of specific
residues to protein function, which residues would you choose?
PROSITE
Bioinformatik
31
Alignments such as those above, combined with experimentation, has led to the identification
of several motifs with particular function within protein sequences. These motifs have been
gathered in a database, and were used to predict conserved domains during the BLAST
searches you performed. You can also search the database Prosite. Use this site to look for
any motif present in the tyrosine kinase sequences.
Go to the Scan Prosite page, and enter the accession number for one of the tyrosine kinase
sequences above (Q62270) at the appropriate point. Click the button to exclude patterns with
a high probability of occurrenceand submit. You can get similar results (often easier to
interpret) using the Conserved Domain Database at NCBI.
•
•
•
What motifs are present in this sequence? What does this tell you about the functional
regions of the protein?
Is there any connection between these motifs, and the conserved regions detected
using the multiple alignment?
Try the same analysis with the Ace2 protein from S. cerevisiae. How much
information do you now have about the potential function of this protein?
EXERCISE
•
1. The molecular biogist who obtained the sequence for Crichton hid a message in the
protein data, revealing his identity. You can find this if you align the dinosaur protein
sequence with its closest homologue (Look for gaps in the sequence). Both sequences
are available at Dino sequences link. Choose your alignment programme carefully!
GENOME LINKS
•
In recent years a large amount of information has been gathered from genome sequencing
projects. Some is collated in specialised databases, and you should examine some of these
today. These include information on the human genome sequencing project at NCBI,
available here. The curated databases contained within this site include LocusLink and
OMIM (Online Medelian inheritance in Man). Use these resources to see what you can
discover about the gene responsible for cystic fibrosis - for example, what chromosome is it
located on, how and when was it cloned, is there any association between specific mutations
and phenotype. A User's Guide to the Human Genome available on the Nature web site
is also an extremely interesting resource for general use. You will need to register for a (free)
user name and password.
Another good example of an organism-specific database in the Saccharomyces Genome
Database. One of the features this site provides is the facility to carry out a BLAST search of
specific Saccharomyces sequences, without using the NCBI server. This is much faster! This
site also provides access to some as-yet unpublished data. For example, the Whitehead
Institute for Biomedical Research has analysed the genomic sequence of several other
Saccharomyces species - S. paradoxus, S. mikatae, and S. bayanus. The data is not completely
available, but there are alignments showing comparisons of specific proteins from all four
species. Follow the link to Homologs and more (under Homologies and comparisons). Now
Bioinformatik
use Synteny viewer to find homologues of the Ace2 protein in the other yeasts. ("Synteny"
means gene order).
1. Is Ace2p conserved in all the yeast species shown? Which protein is most distantly
related to Ace2p from S. cerevisiae? (Note: orthologues are a pair of genes, one in
each species, that are descended from a single gene in the last common ancestor of
those two species.)
2. Is the order of genes surrounding Ace2p conserved in the other yeast species?
32
Bioinformatik
33
Practical 5: Prediction of membrane proteins and subcellular localization of
proteins
Part I: Prediction of membrane proteins
20-30% of all proteins in a cell are membrane proteins, i.e. receptors, channels, pores,
tranporters. Integral membrane proteins span the lipid bilayer at least once. The identification
of potential membrane-spanning segments in a newly identified proteins can be used as a tool
to predict whether this protein might be membrane protein. The prediction of transmembrane
segments is easier than structure prediction of globular proteins, because
−
Amino acids within the transmembrane segment are likely to be hydrophobic (A, V,
L, I, F, W, M),
−
an α-helical arrangement favored in non-polar medium
−
Given the average membrane thickness of 30Å, α-helical transmembrane segments
contain ~18-24 residues
Originally, Hydropathy Plots were used to predict membrane proteins. The algorithm used
by this prediction program calculates an average hydropathy value for each position in the
given sequence. For each position, the algorithm considers the hydropathy index of the
surrounding amino acids (within the given window size, centered around that position), and
then calculates their average. This average value is the one plotted on the graph for that
position.
Hydropathy
•
•
•
Molecules can be hydrophobic (water-fearing) or hydrophilic (water-loving). The
hydropathy of a molecule is one factor in determining its structure. In the case of
proteins, structure has been correlated to function.
The hydropathy plot displays the hydrophobic and hydrophilic tendencies of an amino
acid sequence. A hydropathy scale is used, which has assigned a hydropathy index to
each amino acid (see below), based on its relative hydrophobicity (positive value) or
hydrophilicity (negative value).
The graph plots the average hydropathy of a moving segment of a fixed window size
(which the user enters), moving along the entire sequence. On the plot, a positive peak
indicates a probability that the corresponding polypeptide fragment is hydrophobic (a
negative peak indicating a probable hydrophilic segment).
Hydropathy indexes (Kyte, 1982):
A
Alanine
Ala
1.8
Bioinformatik
R
N
D
C
Q
E
G
H
I
L
K
M
F
P
S
T
W
Y
V
B
Z
Arginine
Asparagine
Aspartic Acid
Cysteine
Glutamine
Glutamic Acid
Glycine
Histadine
Isoleucine
Leucine
Lysine
Methionine
Phenylalanine
Proline
Serine
Threonine
Tryptophan
Tyrosine
Valine
Asparagine or Aspartic Acid
Glutamine or Glutamic Acid
34
Arg
Asn
Asp
Cys
Gln
Glu
Gly
His
Ile
Leu
Lys
Met
Phe
Pro
Ser
Thr
Trp
Tyr
Val
Asn or Asp
Gln or Glu
-4.5
-3.5
-3.5
2.5
-3.5
-3.5
-0.4
-3.2
4.5
3.8
-3.9
1.9
2.8
-1.6
-0.8
-0.7
-0.9
-1.3
4.2
-3.5
-3.5
A commonly used hydrophathy prediction method was developed by Kyte and Doolittle
(Reference: Kyte, Jack, and Russel F. Doolittle. "A Simple Method for Displaying the
Hydropathic Character of a Protein." Journal of Molecular Biology 1982; (157) 105-132.)
Example: Obtain the Fasta format of the protein sequence for the human serotonin
transporter (accession number P31645) from the NCBI database
(http://www.ncbi.nlm.nih.gov/). Open web site
http://fasta.bioch.virginia.edu/fasta/grease.htm. Paste in the sequence and start the plot using
the default window size of 9.
The plot produced by this program represents the average hydropathy along the amino acid
sequence. The plot may help predict whether or not the protein segment has enough
hydrophobicity to either interact with or reside in a membrane.
Window size. Recommended window sizes are odd integers between 5 and 29. The default
window is 9. The window size sets the number of positions that are averaged for each point
on the plot. That means that larger window sizes make smoother plots. If your plot doesn't
have any distinct peaks, you might try a smaller window size. If your plot has many peaks
very close to each other, you might try a larger window size. The window size should be odd
so that the window is not centered between two sequence positions.
Bioinformatik
35
Can you predict how many transmembrane segments the serotonin transporter might have?
Try increasing the window size to obtain a smoother plot and make it easier to identify true
transmembrane regions.
Newer method for the prediction of transmembrane helices
Several other methods have been developed to more accurately predict transmembrane helices
using algorithms that do not just take hydrophobicity of protein segments into account.
Analysis of transmembrane segments in several membrane proteins revealed that these
segments have a more complex structure. There are three major regions in a transmembrane
helix:
−
the helix core (the central part of the helix), which contains mainly
hydrophobic residues and interacts with the fatty acid side chains of the
phospholipids in the membrane
−
the cap region (on either side of the core), which often contains often polar
aromatic residues, and interacts with the phosphate head groups of the
membrane phospholipids
−
the loops on either side outside the membrane, which often contain charged
residues,
A transmembrane helix can be oriented either way, meaning that the N-terminal loop of a
transmembrane helix can be located in the cytoplasm (inside-out orientation) or in the
extracellular space (outside-in orientation). The actual arrangement in the native protein is
referred to as the topology of the transmembrane helix. The topology is determined by the
distribution of charged residues in the loops. Cytoplasmic loops contain more positively
charged residues than extracellular loops, this has been recognized by Gunnar von Heijne and
his co-workers and is known as the “positive-inside rule”.
The knowledge of the structure of transmembrane segments, as well as the statistical
evaluation of known transmembrane helices and the conservation of such helices within a
protein family have been used to develop programs for membrane protein prediction.
A selection of these programs is described below.
Bioinformatik
36
Try and predict the number and position and orientation (topology) of transmembrane
segments in the serotonin transporter with each of the programs below. What is the
localization of the N-terminus and the C-terminus, i.e. cytoplasmic (inside) or extracellular
(outside)? Which prediction(s) do you think is/are more likely to be correct, i.e. might reflect
the actual number and arrangement of transmembrane segments in this protein? Give reasons
for your choice.
Notes:
1. For you practical write-up, it is best to copy/paste the serotonin transporter sequence
into a Word file and then indicate for each prediction the exact position of each
transmembrane segment (electronically by using lines etc. or by hand after printing
out your document) and then compare.
2. The structure of the serotonin transporter has not been solved yet, so we actually don’t
know which prediction is correct. We will have to wait and see…
A) The TMpred program (http://www.ch.embnet.org/software/TMPRED_form.html) makes
a prediction of membrane-spanning regions and their orientation. The algorithm is based on
the statistical analysis of TMbase, a database of naturally occuring transmembrane proteins.
Copy/Paste your sequence (plain text, no spaces, no headers etc.) into the form provided, do
not change any settings, run the prediction. In the output file you will find results for different
stages of the prediction. Under 3.) the final models for your protein are given, with the most
likely one being listed first. For each potential transmembrane segment the amino acid
position are given, as well as the predicted orientation of the helix, i.e. inside-out (i-o) or
outside-in (o-i).
B) The TMHMM program (http://www.cbs.dtu.dk/services/TMHMM-2.0/ ) uses a pattern
recognition methods for transmembrane segment and topology prediction, i.e. this program
takes into account the amino acid distribution pattern in the different regions of a
transmembrane helix (see above).
Copy/Paste your sequence (in FASTA format!) into the form provided, do not change any
settings, run the prediction. The output will give you a list of all transmembrane segments
found and the most likely location of termini and loops as well as a graphical plot of the
result. Use the HELP link for more details on the interpretation of the results.
Bioinformatik
37
[Note: submission to this server is limited, thus some students may not be able to use this
prediction programme during the practical. If possible try again at on another day]
C) The TMAP program (http://www.mbb.ki.se/tmap/ ) is optimized for prediction of
transmembrane helices from multiple sequence alignments, based on the assumption, that a
transmembrane segment can be more reliably predicted when it is conserved within a protein
family.
To use this program you need to perform a multiple sequence alignment (see previous
practical) of the protein in question, i.e. the serotonin transporter, with related proteins, i.e.
other neurotransmitter transporters. Generate the multiple sequence alignment using the
following steps:
−
Copy/paste the neurotransmitter transporter sequences here into the submission form
of the multiple sequence alignment tool ClustalW
(http://www.ebi.ac.uk/clustalw/index.html). Change the following settings:
Select output format “gcg MSF”
Select output order “input”
Run the sequence alignment.
−
Copy/paste the multiple sequence alignment output from ClustalW (everything from
“PileUp” to end of the sequence alignment) into the TMAP
(http://www.mbb.ki.se/tmap/) submission form. Run prediction.
−
The output file will give you the topology, either “Nin” (means N-terminus in
cytoplasm) or “Nout” (means N-terminus extracellularly), then the position of
transmembrane helices for the entire alignment (i.e. PREDICTED
TRANSMEMBRANE SEGMENTS FOR ALIGNMENT), and below that the amino
acid position for each protein in the alignment (i.e. PREDICTED
TRANSMEMBRANE SEGMENTS FOR PROTEIN, followed by a short description
and the accession number). The serotonin transporter sequence should be the first one
(check the accession number).
−
Try to run the same prediction program in the “single sequence mode”
(http://130.237.130.31/tmap/single.html) with the serotonin transporter sequence only
as the input. Compare the results with the “multiple sequence alignment mode”.
Bioinformatik
38
Exercise:
Which of the following proteins are integral membrane proteins? If yes, how many
transmembrane helices can you identify? Give amino acid residue positions. Use the TMpred
programme: http://www.ch.embnet.org/software/TMPRED_form.html. (Note: Make sure you
paste the sequence in plain text, i.e. no spaces, no headers etc.)
5HT1A receptor
P08908
GAPDH
P04797
Aquaporin 2
P41181
Tryptophan hydroxylase 1
P17752
Part II: Prediction of subcellular localization of a protein
In order to learn something about the possible function of a protein, in particular of a newly
identified protein it is useful to predict the subcellular localization of this protein. The
prediction of cellular localization relies largely on the present of cleavable signal sequences
for example in secreted proteins or protein targeted to the mitochondrium and on the
identification of so-called “sorting signals”, i.e. sequence motifs, that determine the targeting
to or retention in particular organelles of a cell. Using the PSORT program
(http://www.psort.org/, follow the link for PSORT II and then click on PSORT II prediction),
try to predict the cellular localization of the serotonin transporter and other proteins listed
below (for sequences click here). To learn more about the individual subprograms simple
click on to the links to obtain background information, such as prediction algorithm,
particular motifs etc.
Note: Make sure you paste in the protein sequences as plain text (no spaces, headers etc.).
Serotonin transporter
P31645
Bioinformatik
39
DNA topoisomerase
Q02880
Calreticulin
P42918
TIM44
O35094
Where in the cell is the most likely localization of each of the proteins above? Which motifs
can you identify? Does the predicted localization seem logical when considering the function
of each protein (look-up the documentation for each protein in the NCBI database, search
with the given accession numbers)?
Anleitung zum Ansetzen von Puffern
40
5 Anleitung zum Ansetzen von Puffern
Theoretischer Teil
Sehr kleine Mengen einer starken Säure oder einer starken Base reichen aus, die
Konzentration der Wasserstoffionen in Wasser im schwach sauren, neutralen oder
schwach alkalischen Gebiet erheblich zu verändern. Der Zusatz eines Tropfens konzentrieter Säure zu einem Volumen reinen Wassers macht dieses deutlich sauer,
beim Zufügen eines Tropfens konzentrierter Alkalilauge wird die Lösung deutlich
basisch. Gleichwohl gibt es Lösungen, deren Wasserstoffionenkonzentration sich bei
Zusatz auch größerer Mengen starker Säure oder starker Base nur recht wenig
ändert. Man bezeichnet solche Lösungen als gepuffert.
Pufferlösungen
Im Stoffwechsel laufen zahlreiche Reaktionen ab, bei denen H+-Ionen freigesetzt
oder verbraucht werden. Andererseits ist ein konstanter pH-Wert im Zytoplasma oder
in bestimmten Körperflüssigkeiten, wie z.B. dem Blut, lebenswichtig. Schon leichte
Abweichungen können zu schweren Krankheitssymptomen führen.
Stoffwechsel bedingte pH-Änderungen müssen also abgepuffert werden. Im Blut wird
diese Funktion von verschiedenen Puffersystemen wahrgenommen.
Allgemein werden Lösungen, deren pH-Wert sich bei Zugabe von Säure oder Lauge
nur wenig verändert, als Pufferlösungen bezeichnet. Sie enthalten ein konjugiertes
Säure-Base-Paar, wobei die Säure OH¯-Ionen neutralisiert, die Base H+-Ionen.
Pufferlösungen lassen sich herstellen, indem man
▪
schwache Säuren mit ihrem Salz (z.B. Acetatpuffer aus Essigsäure und
Natriumacetat) bzw.
▪
schwache Basen mit ihrem Salz (z.B. Ammoniumpuffer aus Ammoniumchlorid
und Ammoniak) mischt.
Sind beide Spezies in genügend großer Konzentration vorhanden, reagiert der
Acetatpuffer so:
H3O+
+
CH3COO¯
CH3COOH
Pufferung bei Zugabe starker Säure
+
H2O
41
Die zugegebene starke Säure reagiert vollständig zu HOAc. Dass der pH-Wert sich
dennoch geringfügig ändert, hat damit zu tun, dass die gebildete Essigsäure wieder
zu einem geringen Anteil dissoziieren kann.
OH¯
+
CH3COOH
CH3COO¯
+
H2O
Pufferung bei Zugabe starker Base
Die Pufferwirkung zeigt sich in der Titrationskurve schwacher Säuren und Basen
durch das Plateau im Bereich des pK-Wertes, ist also der Grund für den geringen
Anstieg der Titrationskurve der Essigsäure bei pH 4 bis 6.
Ähnlich verläuft die Abpufferung starker Säuren und starker Laugen durch den
Ammoniumpuffer:
H3O+
+
NH3
NH4+
+
H2O
Pufferung bei Zugabe starker Säure
OH¯
NH4+
+
NH3
+
H2O
Pufferung bei Zugabe starker Base
In dieser Form wird der Ammoniumpuffer allerdings nur in Ausnahmefällen benutzt.
Dagegen sind Derivate des Ammoniaks wie z.B. als Triethanolammoniumpuffer und
Triethylammoniumpuffer häufig in Gebrauch (s. unten).
Puffergleichung
Das Verhalten einer Pufferlösung lässt sich auch quantitativ beschreiben. Ausgehend
vom Massenwirkungsgesetz für die Dissoziation einer Säure ergibt sich für den
Acetatpuffer:
KS
=
[H+ ] ∗ [CH COO− ]
3
[CH COOH]
3
Durch Umformung erhält man:
[H+ ] = K S ∗
[CH COOH]
3
[CH COO− ]
3
42
Durch Logarithmieren:
pH
=
pK S − log
[CH COOH]
3
[CH COO− ]
3
Die Gleichung zeigt, dass der pH-Wert der Pufferlösung vom pKS-Wert der Essigsäure und vom Verhältnis Essigsäure/Acetat bestimmt wird. In allgemeiner Form ist
die Puffergleichung als Henderson-Hasselbalch-Gleichung bekannt.
pH
=
pK S
[ A- ]
+ log
[HA]
Liegt zum Beispiel in einem Acetat-Puffer Essigsäure in einer Konzentration von 0,1
mol/L und Acetat mit 0,5 mol/L vor, so lässt sich mit Hilfe der Puffergleichung der pHWert bestimmen:
pH
=
4,8 + log
0,5
= 5,5
0,1
Liegen Säure und konjugierte Base in gleicher Konzentration vor, so ist [HA]/[A¯] = 1
und die Puffergleichung wird zu pH = pKS. Die Pufferlösung kann bei diesem pHWert Zugaben von Säuren und Basen gleich gut abpuffern Liegt die Säure in zehnfach höherer Konzentration vor, so gilt pH = pKS – 1. Der Puffer wirkt dann effektiv
gegen die Zugabe von Basen, kann aber nur noch geringe Mengen Säure abpuffern.
Als Faustregel gilt deshalb für den optimalen Einsatzbereich von Pufferlösungen:
pH = pKS ± 1
Da sich der Pufferbereich in der Titrationskurve deutlich abzeichnet, lässt sich anhand von Titrationskurven der pKS-Wert schwacher Säuren oder Basen ermitteln
(geringste Steigung der Kurve).
Beispiel 1: Phosphatpuffer
Phosphorsäure dissoziiert über drei Stufen:
H3PO4
+
H2O
H2PO4¯
+
H3O+
pKS1 = 2,0
H2PO4¯
+
H2O
HPO42¯
+
H3O+
pKS2 = 7,2
HPO42¯
+
H2O
PO43¯
43
+
H3O+
pKS3 = 12,3
Die pK-Werte machen deutlich, dass die Protonenabgabe nach jeder Stufe schwieriger wird. H3PO4 ist eine mittelstarke Säure, während H2PO4¯ eine schwache und
HPO42¯ eine sehr schwache Säure ist. Entsprechend sind die Plateaus der Titrationskurve verteilt.
Zu Beginn der Titration liegt der pH bei etwa 1,5. Die Lösung enthält fast nur die Spezies H3PO4 und H2PO4¯. Durch die Zugabe der Natronlauge werden sukzessive die
H3PO4-Moleküle in H2PO4¯-Ionen überführt, bis beim ersten Äquivalenzpunkt (ÄP1)
fast nur noch H2PO4¯-Ionen vorliegen. Beim zweiten Äquivalenzpunkt (ÄP2) liegen
nur noch HPO42¯-Ionen vor und beim dritten (ÄP3) nur noch PO43¯, wobei der dritte
Äquivalenzpunkt keine markante Steigung aufweist, weil er in den pH-Bereich des
Titrationsmittels Natronlauge fällt (0,1 M NaOH hat einen pH-Wert von 13). Das Plateau im Bereich des pKS2 basiert auf der Pufferung durch das konjugierte SäureBase-Paar H2PO4¯/HPO42¯. In der Praxis wird der Phosphatpuffer häufig verwendet,
um neutrale pH-Bereiche einzustellen, denn nach pH = pKS ± 1 liegt der Pufferbereich zwischen pH 6,2 und 8,2.
44
Beispiel 2: Kohlensäurepuffer
Kohlendioxid (CO2) und Wasser sind die mengenmäßig wichtigsten Endprodukte des
Stoffwechsels. In Wasser gelöstes Kohlendioxid reagiert zur zweiprotonigen Kohlensäure (1), die in einer Nachfolgereaktion dissoziiert (2).
(1)
(2)
CO2
+
H2CO3
H2O
+
H2CO3
H2O
pK = 3,1
+
HCO3¯
+
H3O
HCO3¯
+
H3O+
pKS1 = 3,3
Gesamtreaktion:
CO2
+
2 H2O
pK = 6,4
Das Gleichgewicht der ersten Reaktion liegt auf der linken Seite (pK = 3,1). Löst man
also Kohlendioxid in Wasser, so liegt es vorwiegend als CO2 und nur zu einem geringen Teil als H2CO3 vor.
Reaktion (1) und (2) lassen sich zusammenziehen und die pK-Werte addieren. Das
Gleichgewicht der Gesamtreaktion liegt noch stärker auf der linken Seite: gelöstes
Kohlendioxid reagiert als schwache Säure. Die konjugierte Base ist das Hydrogencarbonat.
Kohlendioxid und Hydrogencarbonat bilden ein Puffersystem mit einem pH-Optimum
bei pH = 6,4.
[HCO3 - ]
pH = 6,4 + log
[CO2 ]
Das Kohlendioxid-Hydrogencarbonat-Puffersystem ist das bedeutendste Puffersystem des Blutes, das auf einen pH von 7,4 gepuffert ist. Das Konzentrationsverhältnis der Puffersubstanzen bei pH 7,4 lässt sich somit berechnen:
10
(pH −pK S )
[HCO3 - ]
10
=
= 10
=
[CO2 ]
1
1
Da die Reaktionskonstante aber temperaturabhängig ist, der pKS der Gesamtreaktion
im Blut bei 37°C 6,1 statt 6,4 bei 25°C beträgt, wird das Konzentrationsverhältnis
dann zu:
1,3
10
45
=
[HCO3 - ]
20
=
[CO2 ]
1
Es liegt also ein Überschuss an Hydrogencarbonat vor, so dass das Puffersystem
vor allem gegen H3O+-Ionen wirkt. Da CO2 ein Gas ist, steht das gelöste CO2 im
Gleich-gewicht mit dem CO2 der Luft in der Lunge. Hier zeigt sich eine Besonderheit
dieses Puffersystems. Es kann nämlich nicht nur H3O+-Ionen am Entstehungsort
abpuffern, sondern auch das Reaktionsprodukt aus dem Gleichgewicht entfernen:
H3O+
+
HCO3¯
→
CO2 ⁭
+
2 H2O
Der senkrechte Pfeil deutet an, dass das Kohlendioxid die Lösung verlässt und in die
Gasphase geht (das System verlässt). Man spricht deshalb auch von einem offenen
Puffersystem.
Praktischer Teil
1.
1 M Tris-hydroxymethyl-aminomethan (TRIS) pH 8,0
(HO-CH2-)3C-NH3+ +
H3O+
(HO-CH2-)3C-NH2 +
FgTRIS
121,14
pK
8,1
Menge:
0,5 L
Einwage:
? g Tris
pH einstellen mit HCl
2.a
Dinatriumhydrogenphosphat/NaCl/KCl
Zunächst muss eine 1 M KCl-Lösung hergestellt werden.
FgKCl
74,55
Menge:
100 mL
Einwage:
? g KCl
Na2HPO4
7,1 g
Molarität:
?M
NaCl
40,91 g
Molarität:
?M
H2O
KCl (1 M)
46
6,625 mL
Molarität:
?M
Auffüllen auf 500 mL mit Wasser.
2.b
FgNaCl
58,44
FgNa2HPO4
141,96
Natriumdihydrogenphosphat/NaCl/KCl
NaH2PO4 x H2O
1,42 g
Molarität:
?M
NaCl
8,182 g
Molarität:
?M
KCl (1 M)
1,325 mL
Molarität:
?M
Auffüllen auf 100 mL mit Wasser.
FgNaH2PO4 x H2O
2.c
137,99
10x Phosphate Buffered Saline (PBS) pH 7,0
Der 10x PBS-Puffer wird hergestellt, indem ein vorgelegtes Volumen von x mL
Puffer Nr. 5 mit Puffer Nr. 6 auf pH 7,4 eingestellt wird.
3.
0,5 M Ethylendiamintetraessigsäure (EDTA) pH 8,0
(HOOC-CH2-)2N-CH2-CH2-N(-CH2-COOH)2
EDTA
+
Mg2+
FgEDTA
292,24
Menge:
0,2 L
Einwage:
? g EDTA
Mg2+EDTA
+
2 H+
pH einstellen mit NaOH
4.
1 M Imidazol pH 7,4
Imidazol
+
H3O+
Imidazol-H+
+
H2O
FgImidazol
68,08
pK
6,95
Menge:
0,5 L
Einwage:
? g Imidazol
5.
50 mM HEPES pH 7,4
4-(2-Hydroxyethyl)-piperazin-1-ethansulfonsäure
FgHEPES
238,3
pK
7,5
Menge:
0,1 L
Einwage:
? g Hepes
47
Der Peptidtransporter TAP
48
6 Der Peptidtransporter TAP
Der Peptidtransporter TAP (transporter associated with antigen processing) spielt eine
zentrale Rolle in der adaptiven Immunantwort (Abbildung 1). TAP transportiert Peptide, die
vornehmlich durch proteasomale Degradation von endogenen Proteinen erzeugt wurden, vom
Zytosol ins Endoplasmatische Rekikulum. Dort binden diese Peptide an MHC (major
histocompatibility complex) I Moleküle. Peptidbeladene MHC I Moleküle werden an die
Zelloberfläche transportiert, um dort cytotoxischen T-Zellen die antigene Fracht zu
präsentieren. Erkennt eine T-Zelle ein MHC I gebundenes Peptid so führt dies zur
Eliminierung dieser Virus infizierten oder entarteten Zelle.
Abbildung 1: MHC I abhängige Antigenpräsentation. Bei der klassischen MHC I Antigenpräsentation
werden endogene Proteine vornehmlich durch das Proteasom degradiert. Die Peptide werden mit Hilfe von TAP
ins Lumen des ER transportiert und dort auf MHC I Moleküle geladen. Anschließend werden die MHC IPeptidkomplexe an der Zelloberfläche cytotoxischen T-Zellen präsentiert. Die Faltung, Assemblierung und
Beladung der MHC I Moleküle wird durch mehrere Chaperone gesteuert.
TAP gehört der Familie der ABC-Transporter an. TAP bildet einen Heterodimer bestehend
aus TAP1 und TAP2 (Abbildung 2). Jede Untereinheit ist aus einer N-terminalen
Transmembrandomäne (TMD) und einer C-terminalen Nukleotidbindungsdomäne, die sich
im Zytosol befindet, aufgebaut. Die TMD von TAP1 besteht aus 10 und von TAP2 aus 9
Transmembranhelizes.
Die
TMD
bildet
sowohl
die
Translokationspore
und
die
49
Peptidbindungsregion. Die NBD bindet über konservierte Sequenzmotive ATP und treibt
durch die ATP-Hydrolyse den Peptidtransport an.
TAP2
TAP1
N-Domäne
Kerndomäne
Kerndomäne
N-Domäne ER Lumen
N
Transmebrandomäne
1 2 3 4 5 6
6 5 4 3 2 1
Zytosol
N
Peptidbindungsregion
Nukleotidbindungsdomäne
C
C
Abbildung 2: Topologiemodell des TAP-Komplexes. TAP besteht aus den zwei Halbtransportern TAP1 und
TAP2, die je aus einer TMD und einer NBD aufgebaut sind. Die sechs C-terminalen Transmembranhelices
bilden die Kerndomäne. Die äußeren Helices sind nicht essentiell für die Transportfunktion, sondern sind an der
Rekrutierung von Tapasin beteiligt. Die Peptidbindungsregion ist gelb dargestellt.
Eine Vorraussetzung für den Peptidtransport ist die Bindung des Peptids an TAP. Im
Gegensatz zum Peptidtransport, der die ATP-Hydrolyse zwingend erfordert, ist die
Peptidbindung ATP-unabhängig. TAP bindet vorzugsweise Peptide mit einer Länge von 8 –
16 Aminosäuren, wobei die höchste Transporteffizienz für 8 – 12 Aminosäuren lange Peptide
erzielt wird. Allerdings werden auch Peptide mit einer Länge bis zu 40 Aminosäuren so wie
sterisch anspruchsvolle Peptide, die Modifikationen an den Seitenketten tragen, von TAP
erkannt und transportiert.
Mit Hilfe von kombinatorischen Peptidbibliotheken wurde die Peptidspezifität von TAP im
Detail untersucht. Demnach sind für die Erkennung der Peptide die drei N-terminalen und die
C-terminale Aminosäuren von Bedeutung. Da die Spezifität des C-terminalen Restes sowohl
für das Immunproteasom, TAP und auch MHC I übereinstimmt, geht man von einer
Koevolution dieser drei Faktoren der Antigenpräsentation aus. Die unterschiedliche Spezifität
von MHC I und TAP bzgl. des N-Terminus des Peptids führt dazu, dass im ER-Lumen ein
Trimmen durch Exopeptidasen erfolgt. Der mittlere Bereich des Peptides, welcher
vornehmlich an der T-Zellrezeptorbindung beteiligt ist, kann höchst divergent sein in Bezug
auf MHC I und TAP-Bindung. Dadurch wird gewährleistet, dass eine kleine Menge von
50
Peptidtransportern und MHC I Molekülen eine sehr große Peptiddiversität darstellen kann,
um den Körper vor einem viralen Angriff zu schützen.
Im Rahmen dieses Praktikums sollen die enzymatischen Eigenschaften von TAP, wie z.B. die
Kinetik der Peptidbindung, die ATP-Bindung und der ATP-getriebene Transport
charakterisiert werden. Dazu müssen TAP-haltige Membranen aus Insektenzellen isoliert
werden und die Peptide für die kinetischen Untersuchungen mit Fluorophoren markiert
werden.
51
6.1 Membranpräparation aus Sf9-Insektenzellen (Versuch 1 A)
Humanes TAP, bestehend aus TAP1 und TAP2, wird mit Hilfe des Baculovirus, der die Gene
für TAP1 und TAP2 trägt, in Sf9 Insektenzellen exprimiert. Bei dem Baculovirus handelt es
sich um ein DNA-Virus, der der Familie der Baculoviridae angehört. Dieses Virus befällt
Endothelzellen des Mitteldarms von bestimmten Insekten wie Spodoptera frugiperda. Das
Baculovirusgenom besteht aus einer zirkulären, doppelsträngigen, 130 kb langen DNA. Die
Gene für TAP1 und TAP2 befinden sich auf dem Baculovirusgenom hinter dem Polyhedrinund P10-Promotor. Beides sind starke Promotoren, die schon kurz nach der Infektion
angeschaltet werden. Ca. 24 h nach der Infektion ist TAP im Western Blot nachweisbar. Die
Ernte der Zellen erfolgt ca. 48 h nach der Infektion. Längere Expressionszeiträume sind
ungünstig, da proteolytische Degradationsprodukte von TAP auftreten. Die Insektenzellen
werden bei einer Zelldichte von 1.8 * 106 Zellen pro ml mit einer MOI 3-5 (muliplicity of
infection, gibt das Verhältnis von Virus zu Zellen an) infiziert. Die Zellen werden nach 48h
geerntet, einmal mit PBS gewaschen und das Zellpellet bei -20 °C gelagert.
Um biochemische Versuche mit TAP durchführen zu können, werden TAP-haltige
Membranen hergestellt. Dabei werden die Zellen mechanisch aufgeschlossen, nicht
aufgeschlossene Zellen, Zelltrümmer und Nuclei werden durch Zentrifugation bei niedrigen
g-Zahlen
entfernt.
Anschließend
werden
die
Membranen
Ultrazentrifugationsschritt aufkonzentriert und bei -80 °C gelagert.
Material
20 mM Tris/HCl, pH 7,4
1 M DTT
2,5 M Sucroselösung
1×PBS (Phosphate buffered saline)
137 mM NaCl
2,7 mM KCl
10 mM Na2HPO4
2 mM KH2PO4
pH 7,0
durch
eine
52
Protease-Inhibitoren
250 mM Benzamidin in H2O
100 mM PMSF (Phenylmethylsulfonylfluorid) in Isopropanol
Protokoll
Alle Schritte werden auf Eis oder bei 4°C (Zentrifugationen) durchgeführt.
Das Pellet einer 300 ml Sf9-Insektenzellkultur wird auf Eis aufgetaut und in 20 ml Tris-Puffer
aufgenommen, 1% (v/v) Proteaseinhibitoren und 1 mM DTT werden zugegeben. Die Zellen
werden in einem Dounce-Homogenisator durch Hoch-und Runterziehen (40×) des Pistills
(tight) aufgeschlossen. Die Suspension wird mit 2,5 M Sucroselösung zu einer
Endkonzentration von 250 mM Sucrose versetzt, noch 3x gedouncet und in ein 50 ml
Falconröhrchen überführt. Nicht aufgeschlossene Zellen und Zelltrümmer werden mittels
Zentrifugation pelletiert (4 min bei 200×g und 8 min bei 700×g). Der Überstand wird in ein
UZ-Röhrchen überführt und 20 min bei 20.000 x g zentrifugiert. Der Überstand wird
verworfen und das Pellet in 2 ml 1×PBS resuspendiert. 50-100 µl Aliquots werden in
flüssigem Stickstoff schockgefroren und bei -80 °C gelagert.
53
Protein-Bestimmung: BCA-Assay (Versuch 1A)
Die Konzentration des Gesamtproteins wird mit Hilfe eines Bicinchoninsäure (BCA) Assays
bestimmt. Der sensitive, kolorimetrische Nachweis von Proteinen bei diesem Assay resultiert
2+
aus der Reduktion von Cu
+
zu Cu durch die Peptidbindung und bestimmte Aminosäuren.
BCA bildet mit dem reduzierten Kupfer einen Farbkomplex, dessen Absorption bei 584 nm
gemessen wird.
Der BCA-Test wird nach Anleitung des Herstellers (Fa. Pierce) durchgeführt. Die
Standardkurve mit 7 Punkten wird wie folgt zusammenpipettiert:
Verdünnung
PBS
BSA-Quelle
BSA Endkonzentration µg/ml
A
1000 µl 53 µl stock [2 mg/ml]
100 µg/ml
B
100 µl 300 µl Verdünnung A
75 µg/ml
C
375 µl 375 µl Verdünnung A
50 µg/ml
D
375 µl 375 µl Verdünnung C
25 µg/ml
E
375 µl 375 µl Verdünnung D
12,5 µg/ml
F
375 µl 375 µl Verdünnung E
6,25 µg/ml
G
375 µl
-
0 µg/ml
Die zu messenden Proben werden in verschiedenen Konzentrationen vermessen: 1:10, 1:100,
1:200, 1:500 und 1:1000 verdünnt in PBS. Die Anzahl der einzelnen Messungen des BCATestes wird berechnet und die Färbelösung laut folgender Formel angesetzt (pro Messung
werden 150 µl Färbelösung benötigt). Alle Proben und Standardpunkte werden in Duplikaten
vermessen.
Ansatz der Färbelösung
25 Teile Komponente A
24 Teile Komponente B
1 Teil Komponente C
Gesamt 50 Teile
Aufgrund von Pipettierfehlern wird empfohlen, 10 % mehr Färbelösung als benötigt
anzusetzen.
150 µl angesetzte Färbelösung und 150 µl verdünnte Probe werden in einer Mikrotiterplatte
gemischt. Die Platte wird mittels Parafilm vor Verdunstung geschützt und für 30 min bei 37
°C inkubiert. Anschließend werden die Absorptionen am Fluostar bei einer Wellenlänge von
584 nm ermittelt und die Proteinkonzentration berechnet.
54
Peptid-Bindungsassay mittels Fluorescein-markiertem Peptid (Versuch 1A)
Der Peptid-Bindungsassay mittels Fluorescein-markiertem Peptid dient der Quantifizierung
von TAP in den Membranen. Aufgrund der Tatsache, dass in Membranen gegen einen großen
Hintergrund von Proteinen gearbeitet wird, bietet dieser Assay die Möglichkeit, die Menge an
TAP über die Menge an TAP-spezifisch gebundenem Peptid zu bestimmen. Die Spezifität der
Bindung wird durch einen 400fachen Überschuss an unmarkiertem Kompetitor Peptid
bestimmt.
Material
Peptide
Fluoreszenz Peptid (RRYC(F)KSTEL)
Kompetitor Peptid (RRYQKSTEL)
137 mM NaCl
2,7 mM KCl
10 mM Na2HPO4
2 mM KH2PO4
pH 7,0
Lyse-Puffer
1xPBS
pH 7.0
SDS
1% (w/v)
Platte (Fluoplate)
96-well schwarz, für Fluoreszenz Messungen
C4F
30 µM
R9LQK
4.2 mM
55
Protokoll
1. TAP – Bestimmung mittels Zentrifugationsassay (3fach-Bestimmung)
Tabelle 1
Bindung
Kompetitor
100 µl
100 µl
2.5 µl
2.5 µl
Kompetitor Peptid [R9LQK, 4.2 mM]
-
7 µl
PBS
47.5 µl
TAP1/2
C4F Peptid [30 µM]
1.
40.5µl
zu 150 µl TAP-haltigen Membranen werden 500 µl Bindungs-Puffer gegeben.
2. Wie in Tabelle 1 dargestellt, alle Komponenten bis auf TAP-haltige Membranen (wie
in 1 verdünnt) zusammenpipettieren (Die Messung erfolgt in Triplikaten).
3. Durch Zugabe der TAP-haltigen Membranen die Bindung starten.
4. 15 min auf Eis inkubieren.
5. Bindungsansatz durch Zugabe von 1 ml eiskaltem PBS verdünnen und anschließend
sofort bei 14.000 rpm und 4°C für 8 min abzentrifugieren.
6. Pellets in 100 µl eiskaltem PBS resuspendieren und anschließend nochmals mit 900 µl
PBS verdünnen (zügig arbeiten).
7. Zentrifugation bei 14.000 rpm bei 4°C für 8 min.
8. Pellets in 300 µl Lyse-Puffer bei Raumtemperatur resuspendieren und 10 min
inkubieren.
9. Jeweils 250 µl der lysierten TAP-haltigen Membranen bzw. 250 µl der Standardreihe
in ein Well der Fluoreszenz-Platte überführen und die Fluoreszenz am ELISA-Reader
vermessen ( λex/em = 485/520 nm)
56
2. Erstellen der Standardkurve
Tabelle 2
C4F Endkonz.
Lysis-Puffer
C4F Peptid (300 nM)
0 nM
600 µl
0
2 nM
596 µl
4 µl
4 nM
592 µl
8 µl
8 nM
584 µl
16 µl
Auswertung
Bestimmung der TAP-Konzentration:
Anhand der Standardkurve soll die Menge an TAP (Mw TAP1/2: 150 kDa)
bestimmt werden
a. Konzentration in mol/l
b. in Gewichtsprozenten der Gesamtproteinmenge
10 nM
580 µl
20 µl
57
58
6.2 Markierung von Peptiden mit Iodoacetamidofluorescein (Versuch 1B)
Zur biochemischen Charakterisierung von TAP werden Fluorescein-markierte Peptide
verwendet. Zur Markierung werden die Peptide, welche ein Cystein enthalten, mit
Iodoacetamidofluorescein über einen nucleophilen Angriff markiert. Anschließend werden die
Peptide über eine „reversed phase high performance liquid chromatography“ (RP-HPLC)
aufgereinigt, um freien Farbstoff und nicht markiertes Peptid abzutrennen. Die Flüssigkeit
wird durch eine Gefriertrocknung (Lyophilisierung) entfernt und die Qualität des
aufgereinigten Produkts mittels analytischer HPLC und MALDI-TOF-MS (matrix assisted
laser desorption ionisation time of flight mass spectrometry) verifiziert.
Material
10x PBS pH 6.5
1,37 M NaCl
27 mM KCl
100 mM Na2HPO4
20 mM KH2PO4
PBS/DMF (Dimethylformamid)
PBS pH 6.5
20% (v/v) DMF
Iodoacetamidofluoreszein (Mw 515 Da)
50 mM in 100% DMF lösen
Peptid (RRYCKSTEL; Mw 1155 Da)
0.5 mg Peptid in PBS/DMF lösen
Konzentration von 3.5 mM einstellen
Tricinpuffer
100 mM Tricine pH 9.0
RP-HPLC
HPLC Anlage der Firma Jasco
Säule: Vydac-218TP510-C18 Protein & Peptid Säule (4,6 x 250 mm)
59
Reinigungsprogramm (präparativ)
Eluent A: H20/0.2% TFA (trifluoro acetic acid)
Eluent B: Acetonitril/0.2% TFA
Flussrate: 1 ml/min
0 – 0,5 min 100% A
0,5 – 30,5 min linearer Gradient 15 – 44% B
30,5 – 32,5 min 100% B (1,5 ml/min)
32,5 – 36,5 min 100% A (1,5 ml/min)
Reinigungsprogramm (analytisch)
Eluent A: H20/0.2% TFA (trifluoro acetic acid)
Eluent B: Acetonitril/0.2% TFA
Flussrate: 1 ml/min
0 – 0.5 min 100% A
0.5 – 15.5 min linearer Gradient 15 – 45% B
15.5 –17.5 min 100% B
17.5 – 21.5 min 100% A (1,5 ml/min)
Alternative Reinigungsprogramme:
Eluent A: H2O/ 50 mM Ammoniumacetat pH 6.5
Eluent B: 75%/ 50 mM Ammoniumacetat pH 6.5
Flussrate: 1 ml/min
Gradient muß etabliert werden!
Protokoll
Es wird die Einwaage von Iodoactamidofluorescein so gewählt, dass ein ca. 2-facher molarer
Überschuss über das zu markierende Peptid vorliegt. Der frisch angesetzten Peptidlösung
werden 10 µl und der Iodoactamidofluoresceinlösung werden 2 µl entnommen und mit je
400 µl Wasser gemischt. Die Retentionszeiten beider Substanzen werden getrennt analysiert.
Dem Peptid wird ein 1.2-facher molarer Überschuss an Iodoacetamidofluorscein zugegeben
und für ca. 60 min im Dunkeln bei Raumtemperatur inkubiert (Peptid und Fluorophor müssen
möglichst schnell nach dem Ansetzen gemischt werden, da ansonsten Nebenreaktionen
stattfinden, die die Ausbeute stark verringern(welche?)). Danach erfolgt die Aufreinigung
mittels HPLC. Es werden 0,5 ml Fraktionen gesammelt. Die gewünschten Fraktionen werden
vereinigt, in flüssigem Stickstoff eingefroren und über Nacht gefriergetrocknet. Am nächsten
Tag wird das Peptid in 100 µl Wasser gelöst und die Konzentration über die Absorption von
Fluorescein (ε492 = 75000 cm-1 M-1 bei pH 9,0) bestimmt. Dazu werden eine 1:10 und 1:100
Verdünnung in 100 mM Tricinepuffer der Peptidlösung hergestellt (jeweils 20 µl) und die
Absorption bei 492 nm gemessen. Das Peptid wird mit Wasser auf eine Konzentration von
60
300 µM eingestellt und bei -20 °C gelagert. Zur Verifizierung werden nochmals 20 µl des
markierten Peptids mittels HPLC analysiert und die Masse wird mittels MALDI-TOF-MS an
einem Voyager-DE (PerSeptive Biosystems) Massenspektrometer bestimmt. Dazu werden 0.5
µl Peptid mit 0.5 µl gesättigter α-Cyano-4-hydroxyzimtsäure in einem Verhältnis von
33:67:0.1 Wasser/Acetonitril/TFA auf dem Probenteller gemischt: Durch Verdunstung bilden
sich Kristalle für die MALDI-TOF-MS (Die Massenbestimmung findet nicht während des
Praktikums statt).
Auswertung
Bestimmen Sie die Konzentration fluoresceinmarkierten Peptids.
Was ist die Ausbeute der Markierungsreaktion, in Prozent des eingesetzten Peptids?
61
6.3 Bestimmung der Bindungsaffinität (KD-Wert) eines Peptids zu TAP
(Versuch 1C)
Der
Peptidtransporter
TAP
transportiert
Peptide
vom
Zytosol
ins
Lumen
des
Endoplasmatischen Retikulums. Die höchste Transporteffizienz wird für Peptide mit einer
Länge von 8 – 12 Aminosäuren beobachtet. Die Spezifität der Peptide beschränkt sich auf die
drei N-terminalen und die C-terminale Aminosäurenreste. Bevor die Peptide jedoch
transportiert werden, binden sie an TAP.
In diesem Praktikumsversuch soll in einem Bindungssättigungsexperiment mit den zuvor
isolierten TAP-haltigen Membranen die Bindungsaffinität (KD-Wert) eines Peptids zu TAP
bestimmt werden. Dazu wird die Peptidbindung an TAP in Abhängigkeit von der
Peptidkonzentration untersucht. Um das Peptid detektieren zu können, wurde es mit dem
Fluorophor Fluoreszein markiert, der sich mit Hilfe eines Fluoreszenzspektometers sehr
empfindlich nachweisen lässt. Als Peptid wurde ein Nonamer (RRYQC(Fluoreszein)STEL)
ausgewählt, da dies der optimalen Peptidlänge für TAP entspricht.
Material
10x PBS pH 7.0
1,37M NaCl
27mM KCl
100mM Na2HPO4
20mM KH2PO4
Bindungspuffer
1x PBS
1mM DTT
Peptide
C4F: RRYC(Fluoreszein)KSTEL
R9LQK: RRYQKSTEL
TAP
TAP-haltige Membranen
Zum Absättigen der Filtermembranen
0,3% PEI-Lösung (Poly-ethylenimin)
Solubilisierungspuffer
1xPBS/1% SDS
62
Protokoll Bindungssättigungsexperiment:
Auf einer 96-well Platte werden die TAP-haltigen Membranen, das Fluoreszein-markierte
Peptid (C4F) und das ungelabelte Kompetitorpeptid bei 4°C zusammengegeben und mit
Bindungspuffer auf das gleiche Volumen gebracht. Die Zugabe des 400-fachen molaren
Überschusses von unmarkiertem Kompetitorpeptid (RRYQKSTEL) ermöglicht die
Unterscheidung zwischen unspezifischer Bindung (an Gefäßwände, an die Lipidschicht der
Membranen) und spezifischer Assoziation an TAP.
Durch
20
min
Inkubation
der
Proben
auf
Eis
wird
die
Einstellung
des
Bindungsgleichgewichtes abgewartet. Währenddessen werden Filterplatten mit einem
Porendurchmesser von 0,65 µm erst mit 150 µL 0,3% iger PEI-Lösung (lädt Filtermembran
positiv) für 10 min inkubiert anschließend abgesaugt. Die Wells der Filterplatte werden
einmal mit 100 µl Bindungspuffer gewaschen. Dann werden die Proben auf die vorbereiteten
Filterplatten übertragen und ungebundenes Peptid abgesaugt. Die zurückbleibenden
Membranen werden dreimal durch Zugabe und Absaugen von 100 µl kaltem Bindungspuffer
gewaschen. Diese Arbeitsschritte sind wegen der einsetzenden Dissoziation bereits
gebundener Peptide sehr zeitkritisch und werden daher mit einer Multi-Kanal-Pipette
durchgeführt. Danach wird die Filterplatte von dem Filtrationsapparat abgenommen.
Durch Zugabe von 250 µl Solubilisationspuffer (enthält 1% SDS, daher bei
Zimmertemperatur aufbewahren) wird TAP denaturiert (Achtung: Schaumbildung vermeiden)
und die gebundenen Peptide innerhalb von 5 Minuten freigesetzt. Aus den 250 µl Solubilisat
werden 200 µl entnommen und auf eine schwarze Fluoreszenzmicrotiterplatte überführt.
Um die exakte Menge an gebundenem Fluoreszein-Peptid zu quantifizieren, werden
Eichlösungen zubereitet und je 200 µl in die Fluoreszenzmicrotiterplatte pipettiert. Für alle
Proben wird mit einem Elisa-Reader (Ex/Em: 485/520 nm) die Fluoreszenz bestimmt.
Pipettierschemata:
Von allen Konzentrationen werden Triplikatbestimmungen angefertigt.
Das Pipettierschema soll sich anhand folgender Endkonzentrationen von Fluoreszenz-Peptid,
Kompetitor-Peptid (400 x Überschuss, jedoch min. 1µl von 4,2mM Stocklösung) und TAPhaltigen Membranen (40 µg Gesamtproteinmenge/“well“) erstellt werden. Das
Gesamtvolumen beträgt 50 µl.
63
Pipettierschema Bindungssättigungsexperiment
Probe
1-3
C4F
Kom-
TAP
Endkonz.
petitor
40 µg
[nM]
R9LQK
[µl]
3
10
nM
nM
C4F
nM
C4F
nM
C4F
nM
R9LQK
Bindung-
4,2 mM
spuffer
[µl]
+
10-12
13-15
C4F
+
4-6
7-9
C4F
30
-
60
-
16-18
19-21
+
22-24
25-27
100
+
28-30
31-33
200
+
34-36
37-39
-
300
40-42
+
Pipettierschema Bindungssättigungsexperiment Eichgerade:
Konzentration [nM]
0
0.5
2.5
5
7.5
10
Fluoreszein-Peptid [µl]#
-
3#
15#
30#
45#
60#
Solubilisationspuffer [µl]
600
597
585
570
555
540
# aus Stocklösung: 100 nM
64
Auswertung
1. Stellen Sie eine Eichbeziehung zwischen der Fluoreszenzintensität im Emissionsmaximum
und der C4F-Peptidkonzentration her.
2. Wie ist der Kd-Wert für das untersuchte Peptid?
65
6.4 Untersuchung der Nukleotidbindung von TAP (Versuch 1D)
Als Mitglied der ABC-Transporterfamilie besitzt TAP die Eigenschaft ATP zu hydrolysieren
um den Transport von Peptiden über die ER-Membran zu energetisieren. Eine Voraussetzung
für die ATP-Hydrolyse ist, dass TAP ATP binden kann. Sowohl TAP1 als auch TAP2
besitzen eine Nukleotidbindungsdomäne (NBD). Beide NBDs bilden zusammen zwei ATPBindestellen die durch hochkonservierte Motive charaktrisiert sind.
In diesem Versuch soll die ATP-Bindung von TAP1/2 untersucht werden. Dabei erfolgt die
Bindung von TAP an ATP-Agarose, gefolgt von spezifischer Elution mittels ATP oder
anderen Nukleotiden und Quantifizierung der Bindung mittels Immunoblotting zur Detektion
des eluierten TAP.
Material
Mikrosomen
250 μl Sf9-Membranen (Proteinkonzentration 5 mg/ml)
Solubilisierung-Puffer
20 mM HEPES pH 7.4
150 mM NaCl
1 mM KCl
5 mM MgCl2
2% NP-40
15% Glycerin
Wasch-Puffer
20 mM HEPES pH 7.4
150 mM NaCl
1 mM KCl
5 mM MgCl2
0.2% NP-40
15% Glycerin
Elutions-Puffer
Wasch-Puffer
ATP (500 nM - 5 mM ATP)
5 mM MgCl2
ATP-Agarose
66
Protokoll
Wichtig: Alle folgenden Schritte werden auf Eis durchgeführt! Sowohl die Puffer als
auch die Zentrifuge müssen kalt sein.
Solubilisierung:
Mikrosomen werden auf Eis aufgetaut, 8 min bei 4°C und 14000 rpm zentrifugiert und einmal
mit 300 µl PBS gewaschen. Mikrosomen-Pellet wird in 250 μl Solubilisierung-Puffer
resuspendiert und mindestens 30 min auf Eis inkubiert (Achtung: Luftblasen vermeiden!).
Anschließend wird der Solubilisierungs-Ansatz 30 min bei 4°C und 55.000 rpm im TLA-55
Rotor (Tisch-Ultrazentrifuge) zentrifugiert.
Vorbereiten der ATP-Agarose:
Während der Zentrifugation kann die ATP-Agarose vorbereitet werden. Es werden 5mg ATPAgarose pro Ansatz benötigt. Die Gesamtmenge an ATP-Agarose wird in ddH2O equilibriert
(1ml pro 5mg). Die Beads werden dann gleichmäßig auf die verschiedenen Ansatz-tubes
verteilt und 2 mal mit je 500µl Waschpuffer gewaschen (2 min @ 200xg, 4°C).
Vorbereiten der Nukleotide:
Es müssen 8 verschiedene Ansätze mit verschiedenen Nukleotidkozentrationen vorbereitet
werden. Das Gesamtvolumen eines Ansatzes beträgt 300 µl. Von diesen 300 µl sind 250 µl
Solubilisat – der Rest besteht aus PBS-Puffer und Nukleotid.
Es soll der IC50-wert für verschiedene Nucleotide bestimmt werden (ATP, ADP, CTP und
GTP) – jeweils eine Gruppe wird einen IC50 bestimmen.
Bitte rechnen Sie vor Pratikumsbeginn die Ansätze aus: Es wird für jedes Nucleotid eine
Stammlösung von 300mM zur Verfügung gestellt.
Folgende Konzentrationen im jeweiligen Ansatz wird benötigt:
Enkonzentration
Nukleotid [µM]
0
3
10
30
100
250 µl
250 µl
250 µl
250 µl
250 µl
300
1000
5000
250 µl
250 µl
250 µl
c (Nucleotid)
V (Nucleotid)
V (PBS)
V(Solubilisat)
V (gesamt)
jeweils 300 µl
Es können höchstens 1µl pipettiert werden und bitte nicht mehr als 2 Vorverdünnungen vom
Nucleotid machen.
67
Nachdem die Zentrifugation für das Solubilisat abgeschlossen ist, kann das Solubilisat
abgenommen werden und jeweils 250 µl auf den Nukleotidansatz gegeben werden. Diese
Ansätze werden 15 min auf Eis inkubiert.
Danach werden die Nukleotidansätze auf die vorbereiteten Beads gegeben und mindestens 1
Stunde bei 4°C rotierend inkubiert.
Die Ansätze werden danach 3 mal mit 500 µl Waschpuffer gewaschen (jeweils 2 min bei
200xg zentrifigieren). Die Beads werden dann in 50 µl 3xSDS-Ladepuffer resuspendiert und
20 min bei 65°C inkubiert. 15 µl der Sampels und 4 µl vom Marker werden auf das Gel
geladen, das Gel gefahren und TAP durch den Western-Blot sichtbar gemacht.
SDS-Polyacrylamidgelelktrophorese
Der elektrophoretischen Trennung von Proteinen unter denaturierenden Bedingungen dienen
diskontinuierliche SDS-Polyacrylamidgele. SDS ist ein anionisches Detergenz, das die
Eigenladung von Proteinen überdeckt, so dass Komplexe mit konstanter negativer Ladung pro
Masseneinheit entstehen. So können Proteine in einem Polyacrylamidgel nach ihrem
Molekulargewicht getrennt werden.
Materialien
BioRad-System SDS-Page Gießapparatur
Acrylamid/Bisacrylamid (30%)
Trenngelpuffer: 1,5 M TRIS/ HCl pH 8,8, 0,4 % SDS
Sammelgelpuffer: 0,5 M TRIS/ HCl 0,4 % SDS pH 6,8
10 % APS (Ammoniumperoxidsulfat)
TEMED
Protokoll
Zur
Vorbereitung
der
SDS-Gele
werden
die
Gießkammern
entsprechend
der
Herstellerangaben vorbereitet. Zuerst wird die Lösung des Trenngels laut Pipettierschema
angesetzt, in die beiden Gießkammern bis zu 2/3 der Gesamthöhe pipettiert und anschließend
mit 1ml Isopropanol überschichtet, um alle Luftblasen zu entfernen und eine scharfe
Abschlusskante des Trenngels zu erreichen. Nachdem das Gel vollständig auspolymerisiert ist
wird das Ethanol entfernt, die vorbereitete Sammelgellösung aufgetragen und die
Probenkämme eingefügt. Die fertigen Gele werden (einschlagen in feuchte Papiere und
Lagerung in einer kleinen Plastiktüte) über Nacht im Kühlschrank gelagert.
68
Pipettierschema für 2 SDS-Page Gele 7*8,5 cm des Biorad-Systems:
H2O
Acrylamid/Bisacrylamid
Trenngelpuffer
Sammelgelpuffer
APS 10%
TEMED
Trenngel 10%
3,95 ml
3,1 ml
3,95 ml
40 µl
20 µl
Sammelgel 5%
2,7 ml
650 µl
1,1 ml
30 µl
10 µl
Immunoblot
Material
SDS-PAGE
Nitrocellulose-Membran
Filterpapier
Transfer-Puffer
25 mM Tris/HCl pH7.5
192 mM Glycin
0.03% (w/v) SDS
20% (v/v) Methanol
10xTBS
200 mM Tris
2.5 M NaCl
Protokoll
Nitrocellulosemembran und Filterpapier werden zunächst zurecht geschnitten und in
Transferpuffer getränkt. Die untere und obere Elektrodenplatte des Semi-Dry-Blots werden
vor dem Auflegen der Filterpapiere mit dem Transferpuffer angefeuchtet. Auf den
Filterpapier wird dann die Membran und anschließend das gewünschte Gel aufgelegt. Zum
Schluss wird eine weitere Lage von Filterpapier auf das Gel aufgelegt. Die obere
Elektrodenplatte schließt den Aufbau ab. Achtung! Luftblassen zwischen den einzelnen
Schichten des Blots lassen sich gut durch vorsichtiges Rollen mit einem 50 ml Falcon
heraus drücken. Die Dauer des Blottingverfahrens beträgt 75 mA/Gel für 90 min.
Anschließend werden die Membranen 1 h, RT, in 3% Milch-TBST-Lösung abgesättigt. TAP1
wird mit dem 148.3 Hybridomüberstand und TAP2 mit dem 435.3 Hybridomüberstand (1:5 in
3% Milch-TBST-Lösung) nachgewiesen. Dabei wird die Membran mit dem Antikörper
mindestens 1 h bei RT (besser über Nacht bei 4°C) inkubiert. Anschließend wird die
Membran 3x 5 min mit TBST gewaschen und 1h mit dem Ziege-Anti-Maus-HRP-Konjugat
(Fc spezifisch), 1:20000 in 3% Milch-TBST, bei RT inkubiert. Schließlich werden die
69
Membranen 3x 5 min mit TBST gewaschen und mit ECL1- und ECL2-Lösung 1 min
inkubiert. Abschließend werden die Membranen in einem Lumi-Imager (Roche) ausgewertet
und die Daten mit dem Computerprogramm „Prism“ mit einer sigmoidalen Kurve gefittet.
Gleichung:
Y = B+
A− B
(1 + 10(
( LogIC 50 − X )* n )
)
A: maximales Signal
B: Hintergrund
IC50: Halbmaximale Verdrängung
X: Logarithmus zur Basis 10 der ATP Konzentration
N: Hill-Koeffizient
Auswertung
Bestimmen Sie den IC50 Wert für ATP aus der Menge an eluiertem TAP.
70
6.5 Peptidtransportassay (Versuch 1E)
Der in vitro Peptidtransportassay basiert auf der Glykosylierung des transportierten Peptids
im Lumen des Endoplasmatischen Reticulums (ER), sowie dessen spezifische Bindung an
Concanavalin A-Sepharose und anschließender Trennung von nicht transportiertem Peptid.
Der Transport wird in Abhängigkeit von der ATP-Konzentration (0.1-10 mM), PeptidKonzentration (0.1-2 µM), Temperatur (4-40°C), Kationen und pH (pH 5-9) untersucht.
Durch Ermittlung der Fluoreszenz im ELISA-Reader wird der Peptidtransport quantifiziert.
Material
Peptid: RRYQNSTCL (NST-f)
137 mM NaCl
2,7 mM KCl
10 mM Na2HPO4
2 mM KH2PO4
pH 7,0
Stopp-Puffer
1xPBS
10 mM EDTA (pH 8,0)
Lyse-Puffer
50 mM Tris/HCl, pH 7.5
150 mM NaCl
5 mM KCl
1 mM CaCl2
1 mM MnCl2
1% NP-40
Elutionspuffer
Lyse-Puffer
200 mM Methyl α-D-Mannopyranoside
(15 ml Lyse-Puffer versetzt mit 582.6 mg Methyl α -D-Mannopyranoside)
71
Gruppe 1 + 2: ATP-Konz. ( 0; 0,1; 0,2; 0,3; 1; 3; 6; 10 mM)
Gruppe 3:
Peptid-Konz. (0.1; 0,2; 0,5; 1; 1,5; 2µM)
Gruppe 4:
Temperatur (4, 12, 24, 32, 40 °C)
Gruppe 5:
pH (pH 5: MES-Puffer; pH 6: Na-Phosphatpuffer; pH 7: HEPES-Puffer;
pH 8: Tris-Puffer; pH 9: CHES-Puffer)
Gruppe 6:
Kationen (statt 5 mM MgCl2 im Puffer: NiCl2, CuCl2, MnCl2, CaCl2)
Protokoll
Das Pipettierschema für den Transportassay sieht folgender Weise aus:
Membranen
ATP [30mM]
MgCl2 [100 mM]
1 U/µL apyrase
1x PBS
NST-f peptid [10 µM]
Transport
20 µl
5 µl
2.5 µl
17.5 µl
5 µl
Apyrase
20 µl
2,5 µl
1 µl
21,5 µl
5 µl
Der Reaktionsansatz wird bis auf das Peptid in 1,5 ml Reaktionsgefäße vorgelegt (auf Eis!).
Je nach untersuchtem Parameter bietet es sich an, einen Mastermix anzusetzen. Alle
Reaktionsansätze werden 1min bei 32°C vorinkubiert (wichtig für die Apyrase!). Der
Transport wird durch Zugabe vom NST-f Peptid gestartet und erfolgt für 3 min bei 32 °C.
Anschließend wird durch Zugabe von 1 ml eiskaltem Stopp-Puffer der Transport gestoppt.
Nach Zentrifugation für 8 min bei 14000 rpm und 4°C wird der Überstand verworfen und das
Pellet in zunächst 100 µl Lyse-Puffer resuspendiert und mit weiteren 800 µl versetzt. Die
Reaktionsansätze werden nach einer Inkubation von 15 min bei Raumtemperatur für 8 min
bei 14000 rpm zentrifugiert. In der Zwischenzeit werden ConA-beads (50%ige Suspension
(w/v)) in ein kleines Falcon überführt und 2 x mit Lysepuffer gewaschen (2 min @ 1000 x g).
Der Überstand der lysierten Membranen wird zu 50 µl gewaschenen ConA-beads pipettiert
und 1 h bei 4°C bzw. 30 min bei RT überkopf rotiert. Die Reaktionsansätze werden für 2 min
bei 1000 rpm und 4°C zentrifugiert und der Überstand verworfen. Die ConA-beads werden
zweimal mit je 1 ml Lyse-Puffer gewaschen, mit 300 µl Elutionspuffer versetzt und 30 min
bei RT inkubiert. Die beads werden für 2 min bei 1000 rpm pelletiert, 250 µl der Elution in
eine
schwarze
Microtiterplatte
(λex/em=485/520 nm).
pipettiert
und
im
ELISA-Reader
quantifiziert
Funktionelle Charakterisierung der Nukleotidbindedomäne von MDL1
72
7 Funktionelle Charakterisierung der Nukleotidbindedomäne von
MDL1
Das mitochondriale Protein MDL1 (multidrug resistance like) aus Saccharomyces
cerevisiae (S. cerevisiae) wird der Superfamilie der ABC (ATP-binding cassette)
Transporter zugeordnet. Bei den ABC Transportern handelt es sich um
Transmembranproteine, die den aktiven Transport einer Vielzahl löslicher
Substanzen, wie z.B. Ionen, Zucker, Aminosäuren, Chemotherapeutika oder
Proteine, über Zellmembranen katalysieren.
Alle bekannten ABC Transporter haben mehrere Strukturmotive gemeinsam. Sie
üben ihre Funktion als Dimer aus und sind aus vier Domänen aufgebaut. Hierzu
zählen die zwei hydrophoben Transmembrandomänen (TMDs), welche mit 5-10 αHelices die Membran durchspannen und so den Translokationskanal bilden. Jede
TMD ist mit einer hydrophilen Nukleotidbindedomäne (NBD) verbunden, welche die
Energetisierung des Transportprozesses vermittelt. Man geht heute davon aus, dass
sich nach Dimerisierung der beiden NBDs die Konformation der TMDs verändert und
die Substrattranslokation bewirkt.
Intermembranraum
1 2 3 4 5 6
6 5 4 3 2 1
TMD
Matrix
N
mitochondriale
Signalsequenz
ATP
N
ATP
NBD
C
C
Peptide?
MDL1, das in der inneren Mitochondrienmembran als Homodimer vorliegt, besteht
aus 695 Aminosäuren und hat ein Molekulargewicht von ca. 70 kDa (Young et al.
2001). Die physiologische Funktion, wie auch die strukturelle Organisation des
MDL1-Komplexes ist bisher weitgehend unbekannt. Es wird diskutiert, dass MDL1 an
der Qualitätskontrolle in Mitochondrien beteiligt ist, in dem es den Transport von
Peptiden, die durch den proteolytischen Abbau von nicht-assemblierten
Membranproteinen in der Matrix entstehen, katalysiert (Young et al. 2001).
73
Möglicherweise vermittelt MDL1 über diesen Peptidtransport die Kommunikation
zwischen Mitochondrium und der zellulären Umgebung.
Für die Energetisierung des Transportprozesses spielen die ATP-Bindung, die
anschließende NBD-Dimerisierung und ATP-Hydrolyse eine zentrale Rolle. Für den
ATP-Hydrolysezyklus der MDL1-NBD wurde ein „processive clamp“-Modell
vorgeschlagen (Janas et al. 2003, van der Does et al. 2005). Dieses Modell besagt,
dass die Dimerisierung der NBDs durch die Bindung von zwei ATP-Molekülen
induziert wird. Im Anschluss werden die beiden ATP-Moleküle schrittweise
hydrolysiert, wobei das gebildete Phosphat direkt aus dem Dimer freigesetzt wird.
Erst wenn die Hydrolyse beider ATP-Moleküle abgeschlossen ist, folgt die
Dissoziation der beiden NBDs und die Freisetzung der beiden ADPs.
In diesem Praktikumsteil soll die Funktion der C-terminalen NBD (Aminosäure 423695) von MDL1 charakterisiert werden. Hierfür soll zunächst die klonierte His„getaggte“ MDL1(WT)-NBD (WT wildtype), wie auch die MDL1(E599Q)-NBD, in
Escherichia coli (E. coli) überexprimiert und über eine Metallchelataffinitätschromatographie gereinigt werden.
Im Anschluss soll die ATP-Hydrolyse-Rate der MDL1(WT)-NBD mit Hilfe des
malachitgreen-assays analysiert werden. Des Weiteren soll im Laufe dieses
Praktikums mit Hilfe von Fluoreszenzlöschung die Assoziation der MDL1(E599Q)NBD charakterisiert werden.
Literatur:
-
-
-
van der Does C., Presenti C., Schulze K., Dinkelaker S., Tampé R. (2006):
Kinetics of the ATP hydrolysis cycle of the nucleotide-binding domain of MDL1
studied by a novel site-specific labeling technique.
J. Biol. Chem. 281, 5694-5701
Janas E., Hofacker M., Chen M., Gompf S., van der Does C. und Tampé R.
(2003):
The ATP hydrolysis cycle of the nucleotide-binding domain of the
mitochondrial ATP-binding cassette transporter Mdl1p.
J. Biol. Chem. 278, 26862-26869
Young L., Leonhard K., Tatsuta T., Trowsdale J. und Langer T. (2001):
Role of the ABC-transporter Mdl1 in peptide export of mitochondria.
Science 291, 2135-2138
74
7.1 Überexpression der MDL1-NBD in E. coli (Versuch 2A)
Für die Überexpression werden kompetente BL21(DE3) Zellen mit dem Plasmid
pET28b-MDL1-NBD (Abbildung 1) bzw. pET28b-MDL1(E599Q)-NBD transformiert.
N-terminal des MDL1-NBD-Gens befindet sich ein T7 Promotor. Dieser kann durch
Zugabe von IPTG (Isopropyl-β-D-thiogalactopyranosid) induziert werden, wodurch es
zu einer Überexpression der MDL1-NBD bzw. MDL1(E599Q)-NBD kommt.
E. coli hat eine optimale Wachstumsrate bei 37°C und einer Schüttelgeschwindigkeit
von 180 rpm. Nach Erreichen einer optischen Dichte (OD600) von 0,5 in LBKanMedium (Selektionsmarker Kanamycin) werden die Zellen mit IPTG induziert. Die
eigentliche Proteinexpression nach Induktion erfolgt bei 30°C, da bei dieser
Temperatur eine höhere Ausbeute und weniger Abbauprodukte zu erwarten sind. Die
Zellen werden für 3 h kultiviert und anschließend durch Zentrifugation geerntet.
75
7.1.1 Versuch 2A-1
Materialien

Plasmiden
pET28b-MDL1-NBD
bzw.
pET28b-MDL1(E599Q)-NBD
transformiert in BL21(DE3) Zellen; (LBKan-Agarplatte)
3 autoklavierte 250 ml Erlenmeyerkolben
autoklaviertes LB-Medium
autoklavierte Zahnstocher
Kanamycin [30 mg/ml], sterilfiltriert
Schüttler für die Zellkultur, Temperatur 37°C und 30°C
Medienzusammensetzung

LB (Luria Betani)-Medium
5g
10 g
5g
Hefeextrakt
Trypton/Pepton Mischung
NaCl
ad 1 l H2O, autoklavieren
Das Antibiotikum Kanamycin wird in einer Endkonzentration von 30 µg/ml zu
dem abgekühlten Medium gegeben.
Protokoll
Sterile Bedingungen
⇒
Alle Arbeiten werden an der Flamme
ausgeführt!
Alle geöffneten Gefäße und Deckel sollten in
unmittelbarer Nähe der Flamme liegen!
Zahnstocher nicht mit der Hand berühren!
Am Abend werden 3 autoklavierte 250 ml Erlenmeyerkolben unter sterilen
Bedingungen mit 100 ml Kanamycin-haltigem LB-Medium gefüllt. Mit pET28bMDL1-NBD bzw. pET28b-MDL1(E599Q)-NBD transformierte BL21-Codon Plus (DE)Zellen werden mit Hilfe eines Zahnstochers (Zahnstocher mit abgeflammter Pinzette
anfassen) in dem vorbereiteten Medium angeimpft und über Nacht bei 37°C und
einer Schüttelgeschwindigkeit von 180 rpm inkubiert.
76
7.1.2 Versuch 2A-2
Materialien

autoklavierte 2 bzw. 5 l Kolben mit 1 bzw. 2 l LB-Medium (6 l Medium pro
Konstrukt)
Kanamycin [30 mg/ml], sterilfiltriert
IPTG Stammlösung [1 M], sterilfiltriert
drei 100 ml MDL1-NBD bzw. MDL1(E599Q)-NBD-Übernachtkulturen
sterile Pipetten 1 und 10 ml
Photometer, Küvetten
1 l Zentrifugenbecher
Zentrifuge
50 ml Falcons
Tris-HCl [1 M, pH 8] & NaCl [5 M]
Standardpuffer [20 mM Tris-HCl, 100 mM NaCl, pH 8]
flüssiger Stickstoff
Pufferzusammensetzung

Standardpuffer
20 mM
100 mM
Tris-HCl, pH 8
NaCl
Protokoll
Am Morgen werden 6 l LB-Medium mit den vorbereiteten MDL1(WT)-NBD bzw.
MDL1(E599Q)-NBD-Kulturen angeimpft. Hierzu wird dem autoklavierten LB-Medium
Kanamycin in einer Endkonzentration von 30 µg/ml unter sterilen Bedingungen
zugegeben. 1 l Medium wird anschließend mit jeweils 10 ml der Übernachtkultur
angeimpft. Die Kulturen werden bei 37°C und 180 rpm inkubiert. In Zeitabständen
von 20 min wird eine Probe der Kultur entnommen und im Photometer bei einer OD
von 600 nm vermessen. Die Genexpression wird bei Erreichen einer OD600 von 0,5 0,6 mit IPTG in einer Endkonzentration von 0,2 mM induziert. Nach Induktion wird
die OD600 alle 30 min überprüft. Nach 3 Stunden Inkubation bei 30°C und 180 rpm
werden die Zellen durch Zentrifugation (4.100 x g, 20 min, 4°C) geerntet. Die Pellets
(Überstand nach Zentrifugation vollständig entfernen) werden in Standardpuffer
(12 ml pro 6 l Kultur) resuspendiert, in 50 ml Falcon-Röhrchen überführt und in
flüssigem Stickstoff schockgefroren. Die Lagerung der Zellen erfolgt bei -20°C.
Auswertung
Das Wachstumsverhalten der E. coli Kulturen vor und nach Induktion ist in einem
Diagramm (OD600 gegen Zeit) festzuhalten und soll im Diskussionsteil des Protokolls
diskutiert werden.
77
7.2 Aufreinigung der MDL1-NBD bzw. MDL1(E599Q)-NBD von
E. coli (Versuch 2B)
Um mit der Nukleotidbindedomäne von MDL1 biochemische Experimente
durchführen zu können, ist es wichtig das überexprimierte Protein aus den E. coli
Zellen zu isolieren. Hierzu werden die Zellen zunächst mit Ultraschall
aufgeschlossen. Um das Protein aus dem Zelllysat zureinigen wird im Anschluss
eine Metallchelataffinitätschromatographie durchgeführt. Durch diese Anwendung
wird das gewünschte Protein von Proteinverunreinigungen befreit und kann in
nahezu reiner Form isoliert werden.
Die MDL1-NBD bzw. die MDL1(E599Q)-NBD wird über einen N-terminalen His6-Tag
an mit Ni2+-IDA (Iminodiacetat) beladene Säulen gebunden. Im Anschluss folgen
mehrere Waschschritte mit steigenden Imidazol Konzentrationen, wodurch
unspezifisch und schwächer gebundene Proteine von der spezifisch gebundenen
MDL1-NBD abgetrennt werden. Die eigentliche Elution des Proteins findet mit
250 mM Imidazol statt.
Um nachzuweisen, dass die MDL1(WT)-NBD bzw. MDL1(E599Q)-NBD in den
Eluatfraktionen enthalten ist, wird eine SDS-Polyacrylamid-Gelelektrophorese (SDSPAGE) durchgeführt. Mit Hilfe der SDS-PAGE können Proteine im elektrischen Feld
nach ihrer Größe aufgetrennt werden. Hierzu werden die Proteine mit einem SDShaltigen Auftragspuffer versetzt, welcher einerseits die Proteindenaturierung
unterstützt und andererseits die Eigenladung der Proteine überdeckt, womit eine
elektrophoretische Auftrennung analog zur Proteinmasse möglich ist. Um eine
vollständige Denaturierung der Proteine zu gewährleisten werden die Proben 5 min
bei 65°C inkubiert. In unserem Fall wird eine diskontinuierliche Gelelektrophorese
nach Laemmli (1970) durchgeführt, d.h. man macht sich den pH-Unterschied
zwischen Sammel- und Trenngel zunutze um fokussierte Proteinbanden zu erhalten.
Aufgrund der Größe der MDL1-NBD (33 kDa) wird als Trenngel ein 12%-iges SDSPolyacrylamidgel
(SDS-PAGE;
SDS
Natriumdodecylsulfat;
PolyacrylamidGelelektrophorese) benutzt, welches nach abgeschlossener Proteinauftrennung mit
Coomassie gefärbt wird. Anschließend werden die Protein enthaltenden Fraktionen
vereinigt.
Für weitere Versuche ist es wichtig, vorhandenes Imidazol mit Hilfe einer PD-10
Säule aus dem Puffer zu entfernen. Im Anschluss wird die Konzentration des
Gesamtproteins mit Hilfe des Bradford-Tests bestimmt.
78
7.2.1 Versuch 2B-1
Materialien

E. coli Zellpellet mit induziertem MDL1-NBD bzw. MDL1(E599Q)-NBD
PMSF [200 mM] in EtOH (frisch ansetzen, Lagerung auf Eis)
50 ml Becherglas
Ultraschall
Ultrazentrifuge, 40 ml Ultrazentrifugenröhrchen
Ni-IDA Säule (5 mL HiTrap, Amersham)
NiSO4 [0,1 M]
5 ml Spritzen
Tris-HCl [1M, pH 8,0]
Imidazol [1 M, pH 7,5]
NaCl [5 M]
gekühlte Puffer A [20 mM Tris-HCl, 100 mM NaCl, 20 mM Imidazol, pH 8,0]
gekühlte Puffer B [20 mM Tris-HCl, 100 mM NaCl, 250 mM Imidazol, pH 8,0]
Aekta System
Zentrikon (cut off 10 K)
PD10 Säulen
Standardpuffer (vergleiche Versuch 2A-2)
Bradfordreagenz, (Pierce, Lagerung RT, dunkel)
BSA [2 mg/ml]
Fluostar (Elisa Reader), Mikrotiterplatte (Flachboden)
Medienzusammensetzung

Puffer A zur Proteinreinigung
20 mM
100 mM
20 mM
Tris-HCl, pH 8
NaCl
Imidazol
Puffer B zur Proteinreinigung
20 mM
100 mM
250 mM
Tris-HCl, pH 8
NaCl
Imidazol
SDS-PAGE Ladepuffer (5-fach)
2,50 ml
2,50 ml
1,25 ml
0,75 g
0,25 ml
Tris-HCl [1 M, pH 8]
Glycerin [v/v 87%]
β-Mercaptoethanol [14,3 M]
SDS
Bromphenolblau [10 mg/ml]
ad 10 ml H2O, Lagerung –20°C
79
Protokoll
Ab der Proteinreinigung werden alle Arbeitsschritte auf Eis oder bei 4°C
durchgeführt! Das aufgearbeitete Protein wird immer auf Eis im Kühlraum
gelagert!
Die vorbereiteten Zellpellets werden aufgetaut (Gesamtvolumen mit kaltem
Standardpuffer auf 50 ml auffüllen, damit die Ultrazentrifugationsröhrchen
vollständigt
gefüllt
sind)
und
mit
dem
Proteaseinhibitor
PMSF
(Phenylmethylsulfonylfluorid) in einer Endkonzentration von 0,2 mM versetzt. Die
Zellen werden im Eisbad 3 x 2 min, mit jeweils 40 sec Pause (output control 5, duty
cycle 50) beschallt. Anschließend werden die Zelltrümmer und unaufgeschlossene
Zellen durch Ultrazentrifugation (45 min, 100.000 x g, 38.000 rpm Ti60, 4°C) entfernt.
Die Reinigung des Proteins aus dem Zelllysat erfolgt mittels IMAC (immobilisierte
Metallchelataffinitätschromatographie). Dazu wird eine IDA Säule vorbereitet. Die
Säule wird mit 15 ml MilliQ-H2O gewaschen, mit 5 ml 0,1 M NiSO4 5 min bei
Raumtemperatur inkubiert und anschließend mit Puffer A zur Proteinreinigung
äquilibriert. Die Reinigung erfolgt am Aekta System. Der Überstand nach
Ultrazentrifugation wird mit einem Fluss von 2 ml/min auf die Säule aufgetragen.
Durch einen Waschschritt mit einer Imidazolkonzentration von 80 mM (74% Puffer A,
26% Puffer B) und einem zweiten Waschschritt mit 100 mM Imidazol (65% Puffer A,
35% Puffer B), werden schwach gebundene Proteine von der Säule gewaschen. Die
eigentliche Elution findet mit einer Imidazolkonzentration von 250 mM (100% Puffer
B) statt. Während der Elution werden 1 ml Fraktionen aufgefangen. 30 µl jeder
Fraktion werden mit 10 µl SDS-Ladepuffer (5-fach) gemischt, 5 min bei 65°C
inkubiert und im Kühlschrank gelagert. Die Fraktionen mit dem größten Proteinanteil
werden vereinigt. Um den Puffer auszutauschen, wird das Protein auf eine PD-10
Säule aufgetragen. Die Säule wird zunächst mit 25 ml Standardpuffer äquilibriert.
Danach wird das Protein (2,5 ml) aufgetragen und die ersten fünf Fraktionen (je
500 µl) werden aufgefangen. Anschließend wird das Protein in 500 µl Schritten mit
Standardpuffer eluiert. Während dieser Zeit müssen weiterhin 500 µl Fraktionen des
Eluats aufgefangen werden (es sollten circa 20 Fraktionen gesammelt werden). Im
Anschluss wird jede einzelne Eluatfraktion 1:50 mit Bradford Reagenz
(Gesamtvolumen 50 µl) versetzt, um nachzuweisen in welchen Fraktionen das
Protein enthalten ist. Fraktionen, bei denen es zu einem Farbumschlag kommt,
werden vereinigt. 10 µl dieser Probe werden 1:1 mit SDS-Ladepuffer und 20 µl H2O
versetzt, 5 min bei 65°C inkubiert und im Kühlschrank gelagert.
80
Bradford-Test
Um die Konzentration des Gesamtproteins zu bestimmen wird ein Bradford-Test
durchgeführt. Hierfür wird eine Standardkurve mit 6 Punkten wie folgt pipettiert:
H2O
A [1 mg/ml]
B [500 µg/ml]
C [250 µg/ml]
D [125 µg/ml]
E [62,5 µg/ml]
F [0 µg/ml]
50 µl
50 µl
50 µl
50 µl
50 µl
100 µl
BSA 50 µl [2 mg/ml] 50 µl [1 mg/ml] 50 µl [500 µg/ml] 50 µl [250 µg/ml] 50 µl [125 µg/ml]
-
Zunächst wird H2O in alle beschriftete Eppendorfgefäße vorgelegt. Im Anschluss wird
in die erste Probe BSA [2 mg/ml] pipettiert (gut mischen). 50 µl dieser Lösung
werden in die nächste Verdünnungsstufe pipettiert, mit einer neuen Pipettenspitze
gemischt und wieder 50 µl in die nächste Verdünnungsstufe (usw.). 10 µl des
jeweiligen Standards werden in eine Mikrotiterplatte vorgelegt. Das Protein wird 1:10,
1:20 und 1:50 in H2O verdünnt (Gesamtvolumen ca. 50 µl). Im Anschluss werden
10 µl jeder Verdünnung in die Mikrotiterplatte pipettiert (alle Messpunkte werden in
Dreifachbestimmung durchgeführt). 300 µl Bradfordreagenz werden zügig zu allen
Proben zugegeben. Nach 10 min Inkubation bei RT werden die Proben bei 595 nm
im Fluostar vermessen.
7.2.2 Versuch 2B-2
Um überprüfen zu können, ob im Eluat MDL1(WT)-NBD bzw. MDL1(E599Q)-NBD
enthalten ist, wird eine SDS-PAGE durchgeführt.
Materialien

Acrylamid/Bisacrylamid (37,5:1)
Tris-HCl [1,5 M], SDS [0,4%] pH 8,8
Tris-HCl [0,5 M], SDS [0,4%] pH 6,8
APS (Ammonium peroxidsulfate) [10%]
TEMED [99%, p.a.]
BioRad-System SDS-Page Gießapparatur und Laufkammer
Proteinproben in SDS-Ladepuffer (5-fach)
SDS-PAGE Standard
Spannungsgeber
Laufpuffer für SDS-Page (10-fach)
Coomassie-Färbelösung
Coomassie-Entfärbelösung
81
Zusammensetzung der Puffer und Lösungen:

Laufpuffer für SDS-Page (10-fach)
250 mM
1,92 M
1%

Coomassie-Färbelösung
0,25% (w/v)
Tris-HCl
Glycin
Einstellung des pH-Wertes auf 8,3
SDS
Coomassie R Brilliant Blue
Coomassie-Entfärbelösung lösen
in
Coomassie-Entfärbelösung
50%
40%
10%
H2O
Ethanol
Eisessig
Protokoll
Gießen der 12% Gele
Zur Vorbereitung der SDS-Gele werden die Gießkammern entsprechend der
Herstellerangaben vorbereitet. Zuerst wird die 12%ige Lösung des Trenngels laut
Pipettierschema angesetzt, in die beiden Gießkammern bis zu 2/3 der Gesamthöhe
pipettiert und anschließend mit 1 ml Isopropanol überschichtet, um alle Luftblasen zu
entfernen und eine scharfe Abschlusskante des Trenngels zu erreichen. Nachdem
das Gel vollständig polymerisiert ist wird das Isopropanol entfernt, die vorbereitete
Sammelgellösung aufgetragen und die Probenkämme eingefügt. Die fertigen Gele
werden (einschlagen in feuchte Papiere und Plastikhülle) im Kühlschrank gelagert.
Pipettierschema für 2 SDS-Page Gele 7*8,5 cm des Biorad-Systems:
Acrylamid/Bisacrylamid
Tris/SDS pH 8,8 [1,5 M]
Tris/SDS pH 6,8 [0,5 M]
H2O
APS [10%]
TEMED
Sammelgel 5%
650 µl
1,1 ml
2,7 ml
30 µl
10 µl
Trenngel 12%
3,76 ml
2,82 ml
2,82 ml
40 µl
20 µl
82
SDS-PAGE
Acrylamid ist sehr giftig, auch in auspolymerisierten Gelen!
Handschuhe tragen!
Die Gelelektrophorese-Kammern werden nach Herstellerangabe zusammengebaut
und mit Laufpuffer (1-fach) gefüllt. Je 10 µl der denaturierten Proteinproben werden
anschließend in die SDS-PAGE Gele aufgetragen. Als Standard werden 5 µl eines
Proteinmarkers mit auf das Gel aufgetragen. Mit dessen Hilfe kann MDL1-NBD bzw.
MDL1(E599Q)-NBD aufgrund des Molekulargewichtes identifiziert werden. Das
Einlaufen der Lösung in das Sammelgel erfolgt bei 120 V. Nachdem die Lauffront die
Kante zwischen Sammel- und Trenngel erreicht hat erfolgt die eigentliche Trennung
der Proteine bei 200 V. Nach der Gelelektrophorese werden die Gele in Coomassie
für 20 min bei Raumtemperatur und leichtem Schütteln gefärbt. Anschließend wird
das nicht gebundene Coomassie durch Entfärbelösung weggewaschen, so dass die
Proteinbanden sichtbar werden.
Auswertung

Chromatogramm der Proteinreinigung
In welchen Fraktionen nach PD-10 Säule befindet sich das Protein?
Beschriftetes Coomassiegel
Auswertung des Bradfordassays (Standardkurve, Berechnung der
Proteinkonzentration in µM)
83
7.3 ATPase Aktivitätstest (Versuch 2C)
In diesem Versuchsteil soll die ATPase Aktivität der isolierten Nukleotidbindedomäne
von MDL1 bestimmt werden. Hierfür wird das Wildtyp-Protein verwendet. Ziel dieses
Versuchsteils ist es, die ATPase Aktivität in Abhängigkeit von der ATP Konzentration
zu messen, um somit den KM und kcat Wert bestimmen zu können.
Hierfür wird der von Morbach et al. 1993 eingeführte ATPase-Aktivitätstest
verwendet. Bei diesem Test wird das anorganische Phosphat, das bei der ATPHydrolyse entsteht, durch eine Reaktion mit Malachitgrün kolorimetrisch
nachgewiesen wird. Somit dient das frei gesetzte anorganische Phosphat in dieser
Methode als Maßeinheit für die Aktivität der MDL1(WT)-NBD.
Materialien

aufgereinigte MDL1(WT)-NBD
ATP [x mM Na-ATP pH 6,5; Konzentration s. Beschriftung]
MgCl2 [200 mM]
Tris-HCl pH 8,0 [1 M]
NaCl [2 M]
H2SO4 [20 mM]
K2HPO4 [0,4 mM]
PCR-Maschine
Malachitgrünlösung
Mikrotiterplatte (Flachboden)
Fluostar (Elisa Reader)
Zusammensetzung der Färbelösung
Malachitgrünlösung:
2 ml einer konzentrierten Malachitgrünlösung
(60 ml H2SO4 werden langsam in 300 ml H2O bei
Raumtemperatur
gemischt,
dann
werden
0,44 g
Malachitgrün zugegeben.)
+ 40 µl Tween 20 [11%]
+ 500 µl NH4Mo7 [7,5% in H2O]
Protokoll
Die Proteinendkonzentration im Assay soll 10 µM betragen. Die ATPase Aktivität der
MDL1(WT)-NBD wird bei folgenden ATP-Endkonzentrationen gemessen:
0,125, 0,25, 0,5, 1 und 2 mM ATP
Zunächst werden die ATP-Stammlösungen (je 50 µl) vorbereitet indem ATP und
MgCl2 in Standardpuffer (vergleiche Versuch 2A-2) gelöst werden.
84
ATP-Stammlösungen (je 50 µl):
Mix 1
Mix 2
Mix 3
Mix 4
Mix 5
(Endkonz. 0,125 mM ATP, 15 mM MgCl2)
(Endkonz. 1 mM ATP, 15 mM MgCl2)
(Endkonz. 2 mM ATP, 15 mM MgCl2)
0,625 mM ATP, 75 mM MgCl2
5 mM ATP, 75 mM MgCl2
10 mM ATP, 75 mM MgCl2
Da die ATPase Aktivität der NBD bei physiologischen Temperaturen untersucht
werden soll, wird der ATPase-Test für 10 min bei 25°C durchgeführt. Die exakte
Temperatur wird durch den Gebrauch einer PCR-Maschine gewährleistet. Für jede
ATP-Konzentration (0,125 – 2 mM) wird jeweils eine Dreifachbestimmung mit Protein
durchgeführt, um Fehler zu minimieren. Als Kontrolle (ATP-Autohydrolyse) wird eine
Zweifachbestimmung mit der jeweiligen ATP-Stammlösung ohne Protein gemessen,
d.h. pro ATP-Konzentration werden 5 PCR-Tubes vorbereitet, beschriftet und auf Eis
gestellt. In 15 PCR-Tubes (3 Tubes pro ATP-Konzentration) werden 20 µl einer
12,5 µM-Proteinlösung vorgelegt (nach ATP-Zugabe ist die Proteinendkonz. 10 µM).
In 10 PCR-Tubes (2 Tubes pro ATP-Konzentration) werden 20 µl Puffer pipettiert. Im
Anschluss werden 5 µl Mix 1 in die 5 zugehörigen PCR-Tubes gegeben. Die Proben
werden im 20 s Abstand in die PCR-Maschine gestellt, so dass die ATP-Hydrolyse
gestartet wird. Nach 10 min bei 25°C werden (jeweils 20 s versetzt) 175 µl
Stopplösung (20 mM H2SO4) in die 5 PCR-Tubes gegeben und diese auf Eis gestellt.
Der Versuch wird danach auch für die restlichen vier ATP-Konzentrationen (Mix 2-5)
durchgeführt.
Anschließend wird ein Standard pipettiert, um die später erhaltenen
Absorptionswerte in eine Phosphatkonzentration umzurechnen. Hierzu wird eine
0,4 mM K2HPO4-Stammlösung folgendermaßen in Stopplösung verdünnt:
Stoffmenge (nmol Pi)
0
0,4
0,8
2
3,2
4,8
6
Stopplösung (µl)
Phosphatlösung (µl)
200
199
198
195
192
188
185
0
1
2
5
8
12
15
Es werden jeweils 175 µl der Standards, der inkubierten Protein-ATP-Lösungen und
der ATP-Kontrollen auf eine beschriftete Mikrotiterplatte pipettiert. Anschließend
werden 50 µl einer frisch präparierten Malachitgrünlösung zugegeben und die Platte
für 15 min bei Raumtemperatur inkubiert. Es muss darauf geachtet werden, dass
möglichst wenige Luftblasen entstehen (Blasen mit Hilfe einer Nadel entfernen). Die
Mikrotiterplatte wird bei 620 nm im Elisareader (Fluostar) ausgelesen.
85
Auswertung

Standardkurve des ATPase Experimentes

Für die Auswertung müssen nmol anorganisches Phosphat pro nmol NBD und
pro min [nmol Pi * nmol NBD-1* min-1] gegen die ATP-Konzentration
[mM ATP] aufgetragen werden.

Bestimmung der Michaelis-Menten-Konstanten und des kcat
monoexponentieller Fit
v=
vmax [S ]
[S ] + K M
7.4 Dimerisierung der EQ-Mutante (Versuch 2D)
Um zu zeigen, dass im ATP-Hydrolysezyklus von Mdl1p eine Dimerisierung der
NBDs stattfindet, wird die EQ-Mutante (Mdl1p-E599Q-NBD) verwendet. Bei dieser
Mutante wurde ein katalytisch essentieller Glutaminsäurerest zu Glutamin mutiert, so
dass die ATP-Hydrolyse stark vermindert ist, obwohl ATP weiterhin gebunden wird.
Da das ATP nicht hydrolysiert werden kann, bleibt der sonst sehr kurzlebige
Dimerisierungszustand beider NBDs in dieser Mutante länger bestehen. In diesem
Versuchsteil werden wir den Dimerisierungszustand mittels analytischer Gelfiltration
sichtbar machen.
Materialien

Aufgereinigte Mdl1p(E599Q)-NBD [10-30 µM]
ATP-Stammlsg. [100 mM Na-ATP pH 6.5)
Superdex 200-Säule
Tris/HCl pH 8,0 [1 M]
NaCl-Stammlsg. [5 M]
Ettan-System
Protokoll
Für die Gelfiltration wird eine Gelfiltrationssäule (Superdex 200: Säulenvolumen 2,4
ml, Totvolumen 0,9 ml) mit filtriertem Standardpuffer (20 mM Tris, 100 mM NaCl) bei
einem Fluss von 50 µl/min am Ettan-System equilibriert. Anschließend werden
50
µl einer 10-30 µM EQ-Mutante injiziert. (Der Lauf dauert ca.1 Stunde.) Um den
dimeren Zustand zu induzieren, werden 50 µl der EQ-Mutante (10-30 µM) mit 2 mM
ATP für 5 Minuten auf Eis inkubiert und am Ettan-System injiziert.
86
7.5 Fluoreszenzlöschung zur Bestimmung der
Geschwindigkeitskonstante der MDL1(E599Q)-NBD
Dimerisierung (Versuch 2E)
Beim Fluoreszenz-Resonanz-Energie-Transfer (FRET) wird strahlungslos Energie
von einem angeregten Donormolekül auf einen Akzeptor übertragen. FRET ist ein
Prozess, der nicht über Emission und Reabsorption von Photonen verläuft, sondern
durch Dipol-Dipol-Interaktionen zwischen Donor und Akzeptor zustande kommt. Die
Rate des Energietransfers (kT) hängt vom Abstand (r) zwischen Donor und Akzeptor,
der spektralen Überlappung (J) des Emissionsspektrum des Donors mit dem
Absorptionsspektrum des Akzeptors und der relativen Orientierung (κ2) von Donorund Akzeptor-Dipol ab. Der Abstand zwischen Donor und Akzeptor, bei welchem die
Effizienz des Energietransfers bei 50% liegt, nennt man den Förster-Abstand (R0).
Der Förster-Abstand liegt typischer Weise zwischen 20 und 90 Å.
6
1 ⎛ R0 ⎞
kT ( r ) =
⎜ ⎟ = Transferrate
τD ⎝ r ⎠
τD = Zerfallsrate des Donors ohne Akzeptor
r = Abstand zwischen Donor und Akzeptor
[
]
1
R0 = 0,211 κ 2 n −4 QD J (λ ) 6
R0 = Förster-Abstand, Einheit Å
κ2 = Dipolorientierungsfaktor
n = Brechungsindex
QD = Fluoreszenz-Quantenausbeute des Donors ohne Akzeptor
J(λ) = Integral der spektralen Überlappung
Da der Energietransfer abstandsabhängig ist und über Entfernungen hinweg
stattfindet, die in der Dimension von biologischen Makromolekülen liegen, wird diese
Methode benutzt, um verschiedene biologische Phänomene zu beobachten, bei
denen sich der molekulare Abstand verändert. So kann FRET dazu benutzt werden,
die Struktur und Konformation von Proteinen zu analysieren, AssoziationsReaktionen zwischen Protein-Untereinheiten zu beobachten oder den Abstand
zwischen zwei Positionen in einem Makromolekül zu messen. Bei AssoziationsReaktionen dient das Auftreten von FRET als Nachweis dafür, dass Donor und
Akzeptor durch die Assoziationsreaktion in nahe Nachbarschaft gebracht worden
sind. Hierbei kommt es nicht darauf an, einen bestimmten Donor-Akzeptor-Abstand
zu messen. Ob es sich bei dem durchzuführenden Versuch um ein Homo-FRET
Experiment oder um eine andere Art der Fluoreszenzlöschung handelt, kann nicht
mit Bestimmtheit gesagt werden.
Um die Geschwindigkeitskonstante der MDL1(E599Q)-NBD-Dimerisierung zu
ermitteln, wird ein ATTO-Fluorophor mit Hilfe seiner drei NTA-Gruppen an den His-
87
Tag der MDL1(E599Q)-NBD gebunden. Nach Dimerisierung durch ATP Zugabe
werden
zeitaufgelöste
Fluoreszenzlöschungs-Experimente
am
Fluoreszenzspektrometer durchgeführt. In diesem Instrument wird mit einer XenonLampe Licht erzeugt, welches nach Durchtritt durch einen Monochromator mit der
gewünschten Wellenlänge auf die Messprobe fällt. Mit einer Blende kann die
Spaltbreite verändert werden, um damit die Fluoreszenzintensität zu variieren. Wir
verwenden eine Spaltbreite von 10 nm. Die Fluoreszenzmessung erfolgt im rechten
Winkel zum „anregenden Licht“ durch eine Absolutmessung in willkürlichen
Fluoreszenzeinheiten mit einem Photomultiplier. Da die ATP-Hydrolyse und somit
auch die Dissoziation der MDL1(E599Q)-NBD stark verlangsamt ist, wird diese NBDMutante für die Bestimmung des kon-Werts verwendet.
Materialien

aufgereinigte MDL1(E599Q)-NBD
ATTO-Fluorophor
ATP [x mM Na-ATP pH 6,5; Konzentration s. Beschriftung]
Quarzküvette
Fluoresenzspektrometer
Protokoll
ATTO-Markierung
In unserem Fall wird ein ATTO-Fluorophor für die Messungen verwendet. Dieses
besteht aus drei NTA-Gruppen, welche an den His-Tag der NBD binden, und einem
Fluorophor, das gleichzeitig als Donor und Akzeptor fungieren kann. Dies ist möglich,
da sich Absorptions- und Emissionsspektrum überlappen.
Zunächst wird die MDL1(E599Q)-NBD mit dem Fluorophor ATTO markiert. Dazu wird
die ATTO Konzentration gemessen (ε = 120000 cm-1 M-1).
Absorption = ε l c
ε = Absorptionskoeffizient [cm-1 M-1]
l = Küvettenlänge [cm]
c = Konzentration [M]
88
Die MDL1(E599Q)-NBD [10 µM] wird mit ATTO im molaren 1:1 Verhältnis gemischt
(Endvolumen ca. 50 µl) und für 10 – 15 min auf Eis inkubiert.
Die ATTO-markierte MDL1(E599Q)-NBD (Konzentrationen s. unten, Volumen 150 µl)
wird anschließend in eine Fluoreszenzküvette gegeben und die Fluoreszenzintensität
im Fluoreszenzspektrometer in einem zeitaufgelösten Experiment bei 20°C
aufgezeichnet. Nachdem sich das Signal stabilisiert hat, wird die Dimerisierung der
NBDs mit 2 mM ATP (Endkonz.) induziert und die zeitabhängige Abnahme der
Fluoreszenzintensität (Exitation 563 nm, Emission 595 nm) für circa 15 min
beobachtet. Es werden folgende Endkonzentrationen an MDL1(E599Q)-NBD
getestet:
0,175 µM
0,250 µM
0,375 µM
0,500 µM
0,750 µM
Das Fluoreszenzsignal nach ATP-Zugabe (t0) wird für eine graphische Darstellung
normalisiert, so dass es zum Zeitpunkt t0 1 (oder 100%) entspricht.
Fnorm =
F(t)
F max
Das normalisierte Fluoreszenzsignal wird im Anschluss auf die MDL1(E599Q)-NBDMonomerkonzentration umgerechnet. Die erhaltenen Werte werden reziprok gegen
die Zeit aufgetragen und mittels linearer Regression gefittet. Die Steigung der
erhaltenen Gerade ergibt die Geschwindigkeitskonstante kon [min-1 * µM-1] für die
Dimerisierung der MDL1(E599Q)-NBDs.
Auswertung

Spektren der Dimerisierung

Berechnung der kon [min-1 * µM-1]