Lactose tolerance

Transcription

Lactose tolerance
STUDENT’S GUIDE
Case Study
Lactose tolerance
Version 1.1
Steve Cross,
Bronwyn Terrill
and colleagues
Wellcome Trust Sanger Institute
Hinxton
Case Stu
Case Stud
lactose tolerance
Lactose tolerance
Most humans lose the ability to digest lactose and become lactose
intolerant by the time they are about ten years old. In some populations,
however, people still produce lactase, the enzyme that digests lactose,
in adulthood. The production of lactase gives an evolutionary advantage
to people who have access to cow’s milk.
At least two genetic changes that allow the lactase gene to be
permanently switched on occurred independently in geographically
isolated populations. Which changes in the human genome allow some
people to digest lactose in adulthood? How often and when have such
changes occurred in human history?
Lactase non-persistence (lactose
intolerance) varies worldwide.
Outline of the activity
91–100%
31–40%
The majority of people living today are unable to digest lactose in adulthood.
In this activity you’ll analyse DNA data from different human populations
to work out which variations in the human genome have enabled some
adult humans to consume dairy products without problems.
81–90%
21–30%
71–80%
11–20%
61–70%
1–10%
51–60%
0%
41–50%
No data
You will also discover which variants are associated with lactose tolerance
(or lactase persistence) in different populations. A statistical test (Chisquared) will be used to work out whether specific genetic changes have
significant effects on lactase persistence.
Copyright © Steve Cross, Bronwyn Terrill et al 2011
2
www.dnadarwin.org
lactose tolerance
Lactose tolerance and intolerance
IMAGE FROM: Wellcome Images.
Lactose, the sugar found in dairy products and milk, is metabolised by the
enzyme lactase (full name: lactase-phlorizin hydrolase or LPH). Lactase
splits apart the two sugars (galactose and glucose) that make up lactose. The
gene for lactase is known as LCT, and it is mostly expressed in cells in the
epithelium of the small intestine. Although nearly everyone can produce
the enzyme at birth, the majority of people outside Northern Europe lose
the ability to produce a lot of it at some point during the first years of life.
This can be referred to as lactase nonpersistance, adult hypolactasia or,
more commonly, lactose intolerance.
When a lactose-intolerant adult ingests a large amount of lactose, it passes
into their gut without being broken down. The bacteria in the gut then
react to the raised lactose levels by shifting their metabolisms to run on
this sugar. Unfortunately this process produces a lot of gas, which in turn
causes cramping, pain, flatulence and diarrhoea.
Lactose intolerance actually comes in three forms:
1. Primary lactose intolerance. As children are weaned they lose the
ability to metabolise milk, so they become lactose-intolerant adults.
2. Secondary lactose intolerance. This occurs in adults from lactose
tolerant populations who, due to damage caused by diseases of the
stomach or intestines, can no longer properly produce lactase.
3. Congenital lactose intolerance. This is a condition where children are
born unable to metabolise lactose. Before the 20th century it would
normally have resulted in death, but it can now be diagnosed and
treated by moving the infant to a lactose-free diet.
The first medically-documented cases of lactose intolerance — or ‘deficiency’
as it was known at the time — were considered to be the ‘abnormal’ result
of a single-gene, autosomal recessive condition. However, by the 1970s, it
was recognised that most people in the world had this ‘condition’. In Asia,
Africa and the Americas the majority of the native adult population do not
produce lactase as adults. Therefore, studies began to consider how lactose
tolerance or ‘lactase persistence’ ­— being the unusual condition — had
occurred and how it was inherited. Populations that do produce lactase
into adulthood tend to be those with a long history of drinking fresh milk,
including those in Northern Europe and some isolated parts of Africa.
Lactase persistence behaves as a dominant trait because half levels of
lactase activity still enable a significant amount of lactose to be digested.
LACTASE
Lactose
Copyright © Steve Cross, Bronwyn Terrill et al 2011
Glucose
3
Galactose
www.dnadarwin.org
lactose tolerance
Regulating genes
The difference between most lactose-intolerant and lactose-tolerant adults
isn’t that they have different genes — it’s that their genes are controlled
differently. Lactase-persistent people have a lactase gene that is not
switched off after childhood.
The body has three main ways of regulating the effect of a gene:
1. Regulation at the level of gene expression: turning the gene on and
off or raising or lowering the amount of gene product;
2. Regulation at the level of RNA: controlling levels of protein production
by modifying or removing RNA products so that they can’t be translated
into proteins;
3. Regulation at the level of protein: modifying the protein to make more
or less active versions, or degrading the protein to limit its activity.
In Exercise 1, you’ll be looking at the first type of regulation. Studies of
large families have shown there is no difference between the LCT genes
of people with or without lactase persistence. However, the same studies
focused their attention on a 47 kilobase (a kilobase, or kb, is a thousand
bases) region containing a possible control sequence for lactase production
very close to the LCT gene on chromosome 2.
In Exercise 2, you will study variants, known as Single Nucleotide
Polymorphisms (or SNPs, pronounced ‘Snips’) from this area (called 2q21)
on chromosome 2.
Lactose-reduced milk is produced
by treating milk with lactase.
In this product, an immobilised
enzyme has been used to treat the
milk before packing, hence the label
‘no added enzymes’.
Copyright © Steve Cross, Bronwyn Terrill et al 2011
4
www.dnadarwin.org
lactose tolerance
Exercise 1
Genotypes associated with lactose
tolerance and intolerance
A key research paper, published in 2002, identified genetic variants in
northern European populations associated with lactase persistence and
intolerance (Enattah et al, 2002). The extract from the original paper below
reports the results of two variants: one at –13910 bases ‘upstream’ of the LCT
gene and one at –22018 bases ‘upstream’ of the LCT gene.
We analyzed... variants in Finnish DNA samples isolated from a total of
196 biopsy specimens with biochemically determined disaccharidase (LPH)
[lactase] activity.
All 59 samples showing primary lactase deficiency were homozygous with
respect to the C allele of C/T–13910, 6 were heterozygous with respect to the
G/A–22018 variant and the remaining 53 were homozygous with respect to
the G allele.
Of the 137 cases showing lactase persistence, none were homozygous with
respect to the alleles C and G, at C/T–13910 and G/A–22018 respectively; 74
were homozygous for alleles T and A, with 63 being heterozygous at both
positions.
Write the data described in this paragraph in the tables below:
C/T –13910
Condition vs genotypes
CC
CT
TT
Totals
GG
GA
AA
Totals
Lactase deficiency / lactose
intolerant
Lactase persistence
Total samples
G/A –22018
Condition vs genotypes
Lactase deficiency / lactose
intolerant
Lactase persistence
Total samples
Copyright © Steve Cross, Bronwyn Terrill et al 2011
5
www.dnadarwin.org
lactose tolerance
Questions
a. Which of the positions (C/T–13910 or G/A–22018) is 100% associated
with lactase persistence?
b. From this data at this position, which genotype(s) are associated with
lactose tolerance/lactase persistence?
c. From the data at this position, which genotype(s) are associated with
lactose intolerance/lactase non-persistence?
d. How associated is the other variant (C/T–13910 or G/A–22018) with
lactase persistence? (number of non-consistent findings divided by the
total number of samples x 100, subtracted from 100%).
Copyright © Steve Cross, Bronwyn Terrill et al 2011
6
www.dnadarwin.org
lactose tolerance
Exercise 2
Other lactose persistence mutations
Although the 100% associated variant you identified in Exercise 1 is thought
to cause lactase persistence in Europeans, this variant is only present at
low levels in other lactase-persistent populations.
A Nature Genetics paper (Tishkoff et al, 2007) sought DNA variations that
could help to explain lactase persistence in East African populations. There,
lactase persistence was most common in Sudanese populations which
raised livestock, and least common in Tanzanian hunter-gatherers. The
researchers sequenced 3.3 kb and 1.8 kb in the same area on chromosome 2
(2q21) that yielded the European variant.
Below are the results from three different African populations at two
different positions in the genome: at 5 bases and 100 bases further
upstream from the LCT gene than the European variant. In this study, a
lactose tolerance blood test (a method of indirectly measuring levels of LCT
actiivty) was used to determine whether a person was LCT persistent, LCT
intermediate or LCT non-persistent.
The Chi-Squared test
The Chi-Squared test can be used here to test the following Null hypothesis.
Null hypothesis: There is no significant effect of genotype (at this position) on
lactase persistence.
Alternative hypothesis: The genotype (at this position) has a significant effect
on lactase persistence.
Kenyans
Tanzanians
Position
Phenotype observed
CC
CG
GG
–14010
LCT persistence
12
48
38
LCT non-persistence
2
10
40
LCT intermediate
3
12
25
Phenotype observed
CC
CG
GG
Position
–14010
Afro-Asiatic Kenyans
LCT persistence
17
44
36
LCT non-persistence
5
18
57
LCT intermediate
5
21
28
GG
GT
TT
Position
Phenotype observed
–13951
LCT persistence
2
7
25
LCT non-persistence
0
1
12
LCT intermediate
0
0
14
Copyright © Steve Cross, Bronwyn Terrill et al 2011
7
www.dnadarwin.org
lactose tolerance
Population name: Kenyan
Category
Observed (O)
Expected* (E)
(see below)
Position of allele: –14010
O–E
(O–E)2
(O–E)2
E
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Category
Size of population
(ignoring alleles)
Prevalence of allele
Expected
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
Copyright © Steve Cross, Bronwyn Terrill et al 2011
8
www.dnadarwin.org
lactose tolerance
Population name: Tanzanian
Category
Observed (O)
Expected* (E)
(see below)
Position of allele: –14010
O–E
(O–E)2
(O–E)2
E
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Category
Size of population
(ignoring alleles)
Prevalence of allele
Expected
Genotype 1 (CC)
LCT persistent
Genotype 2 (CG)
LCT persistent
Genotype 3 (GG)
LCT persistent
Genotype 1 (CC)
LCT non-persistent
Genotype 2 (CG)
LCT non-persistent
Genotype 3 (GG)
LCT non-persistent
Genotype 1 (CC)
Intermediate
Genotype 2 (CG)
Intermediate
Genotype 3 (GG)
Intermediate
Copyright © Steve Cross, Bronwyn Terrill et al 2011
9
www.dnadarwin.org
lactose tolerance
Population name: Afro-Asiatic Kenyans
Category
Observed (O)
Expected* (E)
(see below)
Position of allele: –13915
O–E
(O–E)2
(O–E)2
E
Genotype 1 (GG)
LCT persistent
Genotype 2 (GT)
LCT persistent
Genotype 3 (TT)
LCT persistent
Genotype 1 (GG)
LCT non-persistent
Genotype 2 (GT)
LCT non-persistent
Genotype 3 (TT)
LCT non-persistent
Genotype 1 (GG)
Intermediate
Genotype 2 (GT)
Intermediate
Genotype 3 (TT)
Intermediate
TOTALS
* To calculate the Expected values, you will need to count the number of number of samples with a single genotype
found across the entire population. This figure, divided by the total sample size will give you the prevalence of the
genotype. Multiplying the prevalence of each form of the allele by the number of samples in each condition group,
will give you the Expected values.
Category
Size of population
(ignoring alleles)
Prevalence of allele
Expected
Genotype 1 (GG)
LCT persistent
Genotype 2 (GT)
LCT persistent
Genotype 3 (TT)
LCT persistent
Genotype 1 (GG)
LCT non-persistent
Genotype 2 (GT)
LCT non-persistent
Genotype 3 (TT)
LCT non-persistent
Genotype 1 (GG)
Intermediate
Genotype 2 (GT)
Intermediate
Genotype 3 (TT)
Intermediate
Copyright © Steve Cross, Bronwyn Terrill et al 2011
10
www.dnadarwin.org
lactose tolerance
(O–E)2
The total of E across all four groups is the chi-squared value ( χ 2).
Using this information, calculate the probability of the Null hypothesis
being true using this lookup table, where p is the probability.
p
0.9
0.75
0.25
0.1
0.05
0.025
0.01
0.005
χ 2
1.06
1.92
5.39
7.78
9.49
11.14
13.27
14.86
Questions
For each of the three populations:
e.
f.
g.
h.
i.
j.
What is the χ 2 value?
What is the probablity that the Null hypothesis is true?
Does this mean that the result is statistically significant?
Were any of the results for these positions statistically significant?
If so, which one(s)?
If all of the lactase-persistent individuals in the world have the same
genetic change that allows them to drink milk, there is a possibility
that the change has only happened once, and that all these people share
a common, milk-drinking ancestor. If, however, there are a number of
changes that give rise to milk-drinking, it is likely that it has evolved
more than once. What can you conclude from these results about the
origins of lactase persistence?
What other data would you need to explore lactase persistence further?
Copyright © Steve Cross, Bronwyn Terrill et al 2011
11
www.dnadarwin.org