transcription

Download Report

Transcript transcription

Transcription in Prokaryotes
by Jean-Pierre Herveg, Etienne De Plaen and a lot of friends at
the Brussels Branch of the Ludwig Institute for Cancer research (Licr) and the Christian de Duve*
Institute for cellular Patholgy (ICP).
April 2006
Université Catholique de Louvain
Avenue E. Mounier, 1200 Brussels (Belgium)
Questions
what is the meaning of the sentence :” The sequence of a prokaryotic protein
is colinear with the translated mRNA
2. which sugar replaces desoxyribose in ARN ?
3. which base anneals to uracil ?
4. what’s a polysome ?
5. what’s an operon, an operator ? Describe !
6. what is X-gal.
7. what is lacZ?
8. what is IPTG?
9. describe a promoter.
10. what is the role of sigma in prokaryotes
11. why are there a pUR278, pUR289 and a PUR288 to make fusion proteins
12. explain the control of protein production by pET-3a
1
Transcription in prokaryotes
The sequence of a prokaryotic protein is colinear with the translated mRNA;
that is, the transcript of the gene is the molecule that is translated into the polypeptide.
In prokaryotes, ther is no nucleus and then replication, transcription and translation take place in
The same compartment.
Reverse transcription is the opposite
The enzymes involved are RNA polymerase for transcription and reverse transcriptase for the opposite.
-------------------------question
what is the meaning of the sentence :” The sequence of a prokaryotic protein is colinear with
the translated mRNA
Ribose:
Is a sugar containing 5 carbon atoms and a hydroxyle in the position 2’
Uracile
Is a RNA base which is represented by a U. U anneals with A.
-------------------------------------La ribosa
Es un azúcar de 5 carbonos. Es un componente estructural de la estructura del ARN, como el ATP, GTP, CTP y TTP.
Uracilo
Es una de las 4 bases del ARN. Se representa con la letra U. En el ARN, El uracilo reemplaza a la timina.
El uracilo se aparea con la adenina.
--------------------------------------Cuestiones para el examen
1. ¿ Cuál es el azúcar y la basa específicos del ARN
2. ¿ Con cuál base se aparea el uralcilo?
There are 5 types of RNA, each encoded by its own type of gene:
mRNA -(80 % in eucaryoyes) Messenger RNA: Encodes the amino acid sequence of a polypeptide.
tRNA - (15 % in eucaryoyes) Transfer RNA: Brings amino acids to ribosomes during translation.
mRNA - (5 % in eucaryoyes) Ribosomal RNA: with ribosomal proteins, makes up the ribosomes.
rRNA possess an enzymic activity. The ribosome is the organelles that translate the mRNA into polypeptide.
snRNA - Small nuclear RNA: With proteins, forms complexes that are used in RNA processing
siRNA and miRNA: are inhibitor od gene expression
Transcription and translation take place in the same compartment.
In prokaryotes, there is simultaneous transcription and translation.
Here, in E. Coli you can see a long fiber running from top to bottom (green arrow ). This fiber is a
Segment of the E. coli chromosome (dsDNA).
This picture comes from:
http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/M/Miller_Hamkalo.html
Extending from the DNA are polysomes (red arrow),
A polysome is a backbone of messenger RNA (mRNA) to which ribosomes are
attached.
On the left picture, you can see that the size of polysomes increases from top
to bottom.
Each polysome is attached to the DNA fiber by a complex of proteins
that includes a molecule of RNA polymerase.
Thus
the DNA is transcribed by RNA polymerase molecules moving
from top to bottom, and
the growing mRNA molecules are translated by ribosomes
moving in a proximal -> distal direction.
In E. coli, then, and probably in all prokaryotes,
the transcription of DNA into mRNA and the translation of mRNA into
polypeptides (not visible here) are closely coordinated
in both time and space.
(In eukaryotes, in contrast, while all transcription takes place in the
nucleus, most (but not all) translation of mRNA occurs later in the cytosol).
Defintion:
An operon is a group of genes including a common promoter and an operator, which control the
transcription of these genes to produce a common messenger RNA (mRNA).
A promoter is a DNA sequence that enables a gene to be transcribed.
The promoter is recognized by RNA polymerase, which then initiates transcription.
An operator is a segment of DNA that recognized by a regulatory protein (inducer or or repressor).
In negative inducible operons (the lac operon), a regulatory repressor protein is normally
bound to the operator and it prevents the transcription of the genes on the operon
If an inducer molecule is present, it binds to repressor and changes its conformation
so that it is unable to bind to the operator.
This allows for the transcription of the genes controlled by the operator.
In negative repressible operons, (the trp operon) transcription of the genes on the operon
normally takes place. Repressor proteins are produced by a regulator gene but they are
unable to bind to the operator in their normal conformation. However certain molecules
called corepressors can bind to the repressor protein and change its conformation so that
it can bind to the operator. The activated repressor protein binds to the operator and
prevents transcription.
Operons can also be positively controlled. With positive control, an activator protein
stimulates transcription by binding to DNA (usually at a site other than the operator).
a negative inducible operons:
the lac operon consists of a regulatory gene and 3 structural genes (z, y, and a).
The i gene codes for a repressor. The z gene codes for ß-galactosidase, which hydrolyses lactose into
galactose and glucose. The y gene codes for permease, which increases permeability to ß-galactosides.
The a gene encodes a transacetylase.
In the presence of an inducer of the lac operon, the repressor protein binds the inducer. RNA polymerase is
thus able to bind at the promoter region, and transcription of the operon ensues.
Lactose analogues
1. IPTG is an inducer (acts like lactose, but is not metabolized by
ß-galactosidase).
A number of lactose derivatives or analogs have been described that are useful for work with the lac operon.
These compounds are mainly substituted galactosides, where the glucose moiety of lactose is replaced
by another chemical group.
Isopropyl-β-D-thio-galactoside (IPTG) is frequently used as an inducer of the lac operon
for physiological work. IPTG binds to repressor and inactivates it, but is not a substrate for β-galactosidase.
One advantage of IPTG for in vivo studies is that it cannot be metabolized by E. coli, therefore the growth
rate of cells (usually maintained with glycerol as the carbon and energy source), is not a variable
in the experiment.
In addition, IPTG is transported efficiently independent of whether the lacY gene
is functional. And since cells don't metabolize IPTG, its concentration doesn't change during the course
of an experiment.
Lactose analogues
1. X-gal (is metabolizide and gives a blue reaction product).
X-gal (5-bromo-4-chloro-3-indolyl-β-D-galactoside) turns colonies which produce
β-galactosidase blue. IPTG is used in what is called the white andblue screening.
http://en.wikipedia.org/wiki/Lac_operon
a negative repressible operon:
The trp operon encodes the genes for the synthesis of tryptophan.
This cluster of genes is regulated by a repressor that could bind to the operator sequence.
The activity of the trp repressor for binding the operator region is enhanced when it binds tryptophan:
tryptophan is known as a corepressor.
Since the activity of the trp repressor is enhanced in the presence of tryptophan
the rate of expression of the trp operon is graded in response to the level of tryptophan in the cell (attenuation
Element).
--------------------------------Cuestiones para el examen
1. Definir la palabra opéron en las bacterias
2. ¿ Que es que X-gal.
3. ¿ Que es que lacZ?
4. ¿ Que es que IPTG?
5. describir el opéron lac, trp…
Fine regulation of the tryptophane operon.
Expression of the trp operon is also regulated by attenuation (attenuator element).
The attenuator region is involved in controlling transcription from the operon after RNA polymerase
has initiated synthesis.
---------------------------------------------to stall: calar (autocalar)
Beginning of transcription
The RNA pol is a tetrameric enzyme. The factor sigma helps to recognize the promoters.
The genetic information contained in DNA is transcribed to RNA by a transcription complex
Including DNA-directed RNA polymerase (RNA Pol). A bacterial transcription initiation complex
comprised of the core RNA Pol enzyme and a (sigma) factor binds to a promoter and,
upon initiation of RNA synthesis, releases the s factor.
http://dnaresearch.oxfordjournals.org/cgi/content/short/dsi016v1
Institution: Bolivia: PNAS Sponsored
Published online before print March 27, 2006,
PNAS | April 4, 2006 | vol. 103 | no. 14 | 5332-5337
Biological Science / Biochemistry
Insights into transcriptional regulation and {sigma} competition from an
equilibrium model of RNA
polymerase binding to DNA
Irina L. Grigorova, Naum J. Phleger, Vivek K. Mutalik, and Carol A. Gros
In bacteria transcription is initiated by RNA polymerase (RNAP) holoenzyme (Es) which is formed
when core RNAP (E) binds the transcription initiation factor s (1). Es initially binds to promoter sites
in a closed complex, which then transits to an open complex, competent for transcription.
The number of intermediates between the closed and open complex is variable and promoter-dependent;
each step may be subject to regulation in vivo (2, 3). At least for some promoters, Es binding to promoters
is thought to be reversible on the time scale of transcription initiation in vivo (3); reversibility has
also been demonstrated in vitro for several promoters (3–6). Even binding to the strong lac UV5
promoter is reversible in vitro when tested under conditions that approximate the in vivo situation (6).
Recruitment of Es to promoters in vivo is thought to depend on the intrinsic binding affinity of the
promoter and is modulated by repressors that prevent and activators that stabilize interactions between
Es and the promoter (3). Based on in vitro studies of the mechanism of activator function, it is believed
that promoters that bind Es weakly require activators to recruit Es In addition, cells contain multiple s,
which direct E to various sets of promoters specific to the s factors (1). These s s are believed to compete
with each other for binding to E (7–10). By changing the relative levels of the {sigma}s, Escherichia coli is
thought to coordinate its transcriptional program with growth conditions (11–13). This view is based
on observations indicating that (i) overexpressing one s decreases expression of genes controlled
by another s (7), (ii) mutationally altering binding constants of one s for E, alters expression by another s(14),
and (iii) physiological effectors such as ppGpp may act by altering relative binding of s s to E (8–10).
In the present work, we use an equilibrium model of RNAP binding to DNA to explore in vivo scenarios
that permit transcription regulation by activator recruitment of RNAP and Es competition.
The promoter is recognized by RNA polymerase and the s factor. RNA pol then
initiates transcription. When the transcript is around 10 nt long, the s factor
quits the complex
-----------------------------Los promotores bacterianos poseen cuatro características comunes:
(1)
lel punto de comienzo de la transcripción, denominado +1,
(2)
la secuencia de la posición –10 llamada caja de Pribnow (TATAAT),
(3)
la secuencia –35 (TGTTGACA) y
(4)
la distancia entre las secuencias –10 y –35. E punto donde se inicia la transcripción es genralmente
una purina (A ou G).
the end of transcription: termination
Two termination mechanisms are well known:
Rho-independent termination involves terminator sequences within the
RNA that signal the RNA polymerase to stop. The terminator sequence is usually a palindromic sequence
that forms a stem-loop hairpin structure that leads to the dissociation of the RNA Pol from the DNA template.
Rho-dependent termination uses a termination factor called r (rho) factor to stop RNA synthesis at specific sites.
This protein binds and runs along the mRNA towards the RNA Pol. When the r (rho) factor reaches the RNA Pol
it causes RNA Pol to dissociate from the DNA, terminating transcription.
Other termination mechanisms include where RNA Pol comes across a region with repetitious thymidine
residues in the DNA template. or where a GC-rich inverted repeat followed by 4 A residues
the inverted repeat forms a stable stem loop structure in the Rna, which causes the RNA to dissociate from
the DNA template.
Translation (genetic code)
Table of the codons and amino acids
(one letter code)
codon iniciador AUG
Stop: TAA, TAG, TGA
uuu
uuc
uua
uug
F
F
L
L
ucu
ucc
uca
ucg
S
S
S
S
uau
uac
uaa
uag
Y
Y
stop
stop
ugu
ugc
tga
ugg
C
C
stop
W
cuu
cuc
cua
cug
L
L
L
L
ccu
ccc
cca
ccg
P
P
P
P
cau
cac
caa
cag
H
H
Q
Q
cgu
cgc
cga
cgg
R
R
R
R
auu
auc
aua
atg
I
I
I
M
acu
acc
aca
acg
T
T
T
T
aau
aac
aaa
aag
N
N
K
K
agu
agc
aga
agg
S
R
R
R
guu
guc
gua
gug
V
V
V
V
gcu
gcc
gca
gcg
A
A
A
A
gau
gac
gaa
gag
D
D
E
E
ggu
ggc
gga
ggg
G
G
G
G
Acido Aspartico : Asp = D
Acido glutamico : Glu = E
Alanina : Ala = A
Arginina : Arg = R
Asparagina : Asp = N
Cisteina : Cys = C
fenilalana : Phe = F
Glicina : Gly = G
Glutamina : Gln = Q
Histidina : His = H
isoleucina :Ile = I
Leucina :Leu = L
Lisina : Lys = K
Metioniona : Met = M
Prolina : Pro = P
Serina : Ser = S
Tirosina : Tyr = Y
Treonina : Thr = T
Triptofana : Trp = W
Valina : Val = V
Free tRNAs are loaded with their specific amino-acyl by an
Aminoacyl-tRNA transferase.
They are now ready to participate to translation.
Free tRNA, are made of a short RNA sequence, in which they are anticodons
tRNA are made of a short RNA sequence, in which they are anticodons they can
Be loaded with an aùino-acyl.
In the ribosome (large subunit), two sites A and P can be loaded each with
aminoacyl-tRNA.
A peptide bond is the made between thes two aminoacyls.
Then, the tRNA molecule is expelled see the black arrow below.
It reached the place where it could be loaded with another aminoacyl.
The ribosome moves (scarlet arrows, the tRNA in the second site moves in the first one
And another aminoacyl-tRNA takes its place.
Comparaison between prokaryotes and eukaryotes:
fused proteins
pUR278
pMAL
non fused proteins
pET-3a
---------------------------question
explain the control of protein production by pET-3a
pQE (Quiagen), HIS tag
Exercises: recombinant insulin and growth hormone
Search for the insulin gene:
Database: Gene (1)
For: INS or INSULIN, both are okay.
The click on Go
You should first read the Summary…. And remember the significance of signal peptide, post
Translationnaly, disulfide bonds, glucose receptor and glucose uptake.
What’s a minus strand ?
Does this gene have an intron ?
On what chromosome stand the gene ?
What is the meaning of p in 11p5.5 ?
Now, click on NM_000207.1 (DEFINITION homo sapiens insulin (INS), mRNA.
recombinant human insulin rh-insulin
LOCUS
NM_000207
450 bp mRNA
DEFINITION Homo sapiens insulin (INS), mRNA.
1
61
121
181
241
301
361
421
gctgcatcag
gcctcctgcc
tgaaccaaca
gaggcttctt
tggagctggg
tgcagaagcg
agaactactg
agagagatgg
aagaggccat
cctgctggcg
cctgtgcggc
ctacacaccc
cgggggccct
tggcattgtg
caactagacg
aataaagccc
caagcacatc
ctgctggccc
tcacacctgg
aagacccgcc
ggtgcaggca
gaacaatgct
cagcccgcag
ttgaaccagc
linear PRI 29-JAN-2006
actgtccttc
tctggggacc
tggaagctct
gggaggcaga
gcctgcagcc
gtaccagcat
gcagcccccc
tgccatggcc
tgacccagcc
ctacctagtg
ggacctgcag
cttggccctg
ctgctccctc
acccgccgcc
ctgtggatgc
gcagcctttg
tgcggggaac
gtggggcagg
gaggggtccc
taccagctgg
tcctgcaccg
/translation="MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCERGFFYTPKTRREAEDLQVG
QVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN"
proprotein
45..374 (329 = 100%)
sig_peptide
45..116 (71/329= 0.215)
atggcc ctgtggatgc gcctcctgcc cctgctggcg ctgctggccc tctggggacc tgacccagcc gcagcc
mat_peptide 117..374 (
/product="proinsulin"
mat_peptide 117..206 (89/329= 0,27)
tttgtgaaccaaca cctgtgcggc tcacacctgg tggaagctct ctacctagtg tgcggggaac gaggcttctt
ctacacaccc aagac
/product="proinsulin peptide B"
mat_peptide 312..374 (62/329= 0.188)
ggcattgtg gaacaatgct gtaccagcat ctgctccctc taccagctgg agaactactg caac
/product="proinsulin peptide A”
In between 0.327 %
Summary:
After removal of the precursor signal peptide, proinsulin is post-translationally cleaved into two chains
(peptide A and peptide B) that are covalently linked via two disulfide bonds.
Binding of this mature form of insulin to the insulin receptor (INSR) stimulates glucose uptake.
Official Symbol: INS and Name: insulin [Homo sapiens]
Other Designations: proinsulin
Chromosome: 11; Location: 11p15.5
GeneID: 3630
The mRNA is translated in a protein which have four domains: The hydrophobic siganl peptide is cleaved in the ER
f the eukaryotic cell. The C peptide is cleaved in the blood by proteases.
At the end of this process, we are left with this structure:
The cDNA coding proinsulin is used to make protein fusion with B-gal. An atg is added at the beginnning.
The fusion protein is purified. The ß-gal beginning ic cleaved by cyanogen bromide. The c peptide is cleaved by
A protease.
121
181
241
301
361
atg +
tttg
tgaaccaaca
gaggcttctt
tggagctggg
tgcagaagcg
agaactactg
cctgtgcggc
ctacacaccc
cgggggccct
tggcattgtg
caactag
tcacacctgg
aagacccgcc
ggtgcaggca
gaacaatgct
tggaagctct
gggaggcaga
gcctgcagcc
gtaccagcat
ctacctagtg
ggacctgcag
cttggccctg
ctgctccctc
tgcggggaac
gtggggcagg
gaggggtccc
taccagctgg
human growth hormone (hGH)
Official Symbol: GH1 and Name: growth hormone 1 [Homo sapiens]
Other Aliases: GH, GH-N, GHN, hGH-N
Other Designations: pituitary growth hormone
Chromosome: 17; Location: 17q24.2
GeneID: 2688
The protein encoded by this gene is a member of the somatotropin/prolactin family of hormones
which play an important role in growth control. The gene, along with four other related genes,
is located at the growth hormone locus on chromosome 17 where they are interspersed in the
same transcriptional orientation; an arrangement which is thought to have evolved by a series
of gene duplications. The five genes share a remarkably high degree of sequence identity.
Alternative splicing generates additional isoforms of each of the five growth hormones,
leading to further diversity and potential for specialization. This particular family member is expressed
in the pituitary but not in placental tissue as is the case for the other four genes in the growth hormone
locus. Mutations in or deletions of the gene lead to growth hormone deficiency and short stature.
LOCUS
NM_022562
376 bp mRNA linear PRI 26-FEB-2006
DEFINITION Homo sapiens growth hormone 1 (GH1), transcript variant 5, mRNA.
ACCESSION NM_022562
VERSION NM_022562.2 GI:20809252
1
61
121
181
241
301
361
aggatcccaa
caatggctac
tacagcaagt
ctctactgct
cgctctgtgg
ccagtgcctc
aaattaagtt
ggcccaactc
agaggctgga
tcgacacaaa
tcaggaagga
agggcagctg
tcctggccct
gcatca
cccgaaccac
agatggcagc
ctcacacaac
catggacaag
tggcttctag
ggaagttgcc
tcagggtcct
ccccggactg
gatgacgcac
gtcgagacat
ctgcccgggt
actccagtgc
gtggacagct
ggcagatctt
tactcaagaa
tcctgcgcat
ggcatccctg
ccaccagcct
FEATURES
source
gene
Location/Qualifiers
1..376 /organism="Homo sapiens"
1..376
/gene="GH1"
/note="synonyms: GH, GHN, GH-N, hGH-N"
CDS
63..155
/gene="GH1"
/translation="MATEAGRWQPPDWADLQADLQQVRHKLTQR"
misc_feature 72^73
/gene="GH1"
/note="Region: location of alternate exons 2, 3 and 4"
STS
197..336 (STS= sequence tagged sites)
see http://cfern.bio.utk.edu/journal/Hamilton1998/THESIS_A1.htm
polyA_signal 357..362
polyA_site
376
atggcta agaggctgga agatggcagc ccccggactg ggcagatctt caagcagacc
tacagcaagt tcgacacaaa ctcacacaac gatga
cacctagctg
caagcagacc
ctacgggctg
cgtgcagtgc
tgacccctcc
tgtcctaata