Transcript Document
Global Analysis of Protein Localization in Budding Yeast
Won-Ki Huh, James V. Falvo, Luke C. Gerke, Adam S. Carroll, Russell W. Howson, Jonathan S. Weissman & Erin K. O’Shea
Presented by Ji-eun Lee and Teresa Tan
Microbiology 475
December 3, 2003
Article Question:
You are a researcher trying to determine the subcellular
localization of a protein in yeast but, much to your dismay,
you find that you are not able to assign the protein to a
localization category by GFP microscopy alone. What
experiment could you perform to determine which subcellular
compartment the protein localizes to in yeast?
Answer:
Use co-localization (with proteins of known localization
labeled with a different marker).
Background
GFP (green fluorescent protein) found in
Aequorea victoria, first discovered in 1960’s
•Mechanism: spontaneous fluorescence,
“can” structure, excitation peak at 508 nm
•Uses: in-vivo marker of protein
localization
•Advantages: patterns of protein
expression intact, no external co-factors,
may be monitored in living cells
•Disadvantages: mislocalization through
C-terminal fusion
ORF (open reading frame) may potentially encode part or all of
a protein
Fluorescence Microscopy Uses: detect structures, molecules,
proteins within the cell; fluorescent molecules (i.e., GFP)
absorb light at one λ and emit at a longer λ
Purpose of This Study
To analyze cellular compartment localization of proteins
in budding yeast, Saccharomyces cerevisiae, in order to
better understand the proteins’ functions
To utilize GFP fusion proteins and fluorescence microscopy to
analyze subcellular protein localization
To understand relationship between genetic and physical
interactions and subcellular localization to
predict/corroborate protein functions
Methods
Construct GFP-tagged library:
6,234 ORFsPCR6,029 GFP-tagged ORFs
Fluorscence Microscopy4,156 total ORFs
GFP-tagged Library Construction
Fusion protein: homologous recombination of GFP with ORF
at the C-terminus
GFP fluorescence microscopy: 4,156 ORFs
showed meaningful GFP signals
Methods Continued
Co-localization experiment: to refine the 12 original
localization categories
mRFP: monomeric red fluorescent protein strains
fused to proteins whose localization are known
(reference proteins) were mated with 700 GFP strains
whose localizations are unknown (see Figure 2c)
An additional 11 localization categories were generated to
account for the proteins whose localizations were
previously unknown (see Table 2)
Co-localization Micrographs
GFP + RFP fusion strains were mated and their
micrographs merged
12 subcellular locations initially classified (left
column)
Additional 11 localization categories defined via
co-localization (right column)
Subcellular Localization of Yeast Proteins
4,156 proteins represented 75% of yeast proteome
Localization data accounted for 70% of previously
unlocalized yeast proteins
56%: nucleus & cytoplasm; 44%: other subcell. locations
80% agreement with Saccharomyces Genome Database
(SGD) for 2,526 proteins
29 nuclear pore complex proteins identified by Mass
Spectrometric analysis, 25 were visible
Top: authors’ contribution to localization data for yeast proteome
Bottom: distribution of subcell. localizations for proteins
Nucleolar Protein Localization
Authors provided 1st direct evidence for some nucleolar
proteins that they reside in nucleolus
164 proteins in nucleolus82 overlap with SGD’s 127
which means 82 are newly defined nucleolar proteins
Mass Spec: 271 human nucleolar proteins, 166 homologues in
yeast (authors detected 164 yeast nucleolar proteins)
Expect human homologues of yeast proteins detected in
nucleolus will also be nucleolar proteins
Authors’ contribution: 82 newly defined nucleolar
proteins
Classification of the authors’ 164 identified nucleolar
proteins
Protein Localization and mRNA Co-expression
Compared transcriptional co-regulation to subcellular protein
localization on a proteome-wide scale
Utilized previous study that identified 33 transcriptional
“modules” – genes with marked co-regulation
Definition of enrichment: ratio of fraction of proteins in each
module with given subcellular localization to that fraction
in whole proteome
19 of 22 most highly expressed modules show statistically
significant enrichment
Strong correlation between co-localization and transcriptional
co-regulation, and furthermore, biological function
Sub-division of sets of co-expressed proteins
Co-localization of co-expressed proteins allows for prediction
for proteins of unknown biological function
Comparison with Genetic and Physical Interactions
Investigated relationship between co-localization and genetic
and physical interactions
Reference set: sum of all genetic and protein-protein
interactions reported in GRID database
Determined fold enrichment of interactions between
localizations
Compared distribution of subcellular localization of
interacting protein pairs to that of random protein pairs
Strong enrichment of interaction between proteins that co-localize
Degree of enrichment varies widely by compartment (cytoplasm vs.
microtubule)
Known or predicted function of proteins consistent with localization
(grey: previously thought not to localize to these
compartments)
Network of statistically significant interactions connects
functionally and physically related subcellular regions
(e.g. secretory pathway compartments)
Conclusions
Localized 4,156 proteins (75% of yeast proteome)
Provided location for 70% previously unlocalized yeast
proteins (30% of yeast proteome)
Overcame limits of fluorescence microscopy via colocalization of unknown proteins with proteins of
known location fused to another marker
Demonstrated higher frequency of co-localization for
proteins of transcriptional co-expression (“modules”)
Showed significant correlation between interaction and colocalization
Study provides most complete picture of yeast protein
localization to date (Wohlschlegel, J. & Yates, J.)
Future Studies
Organellar proteomics: nucleolar proteins involved in cell
cycle and gene regulation and the authors generated 82
newly defined nucleolar proteins, thus it would be
interesting to investigate their particular functions.
Study global dynamics of yeast proteome under different
external stimuli or growth conditions
Assay effects of deletion or mutation of a protein of
interest on global protein localization
Critiques
The authors point out that proteins may occupy the nucleolus
transiently but do not clarify why proteins leave and where
do they go? If their method of detection is not sensitive
enough, what adjustments would they make to enhance
sensitivity?
Mislocalizations may occur from C-terminal fusion of GFP to
ORF through steric hindrance or interruption of C-terminal
localization sequences
Limitations of fluorescence microscopy: kinetochore vs.
spindle pole body; membrane vs. lumen for mitochondria
or endoplasmic reticulum
Must balance validity of authors’ results (i.e., 82 newly defined
nucleolar proteins) with limitations of f.m. & 2 micrograph
scorers by conferring with localization data obtained by
other methods (i.e., mass spec & other databases)
References
Andersen, J.S. et al. Directed proteomic analysis of the human nucleolus. Curr. Biol.
12, 1-11 (2002).
Bergmann, S., Ihmels, J. & Barkai, N. Iterative signature algorithm for the analysis
of large-scale gene expression data. Phys. Rev. E 67, 031902 (2003)
Breitkreutz, B.-J., Stark, C. & Tyers, M. The GRID: the General Repository for
Interaction Datasets. Genome Biol. 4, R23 (2003)
Campbell, R.E. et al. A monomeric red fluorescent protein. Proc. Natl Acad. Sci.
USA 99, 7877-7882 (2002).
Dolinski, K. et al. Saccharomyces Genome Database
(http://www.genome.stanford.edu/Saccharomyces) (2003).
Ihmels, J. et al. Revealing modular organization in the yeast transcriptional
network. Nature Genet. 31, 370-377 (2002)
Tsien, R.Y. The green fluorescent protein. Annu. Rev. Biochem. 67, 509-544
(1998).
Wohlschlegel, J. & Yates, J. Proteomics: where's Waldo in yeast? Nature 425, 671-672
(2003).
http://notes.cc.sunysb.edu/CAS/chemistry.nsf/pages/Tonge_Group