Protein structure Classification Ole Lund, Associate professor, CBS, DTU. Why classify proteins     Number of solved structures grow rapidly Generate overview of structure types Detect similarities (evolutionary.

Download Report

Transcript Protein structure Classification Ole Lund, Associate professor, CBS, DTU. Why classify proteins     Number of solved structures grow rapidly Generate overview of structure types Detect similarities (evolutionary.

Protein structure Classification
Ole Lund,
Associate professor,
CBS, DTU.
Why classify proteins




Number of solved structures grow rapidly
Generate overview of structure types
Detect similarities (evolutionary relationships)
Set up prediction benchmarks
OL
Classification schemes

SCOP
–

CATH
–

Manual classification (A Murzin)
Semi manual classification (C orengo)
FSSP
–
Automatic classification (L Holm)
OL
Levels in SCOP
1.
2.
3.
4.
Class
Folds
Superfamilies
Families
10
648
1007
1699
Murzin et al., 1995
http://scop.mrc-lmb.cam.ac.uk/scop/
OL
Major classes in scop

Classes
–
–
–
–
–
–
All alpha proteins
Alpha and beta proteins (a/b)
Alpha and beta proteins (a+b)
Multi-domain proteins
Membrane and cell surface proteins
Small proteins
OL
All alpha: Hemoglobin (1bab)
OL
All beta: Immunoglobulin (8fab)
OL
Alpha/beta: Triosephosphate
isomerase (1hti)
OL
Alpha+beta: Lysozyme (1jsf)
OL
Folds*


Each Class may be divided into one or more folds
Proteins which have the same (>~50%) secondary
structure elements arranged the in the same order in
the protein chain and in three dimensions are
classified as having the same fold
*confusingly also called fold classes
OL
Superfamilies


Superfamilies are a subdivisions of folds
A superfamily contains proteins which are thought to
be evolutionarily related due to
–
–
–

Sequence
Function
Special structural features
Relationships between members of a superfamily
may not be readily recognizable from the sequence
alone
OL
Families




Subdivision of supefamilies
Contains members whose relationship is
readily recognizable from the sequence
(>~25% sequence identity)
Families are further subdivided in to Proteins
Proteins are divided into Species
–
The same protein may be found in several
species
OL
Families
OL
CATH



Levels
Class
Architecture
–

Topology
–

This level is unique to CATH (~30 is defined)
~Fold(/superfamily) in SCOP
Homologous Superfamily
–
~Superfamily(/family) in SCOP
http://www.biochem.ucl.ac.uk/bsm/cath_new/index.html
OL
Architecture

Same overall arrangement of secondary
structures
–
Example: The architecture :Two layer beta sheet
proteins contains different folds each with a
distinct number and connectivity of strands
OL
FSSP




Fully automated classification
Automatic update
Database contains structural alignments
Tree of protein structures
OL
FSSP classification (MHC molecules)
OL
Links

PDB
–

SCOP
–

scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.html
CATH
–

www.rcsb.org/pdb/
www.biochem.ucl.ac.uk/bsm/cath_new/index.html
FSSP
–
http://www.bioinfo.biocenter.helsinki.fi:8080/dali/index.html
OL