Protein structure Classification Ole Lund, Associate professor, CBS, DTU. Why classify proteins Number of solved structures grow rapidly Generate overview of structure types Detect similarities (evolutionary.
Download ReportTranscript Protein structure Classification Ole Lund, Associate professor, CBS, DTU. Why classify proteins Number of solved structures grow rapidly Generate overview of structure types Detect similarities (evolutionary.
Protein structure Classification Ole Lund, Associate professor, CBS, DTU. Why classify proteins Number of solved structures grow rapidly Generate overview of structure types Detect similarities (evolutionary relationships) Set up prediction benchmarks OL Classification schemes SCOP – CATH – Manual classification (A Murzin) Semi manual classification (C orengo) FSSP – Automatic classification (L Holm) OL Levels in SCOP 1. 2. 3. 4. Class Folds Superfamilies Families 10 648 1007 1699 Murzin et al., 1995 http://scop.mrc-lmb.cam.ac.uk/scop/ OL Major classes in scop Classes – – – – – – All alpha proteins Alpha and beta proteins (a/b) Alpha and beta proteins (a+b) Multi-domain proteins Membrane and cell surface proteins Small proteins OL All alpha: Hemoglobin (1bab) OL All beta: Immunoglobulin (8fab) OL Alpha/beta: Triosephosphate isomerase (1hti) OL Alpha+beta: Lysozyme (1jsf) OL Folds* Each Class may be divided into one or more folds Proteins which have the same (>~50%) secondary structure elements arranged the in the same order in the protein chain and in three dimensions are classified as having the same fold *confusingly also called fold classes OL Superfamilies Superfamilies are a subdivisions of folds A superfamily contains proteins which are thought to be evolutionarily related due to – – – Sequence Function Special structural features Relationships between members of a superfamily may not be readily recognizable from the sequence alone OL Families Subdivision of supefamilies Contains members whose relationship is readily recognizable from the sequence (>~25% sequence identity) Families are further subdivided in to Proteins Proteins are divided into Species – The same protein may be found in several species OL Families OL CATH Levels Class Architecture – Topology – This level is unique to CATH (~30 is defined) ~Fold(/superfamily) in SCOP Homologous Superfamily – ~Superfamily(/family) in SCOP http://www.biochem.ucl.ac.uk/bsm/cath_new/index.html OL Architecture Same overall arrangement of secondary structures – Example: The architecture :Two layer beta sheet proteins contains different folds each with a distinct number and connectivity of strands OL FSSP Fully automated classification Automatic update Database contains structural alignments Tree of protein structures OL FSSP classification (MHC molecules) OL Links PDB – SCOP – scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.html CATH – www.rcsb.org/pdb/ www.biochem.ucl.ac.uk/bsm/cath_new/index.html FSSP – http://www.bioinfo.biocenter.helsinki.fi:8080/dali/index.html OL