Transcript Epitope prediction algorithms
Epitope prediction algorithms
Urmila Kulkarni-Kale Bioinformatics Centre University of Pune
October 2K5
Vaccine development In Post-genomic era: Reverse Vaccinology Approach.
• Rappuoli R. (2000). Reverse vaccinology.
Curr Opin Microbiol. 3:445-450.
© Bioinformatics Centre, UoP 2
Genome Sequence Proteomics Technologies
In silico
analysis High throughput Cloning and expression In vitro and in vivo assays for Vaccine candidate identification
© Bioinformatics Centre, UoP
DNA microarrays
3
In Silico
Analysis
Peptide Multiepitope vaccines VACCINOME
October 2K5
Candidate Epitope DB Disease related protein DB
Epitope prediction 4
What Are Epitopes?
Antigenic determinants or Epitopes are the portions of the antigen molecules which are responsible for specificity of the antigens in antigen-antibody (Ag-Ab) reactions and that combine with the antigen binding site of Ab, to which they are complementary. © Bioinformatics Centre, UoP October 2K5 5
Types of Epitopes
•
Sequential / Continuous epitopes
: • recognized by Th cells • linear peptide fragments • amphipathic helical 9-12 mer •
Conformational / Discontinuous epitopes
: • recognized by both Th & B cells • non-linear discrete amino acid sequences, come together due to folding • exposed 15-22 mer © Bioinformatics Centre, UoP October 2K5 6
Properties of Epitopes
• They occur on the and are more protein.
surface flexible
of the protein than the rest of the • They have high degree of
solvent
.
exposure to the
• The amino acids making the epitope are usually
charged
and
hydrophilic.
© Bioinformatics Centre, UoP October 2K5 7
Methods to identify epitopes 1.
2.
3.
Immunochemical methods
• • • ELISA : Enzyme linked immunosorbent assay Immunoflurorescence Radioimmunoassay
X-ray crystallography
: Ag-Ab complex is crystallized and the structure is scanned for contact residues between Ag and Ab. The contact residues on the Ag are considered as the epitope.
Prediction methods
: Based on the X-ray crystal data available for Ag-Ab complexes, the propensity of an amino acid to lie in an epitope is calculated.
© Bioinformatics Centre, UoP October 2K5 8
Antigen-Antibody (Ag-Ab) complexes
• Non-obligatory heterocomplexes that are made and broken according to the environment • Involve proteins (Ag & Ab) that must also exist independently • Remarkable feature: – high affinity and strict specificity of antibodies for their antigens.
• Ab recognize the unique conformations and spatial locations on the surface of Ag • Epitopes & paratopes are relational entities © Bioinformatics Centre, UoP October 2K5 9
Antigen-Antibody complex
October 2K5 © Bioinformatics Centre, UoP 10
Ab-binding sites:
Sequential & Conformational Epitopes
!
Paratope October 2K5 Sequential Conformational © Bioinformatics Centre, UoP Ab-binding sites 11
B cell epitope prediction algorithms
: • • • • •
Hopp and Woods –1981 Welling et al –1985 Parker & Hodges - 1986 Kolaskar & Tongaonkar – 1990 Kolaskar & Urmila Kulkarni - 1999
• • • •
T cell epitope prediction algorithms Margalit, Spouge et al - 1987 Rothbard & Taylor – 1988 Stille et al –1987 Tepitope -1999
October 2K5 © Bioinformatics Centre, UoP : 12
Hopp & Woods method
• Pioneering work • Based on the fact that only the
hydrophilic
nature of amino acids is essential for an sequence to be an antigenic determinant • Local hydrophilicity values are assigned to each amino acid by the method of repetitive averaging using a window of six • Not very accurate © Bioinformatics Centre, UoP October 2K5 13
Welling’s method
• • •
Based on the % of each aa present in known epitopes compared with the % of aa in the avg. composition of a protein.
assigns an antigenicity value for each amino acid from the relative occurrence of the amino acid in an antigenic determinant site.
regions of 7 aa with relatively high antigenicity are extended to 11-13 aa depending on the antigenicity values of neighboring residues
.
© Bioinformatics Centre, UoP October 2K5 14
Parker & Hodges method
• Utilizes 3 parameters : – – –
Hydrophilicity : HPLC Accessibility : Janin’s scale Flexibility : Karplus & Schultz
• Hydrophilicity parameter was calculated using HPLC from retention co-efficients of model synthetic peptides.
• Surface profile was determined by summing the parameters for each residue of a seven-residue segment and assigning the sum to the fourth residue.
• One of the most useful prediction algorithms © Bioinformatics Centre, UoP October 2K5 15
Kolaskar & Tongaonkar’s method
• Semi-empirical method which uses physiological properties of amino acid residues • frequencies of occurrence of amino acids in experimentally known epitopes.
• Data of 169 epitopes from 34 different proteins was collected of which 156 which have less than 20 aa per determinant were used.
• Antigen: EMBOSS © Bioinformatics Centre, UoP October 2K5 16
CEP Server
• Predicts the conformational epitopes from X-ray crystals of Ag-Ab complexes.
• uses percent accessible surface area and distance as criteria October 2K5 © Bioinformatics Centre, UoP 17
An algorithm to map sequential and conformational epitopes of protein antigens of known structure October 2K5 © Bioinformatics Centre, UoP 18
October 2K5 © Bioinformatics Centre, UoP 19
CE: Beyond validation
• High accuracy: – Limited data set to evaluate the algorithm – Non-availability of true negative data sets • Prediction of false positives? – Are they really false positives?
Different Abs (HyHEL10 & D1.3) have over-lapping binding sites • Limitation: – Limited by the availability of 3D structure data of antigens © Bioinformatics Centre, UoP October 2K5 20
CE: Features
• The first algorithm for the prediction of conformational epitopes or antibody binding sites of protein antigens • Maps both: sequential & conformational epitopes • Prerequisite: 3D structure of an antigen © Bioinformatics Centre, UoP October 2K5 21
CEP: Conformational Epitope Prediction Server http://bioinfo.ernet.in/cep.htm
October 2K5 © Bioinformatics Centre, UoP 22
T-cell epitope prediction algorithms
• Considers amphipathic helix segments, tetramer and pentamer motifs (charged residues or glycine) followed by 2-3 hydrophobic residues and then a polar residue.
• Sequence motifs of immunodominant secondary structure capable of binding to MHC with high affinity.
• Virtual matrices which are used for predicting MHC polymorphism and anchor residues.
© Bioinformatics Centre, UoP October 2K5 23
• Case study: Design & development of peptide vaccine against Japanese encephalitis virus October 2K5 © Bioinformatics Centre, UoP 24
We Have Chosen JE Virus, Because
JE virus is endemic in South-east Asia including India.
JE virus causes encephalitis in children between 5-15 years of age with fatality rates between 21-44%.
Man is a "DEAD END" host.
© Bioinformatics Centre, UoP October 2K5 25
We Have Chosen JE Virus, Because
• Killed virus vaccine purified from mouse brain is used presently which requires storage at specific temperatures and hence not cost effective in tropical countries.
• Protective prophylactic immunity is induced only after administration of 2-3 doses.
• Cost of vaccination, transportation is high.
storage and © Bioinformatics Centre, UoP October 2K5 26
Predicted structure of JEVS Mutations: JEVN/
JEVS
October 2K5 © Bioinformatics Centre, UoP 27
October 2K5 © Bioinformatics Centre, UoP 28
CE of JEVN Egp
October 2K5 © Bioinformatics Centre, UoP 29
Species and Strain specific properties: TBEV/ JEVN/JEVS
• Loop1 in TBEV:
LA EE H QGGT
• Loop1 in JEVN:
H N E KR A D SS
• Loop1 in JEVS:
H N KKR A D SS
Antibodies recognising TBEV and JEVN would require exactly opposite pattern of charges in their CDR regions.
Further, modification in CDR is required to recognise strain-specific region of JEVS.
October 2K5 © Bioinformatics Centre, UoP 30
Multiple alignment of Predicted T 426 H -cell epitope in the JE_Egp with corresponding epitopes in Egps of other Flaviviruses 457 JE MVE DF DF
G G
S S
I V
GG GG
VFN VFN
S S
I I
GK GK
AV AV
H H
Q Q
V V
F F
G G
GAFRTL GAFRTL
F F
G G
G G
MS MS
WNE KUN DF
G
S
V
GG
VFT
S
V
GK
AI
H
Q
V
F
G
GAFRSL
F
G
G
MS
DF
G
S
V
GG
VFT
S
V
GK
AV
H
Q
V
F
G
GAFRSL
F
G
G
MS
SLE DF
G
S
I
GG
VFN
S
I
GK
AV
H
Q
V
F
G
GAFRTL
F
G
G
MS
DEN2 DF
G
S
L
GG
VFT
S
I
GK
AL
H
Q
V
F
G
AIYGAA
F
S
G
VS
YF DF
S
S
A
GG
FFT
S
V
GK
GI
H
T
V
F
G
SAFQGL
F
G
G
LN
TBE DF
G
S
A
GG
FLS
S
I
GK
AV
H
T
V
L
G
GAFNSI
F
G
G
VG
COMM DF S GG S GK H V G F G Multiple alignment of JE_Egp with Egps of other Flaviviruses in the YSAQVGASQ region.
151 183 JE SENHGN YSAQVGASQ AAKFTITPNAPSITLKLG MVE STSHGNYSTQIGANQAVRFTISPNAPAITAKMG WNE VESHG----KIGATQAGRFSITPSAPSYTLKLG KUN VESHGNYFTQTGAAQAGRFSITPAAPSYTLKLG SLE STSHGNYSEQIGKNQAARFTISPQAPSFTANMG DEN2 HAVGNDTG-----KHGKEIKITPQSSTTEAELT YF QENWN--------TDIKTLKFDALSGSQEVEFI
October 2K5 © Bioinformatics Centre, UoP 31
Peptide Modeling
Initial random conformation Force field: Amber Distance dependent dielectric constant 4r ij Geometry optimization: Steepest descents & Conjugate gradients Molecular dynamics at 400 K for 1ns Peptides are: SENHGNYSAQVGASQ NHGNYSAQVGASQ YSAQVGASQ YSAQVGASQAAKFT NHGNYSAQVGASQAAKFT SENHGNYSAQVGASQAAKFT 149 168
October 2K5 © Bioinformatics Centre, UoP 33
October 2K5 © Bioinformatics Centre, UoP 34
Relevant Publications & Patent
• Urmila Kulkarni-Kale, Shriram Bhosale, G. Sunitha Manjari, Ashok Kolaskar, (2004). VirGen: A comprehensive viral genome resource.
Nucleic Acids Research 32:289-292.
• Urmila Kulkarni-Kale & A. S. Kolaskar (2003). Prediction of 3D structure of envelope glycoprotein of Sri Lanka strain of Japanese encephalitis virus.
In Yi-Ping Phoebe Chen (ed.), Conferences in research and practice in information technology . 19:87-96.
• A. S. Kolaskar & Urmila Kulkarni-Kale (1999) Prediction of three dimensional structure and mapping of conformational antigenic determinants of envelope glycoprotein of Japanese encephalitis virus.
Virology . 261:31-42.
Patent: Chimeric T helper-B cell peptide as a vaccine for Flaviviruses. Dr. M. M. Gore, Dr. S.S. Dewasthaly, Prof. A.S. Kolaskar, Urmila Kulkarni-Kale Sangeeta Sawant
WO 02/053182 A1
© Bioinformatics Centre, UoP October 2K5 35
• • • • • •
Important references
Hopp, Woods, 1981,
Prediction of protein antigenic determinants from amino acid sequences
, PNAS U.S.A 78, 3824-3828 Parker, Hodges et al, 1986,
New hydrophilicity scale derived from high performance liquid chromatography peptide retention data: Correlation of predicted surface residues with antigenicity and X-ray derived accessible sites,
Biochemistry:25, 5425-32 Kolaskar, Tongaonkar, 1990,
A semi empirical method for prediction of antigenic determinants on protein antigens
, FEBS 276, 172-174 Men‚ndez-Arias, L. & Rodriguez, R. (1990),
A BASIC microcomputer program forprediction of B and T cell epitopes in proteins,
CABIOS, 6, 101-105 Peter S. Stern (1991), 163-169
Predicting antigenic sites on proteins
, TIBTECH, 9, A.S. Kolaskar and Urmila Kulkarni-Kale, 1999 -
Prediction of three dimensional structure and mapping of conformational epitopes of envelope glycoprotein of Japanese encephalitis virus
,Virology, 261, 31-42 © Bioinformatics Centre, UoP October 2K5 36