Bioinformatics Tools for Structural Biology Dr Jaime Prilusky ISPC-WIS Helsinki, June 2006 Bioinformatics Tools - Dr Jaime Prilusky - 2006 • • • • • Target Identity and Foldability Selection of Expression.

Download Report

Transcript Bioinformatics Tools for Structural Biology Dr Jaime Prilusky ISPC-WIS Helsinki, June 2006 Bioinformatics Tools - Dr Jaime Prilusky - 2006 • • • • • Target Identity and Foldability Selection of Expression.

Bioinformatics Tools for
Structural Biology
Dr Jaime Prilusky
ISPC-WIS
Helsinki, June 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
•
•
•
•
•
Target Identity and Foldability
Selection of Expression System
Selection of Crystallization Conditions
Data Awarness
Data Management
http://www.weizmann.ac.il/ISPC/biotools.html
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web’s Services
http://www.weizmann.ac.il/ISPC/biotools.html
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Some of our tools:
•
•
•
•
•
•
•
•
•
•
•
BestPrimers (just best primers)
FoldIndex (will this protein fold?)
HyPare (detects hot-spots for mutation)
LabTargets (browser/database for proteomics)
OCA (browser/database for structure & function)
RecentReferences (what people published?)
SeqAlert (someone else works on this sequence?)
SeqFacts (tell me something about this sequence)
SuggestES (suggest an expression system)
SuggestXC (suggest crystallization conditions)
VerifyCloning (check results from DNA sequencing)
Bioinformatics Tools - Dr Jaime Prilusky - 2006
FoldIndex
Web Services
Enabled
FoldIndex© tries to answer to the question: Will this
protein fold?
FoldIndex© predicts whether a given protein is
intrinsically disordered.
Prilusky J., Felder C.E., Zeev-Ben-Mordehai T., Rydberg
E., Man O., Beckmann J.S., Silman I. and Sussman J.L.
Bioinformatics, 2005 Aug 15;21(16):3435-8
http://bip.weizmann.ac.il/fold
Bioinformatics Tools - Dr Jaime Prilusky - 2006
natively
unfolded
folded
Why are natively unfolded proteins unstructured under physiologic conditions?
Vladimir N. Uversky, Joel R. Gillespie, Anthony L. Fink; Proteins: Structure,
Function, and Genetics, Volume 41, Issue 3, Pages 415-427
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web Services
Enabled
Web Services’ access to FoldIndex:
http://bip.weizmann.ac.il/fldbin/findex?sq=SEQ&m=xml
sq is the protein sequence, one character, no spaces.
m (mode) is either xml or efam, the format FoldIndex
will return the result on. With the mode set to ‘xml’, the
server sends results in XML format. When mode is equal
to ‘efam’, the results are sent in eFamily format.
FoldIndex home page has sample Perl scripts to retrieve
and parse prediction data.
Bioinformatics Tools - Dr Jaime Prilusky - 2006
SeqFacts©
SeqFacts© is a tool for sequence identification,
analysis, characterization and annotation. This server
will try to find relevant information related to your
sequence.
http://bip.weizmann.ac.il/sqfbin/seqfacts
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
SeqFacts
DNA
sequence
Amino Acid
properties
calculation
FoldIndex
references
RecentReferences
quality cleaning
vectors removal
calculation
ORF
RecentReferences
similarity search
specific DB search
DB similarity
SeqAlert(once)
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Automatic analysis of 1226_int11f-SP6-19 (Sun Jun 8 15:55:24 2003)
------------------------------------------------------------------------------Summary:
This analysis is based on a clean portion (4 to 644) from the 920 original bases.
The best ORF of the six frames translation has a length of 186.
A similarity search found 50 hits on 18 organisms: Oryctolagus cuniculus and
Homo sapiens are the most representatives.
Conserved domains present in the translated sequence: Arylesterase, COG3386.
Original files:
1226_int11f-SP6-19.fasta
1226_int11f-SP6-19.gcg
ORFs
Six Frames translation
Similarity Search
Conserved Domain database
Genome Drosophila
Genome Homo
Genome Mus
Genome Rattus
Fantom (Functional Annotation Of Mouse)
Taxonomy
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Automatic analysis of 1226_int11f-SP6-19 (Sun Jun 8 15:55:24 2003)
------------------------------------------------------------------------------Summary:
This analysis is based on a clean portion (4 to 644) from the 920 original bases.
The best ORF of the six frames translation has a length of 186.
A similarity search found 50 hits on 18 organisms: Oryctolagus cuniculus and
Homo sapiens are the most representatives.
Conserved domains present in the translated sequence: Arylesterase, COG3386.
Original files:
1226_int11f-SP6-19.fasta
1226_int11f-SP6-19.gcg
ORFs
Six Frames translation
Similarity Search
Conserved Domain database
Genome Drosophila
Genome Homo
Genome Mus
Genome Rattus
Fantom (Functional Annotation Of Mouse)
Taxonomy
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Automatic analysis of 1226_int11f-SP6-19 (Sun Jun 8 15:55:24 2003)
------------------------------------------------------------------------------Summary:
This analysis is based on a clean portion (4 to 644) from the 920 original bases.
The best ORF of the six frames translation has a length of 186.
A similarity search found 50 hits on 18 organisms: Oryctolagus cuniculus and
Homo sapiens are the most representatives.
Conserved domains present in the translated sequence: Arylesterase, COG3386.
Original files:
1226_int11f-SP6-19.fasta
1226_int11f-SP6-19.gcg
ORFs
Six Frames translation
Similarity Search
Conserved Domain database
Genome Drosophila
Genome Homo
Genome Mus
Genome Rattus
Fantom (Functional Annotation Of Mouse)
Taxonomy
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Automatic analysis of 1226_int11f-SP6-19 (Sun Jun 8 15:55:24 2003)
------------------------------------------------------------------------------Summary:
This analysis is based on a clean portion (4 to 644) from the 920 original bases.
The best ORF of the six frames translation has a length of 186.
A similarity search found 50 hits on 18 organisms: Oryctolagus cuniculus and
Homo sapiens are the most representatives.
Conserved domains present in the translated sequence: Arylesterase, COG3386.
Original files:
1226_int11f-SP6-19.fasta
1226_int11f-SP6-19.gcg
ORFs
Six Frames translation
Similarity Search
Conserved Domain database
Genome Drosophila
Genome Homo
Genome Mus
Genome Rattus
Fantom (Functional Annotation Of Mouse)
Taxonomy
Bioinformatics Tools - Dr Jaime Prilusky - 2006
BestPrimers
Web Services
Enabled
BestPrimers© provides a simple interface for primers
calculation, with FoldIndex© support.
http://bip.weizmann.ac.il/sqfbin/bestPrimers
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
VerifyCloning©
Web Services
Enabled
VerifyCloning© compares your original sequence
against the result from cloning procedures,
highlighting conflicting areas. VerifyCloning has an
automatic reverse/complement mode that will reorder
your sequences if required.
http://bip.weizmann.ac.il/vfclon
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
SuggestES©
Web Services
Enabled
SuggestES© takes the protein sequence you provide
and scans a large database with protein sequences
with known results for different expression systems,
generating a suggestion based on several parameters:
Similarity: how similar is your sequence to the existing
data in the database?
Recentness: how recently was a given expression
system used?
Frequency: how frequently was a given expression
system used?
http://bip.weizmann.ac.il/suggestES
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
SuggestXC©
Web Services
Enabled
SuggestXC© takes the protein sequence you provide
and scans a large database with protein sequences
with known results for different crystallization
conditions, generating a suggestion based on several
parameters:
Similarity: how similar is your sequence to the existing
data in the database?
Recentness: how recently was a given crystallization
condition used?
Frequency: how frequently was a given crystallization
condition used?
http://bip.weizmann.ac.il/suggestXC
Bioinformatics Tools - Dr Jaime Prilusky - 2006
HyPare©
Web Services
Enabled
Hypare© is a newly designed tool based on the PARE
algorithm (Predicting Association Rate Enhancement).
With HyPare you can find hotspots for association, and
thus, engineer your proteins for faster and tighter
binding
http://bip.weizmann.ac.il/hypare
Bioinformatics Tools - Dr Jaime Prilusky - 2006
SeqAlert
SeqAlert© is a sequence alerting service that will
periodically compare your sequence(s) against
sequences from determined 3D structures, or
structures being determined at PDB, and TargetDB,
the database of target sequences from worldwide
structural genomics projects.
It also reports the Pubmed IDs of papers that might be
related to your sequence, published on the last 20
days.
http://bip.weizmann.ac.il/seqalert
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Proteomics LIMS
An integrated Laboratory Information Management
Systems for Proteomics with Complexes support.
http://www.weizmann.ac.il/ISPC/
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web Services
Enabled
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
LabTargets (browser/database for proteomics)
supported operating systems
required software
client-server architecture
generates/exports
•
•
•
•
•
SPINE xml
TargetDB xml
Targets Status Graph
Targets Detail
Structures Gallery
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
How can you access our tools?
Direct Web Access
Web Services, Bulk Access
Web Services, Software Integration
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Direct Web Access
OCA mirrors world distribution
http://bip.weizmann.ac.il/oca
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web Services, Bulk Access
FoldIndex
LPC CSU
OCA
Beer Sheva, Israel
Cambridge, USA
Chicago, USA
Hawaii, USA
Helsinki, Finland
Bangalore, India
Kansas City, USA
Madrid, Spain
Leipzig, Germany
Missouri, USA
Montpellier, France
San Diego, USA
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web Services, Software Integration
IBS: Integrated Bioinformatics System
@ KISTI, Daejeon, South Korea
[FoldIndex, BestPrimers, SeqFacts, SuggestES]
GUTSS: Genome Structure Selection System
@ The Burham Institute, La Jolla CA, USA
[SeqFacts]
Raptor 3D: functional annotations to structures
@ GBF, Braunschweig, Germany
[OCA]
WIS LIMS:
@ Weizmann Institute, Rehovot, Israel
[OCA, FoldIndex, SeqFacts, BestPrimers, …]
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Why to have a local installation?
Some of our tools allow for a
local installation
• intranet requirements
• repetitive batch analysis
• tight services integration
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Web Services
Enabled
OCA
OCA© facilitates the understanding of the
genomics/proteomics biological data through data
analysis and synthesis. The data integration process
brings together dispersed pieces of information,
generating comprehensive summaries that might be
just what a researcher needs for crystallizing an idea
or understanding a problem.
http://bip.weizmann.ac.il/oca
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Israel Structural
Proteomics Center
Dr. Shira Albeck
Rani Bravdo
Prof. Yigal Burstein
Dr. Orly Dym
Yossi Jacobovitch
Dorit Kalo
Nurit Levy
Ran Meged
Yigal Michael
Dr. Yoav Peleg
Dr. Jaime Prilusky
Prof. Gideon Schreiber
Prof. Israel Silman
Prof. Joel L Sussman
Dr. Tamar Unger
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Dr Jaime Prilusky
[email protected]
Bioinformatics Tools - Dr Jaime Prilusky - 2006
Bioinformatics Tools - Dr Jaime Prilusky - 2006