Transcript Introduction
1
Editing Pathway/Genome Databases
Ron Caspi SRI International Bioinformatics
2
Pathway Tools in Editing Mode
● The database is separate from the user interface ● The Navigator provides one interface to the DB ● Editors provide an alternative interface to the DB SRI International Bioinformatics
3
Things to remember
● The editors are there for editing your own PGDB. Do not modify the EcoCyc or MetaCyc datasets ● Try to not alter the DB schema. Such changes may be overwritten when updating to a new version ● e.g. do not add or remove classes or slots ● Avoid Duplication! Reuse information whenever possible ● A PGDB should not describe the same biological or chemical entity more than once ● Some tools help preventing inadvertent creation of duplicate reactions/compounds SRI International Bioinformatics
4
List of Editors
● ● ● ● ● ● ● ● ● ● ● ● ● Compound Editor/Compound Structure Editors Reaction Editor Pathway Editor and Pathway Info Editor Protein/Subunit structure/Enzymatic Reaction Editors Synonym Editor Publication Editor Author/Organization Editor Gene Editor Intron Editor (Eukaryotes only) Transcription Unit Editor External database Editor Frame Editor Ontology Editor SRI International Bioinformatics
5
Activating Editing Mode
●
Type (enable/disable-editors t) at the listener pane
SRI International Bioinformatics
6
Saving Changes
● The user must save changes explicitly ● File => Save Current DB ● Save DB button ● Revert Current DB The following commands are only available for databases stored in MySQL or Oracle: ● List Unsaved Changes in Current DB ● Checkpoint Current DB Updates to File ● Restore Updates from Checkpoint File SRI International Bioinformatics
7
Other DB commands under the File menu
● Summarize databases ● Summarize current organism ● Refresh DB list ● Refresh All Current DBs ● Delete a DB ● Attempt to Reconnect to Database Server SRI International Bioinformatics
8
Invoking the Editors
New Object: Use the “New” command under top menus, or the “Create” command under the File menu (new feature) Existing Object: Right-Click on the Object Handle, select Edit, then the appropriate editor SRI International Bioinformatics
9
Compound Editor
● Create or edit a compound ● ● ● ● Specify Class Common Name and Synonyms Comments, citations Links to other DBs SRI International Bioinformatics
10
More Compound Editing
● Compound Structure Editors (Marvin, JME) http://www.chemaxon.com/marvin/index.html
● ● ● ● Export/Import to Mol files Exporting to other DBs Merging Duplicate Frame and Edit SRI International Bioinformatics
11
The Synonym Editor
● Lets you easily edit the synonyms and set the common name SRI International Bioinformatics
12 ● ● ● ● ● Enter or edit a reaction equation Set Conversion Type EC number (official?) Compartments (transport) Create new instances or classes
Reaction Editor
SRI International Bioinformatics
13
Pathway Info Editor
Class (variant class) Common Name Synonyms Evidence Codes Citations (CIT) Summary External Links Hypothetical reactions Enzymes not in use Author credits SRI International Bioinformatics
14
Evidence Codes for Pathways
●
http://bioinformatics.ai.sri.com/evidence-ontology/
● Experimental evidence codes: ● ● ● ● ● ● IDA: IEP: IPI: inferred from physical interaction TAS: IGI: IMP: inferred from direct assay inferred from expression pattern traceable author statement inferred from genetic interaction inferred from mutant phenotype Full documentation for an evidence code is displayed in the Navigator (click the code icon) SRI International Bioinformatics
15
Pathway Editor
● Graphically create and modify pathways ● ● Reaction Menu : add reactions one by one Pathway Menu : enter a linear pathway segment, add subpathways SRI International Bioinformatics
16
Connections Editor Operations
SRI International Bioinformatics
17
Pathway Editor Limitations
● Complex situations can cause ambiguity: ● link may be ignored ● dialog box for disambiguating ● pathway drawn in bizarre arrangement ● Fix: ● try removing offending link and add links in different order ● The Pathway Editor does not handle polymerization pathways easily.
SRI International Bioinformatics
18
Overview of Creating a Pathway
● Identify all the metabolites . Define any missing ones.
● Find the individual reactions in the PGDB/MetaCyc, and create new reactions if necessary.
● Compose the pathway from the individual reactions using the pathway editor ● Assign a class for the pathway ● Add a summary , citations , and evidence code ● Assign the appropriate enzymes , create complexes when appropriate.
● Curate information about enzymes and genes, including evidence codes for all enzymes.
SRI International Bioinformatics
19
Tutorial: Compounds, Reactions and Pathways
SRI International Bioinformatics
20
Import MetaCyc reactions instead of creating duplicates!
If the reaction is already present in MetaCyc, you will see a window like this one You should choose the option “ Import ”. However, you MUST write down the frame ID of the identical reaction, so you could use it later when specifying the pathway.
SRI International Bioinformatics
21
Reuse existing reactions instead of creating duplicates!
If the reaction is already present in your PGDB, you will see a window like this one You should choose the option “ Delete ”. However, pathway.
you MUST write down the frame ID of the identical reaction, so you could use it later when specifying the SRI International Bioinformatics
22
Fill Reaction frame ID’s in your handout
Reaction
ascorbate + H2O = 3-keto-L-gulonate
Frame ID
RXN0-707 3-keto-L-gulonate + ATP = 3-keto-L-gulonate 6-phosphate + ADP 3-keto-L-gulonate 6-phosphate = L-xylulose-5-phosphate + CO2 L-xylulose-5-phosphate = L-ribulose-5-phosphate L-ribulose-5-phosphate = xylulose-5-phosphate xylulose-5-phosphate = D-ribulose-5-phosphate SRI International Bioinformatics
Creating New Reactions
23 Don’t forget to include spaces as “+” and “=“: between chemical names and terms such 1.
ascorbate + H 2 O = 3-keto-L-gulonate 2.
3-keto-L-gulonate + ATP = 3-keto-L-gulonate-6-phosphate + ADP 3.
3-keto-L-gulonate-6-phosphate = L-xylulose-5-phosphate + CO 2 4.
L-xylulose-5-phosphate = L-ribulose-5-phosphate 5.
L-ribulose-5-phosphate = xylulose-5-phosphate 6.
xylulose-5-phosphate = D-ribulose-5-phosphate SRI International Bioinformatics
24
Curating Enzymes
SRI International Bioinformatics
25
Adding an Enzyme to a Reaction
● To add an enzyme to a reaction: ●
Right click the reaction, choose Edit → Create/Add enzyme.
● If the enzyme exists, specify its ID. If not, click “Search by genes or create new protein” ● The Protein Subunit Structure Editor pops up SRI International Bioinformatics
26
Specifying Enzyme Complexes
SRI International Bioinformatics
27
Specifying Multiple Subunits
SRI International Bioinformatics
28
The Enzyme Editor
General info Enzymatic Reaction(s) (1..n) Subunit(s) (0..n) SRI International Bioinformatics
29
Protein Editor
For an example, open CPLX-6621 in MetaCyc SRI International Bioinformatics
30
Enzymatic Reaction Editor
SRI International Bioinformatics
31
Protein Subunit Editor
Specify the copy number of each subunit Specify UniProt ID if available, experimental MW, and any useful info that may apply SRI International Bioinformatics
Citations
● Citation boxes ● The CITS field ● File =>Import Citations from PubMed ● Publication editor (invoke by right clicking on a citation at bottom) 32 Non PubMed citation: Enter in citation box in the form Smith06 , invoke editor by clicking out of a citation box.
SRI International Bioinformatics
33 Tutorial: Enzymes, citations and export SRI International Bioinformatics