DNA Barcoding and the Consortium for the Barcode of Life David E. Schindel, Executive Secretary National Museum of Natural History Smithsonian Institution [email protected]; http://www.barcoding.si.edu 202/633-0812; fax 202/633-2938 Data.
Download ReportTranscript DNA Barcoding and the Consortium for the Barcode of Life David E. Schindel, Executive Secretary National Museum of Natural History Smithsonian Institution [email protected]; http://www.barcoding.si.edu 202/633-0812; fax 202/633-2938 Data.
DNA Barcoding and the Consortium for the Barcode of Life
David E. Schindel, Executive Secretary
National Museum of Natural History Smithsonian Institution [email protected]
; http://www.barcoding.si.edu
202/633-0812; fax 202/633-2938 Data Analysis Working Group, DIMACS, 26 Sept 2005
Species Identification Matters
• Endangered/protected species • Agricultural pests • Invasive species • Disease vectors/pathogens • Hazards (e.g., bird strikes on airplanes) • Environmental quality indicators • Unsustainable harvesting • Fidelity of cell lines/culture collections Data Analysis Working Group, DIMACS, 26 Sept 2005
The Practice of Taxonomy
Taxonomic Decision Making Distributions of Character Variation Characters Specimens
The Uses of Taxonomy
Socioeconomic Decisions Concerns/ Regulations Specimens
Data Analysis Working Group, DIMACS, 26 Sept 2005
The Problem…
• Taxonomists are a limited resource • Taxonomic infrastructure is not widely available • Taxonomic decisions are difficult for non specialists • Therefore, the practice of taxonomy does not scale up to meet the needs of society (or ecology, ecosystem studies, etc.) Data Analysis Working Group, DIMACS, 26 Sept 2005
A DNA barcode is a short gene sequence taken from standardized portions of the genome, used to identify species
Data Analysis Working Group, DIMACS, 26 Sept 2005
Uses of DNA Barcodes
“Triage” tool for flagging potential new species: • Undescribed and cryptic species Research tool for assigning specimens to known species, including: • Life history stages, damaged specimens, gut contents, droppings Applied tool for identifying regulated species: • Disease vectors, agricultural pests, invasives • Protected species, CITES listed, trade-sensitive Data Analysis Working Group, DIMACS, 26 Sept 2005
The Mitochondrial Genome ND5 Cyt b D-Loop Small ribosomal RNA Large ribosomal RNA ND1 ND6 L-strand CO I I ND2 H-strand ND4 ND4L ND3 CO III CO I CO II ATPase subunit 8 ATPase subunit 6
Data Analysis Working Group, DIMACS, 26 Sept 2005
How much information is there in a DNA Barcode?
• Human genome: – Contains 3 billion base-pairs – Identified by 648 bp COI barcode sequence – Content-to-label ratio: 5 X 10 6 • Oxford English Dictionary, 2 nd Ed.,1989: – 20 volumes, 21,730 pages, 500,000 entries, 59 million words, 350 million print characters – Identified by 10-character ISBN – Content-to-label ratio: 4 X 10 7 Data Analysis Working Group, DIMACS, 26 Sept 2005
Current Norm: High throughput
Large capacity PCR and sequencing reactions ABI 3100 capillary automated sequencer Data Analysis Working Group, DIMACS, 26 Sept 2005
Future Norm?
• A taxonomic GPS • Link to reference database • Usable by non specialists.
Data Analysis Working Group, DIMACS, 26 Sept 2005
Data Analysis Working Group, DIMACS, 26 Sept 2005
Consortium for the Barcode of Life (CBOL)
• An international affiliation of: – 80+ Members Org’s, 35+ countries, 6 continents – Natural history museums, biodiversity organizations – Users: e.g., government agencies – Private sector biotech companies, database providers • First barcoding publications in 2002 • Cold Spring Harbor planning workshops in 2003 • Sloan Foundation grant, launch in May 2004 • Secretariat opens at Smithsonian, September 2004 • First international conference February 2005 Data Analysis Working Group, DIMACS, 26 Sept 2005
CBOL Member Organizations
(as of May 2005)
Data Analysis Working Group, DIMACS, 26 Sept 2005
CBOL’s Working Groups
• • • •
Database:
Designing/constructing the Barcode Section of GenBank
DNA:
Protocols for formalin-fixed and old museum specimens; Producing LIMS for dissemination
Data Analysis:
Beyond phenetic methods; population genetics perspective
Plants:
barcoding Identify gene region(s) for Data Analysis Working Group, DIMACS, 26 Sept 2005
CBOL’s Goals
• Create a reference barcode database • Identify high-priority taxa and societal needs • Promote/facilitate barcoding projects and ‘CBOL campaigns’ • Improve methods, address shared obstacles through WGs • Populate database from collections • More portability, less time/expense • Improve taxonomic research environment Data Analysis Working Group, DIMACS, 26 Sept 2005
Recent and Planned Activities
• Data standards, Barcode records in GenBank • Launch of FishBOL, All Birds Initiatives • International Network for Barcoding Invasive and Pest Species (INBIPS) • APEC Workshop on Invasives, Beijing • Mosquitoes and disease vectors • Plans for CITES species, endangered Vertebrates, Bushmeat Data Analysis Working Group, DIMACS, 26 Sept 2005
Barcode Section of GenBank
Specimen Metadata
Georeference Habitat Character sets Images Behavior Other genes
Other Databases
Phylogenetic Pop’n Genetics Ecological
Voucher Specimen Barcode Sequence
Trace files
Literature (link to content or citation) Species Name
Indices - Catalog of Life - GBIF/ECAT Nomenclators - Zoo Record - IPNI NameBank Publication links - New species Data Analysis Working Group, DIMACS, 26 Sept 2005