2619_sRNASlides2012short

Download Report

Transcript 2619_sRNASlides2012short

project date 20/07/2012
The sRNA Workbench
Matthew. B. Stocks
01
PROJECT AIMS
Based on original algorithms developed for the web based UEA
sRNA Tools
Moxon et. al. (2008)
Easy to use
•
Designed to be used by biologists with limited access to
bioinformatics support, perhaps on a desktop computer (but is
extendable to servers or high performance clusters)
•
Cross platform support
•
To allow multiple analysis techniques from within a single
program through the use of an MDI (Multiple Document
Interface)
•
Performance improvements, Speed, Memory
•
Still accessible through the command line
Code maintainability
02
WHAT WE ALREADY OFFER
03
HELPER TOOLS
Adapter Removal
Removes 3’ and/or 5’ adaptors from raw highthroughput sequence data
FASTA or FASTQ
Provides sRNA length distribution
Filter
Filters sRNA sequences from high-throughput data
according to user-defined criteria
Sequence Alignment
Aligns sRNA Sequences to a Reference Genome or
other long read file
DEFAULT STYLES
04
ANALYSIS TOOLS
05
miRCat
•miRCat (micro RNA Categorisation)
Moxon et. al (2008) Stocks et. al. (2012)
Identify novel miRNAs in high-throughput sequence data
Identify known miRNAs found in miRBase in result set
Griffiths-Jones (2010)
View miRNA statistics
One click viewing of the predicted miRNA as they appear on the
genome
One click viewing of miRNA pre-cursor secondary structure plot
Concurrency....
06
miRCat threading
sRNA
sRNA
sRNA
CPU
Core
CPU
Core
Thread Pool:
FIFO data structure (Queue)
First sequence added to the list (First In) is the first item
processed (First Out)
sRNA
sRNA
Database
07
PERFORMANCE TEST
08
TARGET PREDICTION
Part of a parallel BBSRC tools and resources project
A high-throughput technique known as Parallel Analysis of
RNA Ends (PARE) is used to sequence mRNA cleavage
products on a large-scale
Sequence 5' end of uncapped mRNA (Degradome)
Including transcripts targeted by sRNAs and subjected to
endonucleolytic cleavage
The sRNA and degradome data can be used to identify
interactions between sRNAs and their target mRNA
09
PARESnip
PAREsnip can be used to search for genome-wide
interactions between all sRNAs and transcripts as well as
predicting targets of small groups of miRNAs
PAREsnip outputs all potential sRNA target duplexes
evidenced through the degradome
Features:
Categorisation system
Addo-Quaye et. al (2009)
Data structures based on m-way search trees and multithreaded optimisation
~90 mins to process 100k sequences of A. thaliana data
At least as sensitive to detecting targets as other tools
designed for this type of analysis or better
Folkes et. al. (2012) NAR
10
PARESnip
Each coloured
rectangle represents
an interaction
The taller the
rectangle, the higher
the abundance
Tool tips provide
additional information
T-plots can also be
generated
Target information is given
on the right
Navigation controls allow
the user to simply click
through the plots or output
them to PDF for publication
11
OTHER TOOLS
miRProf (micro RNA Profiler)
Detect all known miRNAs found in miRBase within your
dataset
Griffiths-Jones, (2010)
Determines normalised expression levels of known miRNA
found in miRBase
Can be used to compare expression levels across multiple
samples
TA-SI prediction (Trans Acting Small Interfering RNA prediction)
Predicts phased ta-siRNA in plant datasets
Chen et. al. (2007)
Requires sRNA database and genome database
View phased sRNAs as they appear on the genome
SiLoCo...
Multi sample general loci prediction
Expression profile
DEFAULT STYLES
12
ONLINE AVAILABILITY
18
srna-workbench.cmp.uea.ac.uk
19
STATS
Since launch on 27th Jan 2011
Total Visitors: 3,925
Page Views: 14,650 (3.73 pages per visit)
•Average time on site: 4.27 mins
•Total Version 2 Downloads: 1,200
•Around 30 collated requests for the RSS feed per day
20
STATS
•Visits from 57 countries/territories
UK top visitor then USA...
21
PUBLICATIONS
Stocks, M. B.; Moxon, S.; Mapleson, D.; Woolfenden, H. C.;
Mohorianu, I.; Folkes, L.; Schwach, F.; Dalmay, T. & Moulton, V.
(2012) The UEA sRNA Workbench: A Suite of Tools for Analysing
and Visualising Next Generation Sequencing microRNA and Small
RNA Datasets. Bioinformatics
Folkes, L.; Moxon, Simon.; Woolfenden, H.C.; Stocks, M.B.; Szittya,
G.; Dalmay, T.: Moulton, V.; (2012) PAREsnip: a tool for rapid
genome-wide discovery of small RNA/target interactions
evidenced through degradome sequencing. Nucleic Acids
Research
22
ACKNOWLEDGEMENTS
Vincent Moulton
Tamas Dalmay
Simon Moxon
Frank Schwach
Irina Mohorianu
Dan Mapleson
Hugh Woolfenden
Leighton Folkes
23
ANY QUESTIONS?
24