DNA sequence analysis generates large volumes of data presenting
challenging bioinformatic and statistical problems. This tutorial
introduces Bioconductor (http://bioconductor.org) packages and work flows
for the analysis of sequence data. We learn about approaches for
efficiently manipulating sequences and alignments, and introduce common
work flows and the unique statistical challenges associated with `RNA‐
seq', variant annotation, and other experiments. The emphasis is on
exploratory analysis, and the analysis of designed experiments.The workshop emphasizes orientation within the
Bioconductor milieu; we will touch on the Biostrings, ShortRead,
GenomicRanges, edgeR,
and VariantAnnotation
, and other packages, with
short exercises to illustrate the functionality of each package.
The workshop assumes an intermediate level of familiarity with R, and basic understanding of biological and technological aspects of high‐throughput sequence analysis. Participants should come prepared with a modern wireless‐enabled laptop and web browser installed.
This workshop is for professional bioinformaticians and statisticians intending to use R / Bioconductor for analysis and comprehension of high-throughput sequence data.
Workshop materials will be available prior to the course, at http://bioconductor.org/help/course-materials/2013/
The Bioconductor project is described in detail at http://bioconductor.org.