Difference between revisions of "Error correction of Illumina reads using Celera Assembler"
From mn/bio/cees-bioinf
Line 1: | Line 1: | ||
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset: | It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset: | ||
+ | |||
- adapter sequences | - adapter sequences | ||
+ | |||
- errors | - errors | ||
+ | |||
- duplicated reads | - duplicated reads | ||
Revision as of 14:48, 13 June 2013
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:
- adapter sequences
- errors
- duplicated reads
In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.
WORK IN PROGRESS.