Difference between revisions of "Error correction of Illumina reads using Celera Assembler"

From mn/bio/cees-bioinf
Jump to: navigation, search
Line 1: Line 1:
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:
 +
 
- adapter sequences  
 
- adapter sequences  
 +
 
- errors  
 
- errors  
 +
 
- duplicated reads
 
- duplicated reads
  

Revision as of 14:48, 13 June 2013

It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:

- adapter sequences

- errors

- duplicated reads

In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.

WORK IN PROGRESS.