Difference between revisions of "Error correction of Illumina reads using Celera Assembler"

From mn/bio/cees-bioinf
Jump to: navigation, search
m
Line 1: Line 1:
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:
 
 
- adapter sequences  
 
- adapter sequences  
 
 
- errors  
 
- errors  
 
 
- duplicated reads
 
- duplicated reads
  

Revision as of 14:48, 13 June 2013

It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset: - adapter sequences - errors - duplicated reads

In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.

WORK IN PROGRESS.