Difference between revisions of "Error correction of Illumina reads using Celera Assembler"

From mn/bio/cees-bioinf
Jump to: navigation, search
Line 1: Line 1:
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:   
 
It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:   
- adapter sequences   
+
- adapter sequences   
- errors   
+
- errors   
- duplicated reads   
+
- duplicated reads   
  
 
In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.
 
In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.
  
 
WORK IN PROGRESS.
 
WORK IN PROGRESS.

Revision as of 14:49, 13 June 2013

It is usually useful to preprocess the data you have before assembly. There can be several issues with a dataset:

- adapter sequences   
- errors  
- duplicated reads  

In this how-to, we will remove adapter sequences and errors from a large Illumina dataset.

WORK IN PROGRESS.