|
Self-correction approach (SCA) was proposed in the [http://www.ncbi.nlm.nih.gov/pubmed/24034426 ref] (''Reducing assembly complexity of microbial genomes with single-molecule sequencin'', Genome Biology 2013). |
|
Self-correction approach (SCA) was proposed in the [http://www.ncbi.nlm.nih.gov/pubmed/24034426 ref] (''Reducing assembly complexity of microbial genomes with single-molecule sequencin'', Genome Biology 2013). |
- |
We Following we used ''E. coli'' as an example.
|
+ |
Following we used ''E. coli'' as an example to show the steps
|
|
|
+ |
1. Long reads self-correction
|
|
pacBioToCA -length 500 -partitions 200 -l pacbio -t 6 -s ../pacbio.spec -fastq Filtered_four.fastq longReads=1 genomeSize=4650000 |
|
pacBioToCA -length 500 -partitions 200 -l pacbio -t 6 -s ../pacbio.spec -fastq Filtered_four.fastq longReads=1 genomeSize=4650000 |
- |
python trimFastqByQVWindow.py --qvCut=54.5 --out=trimmed.pacbio.fastq ../pacbio.fastq
|
+ |
2. Trimming corrected long reads
|
|
|
+ |
python trimFastqByQVWindow.py --qvCut=54.5 --out=trimmed.pacbio.fastq pacbio.fastq
|
- |
java convertFastqToFastaAndQual ../trimmed.pacbio.fastq ../trimmed.pacbio.fasta ../trimmed.pacbio.qual
|
+ |
3. Convert the data format from fastq to frg
|
|
|
+ |
java convertFastqToFastaAndQual trimmed.pacbio.fastq trimmed.pacbio.fasta trimmed.pacbio.qual
|
|
convert-fasta-to-v2.pl -l Pacbio -s trimmed.pacbio.fasta -q trimmed.pacbio.qual > trimmed.pacbio.frg |
|
convert-fasta-to-v2.pl -l Pacbio -s trimmed.pacbio.fasta -q trimmed.pacbio.qual > trimmed.pacbio.frg |
|
|
+ |
4. Select 25X longest corrected long reads
|
|
gatekeeper -T -F -o asm.gkpStore trimmed.pacbio.frg |
|
gatekeeper -T -F -o asm.gkpStore trimmed.pacbio.frg |
|
gatekeeper -dumpfasta 25X_Clr -longestlength 0 116250000 asm.gkpStore |
|
gatekeeper -dumpfasta 25X_Clr -longestlength 0 116250000 asm.gkpStore |
|
runCA -p asm -d asm -s ../asm.spec 25X_Clr.frg |
|
runCA -p asm -d asm -s ../asm.spec 25X_Clr.frg |