Line 1: |
|
|
+ |
The Illumina sequencing data were available at [ftp://ftp.broadinstitute.org/pub/papers/assembly/Ribeiro2012/ ALLPATHS-LG website], Please refer to [http://www.ncbi.nlm.nih.gov/pubmed/22829535 ''Finished bacterial genomes from shotgun sequence data.'' Genome Research 2012] for detail.
|
|
|
|
|
|
|
+ |
=E. coli=
|
|
|
|
|
|
|
+ |
==Website data==
|
|
|
+ |
The Illumina and pacbio data were downloaded from ALLPATHS-LG website : [ftp://ftp.broadinstitute.org/pub/papers/assembly/Ribeiro2012/data/ecoli_data_alt.tar.gz ecoli_data_alt.tar.gz] <br>
|
|
|
|
|
|
|
+ |
===Fragment library===
|
|
|
+ |
Reads length : 101bp <br>
|
|
|
+ |
Reads amount : 1186190 X2 <br>
|
|
|
+ |
Insert size : 180bp <br>
|
|
|
+ |
Coverage : 46.02X
|
|
|
+ |
===Jumping library 1===
|
|
|
+ |
Reads length : 93bp <br>
|
|
|
+ |
Reads amount : 1615702 X2 <br>
|
|
|
+ |
Insert size : 3000bp <br>
|
|
|
+ |
===Jumping library 2===
|
|
|
+ |
Reads length : 93bp <br>
|
|
|
+ |
Reads amount : 362199 X2 <br>
|
|
|
+ |
Insert size : 3000bp <br>
|
|
|
+ |
===PacBio reads===
|
|
|
+ |
Reads average length : 1514.24bp <br>
|
|
|
+ |
Reads amount : 409304 <br>
|
|
|
+ |
Coverage : 133.58X <br>
|
|
|
+ |
==Raw data==
|
|
|
+ |
The raw data of website data from Sequence Read Archive (SRA)
|
|
|
|
|
|
|
+ |
===Fragment library===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX131033 SRX131033] <br>
|
|
|
+ |
Reads length : 101bp <br>
|
|
|
+ |
Reads amount : 13457571 X2 <br>
|
|
|
+ |
Insert size : 180bp <br>
|
|
|
+ |
Coverage : 522.1X <br>
|
|
|
+ |
===Jumping library 1===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX117481 SRX117481] <br>
|
|
|
|
|
|
|
+ |
===Jumping library 2===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX145082 SRR492488] <br>
|
|
|
+ |
|
|
|
+ |
===PacBio reads===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX109917 SRX109917], [http://www.ncbi.nlm.nih.gov/sra/SRX109901 SRX109901](SRR386913, SRR387092, SRR386907, SRR387035), [http://www.ncbi.nlm.nih.gov/sra/SRX109936 SRX109936]
|
|
|
+ |
==Self-fraction data==
|
|
|
+ |
We randomly selected the same fraction as website data from fragment library of raw data by prepare.sh.
|
|
|
|
|
|
|
+ |
PrepareAllPathsInputs.pl\
|
|
|
+ |
DATA_DIR=$PWD/test.genome/data\
|
|
|
+ |
PLOIDY=1\
|
|
|
+ |
FRAG_FRAC=0.088\
|
|
|
+ |
IN_GROUPS_CSV=in_groups.csv\
|
|
|
+ |
IN_LIBS_CSV=in_libs.csv\
|
|
|
+ |
OVERWRITE=True\
|
|
|
+ |
| tee prepare.out
|
|
|
+ |
==100X fragment reads==
|
|
|
+ |
We randomly selected 100X coverage data from fragment library of raw data by prepare.sh.
|
|
|
|
|
|
|
+ |
Fraction = 100 / 522.1 = 0.192 <br>
|
|
|
|
|
|
|
+ |
PrepareAllPathsInputs.pl\
|
|
|
+ |
DATA_DIR=$PWD/test.genome/data\
|
|
|
+ |
PLOIDY=1\
|
|
|
+ |
FRAG_FRAC=0.192\
|
|
|
+ |
IN_GROUPS_CSV=in_groups.csv\
|
|
|
+ |
IN_LIBS_CSV=in_libs.csv\
|
|
|
+ |
OVERWRITE=True\
|
|
|
+ |
| tee prepare.out
|