Line 1: |
|
|
+ |
=S. pneumoniae=
|
|
|
+ |
Streptococcus pneumoniae TIGR4. The S. pneumoniae TIGR4 consists of a circular chromosome of 2,160,842 bp in length.
|
|
|
|
|
|
|
+ |
==Website data==
|
|
|
+ |
The Illumina and pacbio data were downloaded from ALLPATHS-LG website : [ftp://ftp.broadinstitute.org/pub/papers/assembly/Ribeiro2012/data/strep_data.tar.gz strep_data.tar.gz] <br>
|
|
|
|
|
|
|
+ |
===Fragment library===
|
|
|
+ |
Reads length : 101bp <br>
|
|
|
+ |
Reads amount : 1067060 X2 <br>
|
|
|
+ |
Insert size : 180bp <br>
|
|
|
+ |
Coverage : 88.89X
|
|
|
+ |
===Jumping library===
|
|
|
+ |
Reads length : 93bp <br>
|
|
|
+ |
Reads amount : 1161883 X2 <br>
|
|
|
+ |
Insert size : 3000bp <br>
|
|
|
+ |
===PacBio reads===
|
|
|
+ |
Reads average length : 1159.12bp <br>
|
|
|
+ |
Reads amount : 403745 <br>
|
|
|
+ |
Coverage : 216.58X <br>
|
|
|
+ |
==Raw data==
|
|
|
+ |
The raw data of website data from Sequence Read Archive (SRA)
|
|
|
|
|
|
|
+ |
===Fragment library===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX110128 SRX110128] <br>
|
|
|
+ |
Reads length : 101bp <br>
|
|
|
+ |
Reads amount : 5706200 X2 <br>
|
|
|
+ |
Insert size : 180bp <br>
|
|
|
+ |
Coverage : 475.33X <br>
|
|
|
+ |
===Jumping library===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX105406 SRX105406] <br>
|
|
|
+ |
===PacBio reads===
|
|
|
+ |
Accession : [http://www.ncbi.nlm.nih.gov/sra/SRX109959 SRX109959],[http://www.ncbi.nlm.nih.gov/sra/SRX109958 SRX109958]
|
|
|
+ |
==Self-fraction data==
|
|
|
+ |
We randomly selected the same fraction as website data from fragment and jumping library of raw data by prepare.sh.
|
|
|
|
|
|
|
+ |
PrepareAllPathsInputs.pl\
|
|
|
+ |
DATA_DIR=$PWD/test.genome/data\
|
|
|
+ |
PLOIDY=1\
|
|
|
+ |
FRAG_FRAC=0.187\
|
|
|
+ |
JUMP_FRAC=0.558\
|
|
|
+ |
IN_GROUPS_CSV=in_groups.csv\
|
|
|
+ |
IN_LIBS_CSV=in_libs.csv\
|
|
|
+ |
OVERWRITE=True\
|
|
|
+ |
| tee prepare.out
|
|
|
+ |
==100X fragment reads==
|
|
|
+ |
We randomly selected 100X coverage data from fragment library of raw data by prepare.sh.
|
|
|
|
|
|
|
+ |
Fragment library fraction = 100/475.12 = 0.21 <br>
|
|
|
|
|
|
|
+ |
PrepareAllPathsInputs.pl\
|
|
|
+ |
DATA_DIR=$PWD/test.genome/data\
|
|
|
+ |
PLOIDY=1\
|
|
|
+ |
FRAG_FRAC=0.21\
|
|
|
+ |
JUMP_FRAC=0.558\
|
|
|
+ |
IN_GROUPS_CSV=in_groups.csv\
|
|
|
+ |
IN_LIBS_CSV=in_libs.csv\
|
|
|
+ |
OVERWRITE=True\
|
|
|
+ |
| tee prepare.out
|
|
|
|
|
|
|
+ |
We also use another setting with all jumping library reads.
|
|
|
|
|
|
|
+ |
PrepareAllPathsInputs.pl\
|
|
|
+ |
DATA_DIR=$PWD/test.genome/data\
|
|
|
+ |
PLOIDY=1\
|
|
|
+ |
FRAG_FRAC=0.21\
|
|
|
+ |
IN_GROUPS_CSV=in_groups.csv\
|
|
|
+ |
IN_LIBS_CSV=in_libs.csv\
|
|
|
+ |
OVERWRITE=True\
|
|
|
+ |
| tee prepare.out
|