ALLPATHS-LG Data

Revision as of 10 October 2013 01:32 by admin (Comments | Contribs)

We downloaded three species sequencing data, E. coli, R. sphaeroides and S. pneumoniae from ALLPATHS-LG website, and their raw data from Sequence Read Archive (SRA)

E. coli

Escherichia coli K12 MG1655. The E. coli MG1655 consists of a circular chromosome of 4,639,675 bp in length.


mkdir -p test.genome/data
PrepareAllPathsInputs.pl\
DATA_DIR=$PWD/test.genome/data\
PLOIDY=1\
FRAG_FRAC=0.088\
IN_GROUPS_CSV=in_groups.csv\
IN_LIBS_CSV=in_libs.csv\
OVERWRITE=True\
| tee prepare.out 


R. sphaeroides

Rhodobacter sphaeroides strain 2.4.1. The R. sphaeroides 2.4.1 consists of two circular chromosomes of 3,188,609 bp and 943,016 bp, and five plasmids of 114,045 bp, 114,178 bp, 105,284 bp, 100,828 bp and 37,100 bp in length, respectively.

S. pneumoniae

Streptococcus pneumoniae TIGR4. The S. pneumoniae TIGR4 consists of a circular chromosome of 2,160,842 bp in length.