Hvolc

Haloferax volcanni DS2

Three assemblies are available from How to score genome assemblies using the Mauve system.


Sequence assembly

Assembly Description Download §
volc454 It was sequenced using 454 pyrosequencing by Roach Inc on a GS FLX Titanium instrument. 25x coverage of reads were obtained. Reads were assembled to contigs with Newbler by Roache. assembly1.fasta
volcV It was sequenced to 25x coverage using Illumina 100 nt read pairs with 500 nt inserts, and 15x coverage of 50 nt Illumina mate-pairs with 6.5 kbp insert. Both data type were generated by BGI. The assembly was constructed with velvet using the above ginve insert size estimates and default parameters. No read error ecoorection or quality trimming steps were performed. assembly2.fasta
volcIDBA It was sequenced with 80x coverage 76 nt read pairs with 300 nt inserts on an Illumina GAIIx instrument at UC Davis Genome Center, and 2x coverage of 50 nt mate-pairs with 6.5 kbp insert sequences at BGI. The reads were error corrected with REPTILE using default parameters, contigs assembled with IDBA using the custome parameters --mink 33 --maxk 78 and evertything else default, and scaffolded with SSPACE using the custom parameter -a 0.5 and everything else default. assembly3.fasta

§The files only contain the contigs which were split at the >10 Ns.

Merged File: Mauve_Contigs.fa

  • Scored with Mauve metrics:
Name NumContigs NumAssemblyBases NumMisCalled NumUnCalled NumGapsRef NumGapsAssembly TotalBasesMissed PercBasesMissed ExtraBases PercExtraBases BrokenCDS IntactCDS ContigN50 ContigN90 MaxContigLength
volc454 157 3920004 90 0 141 128 119928 2.9886 1818 0.0464 30 3908 123582 11735 217295
volcV 1394 4394403 925 30431 1848 1739 161388 4.0217 503214 11.4512 505 3433 843300 57 1354539
volcIDBA 367 3880100 1209 5884 999 991 155857 3.8839 22465 0.579 442 3496 19349 5537 99636


Contig integrator

Integrator Download
CISA CISA
minimus2 minimus2


Evaluation

  • Benchmark genome
Haloferax_volcanii_DS2.gbk
  • Evaluate by Mauve Assembly Metrics
How to score genome assemblies using the Mauve system