Skip to main content

Advertisement

Springer Nature is making Coronavirus research free. View research | View latest news | Sign up for updates

Table 2 Description of benchmark datasets

From: MZPAQ: a FASTQ data compression tool

Identifier Size (MB) Type Technique Organism Description
SRR 554369 456 FASTQ paired short reads Illumina GAIIx; 50x total depth P.aeruginosa Small genome (6-7 MB), medium depth
SRR 327342 3,881 FASTQ paired short reads Illumina GAII; 175x total depth; S.cerevisiae Small genome (12 MB), high depth.
MH0001. 081026 1,880 FASTQ paired short reads Illumina GA; unknown depth Human gut metagenome Mixed species and unknown references
SRR 1284073 1,309 FASTQ single variable-length long reads PacBio; 140x depth Bacteria E.Coli Small genome (4.7 MB), higher error rate.
SRR 870667 22,987 FASTQ paired short reads Illumina GAIIx; 35x total depth Plant T.cacao. Medium sized genome (345 MB)
ERR 174310 53,869 FASTQ paired short reads Illumina HiSeq 2000; 13x total depth H.sapiens (NA12877) individual Common instrument depth