Skip to main content

Table 2 Description of benchmark datasets

From: MZPAQ: a FASTQ data compression tool

Identifier

Size (MB)

Type

Technique

Organism

Description

SRR 554369

456

FASTQ paired short reads

Illumina GAIIx; 50x total depth

P.aeruginosa

Small genome (6-7 MB), medium depth

SRR 327342

3,881

FASTQ paired short reads

Illumina GAII; 175x total depth;

S.cerevisiae

Small genome (12 MB), high depth.

MH0001. 081026

1,880

FASTQ paired short reads

Illumina GA; unknown depth

Human gut metagenome

Mixed species and unknown references

SRR 1284073

1,309

FASTQ single variable-length long reads

PacBio; 140x depth

Bacteria E.Coli

Small genome (4.7 MB), higher error rate.

SRR 870667

22,987

FASTQ paired short reads

Illumina GAIIx; 35x total depth

Plant T.cacao.

Medium sized genome (345 MB)

ERR 174310

53,869

FASTQ paired short reads

Illumina HiSeq 2000; 13x total depth

H.sapiens (NA12877) individual

Common instrument depth