ONT swab sequencing statistics

Swab sampling
Sequencing
Author

Simon Grimm

Published

June 20, 2025

When creating cost estimates for surveillance programs, we frequently need to provide parameters for metrics such as the number of base pairs produced in one sequencing run or read length distribution. To make these parameters more legible, here is a set of statistics based on NAO’s Oxford Nanopore-sequencing of pooled swab samples.

Total and viral read output

Based on 7 sequencing runs, performed between January 2025 and June 2025, the average read output is 2.47 gigabases and 3.1 million reads. Many reads are short, with a median read length across all reads of 166 bp.

Total Base Pairs Total Reads Mean Read Length Median Read Length
Sequencing run
NAO-ONT-20250120 246.44 MB 1.0M 252 bp 118 bp
NAO-ONT-20250127 512.08 MB 1.1M 473 bp 147 bp
NAO-ONT-20250213 264.89 MB 1.2M 217 bp 123 bp
NAO-ONT-20250220 585.21 MB 1.9M 308 bp 134 bp
NAO-ONT-20250313 3.00 GB 1.1M 2821 bp 2859 bp
NAO-ONT-20250327 9.90 GB 2.8M 3497 bp 3571 bp
NAO-ONT-20250606 2.75 GB 12.8M 215 bp 131 bp
Average 2.47 GB 3.1M 790 bp 166 bp

Actual read length distributions are fairly heterogeneous across runs, with many runs showing a bimodal distribution with peaks at 10 bp and 100 bp, and NAO-ONT-20250313 and NAO-ONT-20250327 showing a trimodal distribution, with peaks at 10 bp, 1000 bp, and 2000 bp.

More relevant for virus detection are read length distributions of viral reads. In most runs, viral reads predominantly exceed 1,000 bp in length.

Output across time

Finally, for an up-and-running biosurveillance system we would want to have fast turnaround sequencing. Hence, we’re interested to know how much sequencing output is generated in the first 12 hours of a sequencing run. We have this data available as figures generated by ONT’s MinKNOW software (Google Doc)

Roughly, we see that for most runs the rate at which base pairs are generated in the first 12 hours is 1.6 to 3.3 times higher than after 12 hours.

Run duration (h) Output at 12h (GB) Total output (GB) Share of Time Share of Data Rate 0-12h (GB/h) Rate >12h (GB/h) Early/Late ratio
Sequencing run
NAO-ONT-20250120 25.0 1.0 1.6 48.0% 62.5% 0.08 0.05 1.6
NAO-ONT-20250127 28.0 1.0 1.7 42.9% 58.8% 0.08 0.04 2.0
NAO-ONT-20250213 40.0 1.3 3.1 30.0% 41.9% 0.11 0.06 1.8
NAO-ONT-20250220 40.0 1.2 2.7 30.0% 44.4% 0.10 0.05 2.0
NAO-ONT-20250313 4.5 4.9 4.9 100.0% 100.0% 1.09
NAO-ONT-20250327 11.5 8.0 8.0 100.0% 100.0% 0.70
NAO-ONT-20250606 40.0 4.0 6.8 30.0% 58.8% 0.33 0.1 3.3