Performing a de novo assembly locally

This tutorial illustrates how to import FASTQ files into a BioNumerics database, how to obtain a quick overview of the basic statistics on read quality and read length, and finally how to perform a de novo assembly locally so without using the external calculation engine.

Sequence read sets

A sequence read set is designed to hold large sets of short reads generated by next generation sequencing (NGS). Base sequences and their associated quality scores are stored for single-end and paired-end reads, originating from various high-throughput sequencing platforms such as Illumina, Ion Torrent, PacBio, Oxford Nanopore, etc.

Download PDF file: 
Download sample data: 
Sequence read set data
This data set contains two gzipped fastq files of one paired end read data file pair coming from Staphylococcus aureus. This data was generated by Illumina MiSeq whole genome sequencing and downloaded from NCBI.