Performing a resequencing assembly

This tutorial illustrates how to import FASTQ files into a BioNumerics database, how to obtain a quick overview of the basic statistics on read quality and read length, and finally how to perform a resequencing assembly against a reference sequence.

Sequence read sets

A sequence read set is designed to hold large sets of short reads generated by next generation sequencing (NGS). Base sequences and their associated quality scores are stored for single-end and paired-end reads, originating from various high-throughput sequencing platforms such as Illumina, Ion Torrent, PacBio, Oxford Nanopore, etc.

Download PDF file:

Reseq_assembly.pdf

Download sample data:

Sequence read set data

This data set contains two gzipped fastq files of one paired end read data file pair coming from Staphylococcus aureus. This data was generated by Illumina MiSeq whole genome sequencing and downloaded from NCBI.

See movie:

Performing a resequencing assembly

Sequence read sets

Calculate a genome resequencing assembly

Quality assessment of next generation sequencing runs

Search form

Performing a resequencing assembly

Sequence read sets