Clustering a binary data set

This tutorial illustrates how to calculate a dendrogram based on a binary data set.

Characters

A character is basically a name-value pair of which the value can be binary, multi-state or continuous. Because of this very broad definition, a wide variety of data can be analyzed as character types (= an array of characters). This includes morphological and biochemical features, commercial test panels (API®, Biolog®, Vitek®, etc.), antibiotics resistance profiles, fatty acid profiles, microarrays, SNP arrays, repeat numbers in MLVA, allelic profiles in MLST, etc.

Download sample data: 
Binary character data

MS Excel file, containing presence and absence information for four genes.