Figure 2From: Linear normalised hash function for clustering gene sequences and identifying reference sequences from multiple sequence alignmentsThe optimal number of clusters for the different hash ranges and different number of indices per cluster for (a) Nocardia 16S rRNA 364 sequences of 80 known species; (b) Nocardia 16S rRNA 97 sequences of 4 known species; (c) EV71 109 sequences of 11 known genogroups/subgenogroups; and (d) EV71 500 VP1 sequences of unknown genogroups/subgenogroups.Back to article page