Hierarchical Clustering

protocols

Cluster genes and/or samples based on how close they are to one another. The result is a tree structure, referred to as dendrogram.

Before you begin

Gene expression data must be in a GCT or RES file.
Example file: all_aml_test.gct.

learn more:
file formats

Step 1: PreprocessDataset

Preprocess gene expression data to remove platform noise and genes that have little variation. Although researchers generally preprocess data before clustering if doing so removes relevant biological information, skip this step.

Considerations
learn more:
PreprocessDataset

Step 2: HierarchicalClustering

Run hierarchical clustering on genes and/or samples to create dendrograms for the clustered genes (*.gtr) and/or clustered samples (*.atr), as well as a file (*.cdt) that contains the original gene expression data ordered to reflect the clustering.

Considerations
learn more:
HierarchicalClustering

Step 3: HierarchicalClusteringViewer

Display a heat map of the clustered gene expression data, with dendrograms showing how the genes and/or samples were clustered.

Considerations
learn more:
HierarchicalClusteringViewer