2013年2月17日星期日

SHIPS - a non-parametric clustering algorithm


http://stat.genopole.cnrs.fr/logiciels/SHIPS

Overview

SHIPS (Spectral Hierarchical clustering for the Inference of Population Structure) is a non-parametric clustering algorithmthat clusters individuals from a population into genetically homogeneous sub-populations from genotype data. After computing a pairwise distance matrix, the algorithm progressively divides the original population in two sub-populations by the use of aspectral clustering algorithm. The process is then iterated in each of the two sub-populations and so on. This leads to the construction of a binary tree, where each node represents a group of individuals. To determine the final clusters a tree pruning procedure and an estimation of the optimal number of clusters, that is a gap statistic, are applied. In such an approach both the final clustering of the individuals and the number of clusters are estimated by the method.
The algorithm SHIPS is implemented with the software R that can be downloaded from the (CRAN web page) and is divided in several functions :
  • ships.cluster constructing the tree and providing several clustering possibilities
  • ships.gap that estimates the final number of clusters
  • ships.plotCluster that provides a graphical representation of the clustering
  • ships.plotGap that plots the criterion used to estimate the final number of clusters

SHIPS ressources

没有评论:

发表评论