top of page

K-means Clustering | Conclusions

  • K-means clusters data based on the distance of the data point to a centroid.

  • K-means requires knowing how many clusters the result will have.

  • The k centroids are randomly initialized, the points are grouped to the nearest centroid, the points are averaged and the centroid is moved to this new location. These steps are repeated until a stopping condition has been reached.

  • The features need to be normalized before using them as an input, as it affects the performance of K-Means.

  • K-Means does not perform well with non-spherical shapes, unless the data are appropriately transformed beforehand.

bottom of page