MENU

Fun & Interesting

Cluster analysis in R | Finding out Intra and Inter cluster distances and optimum number of clusters

The Outlier 33,438 lượt xem 3 years ago
Video Not Working? Fix It Now

~Hello everyone in this tutorial, I am going to cover almost all concepts involved in cluster analysis, starting from how to find an optimum number of clusters using wss plot function, silhouette method using factoextra package and also by using NbClust package, for k-means and hierarchical clustering algorithms, in hierarchical clustering I will explain about dendrograms, how to plot, export and hang them. In this tutorial, I will also show how to find the inter and intracluster distances using an R package called clv. Finally how to find the cluster means or average
0:00 Introduction
2:10 Preliminary steps
6:40 k means
7:20 Opt no of clusters
12:50 Hierarchical clustering
19:00 Cluster distances
21:27 Cluster means
Other youtube videos
Coding
1) https://www.youtube.com/watch?v=5eDqRysaico&t=749s
2) https://www.youtube.com/watch?v=PX5nSBGB5Tw&t=385s
Theory
1) https://www.youtube.com/watch?v=Se28XHI2_xE
2) https://www.youtube.com/watch?v=QXOkPvFM6NU
3) https://www.youtube.com/watch?v=4b5d3muPQmA
4) https://www.youtube.com/watch?v=7xHsRkOdVwo
Descriptive
~NbClust
1) https://www.rdocumentation.org/packages/NbClust/versions/3.0/topics/NbClust
~Finding the optimum number of clusters
1) https://uc-r.github.io/kmeans_clustering
2) https://stackoverflow.com/questions/15376075/cluster-analysis-in-r-determine-the-optimal-number-of-clusters
~Distances
1) https://www.geeksforgeeks.org/ml-intercluster-and-intracluster-distance/

Script
https://docs.google.com/document/d/19owjlv9cpVDrJEWsYUgTVdDk9_2knkBVfLYnZefGyT0/edit?usp=sharing

Comment