
All changes on the primary node’s data sets are recorded in a special capped collection called the operation log (oplog).
#The clusters code
Let us see the python code with the help of an example.A MongoDB cluster needs to have a primary node and a set of secondary nodes in order to be considered a replica set.Īt most, one primary node must exist, and its role is to receive all write operations.

Distance Measures: Using different distance measures (used to calculate distance between a data point and cluster centre) might yield different clusters.ĥ. Outliers pull the cluster towards itself, thus affecting optimal cluster formation.Ĥ. Outliers: Cluster formation is very sensitive to the presence of outliers. This means that the outcome of clustering can be different each time the algorithm is run even on the same data set.ģ. The K-means algorithm is non-deterministic. Initial Values/ Seeds: Choice of the initial cluster centres can have an impact on the final cluster formation. Number of clusters (K): The number of clusters you want to group your data points into, has to be predefined.Ģ. So, we must keep in mind the following factors when solving business problems using the K-means clustering algorithm.ġ. Important Factors We Must consider While Using K-means AlgorithmĬertain factors can impact the efficacy of the final clusters formed when using k-means clustering. Medical:Ĭluster Analysis has also been widely used in the field of biology and medical science like sequencing into gene families, human genetic clustering, building groups of genes, and clustering of organisms at species and so on.

In the areas of social networking and social media, Cluster Analysis is used to identify similar communities within larger groups. This is where clustering techniques can help. You want to just look at patterns in customer data and then try and find segments. You do not have any label in mind, such as good customer or bad customer. Cluster Analysis can help in market segmentation and positioning, and to identify test markets for new product development.Īs a manager of the online store, you would want to group the customers into different clusters, so that you can make a customised marketing campaign for each of the group. Marketing:Ĭluster Analysis can be helpful in the field of marketing. This will help the retail chain with assortment planning, planning promotional activities and store benchmarking for better performance and higher returns. Cluster analysis can help the retail chain to get desired insights on customer demographics, purchase behaviour and demand patterns across locations. Let a retail chain with so many stores across locations wants to manage stores at best and increase the sales and performance. Practical Application of Clustering Customer Insight:
