Questions tagged [clustering]
Cluster analysis is the task of partitioning data into subsets of objects according to their mutual "similarity," without using preexisting knowledge such as class labels. [Clustered-standard-errors and/or cluster-samples should be tagged as such; do NOT use the "clustering" tag for them.]
4,038
questions
1
vote
0
answers
15
views
Spatial Temporal Clustering evenly spaced over time
I have a large dataset of spatio-temporal data. It has longitude and latitude coordinates, and a date for each observation. For example:
Long
Lat
Date
50
20.43
9-19-2010
51
19.5
10-4-2010
51
19.3
...
0
votes
0
answers
11
views
Choosing number of classes in LCA
I'm an undergrad student, a little confused about goodness of fit tests. I'm trying to choose the appropriate number of classes for a latent class analysis using ...
1
vote
1
answer
26
views
Performing a PCA on data of different hierarchical levels
I (novice) plan to use a PCA on several different, related, i.e. non-orthogonal questionnaire measures. These measures have composite scores (item sums etc.) and some of them have sub-facets. Also, ...
0
votes
0
answers
15
views
Declustering impact, stationarity and discretization
I have a seasonal time series, and I am considering declustering (before any other preprocessing steps) it using runs declustering. If I observe an extremal index of 1, can I claim that my data is i.i....
2
votes
0
answers
23
views
Quantifying a Sequence of Binary Outcomes
I'm compiling a dataset on animal color patterns. Blotches flank the body of the animal (Agkistrodon contortrix) and meet along its midline (i.e., spine). At midline, the blotches can join ...
2
votes
2
answers
30
views
Does clustering actually reduce the number of rows in a dataset?
I am reading the book "grokking Machine Learning" by Luis G. Serrano and came across the following sentence:
"It seems that clustering and dimensionality reduction are nothing like each ...
0
votes
0
answers
9
views
What are the right metrics to validate the performance of a custom clustering model with three possible outcomes?
I have developed a custom clustering model on top of MiniBatchKmeans, that has three possible outcomes for each data point:
Assign the point to the correct cluster.
Assign the point to the wrong ...
0
votes
0
answers
16
views
How can I assign households to coordinates in a social space consistent with pairwise distance measures?
How can I assign households to coordinates in a social space consistent with pairwise distance measures?
I have a question which is somewhat ill-defined, about creating an interesting and useful ...
0
votes
0
answers
4
views
Fuzzy cluster analysis with binary and continue data
I am using R to conduct a fuzzy cluster analysis. I was told that the common fuzzy c means (which is similar to the k mean in the crisp cluster) is not suitable for mixed types of data.
I was ...
0
votes
0
answers
17
views
How to compare multiple point patterns of the same realm resulting from different clustering methods?
I have a dataset of 500 sampling points to which I apply multiple different clustering methods for categorization. How can I compare the resulting spatial point patterns of cluster group labels? I ...
0
votes
0
answers
21
views
Looking for a suitable way to find groups of events
I have an excel file in which I have three columns. The first one is the name of an event, the second one is the moment the event starts and the third one is the time at which an event ends. Let's say ...
0
votes
0
answers
17
views
How to cluster time-of-day event data?
Suppose I have data about patient sneezes, e.g.,:
Name Time Loudness
Alex 07:59 10
Bob 08:03 12
Charlie 17:06 9
Alex 08:09 13
...
You can see ...
5
votes
2
answers
290
views
Is domain knowledge external validation in clustering?
I have cluster results with good values on etc Silhuette Width. The cluster sizes are: 4998, 1, 1 which isn't good knowing my customers doesn't have that particular partition (it's more balanced). I ...
4
votes
2
answers
69
views
Clustering by discrete, unrelated properties?
I have a large number of objects that have unrelated properties such as
color=yellow
material=stone
...
1
vote
1
answer
17
views
Quantifying (observed) spatial clustering?
I am looking for some advice regarding spatial statistics.
I have a large dataset with multiple samples across 5 different conditions. Each sample is composed of different point types in 2D space. For ...