Skip to main content

Questions tagged [clustering]

Cluster analysis is the task of partitioning data into subsets of objects according to their mutual "similarity," without using preexisting knowledge such as class labels. [Clustered-standard-errors and/or cluster-samples should be tagged as such; do NOT use the "clustering" tag for them.]

1 vote
0 answers
15 views

Spatial Temporal Clustering evenly spaced over time

I have a large dataset of spatio-temporal data. It has longitude and latitude coordinates, and a date for each observation. For example: Long Lat Date 50 20.43 9-19-2010 51 19.5 10-4-2010 51 19.3 ...
Robertmg's user avatar
  • 121
0 votes
0 answers
11 views

Choosing number of classes in LCA

I'm an undergrad student, a little confused about goodness of fit tests. I'm trying to choose the appropriate number of classes for a latent class analysis using ...
dancing_monkeys's user avatar
1 vote
1 answer
26 views

Performing a PCA on data of different hierarchical levels

I (novice) plan to use a PCA on several different, related, i.e. non-orthogonal questionnaire measures. These measures have composite scores (item sums etc.) and some of them have sub-facets. Also, ...
Livster's user avatar
  • 11
0 votes
0 answers
15 views

Declustering impact, stationarity and discretization

I have a seasonal time series, and I am considering declustering (before any other preprocessing steps) it using runs declustering. If I observe an extremal index of 1, can I claim that my data is i.i....
Thoms's user avatar
  • 1
2 votes
0 answers
23 views

Quantifying a Sequence of Binary Outcomes

I'm compiling a dataset on animal color patterns. Blotches flank the body of the animal (Agkistrodon contortrix) and meet along its midline (i.e., spine). At midline, the blotches can join ...
Ross Couvillon's user avatar
2 votes
2 answers
30 views

Does clustering actually reduce the number of rows in a dataset?

I am reading the book "grokking Machine Learning" by Luis G. Serrano and came across the following sentence: "It seems that clustering and dimensionality reduction are nothing like each ...
Leox's user avatar
  • 129
0 votes
0 answers
9 views

What are the right metrics to validate the performance of a custom clustering model with three possible outcomes?

I have developed a custom clustering model on top of MiniBatchKmeans, that has three possible outcomes for each data point: Assign the point to the correct cluster. Assign the point to the wrong ...
Sanjay Mythili's user avatar
0 votes
0 answers
16 views

How can I assign households to coordinates in a social space consistent with pairwise distance measures?

How can I assign households to coordinates in a social space consistent with pairwise distance measures? I have a question which is somewhat ill-defined, about creating an interesting and useful ...
andrewH's user avatar
  • 3,157
0 votes
0 answers
4 views

Fuzzy cluster analysis with binary and continue data

I am using R to conduct a fuzzy cluster analysis. I was told that the common fuzzy c means (which is similar to the k mean in the crisp cluster) is not suitable for mixed types of data. I was ...
CTSleo's user avatar
  • 1
0 votes
0 answers
17 views

How to compare multiple point patterns of the same realm resulting from different clustering methods?

I have a dataset of 500 sampling points to which I apply multiple different clustering methods for categorization. How can I compare the resulting spatial point patterns of cluster group labels? I ...
jlklein's user avatar
0 votes
0 answers
21 views

Looking for a suitable way to find groups of events

I have an excel file in which I have three columns. The first one is the name of an event, the second one is the moment the event starts and the third one is the time at which an event ends. Let's say ...
slow_learner's user avatar
0 votes
0 answers
17 views

How to cluster time-of-day event data?

Suppose I have data about patient sneezes, e.g.,: Name Time Loudness Alex 07:59 10 Bob 08:03 12 Charlie 17:06 9 Alex 08:09 13 ... You can see ...
sds's user avatar
  • 2,246
5 votes
2 answers
290 views

Is domain knowledge external validation in clustering?

I have cluster results with good values on etc Silhuette Width. The cluster sizes are: 4998, 1, 1 which isn't good knowing my customers doesn't have that particular partition (it's more balanced). I ...
ExchangedVisual111's user avatar
4 votes
2 answers
69 views

Clustering by discrete, unrelated properties?

I have a large number of objects that have unrelated properties such as color=yellow material=stone ...
Ivan Voras's user avatar
1 vote
1 answer
17 views

Quantifying (observed) spatial clustering?

I am looking for some advice regarding spatial statistics. I have a large dataset with multiple samples across 5 different conditions. Each sample is composed of different point types in 2D space. For ...
stellaria's user avatar

15 30 50 per page
1
2 3 4 5
270