Newest 'clustering' Questions - Mathematics Stack Exchange

0 votes

1 answer

19 views

Clustering for a real problem - location matters!

I am working on a clustering problem and need some help to develop an appropriate mathematical model. Here are the details of my problem: Locations: I have a set of 141 locations, each defined by ...

juasmilla

101

asked 8 hours ago

0 votes

0 answers

16 views

Clustering a sequence of Bernoulli random variables

Let $Z_1$, ..., $Z_n$ be a sequence of independent Bernoulli random variables such that for all $i\in\left\{1,..,n\right\}$ $Z_i\sim\mathcal{B}(p_i)$ where $p_i < 1/2$. Define $\ell(x_{1:n}, y_{1:n}...

Ibra

175

asked Jun 29 at 14:13

0 votes

0 answers

5 views

Splitting upon insertion in hierarchical clustering

It's my understanding that, upon the insertion of a new element, complete-link hierarchical clustering can lead the splitting of a cluster so as to maintain its "spherical compactness". Do ...

Tfovid

153

asked Jun 6 at 19:45

0 votes

0 answers

9 views

Quantifying the distance between two discrete fuzzy sets

I am looking to use fuzzy sets to represent several collections of data points. Then, given a crisp set, I'd like to determine which collection the crisp set is most similar to. Each collection is ...

Alex

1

asked May 15 at 22:24

0 votes

0 answers

30 views

Spectral Clustering: Finding the normalized minimum cut using the laplacian

I am trying to prove that finding the min $Ncut(A,B)$ for a edge weight graph $W$ with the diagonal matrix of edge degrees $D$ is equivalent to solving for $f \in \{a,b\}^n$ with the constraint that $...

bluesquare

1

asked Apr 2 at 15:54

1 vote

0 answers

18 views

Maximum number of local minima in k-means

Suppose $\mathcal{Z} = \{z_1, \dots, z_n\}$ is the set of points in $d$-dimensional Euclidean space. The aim is to partition the dataset into $(K\leq n)$ distinct clusters $R_1,\dots, R_K$ where $R_i\...

entropy

147

asked Mar 2 at 12:11

0 votes

0 answers

13 views

Metrics for document clustering with measure of synonyms

I asked this question on Data Science stack exchange, but didn't get any responses there. I have a (finite) vocabulary which is a metric space, where the metric measures how antonymous the words are. ...

user1266745

asked Feb 5 at 20:55

3 votes

1 answer

222 views

Why do randomly drawn numbers tend to repeat themselves?

I track the behavior of random numbers and I have discovered that once a number appears, it tends to reappear again shortly thereafter. For example, I've been tracking the Red Powerball in the ...

steveK

137

asked Jan 31 at 4:53

1 vote

0 answers

29 views

References for a statistics question relating to clustering

I am interested in references for the following research topic. It was mentioned to me that this may be a classically studied question, but I'm unsure what line of work of references to begin looking ...

spectrum

11

asked Jan 26 at 21:17

0 votes

0 answers

23 views

notation for clusters of 2D data points

Is there any convention about the notation to use for clusters of $2-$D data points? I have a set of clusters of $2-$D data point. I can denote each cluster with $c_i$, where $i = 1, 2, ..., n$, and $...

Ommo

349

asked Dec 14, 2023 at 14:42

1 vote

1 answer

39 views

Derivation of a function - GBM

why does the sum disapear in this derivation: derivation of loss Mean Squared Error. It comes from the following wikipedia page: https://en.wikipedia.org/wiki/Gradient_boosting. It is the last ...

F.I.

15

asked Nov 28, 2023 at 17:43

0 votes

0 answers

14 views

Eigenvectors corresponding to eigenvalue 1 in the Normalized Laplacian - Why does it represent clusters?

Consider the Normalized Laplacian associated to a similarty graph $$ L = D^{-1/2}SD^{-1/2} $$ I have two sources stating that, in the "ideal case of zero noise", the eigenvectors ...

ygh

121

asked Oct 31, 2023 at 16:26

0 votes

0 answers

16 views

minimizing Earth Mover Distance

So I have a discretized magnitude spectrum $S \in \mathbb{R}^n$ ($n$ number of bins), and a set of frequencies $f_1, f_2, ..., f_m$ (not necessarily corresponding to any of the discretized bin ...

SmoothKen

429

asked Oct 13, 2023 at 7:46

1 vote

0 answers

379 views

What is the correct formula for Within Cluster Sum of Squares

I am studying clustering with K-Means algorithm and I got stumbled in the "inertia", or "within cluster sum of squares" part. First I would appreciate if anyone could explain me ...

Artur Juan Dantas

11

asked Oct 12, 2023 at 16:20

1 vote

0 answers

96 views

Modeling a similarity measure between numbers based on predictive probability

Suppose I'm trying to predict a number $v_p \in \mathbb{R}$ and, thanks to sampling, I know that the prediction $v_p=a$ is true in $P(v_p)=P(a)$ percent of cases. In other words, $P(a)$ percent of the ...

Ben W

31

asked Aug 4, 2023 at 14:19

Stack Exchange Network

Questions tagged [clustering]

Clustering for a real problem - location matters!

Clustering a sequence of Bernoulli random variables

Splitting upon insertion in hierarchical clustering

Quantifying the distance between two discrete fuzzy sets

Spectral Clustering: Finding the normalized minimum cut using the laplacian

Maximum number of local minima in k-means

Metrics for document clustering with measure of synonyms

Why do randomly drawn numbers tend to repeat themselves?

References for a statistics question relating to clustering

notation for clusters of 2D data points

Derivation of a function - GBM

Eigenvectors corresponding to eigenvalue 1 in the Normalized Laplacian - Why does it represent clusters?

minimizing Earth Mover Distance

What is the correct formula for Within Cluster Sum of Squares

Modeling a similarity measure between numbers based on predictive probability

Hot Network Questions

Questions tagged [clustering]

Related Tags