Newest 'dirichlet-distribution+topic-models' Questions

1 vote

0 answers

64 views

Why does latent dirichlet allocation (LDA) fail when dealing with large and heavy-tailed vocabularies?

I'm reading the 2019 paper Topic Modeling in Embedding Spaces which claims that the embedded topic model improves on these limitations of LDA. But why does LDA have these limitations—why does it fail ...

seanmachinelearning

11

asked Jun 22, 2022 at 15:57

1 vote

1 answer

49 views

In Latent Dirichlet allocation, is the following formula the probability of observing a single document, or an entire corpus?

This is the formula in question: Source: https://en.wikipedia.org/wiki/Latent_Dirichlet_allocation

Bob Odenkirk

13

asked Feb 3, 2021 at 18:57

0 votes

1 answer

158 views

LDA alpha equivalent in structural topic model

I'm using an implementation of the structural topic model (stm), written in R using the stm package. I want to reduce the number of topics that are prevalent in ...

James

25

asked Dec 18, 2019 at 14:34

12 votes

0 answers

2k views

Is sparsity of topics a necessary condition for latent Dirichlet allocation (LDA) to work

I have been playing with the hyper-parameters of the latent Dirichlet allocation (LDA) model and am wondering how sparsity of topic priors play a role in inference. I have not performed these ...

kedarps

3,592

asked Mar 7, 2017 at 21:14

4 votes

1 answer

1k views

Correlation of Dirichlet distribution in Latent Dirichlet Allocation

Latent Dirichlet Allocation uses as prior for topic distribution the Dirichlet prior. However this model doesn't provide a correlation between topics and for this reason it was introduced Correlated ...

Gio_cor

68

asked Feb 7, 2017 at 17:42

4 votes

2 answers

768 views

Topic Models: Latent Dirichlet Allocations

I am trying to figure out the details of LDA and have been stuck for a while now. While reading the paper by Blei, I came across this - Latent Dirichlet allocation (LDA) is a generative ...

Clock Slave

1,087

asked Dec 15, 2016 at 15:11

4 votes

1 answer

367 views

which classifier to choose for probability histogram-like features

I have a populations of 500 elements. Each element is represented by a 10 dimension feature vector which sum of element is equal to 1 (you can think about it as a histogram of probabilities). In ...

gabboshow

683

asked Feb 3, 2016 at 21:30

1 vote

1 answer

96 views

About LDA model, I need a true expert to tell me that what is the real benefits of the Dirichlet prior? [closed]

Well,you know ,the only difference between pLSI and LDA is that the latter has a Dirichlet prior,thus the number of model parameters do not increase with the size of corpus,and this avoid the ...

lynnjohn

191

asked Aug 2, 2015 at 3:55

4 votes

1 answer

2k views

Hierarchical Dirichlet Processes in topic modeling

I think I understand the main ideas of hierarchical dirichlet processes, but I don't understand the specifics of its application in topic modeling. Basically, the idea is that we have the following ...

r_31415

3,351

asked Jan 31, 2015 at 0:35

1 vote

1 answer

5k views

How do you estimate $\alpha$ parameter of a latent dirichlet allocation model?

Blei has shown that it is possible to estimate $\alpha$ in a LDA model, but I have yet to find a library (any library; C, C++, Java, ...) to do so. Usually, implementations (including Blei's) treat $\...

Kang Min Yoo

113

asked Dec 2, 2014 at 10:17

0 votes

1 answer

209 views

Can dummy variables be used to represent space in latent Dirichlet allocation?

Can dummy variables be used to represent space in latent Dirichlet allocation? I have a set of geocoded textual documents. I would like to use LDA to generate a topic model for the documents. ...

mech

3

asked Nov 10, 2014 at 16:44

5 votes

1 answer

2k views

understanding of effect of $\alpha$ in Dirichlet distribution

When reading the topic modeling tutorial written by Blei, KDD 2011 tutorial I was confused about a set of diagrams which aim to show the effect of $\alpha$ in Dirichlet distribution. For example, for ...

user3269

5,222

asked Apr 29, 2014 at 14:31

5 votes

0 answers

1k views

Gibbs sampling for LDA -- does a small Dirichlet concentration parameter make a difference?

I'm using a Gibbs sampler for Latent Dirichlet allocation as described by Griffiths and Steyvers (http://www.ncbi.nlm.nih.gov/pmc/articles/PMC387300/). The sampling of a new topic $j$ for word $i$ is ...

Ben

473

asked Dec 11, 2013 at 19:48

2 votes

2 answers

1k views

Implementing Latent Dirichlet Allocation - notation confusion

I am trying to implement LDA using the collapsed Gibbs sampler from http://www.uoguelph.ca/~wdarling/research/papers/TM.pdf the main algorithm is shown below I'm a bit confused about the notation ...

user1893354

1,895

asked Sep 6, 2013 at 15:56

0 votes

0 answers

179 views

Posterior in latent Dirichlet analysis

I have a question regarding LDA (Latent Dirichlet Analysis) - what is the correct formulation of the posterior? In http://www.cs.princeton.edu/~blei/papers/Blei2011.pdf‎ it is $p(\beta_{1:K}, \theta_{...

user1315305

1,309

asked Aug 2, 2013 at 22:15

Stack Exchange Network

All Questions

Why does latent dirichlet allocation (LDA) fail when dealing with large and heavy-tailed vocabularies?

In Latent Dirichlet allocation, is the following formula the probability of observing a single document, or an entire corpus?

LDA alpha equivalent in structural topic model

Is sparsity of topics a necessary condition for latent Dirichlet allocation (LDA) to work

Correlation of Dirichlet distribution in Latent Dirichlet Allocation

Topic Models: Latent Dirichlet Allocations

which classifier to choose for probability histogram-like features

About LDA model, I need a true expert to tell me that what is the real benefits of the Dirichlet prior? [closed]

Hierarchical Dirichlet Processes in topic modeling

How do you estimate $\alpha$ parameter of a latent dirichlet allocation model?

Can dummy variables be used to represent space in latent Dirichlet allocation?

understanding of effect of $\alpha$ in Dirichlet distribution

Gibbs sampling for LDA -- does a small Dirichlet concentration parameter make a difference?

Implementing Latent Dirichlet Allocation - notation confusion

Posterior in latent Dirichlet analysis

Hot Network Questions

All Questions

Related Tags