I am working with models that use Dirichlet distributions. However, I want to account for correlations between components. If this question is a duplicate, I'd also appreciate any pointers to the right direction. I have seen this question on "Distributions on the simplex with correlated components", but it didn't really answer what I am looking for.
So, say I have a Dirichlet model to estimate the colors of a jar of beans (red, blue, green, yellow) based on a sample: $n$ is the number of beans sampled, $k=4$ is the number of dimensions/components in the Dirichlet simplex (i.e., the number of colors), $x=(x_i,...,x_n)$ is a vector of $k$ dimensions of the bean colors, and $p_k=1/n\sum_{x=1}^{n}(x_i)$ is the proportion of bean colors. Suppose we have the following concentration vector $\alpha_k$ to model the proportion of bean colors, such as: $$p_k \sim Dirichlet(\alpha_k)$$
Suppose we have a very simple concentration vector, where $\alpha_k=(1,1,1,1)$, meaning that there is an equal chance for the beans to be each color (25%). However, suppose we know with a very high degree of certainty that 25% of the beans are red and 25% are blue, but we are not so certain of the proportion of green and yellow beans. (They also have an estimated proportion of 25% each, but there is a good chance they are 20% and 30%, or 35% and 15%.) In this sense, the probability functions for green and yellow are correlated: the error in one dimension affects one dimension more than others.
My question is how can correlated components be included in a Dirichlet distribution, as in the case described above? My problem is that a basic Dirichlet distribution assumes that the probability functions are independent (one bean not being one color has an equal probability of being any other color).
At first I thought that multinomial Dirichlet distributions could account for this because their they include a covariance. But I believe the resulting distributions still assume independence between components matrix, as pointed out in this video (1:30). I also found this article, "A generalization of the Dirichlet distribution" by Ongaro and Migliorati (2012) that proposes flexible Dirichlet distributions, as well as the FlexDir CRAN package. Could this generalization address my question?
Edit: One clarification as an addendum to the question. The concentration vectors of $\alpha$ can be increased to reduce the variance of the probability distribution. This would reflect higher "confidence" in the estimations. However, the problem is that this has to be done proportionally for all vector or else the mean estimations of the Dirichlet distribution will change. For example, $\alpha_k=(4,4,4,4)$ would reflect high confidence in the estimations, but for all dimensions equally. Meanwhile, $\alpha_k=(4,4,1,1)$ would make the estimates for red and blue more confidence, but it would also would change the proportions so that the mean estimated probability of green and yellow are 10% for each.