Questions tagged [bayesian-probability]
The bayesian-probability tag has no usage guidance.
86
questions
9
votes
3
answers
468
views
What does the KL being symmetric tell us about the distributions?
Suppose two probability density functions, $p$ and $q$, such that $\text{KL}(q||p) = \text{KL}(p||q) \neq 0$. Intuitively, does that tell us anything interesting about the nature of these densities?
9
votes
1
answer
313
views
Who introduced the term hyperparameter?
I am trying to find the earliest use of the term hyperparameter. Currently, it is used in machine learning but it must have had earlier uses in statistics or optimization theory. Even the multivolume ...
9
votes
1
answer
477
views
In what sense is the Bayesian posterior mean a “convex combination”?
I asked this on math.stackexchange with no response, I'm hoping someone here might have something.
Suppose I want to estimate $x \in \mathbb{R}^n$ from two signals with zero mean, normally ...
8
votes
1
answer
1k
views
Rate of convergence of Bayesian posterior
Suppose a data generating process (DGP) is parameterized by some unknown parameter $\theta_0$, say $P_{\theta_0}$, and we want to estimate the value of $\theta_0$ using Bayesian method. Let $\pi(\...
8
votes
1
answer
319
views
Base schemes and Bayesian priors
One of Grothendieck's dicta about algebraic geometry is to consider "the relative situation", where one doesn't consider the category of schemes but of schemes over a fixed base scheme.
In Bayesian ...
6
votes
1
answer
2k
views
What can be said about an infinite linear chain of conjugate prior distributions?
We can sample a discrete value from the multinomial distribution.
We can also sample the parameters of the multinomial distribution from its conjugate prior the dirichlet distribution.
Since the ...
6
votes
0
answers
202
views
Existence of stick breaking representations for random measures
The Dirichlet process has a roughly size ordered representation in terms of beta random variables, called a stick-breaking representation (Sethuraman, 1994). Similar results hold for the beta process, ...
5
votes
6
answers
2k
views
Are all probabilities conditional probabilities? [closed]
We know that $P(A\mid B) = \frac{P(A \cap B)}{P(B)}$. So $P(B) = P(A\mid B)P(A \cap B)$. Thus are all probabilities conditional probabilities? Can one make a probability more accurate by introducing a ...
5
votes
3
answers
1k
views
Probability estimates for pairwise majority votes
This is related to the rank aggregation question I asked previously.
I have items $I_1, \ldots, I_N$ and the observations of a number of pairwise trials which pit pairs $I_i$ and $I_j$ against ...
5
votes
1
answer
330
views
Bounding the sensitivity of a posterior mean to changes in a single data point
There is a real-valued random variable $R$. Define a finite set of random variables ("data points") $$X_i = R + Z_i \; \text{for } i\in\{1,\ldots,n\},$$ where $Z_i$ are identically and independently ...
4
votes
2
answers
216
views
Do these distributions have a name already?
In playing with some math finance stuff I ran into the following distribution and I was curious if someone had a name for it or has studied it or worked with it already.
To start, let $\Delta^n$ be ...
4
votes
1
answer
531
views
Gaussian process kernel parameter tuning
I am reading on gaussian processes and there are multiple resources that say how the parameters of the prior (kernel, mean) can be fitted based on data,specifically by choosing those that maximize the ...
4
votes
0
answers
228
views
Convergence of the expectation of a random variable when conditioned on its sum with another, independent but not identically distributed
Suppose that for all $n \in \mathbf{N}$, $X_n$ and $Y_n$ are independent random variables with
$$X_n \sim \mathtt{Binomial}(n,1-q),$$
and
$$Y_n \sim \mathtt{Poisson}(n(q+\epsilon_n)),$$
where $q \in (...
4
votes
0
answers
651
views
Bayesian Networks and Polytree
I am a bit puzzled by the use of polytree to infer a posterior in a Bayesian Network (BN).
BN are defined as directed acyclic graphs. A polytree is DAG whose underlying undirected graph is a tree. ...
3
votes
2
answers
633
views
Parametrising a sparse orthogonal matrix
I need to find a way to parametrise a matrix that is both sparse (to some degree) and orthogonal, i.e., I am looking for a parametrisation that describes $A \in \mathbb{R}^{n\times m}$ such that $AA^𝑇...