Skip to main content

Questions tagged [bayesian-probability]

The tag has no usage guidance.

9 votes
3 answers
468 views

What does the KL being symmetric tell us about the distributions?

Suppose two probability density functions, $p$ and $q$, such that $\text{KL}(q||p) = \text{KL}(p||q) \neq 0$. Intuitively, does that tell us anything interesting about the nature of these densities?
HesterJ's user avatar
  • 123
9 votes
1 answer
313 views

Who introduced the term hyperparameter?

I am trying to find the earliest use of the term hyperparameter. Currently, it is used in machine learning but it must have had earlier uses in statistics or optimization theory. Even the multivolume ...
ACR's user avatar
  • 790
9 votes
1 answer
477 views

In what sense is the Bayesian posterior mean a “convex combination”?

I asked this on math.stackexchange with no response, I'm hoping someone here might have something. Suppose I want to estimate $x \in \mathbb{R}^n$ from two signals with zero mean, normally ...
Ronaldo Carpio's user avatar
8 votes
1 answer
1k views

Rate of convergence of Bayesian posterior

Suppose a data generating process (DGP) is parameterized by some unknown parameter $\theta_0$, say $P_{\theta_0}$, and we want to estimate the value of $\theta_0$ using Bayesian method. Let $\pi(\...
Herr K.'s user avatar
  • 183
8 votes
1 answer
319 views

Base schemes and Bayesian priors

One of Grothendieck's dicta about algebraic geometry is to consider "the relative situation", where one doesn't consider the category of schemes but of schemes over a fixed base scheme. In Bayesian ...
Allen Knutson's user avatar
6 votes
1 answer
2k views

What can be said about an infinite linear chain of conjugate prior distributions?

We can sample a discrete value from the multinomial distribution. We can also sample the parameters of the multinomial distribution from its conjugate prior the dirichlet distribution. Since the ...
DoubleJay's user avatar
  • 2,383
6 votes
0 answers
202 views

Existence of stick breaking representations for random measures

The Dirichlet process has a roughly size ordered representation in terms of beta random variables, called a stick-breaking representation (Sethuraman, 1994). Similar results hold for the beta process, ...
Shannon S.'s user avatar
5 votes
6 answers
2k views

Are all probabilities conditional probabilities? [closed]

We know that $P(A\mid B) = \frac{P(A \cap B)}{P(B)}$. So $P(B) = P(A\mid B)P(A \cap B)$. Thus are all probabilities conditional probabilities? Can one make a probability more accurate by introducing a ...
Tony 's user avatar
  • 59
5 votes
3 answers
1k views

Probability estimates for pairwise majority votes

This is related to the rank aggregation question I asked previously. I have items $I_1, \ldots, I_N$ and the observations of a number of pairwise trials which pit pairs $I_i$ and $I_j$ against ...
David R. MacIver's user avatar
5 votes
1 answer
330 views

Bounding the sensitivity of a posterior mean to changes in a single data point

There is a real-valued random variable $R$. Define a finite set of random variables ("data points") $$X_i = R + Z_i \; \text{for } i\in\{1,\ldots,n\},$$ where $Z_i$ are identically and independently ...
Ben Golub's user avatar
  • 1,058
4 votes
2 answers
216 views

Do these distributions have a name already?

In playing with some math finance stuff I ran into the following distribution and I was curious if someone had a name for it or has studied it or worked with it already. To start, let $\Delta^n$ be ...
Jess Boling's user avatar
4 votes
1 answer
531 views

Gaussian process kernel parameter tuning

I am reading on gaussian processes and there are multiple resources that say how the parameters of the prior (kernel, mean) can be fitted based on data,specifically by choosing those that maximize the ...
john's user avatar
  • 141
4 votes
0 answers
228 views

Convergence of the expectation of a random variable when conditioned on its sum with another, independent but not identically distributed

Suppose that for all $n \in \mathbf{N}$, $X_n$ and $Y_n$ are independent random variables with $$X_n \sim \mathtt{Binomial}(n,1-q),$$ and $$Y_n \sim \mathtt{Poisson}(n(q+\epsilon_n)),$$ where $q \in (...
as1's user avatar
  • 91
4 votes
0 answers
651 views

Bayesian Networks and Polytree

I am a bit puzzled by the use of polytree to infer a posterior in a Bayesian Network (BN). BN are defined as directed acyclic graphs. A polytree is DAG whose underlying undirected graph is a tree. ...
Bremen's user avatar
  • 41
3 votes
2 answers
633 views

Parametrising a sparse orthogonal matrix

I need to find a way to parametrise a matrix that is both sparse (to some degree) and orthogonal, i.e., I am looking for a parametrisation that describes $A \in \mathbb{R}^{n\times m}$ such that $AA^𝑇...
HesterJ's user avatar
  • 123

15 30 50 per page
1
2 3 4 5 6