Questions tagged [bayesian-probability]
The bayesian-probability tag has no usage guidance.
38
questions with no upvoted or accepted answers
6
votes
0
answers
202
views
Existence of stick breaking representations for random measures
The Dirichlet process has a roughly size ordered representation in terms of beta random variables, called a stick-breaking representation (Sethuraman, 1994). Similar results hold for the beta process, ...
5
votes
1
answer
330
views
Bounding the sensitivity of a posterior mean to changes in a single data point
There is a real-valued random variable $R$. Define a finite set of random variables ("data points") $$X_i = R + Z_i \; \text{for } i\in\{1,\ldots,n\},$$ where $Z_i$ are identically and independently ...
4
votes
1
answer
531
views
Gaussian process kernel parameter tuning
I am reading on gaussian processes and there are multiple resources that say how the parameters of the prior (kernel, mean) can be fitted based on data,specifically by choosing those that maximize the ...
4
votes
0
answers
228
views
Convergence of the expectation of a random variable when conditioned on its sum with another, independent but not identically distributed
Suppose that for all $n \in \mathbf{N}$, $X_n$ and $Y_n$ are independent random variables with
$$X_n \sim \mathtt{Binomial}(n,1-q),$$
and
$$Y_n \sim \mathtt{Poisson}(n(q+\epsilon_n)),$$
where $q \in (...
4
votes
0
answers
651
views
Bayesian Networks and Polytree
I am a bit puzzled by the use of polytree to infer a posterior in a Bayesian Network (BN).
BN are defined as directed acyclic graphs. A polytree is DAG whose underlying undirected graph is a tree. ...
3
votes
0
answers
69
views
Confusion with implementation of PDE constraint Bayesiain inverse problem
Consider a PDE,
$$\partial_t u -a \nabla u - ru (1-u) = 0$$
at a given snapshot in time. The inverse problem is to find the diffusion coefficient $a \in L^{\infty}$ from a noisy measurement $$Y = \Phi(...
3
votes
0
answers
164
views
Minimizing an f-divergence and Jeffrey's Rule
My question is about f-divergences and Richard Jeffrey's (1965) rule for updating probabilities in the light of partial information.
The set-up:
Let $p: \mathcal{F} \rightarrow [0,1]$ be a ...
3
votes
0
answers
82
views
Have stick-breaking priors with non-iid atoms been considered, and if not, why not?
Roughly speaking, a stick-breaking prior is a random discrete probability measure $P$ on a measurable space $\mathcal X$ of the form
$$P=\sum_{j\ge1}w_j\delta_{\theta_j}$$
where $(w_j)_{j\ge1}$ is a ...
2
votes
1
answer
209
views
Sum of arrival times of Chinese Restaurant Process (CRP)
Suppose that a random sample $X_1, X_2, \ldots$ is drawn from a continuous spectrum of colors, or species, following a Chinese Restaurant Process distribution with parameter $|\alpha|$ (or ...
2
votes
0
answers
60
views
Concentration of posterior probability around a tiny fraction of the prior volume
In the context of approximating the evidence $Z$ in a Bayesian inference setting
$$
Z = \int d\theta \mathcal L (\theta)\pi (\theta)
$$
with $\mathcal L$ the likelihood, $\pi$ the prior, John Skilling'...
2
votes
0
answers
96
views
Is there any good reference on the Bayesian view that can be helpful for reading papers on the number theory using heuristic arguments?
Nowadays there are many papers on the number theory using heuristics.
I have read some of them.
But I have no clear understanding of the Bayesian Probability(subjective probability).
The concept of ...
2
votes
0
answers
111
views
Convergence of Bayesian posterior
Let $\Delta [0,1]$ denote the set of all probability distributions on the unit interval.
Let $\mu \in \Delta [0,1]$ denote an arbitrary prior. Importantly, $\mu$ does not necessarily admit a density ...
2
votes
1
answer
732
views
Bayesian methods in online setting
Imagine the following (very concrete) model: We have a series of random variables $x_k$ with values in $\lbrace 0, 1\rbrace$. We assume $x_k \mid p_k \sim \operatorname{Alt}(p_k),$ where $p_0 \sim R(0,...
2
votes
2
answers
522
views
Quantifying the effect of noise on the posterior variance in Gaussian processes / multivariate Gaussian vectors
Consider a real-valued Gaussian process $f$ on some compact domain $\mathcal{X}$ with mean zero and covariance function $k(x,x') \in [0,1]$ (also known as the kernel function). This question concerns ...
2
votes
0
answers
59
views
A canonical example of the non-existence of predictive probability distribution
Section 3 of Fortini et al. (2000) states that
Given $(X^\infty, \mathcal X^\infty,P)$, a predictive probability distribution of $x_n$ given $(x_1, \dots, x_{n-1})$ with respect to $P$ need not ...