Newest 'nonparametric+bayesian' Questions

0 votes

0 answers

33 views

Is bootstrapping inherently Frequentist? If so, how do we do a Bayesian non-parametric two-sample test?

I normally use frequentist statistics but I now want to use Bayesian statistics as I want to carry out a two-sample (randomised control trial) test that includes prior information. I have an existing ...

Amorphia

913

asked May 1 at 7:40

4 votes

1 answer

52 views

In what ways is Gaussian Process Regression both parametric and non-parametric?

Gaussian Process Regression is considered a "non-parametric" model. However, the term "non-parametric" is often used imprecisely to mean different things, leading to questions ...

socialscientist

847

asked Apr 16 at 19:42

0 votes

0 answers

30 views

Fisher information or Bayesian Uncertainty for non-parametric distributions

This question sounds ridiculous, let me clarify motivation: Fisher information & Bayesian inference uncertainty seemed very cool to me because they can effectively tell you "how ...

profPlum

411

asked Dec 27, 2023 at 18:58

0 votes

0 answers

26 views

BART with non-parametric heteroscedastic noise?

Is there a variant of BART that robustly captures noise that is both heteroscedastic and non-parametric (or has an a-priori unknown parametric form)? For example, a BART that could fit this test data: ...

Luke Gorrie

467

asked Nov 7, 2023 at 11:31

0 votes

0 answers

65 views

Bayesian analysis of non-normally distributed variable

I would like to use an Bayesian approach to compare a continuous non-normally distributed variable taking values between -1 to 1 between two populations. The measurements are not paired. Overall my ...

NicolasBourbaki

101

asked Oct 10, 2023 at 14:39

1 vote

0 answers

61 views

How can I combined Bayesian and non-parametric techniques?

I'd like to combine Bayesian and non-parametric (e.g. XGBoost) models, with the goal of getting a probability distribution over my target variable rather than a point estimate. I have a prior, and I ...

Thomas Johnson

851

asked Nov 17, 2022 at 17:08

2 votes

4 answers

235 views

good intermediate-level textbook for undergraduate applied statistics in data science?

I will be teaching an applied statistics course for the first time and the main audience will be 2nd and 3rd year undergraduates, mostly data science majors. They will have an intro statistics course ...

Community wiki

mstar

1 vote

0 answers

61 views

trace class of prior covariance operator in Bayesian inference problem

I'm interested in certain Bayesian inference problems where the vector space $Q$ where the parameters $\theta$ live is infinite-dimensional. These show up all the time in the geophysical sciences -- ...

Daniel Shapero

143

asked Jan 6, 2022 at 19:10

1 vote

1 answer

136 views

Deciding the Number of Clusters : Standard Methods vs. Non-Parametric Methods

I was watching this video over here (https://www.youtube.com/watch?v=UBiaLq5V7mE) that discussed a Non-Parametric based Bayesian approach for deciding the number of clusters in a dataset. Essentially, ...

stats_noob

1

asked Dec 27, 2021 at 5:04

2 votes

0 answers

73 views

MCMC fitting of Dirichlet Process or Polya Tree prior to residuals in (simple linear regression)/(2-independent-samples) problem

Consider a simple location-shift semi-parametric model with two mutually-independent samples (in what follows, $F$ is a cumulative distribution function (CDF) on $\mathbb{ R }$, the $C_i$ and $T_j$ ...

David Draper

69

asked Jun 23, 2021 at 0:11

2 votes

0 answers

140 views

MCMC fitting of a Dirichlet Process or Polya Tree prior to the residuals in a (simple linear regression)/(2-independent-samples) problem

Consider a simple location-shift semi-parametric model with two mutually-independent samples (here $F$ is a cumulative distribution function (CDF) on $\mathbb{ R }$, the $C_i$ and $T_j$ are real-...

David Draper

69

asked Jun 19, 2021 at 17:39

2 votes

1 answer

854 views

KNN as a crude prototype of Gaussian Process Regression?

I've heard it said before that K-Means-Clustering is a prototypical method for Expectation-Maximization algorithm. Where KM Clustering returns a hard cluster assignment, EM returns soft assignments, ...

jbuddy_13

3,382

asked Jan 19, 2021 at 16:33

2 votes

0 answers

41 views

Unexpected zero on posterior density of Dirichlet process mixture

I was reading this notebook from the PyMC3 documentation about Dirichlet Process Mixtures and, on the last figure, the estimated density reaches almost zero for a particular value, despite the ...

PedroSebe

2,680

asked Oct 27, 2020 at 5:18

2 votes

0 answers

73 views

distance for abc - nonparametric likelihood

When fitting models using abc, data is simulated using parameters drawn from the prior. The distance between the simulated data and the observed data is calculated, and typically if less than a ...

hugh

33

asked May 22, 2020 at 14:03

0 votes

2 answers

1k views

Is there a Bayesian Non-Parametric one-way ANOVA?

The rough idea is that I am trying to compare linguistic properties (e.g. readability) between pieces of texts from two authors essentially. For this, I thought using an ANOVA would be appropriate. ...

BeginnerByron

1

asked Mar 9, 2020 at 15:47

0 votes

0 answers

27 views

Question about possible typo in a tutorial about the stick-breaking model of the Dirichlet distribution

I am reading a tutorial on the Dirichlet distribution: http://mayagupta.org/publications/FrigyikKapilaGuptaIntroToDirichlet.pdf and I think there is a typo in Step 2 of the stick-breaking model of ...

Noppawee Apichonpongpan

593

asked Feb 25, 2020 at 18:28

1 vote

0 answers

31 views

Is data modeled by dirichlet process mixture exchangeable?

Consider DPM model: $$ \begin{aligned} X_{i} | \phi_{i} & \sim F\left(x;\phi_{i}\right) \\ \phi_{1}, \phi_{2}, \cdots | & P \stackrel{iid}{\sim} P \\ P & \sim D P(\alpha G_0) \end{aligned} ...

Spaceship222

241

asked Nov 25, 2019 at 4:21

1 vote

0 answers

28 views

Estimation hardness results in Bayesian inference?

Frequentist statistics has a series of fundamental hardness results that are encountered by beginning statistics students. In non-parametric statistics, a famous hardness result for the normal means ...

Arjen Robben

53

asked Sep 18, 2019 at 16:01

2 votes

0 answers

46 views

Directly applying residual bootstrap to the predictions vs. inferring the parameters?

My friend has a procedure where he does the following: Given a dataset $(x_1,y_1),\ldots,(x_n, y_n)$ Fit $f$ according to $\hat{y_i} = f(x_i) + \epsilon_i$ where $f$ is the regression function. ...

crossvalidateme

203

asked Jul 25, 2019 at 1:29

11 votes

1 answer

514 views

Do Stochastic Processes such as the Gaussian Process/Dirichlet Process have densities? If not, how can Bayes rule be applied to them?

The Dirichlet Pocess and Gaussian Process are often referred to as "distributions over functions" or "distributions over distributions". In that case, can I meaningfully talk about the density of a ...

snickerdoodles777

790

asked Feb 17, 2019 at 19:13

3 votes

2 answers

233 views

Simulating the Posterior Density of a Transformed Parameters

I am reviewing an example (p. 180-181, Example 11.3 and 11.4) from All of Statistics by Larry Wasserman. The example intends to illustrate that the posterior can be found analytically and can be ...

yalex314

159

asked Dec 31, 2018 at 6:31

0 votes

2 answers

72 views

Likelihood term in Bayesian inferencing versus the general definition

In general we say that the likelihood function is defined as some $L(\theta|x)$, so that it is a function over some parameters: $\theta$ given some data: $x$. That is, $\theta$ is free to vary whilst $...

tisPrimeTime

545

asked Aug 9, 2018 at 8:55

2 votes

0 answers

133 views

Smooth regression algorithms that produce zero training error

I am looking to fit three regression functions $f_1, f_2, f_3:\mathbb{R}^2 \to \mathbb{R}$. For example, let's say $X_1$ is time, $X_2$ is geographic latitude, $f_1$ is the temperature, $f_2$ is the ...

User191919

201

asked Aug 7, 2018 at 21:05

4 votes

1 answer

530 views

Is parametric Bayesian inference a special case of nonparametric Bayesian inference?

I'm thinking about univariate density estimation. Original Question In parametric inference, you assume the data are generated from a density that can be summarized by finitely-many parameters. You ...

jcz

1,425

asked Jul 29, 2018 at 20:26

4 votes

1 answer

958 views

Is there a loss function when estimating a model using MCMC?

I am trying to understand how fitting a model using MCMC works. Is there a loss function that is optimized? Or is it simply a case of more draws from the distribution amount to a more complete ...

Skander H.

12.1k

asked Jul 6, 2018 at 0:46

1 vote

0 answers

64 views

Bayesian posterior from pairwise comparison of observations

Say I have $n$ observations of group $A$ and $m$ observations of group $B$ and a function $f: A\times B \rightarrow C$ mapping a pair of observations to one of $k$ categories. I am interested in the ...

Eivind Samuelsen

26

asked Mar 27, 2018 at 9:50

1 vote

0 answers

690 views

Bayesian Wilcoxon test

I have a pre-post dataset with 2 observations per subject (propotion data -bounded between 0 and 1-). I have analyzed the data with a classical dependent t-test under the NHST paradigm. However, as ...

Adrian Santos

157

asked Feb 15, 2018 at 16:41

0 votes

2 answers

266 views

Book Bayesian Nonparametrics [duplicate]

What is the best recommended book on Bayesian Non parametric approaches ? Specifically something which also tackles regression problems such as Gaussian processes.

Community wiki

Wis

1 vote

0 answers

74 views

Clustering and Dirichlet process' parameter

I am reading a paper in which they describe a bayesian model in which the prior $a_i$ is defined as a Dirichlet Process (DP). They say: "We use a DP to find the optimal $a_i$ via clustering". Later on ...

Joe Liner

11

asked Feb 3, 2018 at 15:44

8 votes

2 answers

665 views

What is a mixture of finite mixtures?

A mixture of finite mixture models seem to be an interesting Bayesian (?) approach to solving clustering with an unknown $k$ number of components. It seems though, unlike the mixture model with a ...

MachineEpsilon

3,056

asked Jan 9, 2018 at 8:09

3 votes

1 answer

111 views

Robbins estimate Empirical Bayes

From the compound sampling model where: $Y_i | \theta_i \sim Poi(\theta_i)$ The marginal distribution of $\theta_i$ is $G$, non-parametric. We get that the Bayes estimate of $\theta_i$ under ...

Raxel

347

asked Nov 1, 2017 at 20:18

2 votes

1 answer

195 views

Reference for poor sampler mixing in large bayesian models

I keep seeing this in various presentations, but never saw a reference for it. Although it makes an intuitive sense why samplers potentially can face mixing issue when operating on large space of ...

user3639557

1,502

asked Oct 9, 2017 at 13:26

1 vote

0 answers

31 views

Estimating Gamma PDF parameters from data with negative increments

Say we have collected data, and from a physical perspective we know that the collected data should increase positively with time. However the data looks more like this: This data shown in the figure ...

AnarKi

565

asked Sep 22, 2017 at 11:37

1 vote

1 answer

18 views

Measuring quality of random items - probability that quality exceeds a without any assumptions

Say I draw $n$ random items and measure their quality in the interval $[0,1]$. Now I would like to know: If I draw another item, what is the probability that this item has a quality larger than $0.5$? ...

J Fabian Meier

103

asked Aug 15, 2017 at 9:12

2 votes

1 answer

895 views

Combining triangular distributions

Vose (in Risk analysis a quantitative guide, 2008) argues that it is preferable to use non-parametric distributions when eliciting knowledge about an unknown distribution from experts. The argument is ...

Daniel C

33

asked Apr 20, 2017 at 13:38

31 votes

2 answers

10k views

Is it true that Bayesian methods don't overfit?

Is it true that Bayesian methods don't overfit? (I saw some papers and tutorials making this claim) For example, if we apply a Gaussian Process to MNIST (handwritten digit classification), but only ...

MWB

1,337

asked Mar 2, 2017 at 21:51

6 votes

1 answer

2k views

What does the base distribution of the Dirichlet Process mean?

So far I only really understand the Dirichlet Process through its various metaphors. For the Polya Urn scheme, my understanding is that the "base distribution" is the original distribution of colors ...

cgreen

1,002

asked Feb 23, 2017 at 22:55

8 votes

2 answers

2k views

Bayesian nonparametric answer to deep learning?

As I understand it, deep neural networks are performing "representation learning" by layering features together. This allows learning very high dimensional structures in the features. Of course, it's ...

cgreen

1,002

asked Feb 8, 2017 at 23:36

1 vote

0 answers

53 views

Nonparametric density estimation, individual probablities

Consider the problem of doing nonparametric density estimation using kernel density estimator in the common form $k(\frac{\textbf{x} - \mathbf{x_{j}}}{h})$, $k(\textbf{u}) = \begin{cases} 1 & \...

Martin

121

asked Jan 13, 2017 at 10:15

0 votes

1 answer

267 views

Understanding Gaussian Process and their Priors

I am very interested to understand the motivation behind why are we using these priors let's say in the context of regression. I know that the kernel depicts the distance between the points or let's ...

Xptrz

3

asked Nov 20, 2016 at 3:32

8 votes

1 answer

1k views

Nonparametric nonlinear regression with prediction uncertainty (besides Gaussian Processes)

What are state-of-the-art alternatives to Gaussian Processes (GP) for nonparametric nonlinear regression with prediction uncertainty, when the size of the training set starts becoming prohibitive for ...

lacerbi

5,226

asked Aug 5, 2016 at 14:53

8 votes

1 answer

276 views

Dirichlet process mixture MCMC

I'm reading Markov Chain Sampling Methods for Dirichlet Process Mixture Models by Radford M. Neal. Equation (3.6) states that $$ \text{If } c=c_{j} \text{ for some } j\neq i: P\left(c_{i}=c\;|\;c_{-i}...

Daeyoung

1,142

asked Apr 9, 2016 at 15:04

1 vote

0 answers

444 views

Need for iid in MLE

I am studying about parametric estimation in supervised learning using maximum likelihood estimation. Here is what I learned: Separate our training data according to class; i.e., we have c data sets ...

nSv23

235

asked Jul 15, 2015 at 1:27

4 votes

0 answers

44 views

German tank variant: estimate resolution of camera given cropped photo sizes

Make whatever assumptions you like, but I like the flavor of nonparametric techniques. I have a list of the $x_i$ by $y_i$ resolutions of a number of photos, all cropped from photos taken at the same ...

Simon Kuang

2,121

asked May 25, 2015 at 5:23

4 votes

0 answers

408 views

Is this how a Bayesian bootstrap works?

I am a bit new to the whole nonparametric and Bayesian idea, so tell me if this is correct: to estimate, say, the mean of a dataset's population we do the following: We define a function $f(x)$ that ...

Simon Kuang

2,121

asked Apr 18, 2015 at 19:16

1 vote

1 answer

513 views

Likely mean of a multinomial distribution with dirichlet prior

I am working to create a Bayesian non-parametric estimate of the mean of a distribution given a distribution of observations. Ultimately I'd like to get to a credibility interval of the likely mean of ...

Justin Bozonier

1,229

asked Nov 11, 2014 at 3:58

2 votes

1 answer

177 views

What does "CRP is a marginalized version of PYP" mean?

I've been reading this phrase and I don't know what it means "CRP is a marginalized version of PYP". What are the parameters/latent-variables we are marginalizing out to drive CRP from PYP?

user3639557

1,502

asked Nov 11, 2014 at 1:29

5 votes

1 answer

857 views

Gaussian Process and Expectation Propagation time complexity?

What's the time complexity of training a Gaussian process and its Expectation Propagation approximation? (Before studying them, I'd like to understand if they are even feasible for my application)

MWB

1,337

asked May 15, 2014 at 10:36

5 votes

4 answers

251 views

Probablistic counterpart for kNN

We know that the Gaussian Mixture Model is a probabilistic counterpart of k-means algorithm. Is there a probabilistic counterpart for kNN? (which is similar to k-means, but supervised.)

Daniel

1,586

asked Mar 17, 2014 at 20:28

2 votes

1 answer

504 views

Why semi/nonparametric models?

Increasing the flexibility of models makes it prone to overfitting. On the other hand, it looks to me that, if the space function classes $\mathcal{F}$ is too big, it is hard to prove bounds on ...

Daniel

1,586

asked Oct 30, 2013 at 19:13

All Questions

Related Tags