Newest 'nonparametric+density-estimation' Questions

1 vote

0 answers

40 views

How to show $\sup_{x\in [a,b]}|f_n(x)-f(x)|=O_p(\sqrt{\frac{\log n}{nh}}+h^2)$ when the kernel $K(\cdot) $ is of bounded variation?

Consider the kernel estimate $f_n$ of a real univariate density defined by $$f_n(x)=\sum_{i=1}^{n}(nh)^{-1}K\left\{h^{-1}(x-X_i)\right\}$$ where $X_1,...,X_n$ are independent and identically ...

Kevin

31

asked Apr 8 at 5:01

1 vote

0 answers

43 views

Why is histogram density estimation nonparametric?

My understanding of histogram density estimation: For $k$ predefined equal-width bins $(b_0, b_1], (b_1, b_2], ..., (b_{k-1}, b_k]$ and $n$ observations $x_1,...,x_n \in (b_0,b_k]$, we estimate ...

fin

11

asked Sep 15, 2023 at 16:38

0 votes

0 answers

85 views

Expected value (and variance) of a Dirichlet Process

Suppose I have a measure $G$ that follows a Dirichlet Process, $$G \sim DP(H_0,\alpha)$$ where $H_0$ is some base measure. Is there a closed form solution for the expected value of $G$?

dogs4ever

1

asked Apr 25, 2023 at 18:32

5 votes

2 answers

549 views

Is density estimation the same as parameter estimation?

I was studying parameter estimation from Sheldon Ross' probability and statistics book. Here the task of parameter estimation is described as follows: Is this task the same of density estimation in ...

tail

151

asked Apr 20, 2023 at 10:25

1 vote

0 answers

251 views

Bias of kernel density estimator of pdf $f$, where $f$ has bounded first derivative $f'$

Let's say the kernel density estimator is given by $$\hat f(x) = \frac{1}{nh_n} \sum_{i=1}^n K\left(\frac{X_i-x}{h_n}\right),$$ where $h_n \to 0$, $nh_n \to \infty$, $K$ a symmetric probability ...

Phil

636

asked Nov 28, 2022 at 4:47

0 votes

0 answers

40 views

Kernel Density Estimator: Misunderstanding in Taylor Series and the bias of KDE [duplicate]

Let's say the kernel density estimator is given by $\hat f(x) = \frac{1}{nh_n} \sum_{i=1}^n K(\frac{X_i-x}{h_n})$, where $h_n \to 0$, $nh_n \to \infty$, $K$ a symmetric probability distribution ...

Phil

636

asked Nov 27, 2022 at 19:42

0 votes

0 answers

50 views

How to prove symmetry of a Uniform kernel?

I am trying to prove this kernel is valid, $$ K(x) = \frac{1}{2}I(-1 < x < 1) $$ So far I can integrate to 1, but how do I prove $$k(x) = k(-x)$$ Also, how do we satisfy that k(x) is $\ge$ 0 for ...

user359211

1

asked May 27, 2022 at 5:29

1 vote

0 answers

102 views

Optimal rate of convergence of nonparametric density estimators

Suppose that $X_1, X_2, \dots, X_n$ forms an independent and identically distributed sample from some $d$-dimensional probability distribution with unknown probability density function $f$. Let $x$ be ...

lmaosome

140

asked Oct 22, 2021 at 8:24

1 vote

0 answers

274 views

histogram vs. kernel in density estimation

Assume we have a problem of estimation of a density $f(x)$ over an interval $[0, 1]$. Can a regular histogram (i.e. with equal-sized bins) be viewed as some kind of a kernel?

ABK

676

asked Apr 13, 2021 at 8:01

1 vote

0 answers

135 views

Extraction of modes from a multi-modal density function

I am trying to extract modes from a multi-modal density function and not just peaks. For example, in the two density functions below (images), I would like to extract the curves contained in the black ...

curiosus

323

asked Mar 3, 2021 at 15:31

1 vote

0 answers

107 views

Convex hull version of density estimation (or lines of constant density)

Background: So I had a thought, tried it out, and liked what it did. I'm sure someone else has done this. It feels very convenient. It also gives an interesting take on robust nonparametric density ...

EngrStudent

9,580

asked Dec 8, 2020 at 15:09

0 votes

0 answers

289 views

Building a classifier using Parzen window

Considering the application of the Parzen window method to model a probability density function in a binary classification problem, and assume a training set where the 4 points {−5, −1, 1, 5} belong ...

AfonsoSalgadoSousa

113

asked Nov 3, 2020 at 19:57

2 votes

1 answer

39 views

Why might the functional form of a distribution be "inappropriate" for a particular application?

Working through Bishop's Pattern Recognition and Machine Learning(a great read so far!) and on page 67 he says: "One limitation of the parametric approach is that it assumes a specific ...

stochasticmrfox

1,617

asked Oct 30, 2020 at 21:47

2 votes

0 answers

41 views

Unexpected zero on posterior density of Dirichlet process mixture

I was reading this notebook from the PyMC3 documentation about Dirichlet Process Mixtures and, on the last figure, the estimated density reaches almost zero for a particular value, despite the ...

PedroSebe

2,680

asked Oct 27, 2020 at 5:18

4 votes

0 answers

442 views

Derivation of k nearest neighbor classification rule

One way to derive the k-NN decision rule based on the k-NN density estimation goes as follows: given $k$ the number of neighbors, $k_i$ the number of neighbors of class $i$ in the bucket, $N$ the ...

diegobatt

426

asked Oct 27, 2020 at 4:10

0 votes

0 answers

337 views

Is a non-parametric density estimation required for a bimodal distribution?

How to approach the following two cases is clear, I am mentioning them to set up my question. (Case 1): For data that appears to be a Gaussian distribution, we can assume the distribution is Gaussian ...

ManUtdBloke

893

asked Aug 12, 2020 at 10:37

1 vote

1 answer

353 views

How Parzen window density estimate $f_n$ converges to f

I am trying to understand how Parzen window density estimate converges to actual density function f(x).[Actually i am trying to learn machine learning on my own using available free resources. Please ...

Nascimento de Cos

167

asked Mar 18, 2020 at 13:09

3 votes

1 answer

100 views

Usefulness of MISE

I'm currently in a class on nonparametric smoothing, and, while talking about density estimation in general, the professor introduced the notion of MISE (mean integrated square error): $\text{MISE}\...

CLL

229

asked Feb 24, 2020 at 9:37

4 votes

1 answer

2k views

Is it appropriate to examine the density plot for time series data?

Usually we use time plot to examine the behaviour of time series data cause it reveals the chronological characteristic. Does it make sense that one looks at the data distribution using some non-...

Seymour

120

asked Feb 21, 2020 at 8:36

2 votes

1 answer

839 views

Convergence of kernel density estimate as the sample size grows

Let $X\sim\text{Normal}(0,1)$ and let $f_X$ be its probability density function. I conducted some numerical experiments in the software Mathematica to estimate $f_X$ via a kernel method. Let $\hat{f}...

user269666

285

asked Jan 11, 2020 at 15:19

1 vote

0 answers

131 views

What is the resulting distribution of a data set that was originally normally distributed but has been quantized and had all negative values removed?

I am trying to benchmark a seasonal forecasting model and calculate not just the point forecasts but the forecast densities from the model. To do this, I generated a simulated data set in the ...

Akaike's Children

1,381

asked Dec 4, 2019 at 23:56

5 votes

1 answer

698 views

Expected value and variance of KDE

I need to find the expected value and variance of KDE given that $$(i) E[u] = 0 \to \int u\phi(u)du=0\\ (ii)V[u] = \sigma^2 \to \int u^2\phi(u)du=\sigma^2$$ where $\phi$ is the kernel function. I've ...

thenac

361

asked Dec 3, 2019 at 20:55

1 vote

0 answers

42 views

Difficulties with orthogonal density estimation

I am working on an implementation of an orthogonal density estimator, using the basis $$ \psi_0(t) = 1, \quad \psi_{2j}(t) = \sqrt{2}\text{cos}(2\pi j t), \quad \psi_{2j+1}(t) = \sqrt{2}\text{sin}(2\...

chris75

21

asked Oct 18, 2019 at 5:42

4 votes

1 answer

1k views

Properties of Kernel Density Estimators

Given Let $X \in \mathbb{R}$ be a real-valued random variable with theoretical probability density function (pdf) $f(x)$ and corresponding cumulative distribution function (cdf) $F(x)$. Let $X_1, X_2,...

inkalchemist1994

335

asked Mar 13, 2019 at 14:47

1 vote

1 answer

160 views

Credibility evaluation - how to model conditional continuous density from multiple variables of various types?

I recently got dataset for 37000 households with declared income and a few dozens of other variables of various types: continuous, discrete, binary. The task is to automatically (unsupervised) ...

Jarek Duda

421

asked Jan 12, 2019 at 16:13

2 votes

2 answers

159 views

Dvoretzky-Kiefer-Wolfowitz Vs. KDE fractional convergence

The DKW bound says, roughly and under very general assumptions, that the empirical CDF of $n$ iid samples of a random variable $X$ converges to the exact CDF of $X$ exponentially with the number of ...

Amir Sagiv

223

asked Feb 25, 2018 at 11:42

1 vote

2 answers

173 views

Closeness of 2-parametric discrete distributions when first 2 moments are matching

Let $\mathcal{D}$ be a particular 2-parameter uni-variate discrete distribution family, and let $D(\theta_1, \theta_2) \in \mathcal{D}$ be one particular distribution from this family, where $\theta_i ...

Abhiram Natarajan

123

asked Sep 19, 2017 at 2:30

2 votes

1 answer

183 views

What are some of the common techniques for density estimation?

I'm trying to estimate the probability density function of a real random variable given its iid realizations. What are some of the standard techniques to do this? One method I have heard of is the ...

Richard Simmons

21

asked Jun 25, 2017 at 18:00

4 votes

2 answers

4k views

Leave one out cross validation in kernel density estimation

I am taking a look at : http://pages.cs.wisc.edu/~jerryzhu/cs731/kde.pdf Where they define the following loss function for kernel density estimates $$J(h) = \int \hat{f_n}^2(x)dx -2\int\hat{f_n}(x)...

user2879934

543

asked May 4, 2017 at 20:58

9 votes

2 answers

3k views

Estimating the gradient of log density given samples

I am interested in estimating the gradient of the log probability distribution $\nabla\log p(x)$ when $p(x)$ is not analytically available but is only accessed via samples $x_i \sim p(x)$. There ...

jkt

563

asked Apr 13, 2017 at 0:18

1 vote

0 answers

190 views

Optimal bandwidth selection in conditional density estimation

Consider the situation that we are estimating a $d$-dimensional density (with suitable regularity conditions) using kernel density estimation, [Method1,conditional density estimation] We can proceed ...

Henry.L

2,480

asked Mar 14, 2017 at 16:50

2 votes

1 answer

840 views

Scaling up the bandwidth for kernel density estimation

Suppose I have $(\mathbf{X}_1, \cdots, \mathbf{X}_n)$ from a multivariate distribution $f$. The multivariate KDE is \begin{align*} \widehat{f}_\mathbf{H}(\mathbf{x}) = n^{-1}\sum_{i=1}^{n}K_\mathbf{H}(...

Tom Chen

621

asked Feb 22, 2017 at 17:50

1 vote

0 answers

53 views

Nonparametric density estimation, individual probablities

Consider the problem of doing nonparametric density estimation using kernel density estimator in the common form $k(\frac{\textbf{x} - \mathbf{x_{j}}}{h})$, $k(\textbf{u}) = \begin{cases} 1 & \...

Martin

121

asked Jan 13, 2017 at 10:15

0 votes

0 answers

33 views

Density estimation for points regularly spaced on a grid? Infer spacing between pdf peaks?

Due to a fundamental characteristic of the data, points are clustered together on a 1-D grid-like structure with equal spacing. Plotting these points in a histogram shows a pdf with several ...

ShanZhengYang

693

asked Aug 4, 2016 at 6:44

9 votes

2 answers

6k views

Density estimation for large dataset

I have a unidimensional data set with more than 1000000 observations. Assuming that those observations are independent realizations of the same random variable I need to estimate the underling ...

Mur1lo

1,375

asked Jun 20, 2016 at 17:07

2 votes

1 answer

381 views

Learn a distribution from distributions on samples [closed]

There's many good ways to learn a distribution $p_X$ of an r.v. $X$ over $k$ symbols given many i.i.d. samples $X_1,\ldots, X_n$. The simplest is to use the sample relative frequencies $\hat{f}_X$ as ...

chausies

421

asked Feb 29, 2016 at 21:48

3 votes

3 answers

223 views

Literature on nonparametric density estimation

I am about to write my bachelor thesis about non-parametric density estimation, especially kernel density estimators and their application in classification. As I am quite new to looking for academic ...

Matt

33

asked Apr 4, 2014 at 18:02

16 votes

3 answers

5k views

Where is density estimation useful?

After going through some slightly terse mathematics, I think I have a slight intuition of kernel density estimation. But I am also aware that estimating multivariate density for more than three ...

lovekesh

469

asked Jan 17, 2014 at 11:37

4 votes

3 answers

251 views

Fast multivariate unimodal density estimator

I have a sample $\boldsymbol{x}_i$ for $i$ in $1,\dots, n$, from a $d$ dimensional density $f(\boldsymbol{x})$ and I would like to estimate this unknown density. In addition I know that $f(\boldsymbol{...

Matteo Fasiolo

3,264

asked May 29, 2013 at 21:35

All Questions

Related Tags