Newest 'nonparametric+machine-learning' Questions

4 votes

1 answer

52 views

In what ways is Gaussian Process Regression both parametric and non-parametric?

Gaussian Process Regression is considered a "non-parametric" model. However, the term "non-parametric" is often used imprecisely to mean different things, leading to questions ...

socialscientist

847

asked Apr 16 at 19:42

0 votes

0 answers

4 views

learning guarantees for gaussian weighting of training points

I have my training data for binary classification that consists of $N$ pairs $$(x_i\in R^F, y_i \in {-1, 1})$$ $i\in [1,\dots,N]$. My classification rule of a new point $x$ is simply $$ \hat{y}(x) = \...

Franco Marchesoni

131

asked Oct 7, 2023 at 10:12

1 vote

0 answers

171 views

Projection pursuit regression

Projection pursuit regression (PPR) is described in Hastie et al.'s The Elements of Statistical Learning in the chapter on neural networks. The algorithm was introduced by Friedman and Stuetzle (1981)....

Estacionario

751

asked Jul 24, 2023 at 8:15

5 votes

2 answers

549 views

Is density estimation the same as parameter estimation?

I was studying parameter estimation from Sheldon Ross' probability and statistics book. Here the task of parameter estimation is described as follows: Is this task the same of density estimation in ...

tail

151

asked Apr 20, 2023 at 10:25

4 votes

3 answers

479 views

Perfect Prediction: Why Would We Ever Use a Statistical Model?

Dear statistics experts I need your help with something that has bothered me for a while now. My problem revolves around perfect prediction and essentially boils down to: Why would we ever set up and ...

This_is_it

69

asked Nov 19, 2022 at 14:03

1 vote

0 answers

39 views

validation and calibration of crop yield data using conditional inference trees

I am trying to validate and calibrate the conditional inference tree model using the crop yield data, and I started by splitting my dataset into training and test sets. After splitting, I had to ...

Jovin Vicent

11

asked May 1, 2022 at 19:37

3 votes

0 answers

66 views

Looking for the Holy Grail of nonparametric regression

Unfortunately, to state precisely the question, I need some formal preliminaries. Let $d \in \mathbb{N}$. For each $d^* \in \{1,\dots,d\}$, define $\mathcal{M}_{d^*}$ be the set of probability ...

Bob

193

asked Apr 14, 2022 at 20:10

1 vote

0 answers

159 views

which non parametric test to use for anomalous NN model outputs

Assume I have a bunch of trained NN models for classifying MNIST. All of them except one was trained on the same training set while the one was trianed on a different training set (could have ...

Sam

403

asked Mar 1, 2022 at 12:55

1 vote

0 answers

436 views

AIPW and Cross-fitting (Stanford stat361)

I am reading lecture note (Stanford stat361: https://web.stanford.edu/~swager/stats361.pdf) written by Stefan Wager. At page 23-24 the author states dependent summands become independent after ...

Ivan.lee

39

asked Oct 27, 2021 at 15:23

3 votes

1 answer

215 views

Random forest with nonnegative dependent variable

I have a modeling framework with an outcome that must necessarily be positive. In the training data, the outcome ranges from close to zero to much higher (approximately 0.05 to 100). Is there a way to ...

bob

725

asked May 10, 2021 at 13:50

5 votes

1 answer

2k views

Is it possible to use variational autoencoders with Non-Gaussian data?

I am dealing with two scenarios: 1) Non-Gaussian data distribution and 2) non-stationary data). First, I am planning to use a variational autoencoder for modeling the probability distribution of the ...

Amhs_11

333

asked Mar 30, 2021 at 17:18

1 vote

0 answers

135 views

Extraction of modes from a multi-modal density function

I am trying to extract modes from a multi-modal density function and not just peaks. For example, in the two density functions below (images), I would like to extract the curves contained in the black ...

curiosus

333

asked Mar 3, 2021 at 15:31

3 votes

1 answer

337 views

What is the difference between sieve estimation and structural risk minimization?

I was wondering if you could help me out. I am quite confused about the difference between sieve estimators (Ulf Grenander) and structural risk minimization (SRM) (Vladimir Vapnik). Could anyone give ...

vshas

131

asked Jan 19, 2021 at 16:20

1 vote

0 answers

888 views

what are the main differences between parametric and non-parametric machine learning algorithms?

I am interested in parametric and non-parametric machine learning algorithms, their advantages and disadvantages and also their main differences regarding computational complexities. In particular I ...

john price

11

asked Dec 16, 2020 at 8:17

2 votes

1 answer

39 views

Why might the functional form of a distribution be "inappropriate" for a particular application?

Working through Bishop's Pattern Recognition and Machine Learning(a great read so far!) and on page 67 he says: "One limitation of the parametric approach is that it assumes a specific ...

stochasticmrfox

1,617

asked Oct 30, 2020 at 21:47

4 votes

0 answers

442 views

Derivation of k nearest neighbor classification rule

One way to derive the k-NN decision rule based on the k-NN density estimation goes as follows: given $k$ the number of neighbors, $k_i$ the number of neighbors of class $i$ in the bucket, $N$ the ...

diegobatt

426

asked Oct 27, 2020 at 4:10

1 vote

2 answers

451 views

Which Nonparametric Model to use for Small Time Series?

I have the following data: ...

caproki

129

asked Aug 7, 2020 at 20:50

1 vote

1 answer

258 views

Does a non-parametric model necessarily have zero bias?

For a parametric model like linear regression, the bias is often interpreted as "the parameters & architecture you chose are inappropriate for the shape of this dataset". For (one ...

kennysong

1,061

asked Jul 14, 2020 at 5:10

2 votes

1 answer

56 views

Quantifying importance of a parameter in neural networks' prediction

Say I'm given a neural network, parameterized by a $d$-dimensional vector $\theta$, and an input $x$. Given the prediction of this model $f_{\theta}(x)$, can I somehow quantify importance of each of $...

SpiderRico

213

asked May 16, 2020 at 6:36

1 vote

1 answer

50 views

What are the implications of a nonparametric machine learning algorithm?

I've been looking into the advantages of using a Random Forest classifier and stumbled upon this random forests are non-parametric Looking at the definition of what non-parametric statistics mean, ...

emilaz

111

asked May 7, 2020 at 11:40

4 votes

1 answer

1k views

Can someone explain why neural networks are highly parameterized?

I understand that neural networks by definition, are a parametric model. If I am correct, Parametric methods make an assumption about the functional form, or shape, of f. For a neural network, what ...

user277337

71

asked Apr 21, 2020 at 3:45

1 vote

1 answer

353 views

How Parzen window density estimate $f_n$ converges to f

I am trying to understand how Parzen window density estimate converges to actual density function f(x).[Actually i am trying to learn machine learning on my own using available free resources. Please ...

Nascimento de Cos

167

asked Mar 18, 2020 at 13:09

0 votes

0 answers

14 views

Doubt in kernel based method - unit hypercube(Parzan window estimate)

I recently started studying pattern recognition on my own. Please clarify me the following. https://books.google.co.in/books?id=T0S0BgAAQBAJ&pg=PA53&lpg=PA53&dq=hypercube+of+side+h&...

Nascimento de Cos

167

asked Mar 17, 2020 at 13:14

1 vote

0 answers

50 views

Why mixture model with Gibbs sampling works?

I just have a question about why Gibbs sampling can correctly estimate parameters with random initial value? That is to say,we can sample z by: \begin{align} p(z_i=k \,|\, \cdot) &\...

yi li

131

asked Aug 13, 2019 at 2:20

2 votes

0 answers

327 views

Is kernalized linear regression parametric or nonparametric?

We know that for linear regression, we can predict: $$\hat{y} = w^Tx +b$$ Where $w$ is the parameter that minimizes the square loss. It is easy to prove that for the final solution using gradient ...

Ibrahim

21

asked Jul 11, 2019 at 4:42

1 vote

0 answers

23 views

Why is "consistent nearest neighbour" Non-parametric? [duplicate]

Definition of "Consistent nearest neighbour", runs our usual KNN classifier but instead of viewing k as a hyper-parameter it always sets k = ceil[log(n)]. So far, I looked-up many references and ...

M.Hossein Rahimi

195

asked Apr 5, 2019 at 2:21

3 votes

0 answers

205 views

Parametric vs non-parametric machine learning methods [duplicate]

I looked-up many references and websites and researched on how to determine if a method is between parametric or non-parametric. I came up with below definitions, A parametric algorithm has a fixed ...

M.Hossein Rahimi

195

asked Mar 14, 2019 at 18:06

11 votes

1 answer

514 views

Do Stochastic Processes such as the Gaussian Process/Dirichlet Process have densities? If not, how can Bayes rule be applied to them?

The Dirichlet Pocess and Gaussian Process are often referred to as "distributions over functions" or "distributions over distributions". In that case, can I meaningfully talk about the density of a ...

snickerdoodles777

790

asked Feb 17, 2019 at 19:13

1 vote

1 answer

986 views

Estimating conditional probability with many samples

I am confused about the estimation of conditional probabilities. Suppose I want to predict a binary outcome variable $Y = 0,1$ given $n$ categorical features $X = (X_1, \ldots, X_n)$, i.e. to ...

user227451

63

asked Nov 19, 2018 at 17:33

0 votes

1 answer

184 views

How to know which two hyperparameters are more important in SVM, KNN and MLP?

I am trying to limit myself to a maximum two hyper-parameters that are important in KNN, SVM and ...

Kim Zac

5

asked Nov 4, 2018 at 14:59

2 votes

0 answers

133 views

Smooth regression algorithms that produce zero training error

I am looking to fit three regression functions $f_1, f_2, f_3:\mathbb{R}^2 \to \mathbb{R}$. For example, let's say $X_1$ is time, $X_2$ is geographic latitude, $f_1$ is the temperature, $f_2$ is the ...

User191919

201

asked Aug 7, 2018 at 21:05

1 vote

1 answer

262 views

Approximate a CDF

Suppose we have $n$ equations with an integral of the form $\int_0^{x_i} F(z)dz = c_i,\ i=1,\ldots,n$ where $F(y)=\mathbb{P}(X \le y)$ is an unknown cumulative distribution function of a non-negative ...

Kumar

719

asked Jul 21, 2018 at 10:47

1 vote

0 answers

36 views

Dimension reduction with semi-supervised embeddings

Is there a dimension reduction method (linear or non-linear) where the embeddings/projections of some of the input points are already known in advance and are taken into account during parameter ...

gkcn

113

asked Jan 11, 2018 at 17:10

1 vote

1 answer

2k views

Scikit Learn DBSCAN with Dice Coefficient

I am trying to cluster a high dimensional data set - Young People Survey Data https://www.kaggle.com/miroslavsabo/young-people-survey This is my first pass and wanted to give clustering the entire ...

jainp

43

asked Sep 12, 2017 at 5:18

44 votes

4 answers

69k views

What exactly is the difference between a parametric and non-parametric model?

I am confused with the definition of non-parametric model after reading this link Parametric vs Nonparametric Models and Answer comments of my another question. Originally I thought "parametric vs ...

Haitao Du

37.2k

asked Mar 20, 2017 at 13:54

0 votes

0 answers

402 views

Non-parametric non-linear regression with deep learning

I have a situation where I have an increasing list of real numbers $\vec a$ of variable length (generally about 50 numbers but sometimes more). It turns out that these numbers uniquely correspond to ...

rhombidodecahedron

3,152

asked Feb 17, 2017 at 16:33

8 votes

2 answers

2k views

Bayesian nonparametric answer to deep learning?

As I understand it, deep neural networks are performing "representation learning" by layering features together. This allows learning very high dimensional structures in the features. Of course, it's ...

cgreen

1,002

asked Feb 8, 2017 at 23:36

0 votes

0 answers

472 views

Relation between Nonparametric Statistics and Statistical Learning Theory

I used to hear some Statistics professor complaining about Machine Learning theories: "It is just Non-parametric Statistics". And, when I read Vapnik's book "Statistical Learning Theory", it seems he ...

user112758

768

asked Jan 27, 2017 at 5:04

2 votes

1 answer

520 views

Why is a parametric classifier faster to train than a non-parametric one?

In the tutorial Parametric and Nonparametric Machine Learning Algorithms it says that parametric classifiers are faster than non-parametric classifiers. The reason that non-parametric classifiers are ...

AdiT

295

asked Dec 18, 2016 at 12:59

2 votes

0 answers

116 views

How to adjust ratings of N items by pairwise comparisons

I have been keeping a list of movies I've seen in a spreadsheet and assigning them numerical rankings that approximate how I feel about them. A few years ago I implemented a program to read in the ...

Pavel Komarov

1,347

asked Dec 3, 2016 at 5:48

10 votes

1 answer

9k views

Why KNN and SVM with a gaussian are non-parametric models?

I was told that these two are non-parametric models. But I can't figure out why, especially for KNN. Could anyone answer my questions?

Hanamichi

653

asked Sep 27, 2016 at 6:42

8 votes

1 answer

1k views

Nonparametric nonlinear regression with prediction uncertainty (besides Gaussian Processes)

What are state-of-the-art alternatives to Gaussian Processes (GP) for nonparametric nonlinear regression with prediction uncertainty, when the size of the training set starts becoming prohibitive for ...

lacerbi

5,226

asked Aug 5, 2016 at 14:53

3 votes

0 answers

57 views

Family of flexible parametric mappings $f_\theta:(0,1) \rightarrow \mathbb{R}$?

For the purpose of reparameterizing a model (mostly with the goal of improving MCMC efficiency), I am looking for a family of flexible parametric mappings $f_\theta:(0,1) \rightarrow \mathbb{R}$ such ...

lacerbi

5,226

asked Jul 25, 2016 at 13:23

2 votes

1 answer

381 views

Learn a distribution from distributions on samples [closed]

There's many good ways to learn a distribution $p_X$ of an r.v. $X$ over $k$ symbols given many i.i.d. samples $X_1,\ldots, X_n$. The simplest is to use the sample relative frequencies $\hat{f}_X$ as ...

chausies

421

asked Feb 29, 2016 at 21:48

1 vote

2 answers

730 views

Machine Learning Procedure for Fractional/Proportional Data?

I am looking for some suggestions of machine learning procedures that work to predict fraction outcomes where the outcome variables $\in [0,1]$. Can you provide me with any suggestions? I thought ...

StatsStudent

11.5k

asked Jan 30, 2016 at 1:17

1 vote

1 answer

236 views

Kernel nonparametric regression

One of the methods for nonparametric regression is using kernels. My question is what are the conditions on the kernels functions in this method? In other words how can I decide if a given function ...

toroto

109

asked Jan 12, 2016 at 8:51

0 votes

1 answer

30 views

How to compute the unconditioned density in $1NN$ classier?

Suppose I have $50$ training points $x_1$, $x_2,\ldots,x_{50}$ and they are distributed via bimodal Gaussian on real line. Now, given a new point, for $1NN$, I am trying to find a interval around $x$ ...

JumpJump

210

asked Oct 1, 2015 at 17:36

53 votes

9 answers

3k views

Are all models useless? Is any exact model possible -- or useful?

This question has been festering in my mind for over a month. The February 2015 issue of Amstat News contains an article by Berkeley Professor Mark van der Laan that scolds people for using inexact ...

Russ Lenth

20.8k

asked Apr 2, 2015 at 0:59

1 vote

0 answers

124 views

What are some examples of applied machine learning problems that requires using mixed models?

What are some examples of applied machine learning problems that requires using mixed models? I'm just introduced to the notion of mixed models. As I understand it, it is a combination of parametric ...

qazwsx

737

asked May 19, 2014 at 23:58

4 votes

1 answer

2k views

Friedman's test to identify best of multiple classifiers on multiple domains

I have several classifiers $f_i\ (i=1, \cdots, N)$ and calculated performance measures on multiple domains $(D)$ for each. Thus, there are $N \times D$ values. I want to find out (increasing ...

Chris

599

asked May 6, 2014 at 15:09

All Questions

Related Tags