Questions tagged [high-dimensional]
Pertains to a large number of features or dimensions (variables) for data. (For a large number of data points, use the tag [large-data]; if the issue is a larger number of variables than data, use the [underdetermined] tag.)
406
questions
0
votes
0
answers
12
views
Bound of operator norm for Gaussian ensemble (wainwright example 6.2)
Consider $W \in \mathbb{R}^{n \times d}$ generated with i.i.d. $N(0,1)$ entries, theorem 6.1 in the Martin Wainwright HDS implies that
$$
\frac{\sigma_\max(W)}{\sqrt{n}} \leq 1 + \delta + \sqrt{\frac{...
1
vote
0
answers
13
views
Sampling from a hypersphere subject to a linear constraint? [duplicate]
I'm running into efficiency issues when trying to sample from a "hypercone" using rejection sampling. By a hypercone, I mean the set of vectors $C_{v,\beta} = \{w \sim N(0,1)\ |\ w^T v \geq \...
4
votes
1
answer
45
views
The sum of $O_p$ --$ O_p \left(s^2\frac{\log d}{n}+s\sqrt{\frac{\log d}{n}} \right) $
I read papers in the area of inference for high-dimensional graphical models and these papers always state the convergence rate of the estimator. Using $O_p$ is a good choice.
Maybe I made some ...
1
vote
0
answers
37
views
how to approximate the eigendecomposition of a correlation matrix when the data have been standardized?
Context
I am working to develop a penalized regression framework that will scale up to analyzing high dimensional data with a certain correlation structure. Let $X$ represent an $n \times p$ matrix of ...
0
votes
0
answers
21
views
Wide Shallow Neural Networks VS Deep NN
I have several key points of understanding, but I cannot reach a final conclusion on why shallow neural networks cannot model data as effectively as deep neural networks.
I understand that we can ...
6
votes
1
answer
111
views
Bound on Rademacher complexity using polynomial discrimination
This is lemma 4.14 in Wainwright's textbook on High-Dimensional Statistics, it states that given a class of function $\mathcal{F}$ has polynomial discrimination of order $v$, then for all integer $n$ ...
0
votes
1
answer
34
views
High dimensional regression with millions of covariates/features
as a matter of preamble, I am a machine learning researcher. I am interested if this community can point me to research and work showing settings that have performed regression where the number of ...
1
vote
1
answer
23
views
The covering number of a d-dim cube
In Martin Wainwright's textbook, equation (5.5) states that the $\delta$-covering number of the d-dimensional cube satisfies
$$
\log N(\delta; [0,1]^d) \asymp d \log(\frac{1}{\delta}),
$$
for small ...
2
votes
1
answer
54
views
Should we routinely conduct unsupervised learning when reporting descriptive statistics on data?
A standard approach prior to conducting a predictive or inferential analysis is to report some basic univariate descriptive statistics on the study variables: mean, median, minimum, maximum, variance, ...
0
votes
1
answer
44
views
What is the meaning of $\asymp$ and $\lesssim$ in Martin wainwright's high dim textbook? [closed]
Unfortunately, this text book did not provide a table of notations he used.
Can anyone provide me with a definition of $\asymp$ and $\lesssim$ and few examples?
For an example in the book, in display (...
1
vote
1
answer
47
views
Modeling a high dimensional multicollinear data
I am trying to predict a Plant physiology trait (y) from hyper spectral reflectance data from 400 to 2400nm (X). So far i have done the following
Skew correction with Square root (sqrt) on y
Scaling ...
1
vote
0
answers
61
views
Maximum Likelihood in High Dimensions [closed]
What are some examples of high-dimensional random variables for which MLE are solved using numerical methods because we are unable to explicitly solve the equations nicely? The only example to comes ...
1
vote
0
answers
20
views
Large $N$, small $T$ in SUR: workaround using system GMM
Consider a system of linear equations as in seemingly unrelated regression (SUR). If the number of equations $N$ is large relative to the sample size $T$, the weighting matrix in SUR (i.e. the error ...
1
vote
0
answers
16
views
Expected value of Cosinus in High dimension
I would like to prove that the cosinus of the angle formed by 3 randomly points tends to $\frac{1}{2}$ as the dimensionality tends to $\infty$. Could it be solved with the expected value formula ? It ...
1
vote
0
answers
21
views
Benjamini Hochberg Procedure [closed]
I am working on a problem for class related to multiple testing where I would like to run the BH procedure with a known $\pi_{0}$, denoting the proportion of hypothesis that are truly null, given ...