Questions tagged [statistics]
Mathematical statistics is the study of statistics from a mathematical standpoint, using probability theory and other branches of mathematics such as linear algebra and analysis.
11,528
questions with no upvoted or accepted answers
7
votes
0
answers
260
views
Expectation of maximum of minimums of permutations
Assume $n$ random permutations $\pi_1,\pi_2,\ldots,\pi_n: \lbrace 1,2,\ldots,m \rbrace \rightarrow \lbrace 1,2,\ldots,m \rbrace$. Let $X_i = \min(\pi_1(i),\pi_2(i),\ldots,\pi_n(i))$ and $Y = \max(X_1, ...
7
votes
1
answer
6k
views
Expectation of truncated log-normal
Let's assume that $y=e^x$, where $x\sim N(\mu,\sigma^2)$, that is, $y$ follows a lognormal distribution.
I'm interested in finding how $\mathbb{E}\left[y|y\geq a\right]$ varies with $\mu$ and $\...
7
votes
1
answer
146
views
Find a function such that follows to normal in distribution
Suppose that $X_{n}\sim \text{Binomial}(n,\theta)$, where $n=1,2,\ldots$ and $0<\theta<1$. Find a function $g$ such that $\sqrt{n}(g(\frac{1}{n}X_n)-g(\theta))\xrightarrow{D} N(0,1)$ for each ...
7
votes
1
answer
483
views
An estimator for the c.d.f $F$ at a point $x_0$?
Problem: Let $X_1,X_2,\ldots,X_n$ be independent identically distributed random variables (i.i.d's) with common CDF $F$. Fix $x_0\in\mathbb{R}$ and find an unbiased estimator for $F(x_0)$. Show that ...
7
votes
0
answers
1k
views
The distribution of the ith order statistic for discrete random variables
Assume $(X_i)_{i=1,...,n}$ are a sequence of real iid random variables with continuous density $p_x$.
We know that $$Y:=\sum_{i=1}^n 1\{X_i\leq u\}\sim Bin(n,F_x(u)),$$ since $1\{X_i\leq u\}\sim Ber(...
7
votes
0
answers
308
views
Decrease of entropy when iterating a random discrete function
Let $m$ be a positive integer. Let $S$ be the set of non-negative integers $x$ less than $m$, with $|S|=m$. Let $X_0$ be the discrete uniform distribution over $S$, with $P(x)=\begin{cases}
1/m & \...
7
votes
1
answer
8k
views
What is the Fisher information for the parameter $\theta$ in a uniform distribution with likelihood $f(X,\theta)=\frac1\theta 1\{0\le x\le\theta\}$?
If X is U[$0$,$\theta$], then the likelihood is given by $f(X,\theta) = \dfrac{1}{\theta}\mathbb{1}\{0\leq x \leq \theta\}$. The definition of Fisher information is $I(\theta) = \mathbb{E} \left[ \...
6
votes
0
answers
115
views
Smallest eigenvalue of matrix with random elements (non-central Wishart)
Suppose that $X \in \mathbb R^{d \times n}$ is a random matrix with independent entries, each of which follows the standard normal law $\mathcal N(0, 1)$, and that $M \in \mathbb R^{d \times n}$ is a ...
6
votes
0
answers
121
views
The statistical average of a continuous value: $\overline{O} = \int O(x) \rho(x) dx$, but coordinate invariant
I am trying to solve a Lagrange multiplier problem for the following equation
$$
L= - \int_{-\infty}^\infty \rho(x) \ln \frac{\rho(x)}{q(x)} dx + \alpha \left( 1- \int_{-\infty}^\infty \rho(x) dx \...
6
votes
1
answer
516
views
How to lower bound $\tau$ based on the expression of $H$?
Let $A=\{a_{ij}\}_{1\le i,j\le n}$ be an $n$ by $n$ normalized symmetric Gaussian random matrix with $E[a_{ij}]=0$ and $E[a_{ij}^2]=1/n$. Ordering its eigenvalues by $\lambda_1\le \lambda_2\le \cdots \...
6
votes
0
answers
251
views
Is the Fisher-Information even continuous in a regular statistical model?
Definition (Regular Model [1, p. 203]).
A standard statistical model $\big( X, \mathcal F, (\mathbb P_{\vartheta})_{\vartheta \in \Theta}\big)$, where $\Theta \subset \mathbb R$ is an open interval, $...
6
votes
0
answers
245
views
Why doesn't the Borel-Kolmogorov paradox cause problems in practice?
The Borel-Kolmogorov paradox shows that the usual formula for conditional density $f_{X|Y}(x|y) = f_{X,Y}(x, y)/f_Y(y)$ can lead to inconsistent results depending on the coordinate system that is used ...
6
votes
0
answers
143
views
Correct measure in concentration inequalities or hypothesis testing
In most discussions of concentration inequalities or calculations of rejection region in hypothesis testing, the measure used is left vague. For example, for independent random variables $X_1, \ldots, ...
6
votes
0
answers
205
views
Is there a pattern to the coefficients in the piecewise equations of the Irwin–Hall distributions?
Intro and Problem Statement
The Irwin–Hall distribution is a probability distribution of the sum of $n$ independent, uniformly-distributed, continuous random variables in the interval $[0, 1]$. The ...
6
votes
0
answers
365
views
Estimate nearly-singular Gaussian covariance matrix
Suppose I have $m$ samples drawn from a Gaussian in $\mathbb{R}^n$, and need sample covariance $\Sigma_m$ to be $\epsilon$-close to true covariance $\Sigma$:
$$E\|\Sigma_m-\Sigma\| \le \epsilon \|\...
6
votes
0
answers
163
views
Creating a Skyrim Alchemy metric: combining the binomial theorem and birthday paradox
I am trying to create a metric to sort alchemy ingredients by in Skyrim, I wanted to sort the ingredients by "probability of yielding a potion with the number of ingredients I have". The ...
6
votes
0
answers
390
views
What is the expected value of the inverse of a Wishart matrix plus a scaled identity matrix?
I'm trying to find the following:
$$\mathbb{E}\left[\left(\mathbf{W}+\lambda\mathbf{I}\right)^{-1}\right]$$
where $\mathbf{W}\sim\mathcal{W}_{p}(n,\mathbf{\Sigma})$ is Wishart-distributed, $\lambda\ge ...
6
votes
0
answers
295
views
Book recommendation for convergence (Almost Surely, probability, distribution) concepts in statistics.
I want to study the convergence concepts in statistics. I am mainly interested in topics like
Convergence in Probability
Convergence in Distribution
Almost Surely convergence
I can't seem to find ...
6
votes
0
answers
254
views
Given a derived random variable $Z=f(X_1,\dots, X_n)$ on $X_j$, what can we say about $X$?
Let $Z$ be a derived random variable given by some function $f$, i.e.
$Z=f(X_1,\dots, X_n)$, where the $X_i\sim X$ are continuous/non-atomic and independently and identically distributed. What can we ...
6
votes
0
answers
186
views
About solving $f(z) = \frac{1}{4} (f(-1-z) +2 f(-1-2z)+ 2f(-1+2z) +f(-1+z))$
Final update on 11/29/2019: I have worked on this a bit more, and wrote an article summarizing all the main findings. You can read it here.
I am looking for a solution where $f(z)\geq 0$ and $\int_{-\...
6
votes
2
answers
731
views
Distribution of sample variance of Cauchy distributed variables
Assume $X_i,i\in\left\{1,...,n\right\}$ are i.i.d. standard Cauchy distributed random variables.
I know that $\bar{X}_n:=\frac{1}{n}\sum_{i=1}^n X_i$ is standard Cauchy distributed.
I would like to ...
6
votes
1
answer
4k
views
Show that the Normal distribution is a member of the exponential family
I want to show that the Normal distribution is a member of the exponential family.
I have been working under the assumption that a distribution is a member of the exponential family if its pdf/pmf ...
6
votes
0
answers
131
views
Proof of a technical fact in the book of Schapire and Freund on boosting
I am currently looking at Exercise 10.3, Chapter 10 of the book on Boosting by Schapire and Freund. More precisely, in the middle of the exercise they propose to use, without proof, the technical fact ...
6
votes
0
answers
8k
views
When to use cumulative moving average vs a simple moving average?
After reviewing the Wikipedia page on moving averages, the difference between the simple moving average and cumulative moving average are clear:
1) Simple moving average only considers the last n ...
6
votes
0
answers
2k
views
Variance of the Euclidean norm of a vector of Gaussians
Given a vector of correlated Gaussian random variables $\vec{X}=\left(X_1, ..., X_k\right)$ that are normally distributed $\vec{X} \sim \mathcal{N}\left(\vec{\mu}, \Sigma\right)$ with arbitrary $\...
6
votes
0
answers
233
views
Area of a triangle where the three vertices are randomly chosen on a circle; also $3D$ version.
My teacher gave us an interesting problem today. Consider a circle of radius $1$, choose three points on that circle at random and make a triangle connecting the three. On average what will the area ...
6
votes
0
answers
664
views
Efficient way of integrating a 2-dimensional Gaussian over a convex polygonal domain
I am looking for a reference for the calculation of integrals of bivariate normal distributions over (convex) polygons.
$$P\{X\in\mathcal{A}\} =
\int_{\Omega} \frac{1}{\sqrt{(2\pi)^2|\Sigma|}}e^{-\...
6
votes
0
answers
400
views
Proof of multivariate change of variable technique in statistics
I am having hard time conceptually grasping how the bivariate change of variable technique works in statistics. The technique can be summarized as follows: Given random variables $X$ and $Y$, and ...
6
votes
0
answers
583
views
Radon-Nikodym on a Process wrt to filtration
Given a probability space $(\Omega,\mathcal{F},P)$. Let $(X_t)_{t\geq0}$ be a stochastic process defined on it with cadlag paths, lets say on $(\mathcal{X},\mathcal{B}(X))$.
Let be $\mathcal{F}_{t}$ ...
6
votes
2
answers
129
views
References for information theoretic statistical tools
Statistical concepts like spaces of probability distributions, "metrics" like Fisher information or relative entropy, and convergence with respect to these quantities are needed in my ...