Consider the Glauber-Sudarshan $P$ representation of a state $\rho$, which is the function $\mathbb C\ni\alpha\mapsto P_\rho(\alpha)\in\mathbb R$ such that $$\rho = \int d^2\alpha \, P_\rho(\alpha) |\alpha\rangle\!\langle\alpha|.$$ Something that is often mentioned about the $P$ representation is that a state is nonclassical when $P_\rho$ is either non-positive or more singular than a Dirac delta function.
On the other hand, the $P$ representation of a coherent state $|\alpha\rangle$ is $P_\alpha(\beta)=\delta^2(\alpha-\beta)$, which does also arguably look "more singular than a $\delta$ function", in that it contains the square of a delta function, while $|\alpha\rangle$ is clearly not nonclassical.
So what gives? What exactly does "more singular than a delta" mean? The Wikipedia page gives the general expression for the $P$ function of a generic state, and this expression involves terms of the form $\big(\frac{\partial}{\partial r}\big)^\ell \delta(r)$, which I'm guessing are the source of "high degree of nonsingularity", and I'm aware that this is quite different than the $\delta^2$ term in the expression above, which is just the "standard" $\delta$ for the two-dimensional case, but I still find it quite unclear how exactly I should think about functions that are "more singular than a delta".
One possible way to understand what such functions actually represent is via some corresponding sequence of functions. For example, I can understand $\delta(x)$ as the limit of suitably normalised functions peaked around the origin, and $\delta^2(x)$ as the same thing for functions $\mathbb C\simeq \mathbb R^2\to\mathbb R$. Are there similar representations for the more nonsingular cases?