According to Wikipedia: https://en.wikipedia.org/wiki/Negative_binomial_distribution
In probability theory and statistics, the negative binomial distribution is a discrete probability distribution that models the number of successes in a sequence of independent and identically distributed Bernoulli trials before a specified (non-random) number of failures (denoted r) occurs.
I see that the Negative Binomial distribution is usually used to model count data, especially in the insurance industry. However, I don't see why it should be used when it models number of success before some failures occur. For example, it is used to model the number of catastrophic events happening in 1 year and I don't see anything to do with "number of success before some failures".
Could you please explain me why we use Negative Binomial distribution to model count data, even when the concept of "number of success before $r$ failures" doesn't exist ?
Thank you very much for your help!