This question was also asked on Reddit
I have recently completed my thesis and one of the comments was that I report on why this graph looks this way. I have tried to find a reason but the closest I can come to is that it means that there are false positives? Or that this is possibly due to population stratification?
The trait of interest was alcohol dependency with covariates of sex and age, in a non african american population.
I am also unsure as to how this will impact my research or the implications of this?
I don't understand why this population (non-African American) "reacted" this way - do I provide a different null model? The previous populations had "relatively normal" plots. They were all run through the same pipeline.
How the data was calculated:
There were 1061 individuals in the non-African American dataset. At QC phase 0, there were 2 individuals who had discordant sex information and 87 duplicate SNPs. When the MAF value was set to 0.000015, 125708 SNPs were removed when the MAF. Both the mind and geno parameters were set to 0.2, resulting in 127357 and 0 SNPs to be removed respectively. When setting the HWE to $1 \times 10^{-12}$, 4495 SNPs were removed. After all these parameters were set and the pipeline completed, there were 851773 SNPs left from 1044 individuals.