Skip to main content

Questions tagged [statistics]

For statistical questions related to bioinformatics. If the nature of question is purely statistical, consider stats.stackexchange.com instead.

3 votes
2 answers
68 views

short Read/percentage threshold for bacterium presence in metagenome

I have ~100 paired end short read human gut metagenome samples that I classified using Kraken2. Now I want to know if a specific bacterium is in any of those samples. As far as I've searched people ...
ahmet's user avatar
  • 33
1 vote
1 answer
50 views

Normalization in Sequence Analysis Research

I am currently engaged in a research project that involves DNA sequence analysis, utilizing nanopore and KMA alignment software (k-mer alignment). As I delve deeper into my research, I am faced with a ...
dim's user avatar
  • 31
1 vote
1 answer
23 views

time*treatment series with repeated (i.e: NOT independent) sampling of replicates

I am performing the analysis of cell cultures in suspension, untreated(U) and treatment A and B, at t0, t1 and t2, 4 replicates per treatment. The experiment started with 12 cultures, 4U, 4A and 4B, ...
Alfredo Pagliuca's user avatar
1 vote
0 answers
21 views

Is this the correct method for determining Power at a given alpha, VAF, and coverage threshold?

I'm trying to calculate power at a given VAF, target coverage, and alpha. For example, what would my power be for a 100x coverage site, detecting VAF of 0.10 with an alpha of 0.05. This is what I came ...
Blaze9's user avatar
  • 41
2 votes
1 answer
37 views

permutation test in edgeR

I have a simple RNA-seq experiment with treatment and control, each with 3 biological repeats. I run my data through edgeR and obtained differentially expressed genes (DEGs). Due to the low sample ...
Netanel Cohen's user avatar
1 vote
1 answer
19 views

SNP Signatures with Limited WGS Data:

On 80 WGS samples, I'm dissecting SNP signatures linked to milk production in a scarcely studied animal. Post-variant calling and QC association analysis have been tricky. I'm here to tap into our ...
M.Bioinfo's user avatar
  • 386
3 votes
0 answers
33 views

How to do post hoc comparisons after a repeated measures ANOVA in R

I have a data set of several samples with their expression of proteins, in response to four different doses of a drug and two genotypes. I have been able to generate the two-way ANOVA using ezANOVA ...
Johnson's user avatar
  • 31
2 votes
1 answer
59 views

Calculating Fisher's exact test for COG categories

This is a continuation of the following question - Fisher's exact test for COG categories in pan, core genome analysis Also, the dataset - How do I perform a Fisher's exact test on this data to ...
K_081's user avatar
  • 149
1 vote
0 answers
94 views

In medical and public health care, should we make use of "clinical" signficance testing (non-parametric tests) or parametric tests of significance?" [closed]

In certain cases, traditional significance test may not be able to identify that effect of a treatment or a drug is significant. However,clinical significance test may suggest a significant effect. ...
Subhash C. Davar's user avatar
0 votes
0 answers
29 views

limma linear model with interaction

I want to run some epigenome wide association studies based on city and pollutant type, could you please suggest if I can get DNA methylation sites (CpG) association based on pollutant type for each ...
bioinfonext's user avatar
1 vote
1 answer
108 views

Fisher's exact test for COG categories in pan, core genome analysis

I have come across several papers that do a Fisher's exact test to show over represented genes in COG categories specifically in a pan-core-accessory-species specific genome analysis. The test is done ...
K_081's user avatar
  • 149
4 votes
2 answers
82 views

probability of finding a 5 amino acids in a row within a proteome

How to calculate the probability of finding two proteins that share a 5 amino acid long motif from a proteome of around 1067 proteins that have an average length of 65 residues. The probability of a ...
saplingmagic's user avatar
0 votes
1 answer
104 views

Statistical approach to link DNA methylation with toxic element exposure and health outcome

I would be thankful to you if you can help with the statistical approach for case-control study to link DNA methylation (epic array) with toxic element exposure (arsenic) and health outcome (...
bioinfonext's user avatar
1 vote
1 answer
58 views

Intepreting and applying ordinal logistic regression coefficients to calculate probabilities?

Can someone help hint me how I can interpret ordinal logistic regression coefficients and how I can use the .L, .Q and .C terms to calculate probabilities? I am analysing a dataset, where people ...
Charles's user avatar
  • 547
1 vote
1 answer
95 views

Mendelian randomization

I would be thankful to you if you can help with how I can use MR from my case-control study to link DNA methylation (epic array) with toxic element exposure (arsenic) and health outcome (...
bioinfonext's user avatar

15 30 50 per page
1
2 3 4 5
13