Understanding False Discovery Rate (FDR) and the Benjamini-Hochberg Method© Karobben

Understanding False Discovery Rate (FDR) and the Benjamini-Hochberg Method

This post provides an in-depth understanding of False Discovery Rate (FDR) and the Benjamini-Hochberg Method, crucial in statistical analysis with large datasets like genomics. It explains FDR's role in identifying false positives in multiple hypothesis testing and the Benjamini-Hochberg Method's effectiveness in controlling FDR. The post compares various p-value adjustment methods, discussing their advantages, limitations, and suitability for different data types. It emphasizes the BH method's balance in statistical power and error control, and its integration in software like R, highlighting its applicability across scientific fields.
Read more

Fruchterman Reingold layout

The Fruchterman-Reingold layout is a force-directed layout algorithm, which treats edges like springs that move vertices closer or further from each other in an attempt to find an equilibrium that minimizes the 'energy' of the system. © Bernie Hogan
Read more

How to do anova in R

When we compare our control with another group, the t-test could fit our goal very well. But when we need to compare it into more than 1 group, 3 groups for example, the t-test could only give the random false positive independently. That means three false positives in three comparisons. This would cause a problem. Because the false positive we need to consider now is at least one false positive in three comparisons. It means we need to adjust the p-value and apply a rigorous method to achieve a more reliable result. For doing that, ANOVA was introduced and applied.
Read more
A quick guide for seaborn plot in python© Karobben
Density plot

Density plot

Plot the density distribution of your data by ggplot
Read more
Odds Ratio

Odds Ratio

Odds of an event happening is defined as the likelihood that an event will occur, expressed as a proportion of the like...
Read more
Correlation

Correlation

Correlation is a statistical measure that shows how closely two variables are related to each other. We are interested in correlation because it helps us understand the relationship between two variables, make better decisions and predictions, and is used in various fields like finance, economics, psychology, and biology. Who said this?
Read more
Data cleaning
Data normalization

Data normalization

Few ways to normalizing your date, reducing batch effect...
Read more