The null hypothesis (H0) states that there is no difference between two groups. In other words, a population parameter (such as the mean, the standard deviation, and so on) is equal to a hypothesized or previously observed value.
The alternative hypothesis (H1) states that there is a difference. This means a population parameter is smaller than, greater than, or different from the hypothesized or previously observed value stated in the null hypothesis.
Type I error means incorrectly rejecting the null hypothesis. In other words, you conclude that there is a difference when there is actually no difference. The probability of making a Type I error is denoted by alpha (α), also called the level of significance.
Type II error means incorrectly accepting the null hypothesis. In other words, you conclude that there is no difference when there is actually a difference. The probability of making a Type II error is denoted by beta (β).
Power of a study is (1 − β). Power is the probability of correctly rejecting the null hypothesis when it is false, meaning it detects a real difference when one truly exists.
There is a trade-off between α and β:
So, when α decreases, power also tends to decrease (and vice versa).
Traditionally:
If the p value obtained from the study is = or < 0.05, then the null hypothesis is rejected and the alternative hypothesis is accepted. The result is said to be statistically significant.
If the 95% confidence interval includes 0, then there is no significance and the null hypothesis is not rejected. If the 95% confidence interval for RR or odds ratio includes 1, then the null hypothesis is also not rejected.
Correlation and regression: Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables.
If both variables are normally distributed, then Pearson’s correlation coefficient ® is calculated. If one or both variables are not normally distributed, then a rank correlation coefficient such as Spearman’s rho (ρ) may be calculated.
The coefficient of determination (r square) denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x.
Using linear regression, if the value of either variable is known, the value of the other variable can be calculated.
Correlation analysis quantifies the strength of the association between two numerical variables. A scatter plot (or diagram) is used to depict the relationship between two variables:
The closer the points (dots) on the scatter plot are to each other, the stronger the association between the two variables.
The correlation coefficient quantifies the strength of the relationship between two variables and ranges between −1 and +1:
Correlation shows association; it does not prove causation.
In multiple regression, two or more independent variables are used to predict the value of a dependent variable.
Sign up for free to take 3 quiz questions on this topic