0.4924
24.2424
100
—
0.4924
24.2424
100
—
The Phi coefficient (φ) is a measure of association between two binary (dichotomous) variables arranged in a 2×2 contingency table. It is mathematically equivalent to the Pearson correlation coefficient applied to two binary variables and ranges from −1 to +1, where the sign indicates the direction of association and the magnitude indicates its strength.
Phi is closely related to the chi-square statistic: φ² = χ²/n. This calculator accepts the four cell values of a 2×2 table and computes both the Phi coefficient and the corresponding chi-square statistic, along with an interpretation of the effect size.
Given a 2×2 contingency table with cells a, b, c, d, the Phi coefficient is computed as:
$$\phi = \frac{ad - bc}{\sqrt{(a+b)(c+d)(a+c)(b+d)}}$$
This formula computes the cross-product difference (ad − bc) and normalizes it by the geometric mean of the four marginal totals. The result ranges from −1 to +1.
The relationship to chi-square is direct:
$$\chi^2 = n \cdot \phi^2$$
where n = a + b + c + d is the total sample size. Conversely, φ = √(χ²/n) when only the magnitude is needed.
The sign of φ indicates direction: a positive φ means that the two variables tend to co-occur (high values of one variable are associated with high values of the other in the binary coding), while a negative φ indicates an inverse relationship. The magnitude follows Cohen's conventions for correlation-like effect sizes: small (0.1), medium (0.3), and large (0.5).
Phi can also be expressed in terms of the 2×2 table's probabilities:
$$\phi = \frac{p_{11}p_{00} - p_{10}p_{01}}{\sqrt{p_{1\cdot}p_{0\cdot}p_{\cdot 1}p_{\cdot 0}}}$$
This probabilistic formulation shows that Phi is indeed the Pearson product-moment correlation between two indicator variables.
The Phi coefficient is interpreted using standard effect size benchmarks:
A positive φ indicates that the diagonal cells (a and d) dominate, meaning the two variables tend to agree. A negative φ indicates that the off-diagonal cells (b and c) dominate, showing an inverse relationship. The corresponding chi-square statistic can be used to test whether the association is statistically significant.
Inputs
Results
A study of 100 patients: 30 treated+recovered, 10 treated+not recovered, 15 untreated+recovered, 45 untreated+not recovered. φ = 0.316 shows a medium positive association between treatment and recovery.
Inputs
Results
A survey of 100 people shows nearly equal distribution across all cells. φ ≈ 0.01 indicates essentially no association between gender and preference.
For a 2×2 table, Cramer's V equals the absolute value of Phi (|φ|). Cramer's V is always non-negative because it uses min(r−1, c−1) = 1 for 2×2 tables, so V = √(χ²/n) = |φ|. Phi retains the sign information that V loses.
Phi is designed specifically for 2×2 tables (two binary variables). For larger tables, use Cramer's V instead. If you have continuous variables that you've dichotomized, be aware that dichotomization can reduce the apparent strength of association.
Yes. When both variables are coded as 0/1 indicator variables, the Pearson product-moment correlation equals the Phi coefficient. This mathematical equivalence means that all interpretive frameworks for Pearson's r apply to Phi as well.
A Phi of zero means that the two binary variables are completely independent — knowing the value of one variable provides no information about the other. The cross-product difference (ad − bc) equals zero, and the chi-square statistic is also zero.
Since φ = √(χ²/n), you can use the Fisher z-transformation: z = arctanh(φ), compute the confidence interval for z as z ± 1.96/√(n−3), then back-transform using φ = tanh(z). This provides an approximate 95% confidence interval for the population Phi.
Yes, but only when the marginal distributions are identical. If (a+b)/(c+d) = (a+c)/(b+d), then Phi can reach ±1. If the marginals differ, the theoretical maximum of |φ| is less than 1. This is an important consideration when comparing Phi values across studies with different marginal distributions.
Roboculator Team
The Roboculator Team explains calculations, planning tools, and practical formulas in clear language for real-life situations.
How helpful was this calculator?
Be the first to rate!
Random Number Generator
Advanced & Specialized Statistical Tools
Central Limit Theorem Calculator
Advanced & Specialized Statistical Tools
Empirical Rule Calculator
Advanced & Specialized Statistical Tools
Chebyshev's Theorem Calculator
Advanced & Specialized Statistical Tools
Monte Carlo Estimation Calculator
Advanced & Specialized Statistical Tools
Power Analysis Calculator
Advanced & Specialized Statistical Tools