Glenn Drive: Statistical Associates Publishing, 2012. — 52 p.
Discriminant function analysis, also known as discriminant analysis or simply DA, is used to classify cases into the values of a categorical dependent, usually a dichotomy. If discriminant function analysis is effective for a set of data, the classification table of correct and incorrect estimates will yield a high percentage correct. Discriminant function analysis is found in SPSS under Analyze>Classify>Discriminant. If the specified grouping variable has two categories, the procedure is considered "discriminant analysis" (DA). If there are more than two categories the procedure is considered "multiple discriminant analysis" (MDA).
Multiple discriminant analysis (MDA) is a cousin of multiple analysis of variance (MANOVA), sharing many of the same assumptions and tests. MDA is sometimes also called discriminant factor analysis or canonical discriminant analysis.
While binary and multinomial logistic regression, treated in a separate Statistical Associates "Blue Book" volume, is often used in place of DA or MDA respectively, if the assumptions of discriminant analysis are met, it has greater power than logistic regression: there is less chance of Type II errors - accepting a false null hypothesis. If the data violate assumptions of discriminant analysis, outlined below, then logistic regression may be preferred because it usually involves fewer violations of assumptions (independent variables needn't be normally distributed, linearly related, or have equal within-group variances), is robust, handles categorical as well as continuous variables, and has coefficients which many find easier to interpret. Logistic regression is preferred when data are not normal in distribution or group sizes are very unequal.
There are several purposes for DA and/or MDA:
To classify cases into groups using a discriminant prediction equation.
To test theory by observing whether cases are classified as predicted.
To investigate differences between or among groups.
To determine the most parsimonious way to distinguish among groups.
To determine the percent of variance in the dependent variable explained by the independents.
To determine the percent of variance in the dependent variable explained by the independents over and above the variance accounted for by control variables, using sequential discriminant analysis.
To assess the relative importance of the independent variables in classifying the dependent variable.
To discard variables which are little related to group distinctions.
To infer the meaning of MDA dimensions which distinguish groups, based on discriminant loadings.