p654_pc

Principal Components Analysis

Purpose: Reduce a large set of p correlated variables [X₁ X₂ … X_p]

down to a smaller set of q < p uncorrelated components [Q₁ Q₂ … Q_q]

such that the components

a. Are formed by linear combinations of the original variables
b. Are orthogonal
c. Reproduce the maximum amount of variance in the original variables.

Definitions:

X is a N x p observed data matrix with N rows of observations on p columns of variables (expressed in deviation scores so that the mean of each column is zero).

S_X = (X'X)/(N-1) is the p x p sample variance - covariance matrix

S_X = PDP' with P'P = PP' = I and D = Diag[ d₁, d₂ , …d_q , … d_p ]

Std = Diag(S_X)^.5

The eigenvalues are ordered in magnitude so that d₁ > d₂ > … > d_p

P = [ P_q | P_p-q ],

P_q is a p x q matrix formed from the first q eigenvector columns of P

D_q = Diag[d₁, d₂ , …d_q ] is a diagonal matrix of the first q eigenvalues

PC Model: X = QB + E

Q is a N x q matrix with N rows of scores on q columns of components

Q = XA for some full column rank p x q matrix A, and Q'Q = diagonal

B is a q x p matrix of coefficients used to reproduce X from Q

E is a N x p matrix of residuals

Goal: Find A and B such that Trace[E'E] is minimized.

Solution: X = QB + E = (XA)B + E

A = P_q and B = P_q'

S_Q = Q'Q/(N-1) = P_q'X'XP_q/ (N-1) = P_q'PDP'P_q = D_q

Proportion Reproduced:

R² = 1 - Trace[E'E]/Trace[X'X]

= [ d₁ + d₂ + … + d_q] / [ d₁ + d₂ + … + d_p ]

Cov[X,Q] = X'Q/(N-1) = X'XP_q / (N-1) = PDP'P_q = P_qD_q

Alternative Form of Solution:

X = QB + E = (XP_q)P_q’ + E = (XP_qD_q^-.5)(D_q^.5P_q’) + E

= VW + E

V = X(P_qD_q^-.5) =( X × Std^-1)(Std × P_qD_q^-.5)

C = (Std× P_qD_q^-.5)

W = (D_q^.5P_q’), or W’ = P_qD_q^.5

S_V = V'V/(N-1) = D_q^-.5P_q'X'XP_qD_q^-.5/(N-1) = D_q^-.5P_q'PDP'P_q D_q^-.5 = D_q^-.5 D_q D_q^-.5 = I

Now the components are ortho – normal.

SPSS prints out W’ for component matrix, C for the component score coefficient matrix

Residuals from PC Analysis

E = X - QB = X - XP_qP_q' = X(I-P_qP_q') = XP_p-qP_p-q'

E'E/(N-1) = ( P_p-qPp-q' )' X'X( P_p-qPp-q' )/(N-1)

= ( P_p-qPp-q' )PDP'( P_p-qPp-q' ) = P_p-qDp-qPp-q'

As required

S_X = PDP = S d_j P_jP_j' = P_qD_qP_q' + P_p-qDp-qPp-q' .

Elements of Factor Analysis

1. Common Factor Model

X is a N x p matrix of scores from N subjects on p variables

The Common Factor model is based on four assumptions:

a. X = FA + e                 (Linearity)
b. E[ F'F / (N-1) ] = C   (correlated common factors)
c. E[ e'e / (N-1) ] = U = diag[ u₁ , ..., u_p ] (uncorrelated errors)
d. E[ F'e / (N-1)] = 0     (Factors are uncorrelated with residuals)

Then

E[ S_X ]   = E [ X'X / (N-1) ]
               = E[ (FA+e)'(FA+e)/(N-1) ]
               = E[ (e'+A'F')(FA+e)/(N-1) ]
               = E[ e'FA + A'F'FA + e'e + A'F'e ]/(N-1)
               = A' E [F'F/(N-1)] A + E[e'e/(N-1)]
               = A'CA + U

If C = I then the factors are uncorrelated and E[ S_X ] = A'A + U

2. Principle Axis Extraction

Extract eigenvalues and eigenvectors from the reduced covariance matrix

( E[S_X ] - U ) = PDP’ , where U is the uniqueness diagonal matrix

If we choose to use q factors then we assume

E[S_X ] = P_qD_q P_q' + U

             = (P_qD_q^.5)( D_q^.5P_q' ) + U

             = A'A + U,

where A = D_q^.5P_q' , and D_q^.5 = Diag[ Sqrt(d₁), Sqrt(d₂), …, Sqrt(d_q) ] .

3. Rotation: Linear Transformation of Factors

T is a q x q full rank matrix used for rotation

E [ S_X ] = A'A + U (orthogonal solution)

= A’(TT^-1)(T’^-1T’)A + U

= (A’T)(T^-1T’ ^-1)(T’A) + U

= A*’CA* + U,

A* = T’A are rotated factor loadings.

If T is orthonormal, then C = (T^-1)(T’ ^-1) = I and S_X = A*'A* + U.
Otherwise, the factors are correlated, and the rotation is called oblique.

Cov[X,F] = E[ X'F/(N-1) ] = E [ (FA* +e)'F/(N-1) ]

= E [ (A*'F'F +e'F)/(N-1) ] = E [ A*'(F'F/N-1) ]

= A*'C = A’T(T^-1T’ ^-1)

= A’T' ^-1

Cov(F,X) = T^-1A

References:

Fabrigar, Wegener, MacCallum, Strahan (1999) Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods, 4, 272-299.

Mulaik, S. (1972) The foundation of factor analysis. NY: McGraw-Hill

Code for PC analysis on matlab: (assume X is a dev score matrix)

S = X’X/(n-1)

S is the name of the covariance matrix.

Std = sqrt(diag(diag(S)))

[ P , D ] = eig(S)

P = eigenvectors

D = eignenvalues

WT = P*D^.5

WT = Component matrix in SPSS printout.

C = Std*P*inv(sqrt(D))

C = component score coefficient matrix

Proofs of Solution to PC Problem

Homework