An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models

Eshima, Nobuoki; Tabata, Minoru; Giovanni Borroni, Claudio

doi:10.3390/e20090634

Open AccessArticle

An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models

by

Nobuoki Eshima

^1,*,

Minoru Tabata

²

and

Claudio Giovanni Borroni

³

¹

Center for Educational Outreach and Admissions, Kyoto University, Kyoto 606-8501, Japan

²

Department of Mathematical Sciences, Osaka Prefecture University, Osaka 599-8532, Japan

³

Department of Statistics and Quantitative Methods, University of Milano Bicocca, 20126 Milano, Italy

^*

Author to whom correspondence should be addressed.

Entropy 2018, 20(9), 634; https://doi.org/10.3390/e20090634

Submission received: 12 July 2018 / Revised: 20 August 2018 / Accepted: 20 August 2018 / Published: 24 August 2018

Download

Browse Figure

Versions Notes

Abstract

:

In factor analysis, factor contributions of latent variables are assessed conventionally by the sums of the squared factor loadings related to the variables. First, the present paper considers issues in the conventional method. Second, an alternative entropy-based approach for measuring factor contributions is proposed. The method measures the contribution of the common factor vector to the manifest variable vector and decomposes it into contributions of factors. A numerical example is also provided to demonstrate the present approach.

Keywords:

entropy coefficient of determination; factor contribution; factor loading; path analysis

1. Introduction

Factor analysis is a statistical method for extracting simple structures to explain inter-relations between manifest and latent variables. The origin dates back to the works of [1], and the single factor model was extended to the multiple factor model [2]. These days, factor analysis is widely applied in behavioral sciences [3]; hence, it is important to interpret the extracted factors and is critical to explain how such factors influence manifest variables, that is, measurement of factor contribution. Let

X_{i}

be manifest variables;

ξ_{j}

latent variables (common factors);

ε_{i}

unique factors related to

X_{i}

; and let

λ_{i j}

be factor loadings that are weights of factors

ξ_{j}

to explain

X_{i}

. Then, the factor analysis model is given as follows:

X_{i} = \sum_{j = 1}^{m} λ_{i j} ξ_{j} + ε_{i}, i = 1, 2, \dots, p,

(1)

where

E (X_{i}) = E (ξ_{j}) = E (ε_{i}) = 0, var (ξ_{j}) = 1, cov (ξ_{j}, ε_{i}) = 0, cov (ε_{i}, ε_{k}) = 0 for i \neq k and var (ε_{i}) = σ_{i}^{2} > 0

For the simplicity of discussion, common factors

ξ_{j}

are assumed to be mutually independent in this section, that is, we first consider an orthogonal factor analysis model. In the conventional approach, the contribution of factor

ξ_{j}

to all manifest variables

X_{i}

,

C_{j}

, is defined as follows:

C_{j} = \sum_{i = 1}^{p} cov {(X_{i}, ξ_{j})}^{2} = \sum_{i = 1}^{p} λ_{i j}^{2}

(2)

The above definition of factor contributions is based on the following decomposition of the total of variances of the observed variables

X_{i}

[4] (p. 59):

\sum_{i = 1}^{p} var (X_{i}) = \sum_{j = 1}^{m} \sum_{i = 1}^{p} λ_{i j}^{2} + \sum_{i = 1}^{p} σ_{i}^{2}

What physical meaning does the above quantity have? Applying it to the manifest variables observed, however, such a decomposition leads to scale-variant results. For this reason, factor contribution is usually considered on the standardized versions of manifest variables

X_{i}

. What does it mean to measure factor contributions by (2)? For standardized manifest variables

X_{i}

, we have

λ_{i j} = cor (X_{i}, ξ_{j})

(3)

Then, (2) is the sum of the coefficients of determination for all standardized manifest variables

X_{i}

with respect to a single latent variable

ξ_{j}

. The squared correlation coefficients (3), that is,

cor {(X_{i}, ξ_{j})}^{2}

, are the ratios of explained variances of a manifest variable

X_{i}

, and in this sense, they can be interpreted as the contributions (effects) of factors

ξ_{j}

to the manifest variable

X_{i}

. Although, what does the sum of these with respect to all manifest variables

X_{i}

, that is, (2), mean? The conventional method may be intuitively reasonable for measuring factor contributions; however, we think it is sensible to propose a method measuring factor contributions as the effects of factors on the manifest variable vector

X = (X_{1}, X_{2}, \dots, X_{p})

, which are interpretable and have a theoretical basis. There is no research on this topic as far as we have searched. The present paper provides an entropy-based solution to the problem. Entropy is a useful concept to measure the uncertainty in the systems of random variables and sample spaces [5] and it can be applied to measure multivariate dependences of random variables [6,7].

This paper proposes an entropy-based method for measuring factor contributions of

ξ_{j}

to the manifest variable vector

X = (X_{1}, X_{2}, \dots, X_{p})

concerned, which can treat not only orthogonal factors, but also oblique cases. The present paper has five sections in addition to this section. In Section 2, the conventional method for measuring factor contributions is reviewed. Section 3 considers the factor analysis model in view of entropy and makes a preliminary discussion on measurement of factor contribution. In Section 4, an entropy-based path analysis is applied as a tool to measure factor contributions. Contributions of factors

ξ_{j}

are defined by the total effects of the factors on the manifest variable vector, and the contributions are decomposed into those to manifest variables and subsets of manifest variables. Section 5 illustrates the present method using a numerical example. Finally, in Section 6, some conclusions are provided.

2. Relative Factor Contributions in the Conventional Method

In the conventional approach, for the orthogonal factor model (1), the contribution ratio of

ξ_{j}

is defined by

{RC}_{j} = \frac{C_{j}}{\sum_{l = 1}^{m} C_{l}} = \frac{\sum_{i = 1}^{p} λ_{i j}^{2} .}{\sum_{l = 1}^{m} \sum_{k = 1}^{p} λ_{k l}^{2}}

(4)

The above measure is referred to as the factor contribution ratio in the common factor space. Let

R_{i}

be the multiple correlation coefficient of latent variable vector

ξ = {(ξ_{1}, ξ_{2}, \dots, ξ_{m})}^{T}

and manifest variable

X_{i}

. Then, for standardized manifest variable

X_{i}

, we have

R_{i}^{2} = \sum_{j = 1}^{m} λ_{i j}^{2}

(5)

The above quantity can be interpreted as the effect (explanatory power) of latent variable vector

ξ = (ξ_{j})

on manifest variable

X_{i}

; however, the denominator of (4) is the sum of those effects (5) and there is no theoretical basis to interpret it. Another contribution ratio of

ξ_{j}

is referred to as that in the whole space of

X = (X_{i})

, and is defined by

{\tilde{RC}}_{j} = \frac{C_{j}}{\sum_{i = 1}^{p} var (X_{i})} = \frac{\sum_{k = 1}^{p} λ_{k j}^{2}}{\sum_{k = 1}^{p} (\sum_{l = 1}^{m} λ_{k l}^{2} + σ_{k}^{2})}

(6)

If the manifest variables are standardized, we have

{\tilde{RC}}_{j} = \frac{C_{j}}{p} = \frac{\sum_{k = 1}^{p} λ_{k j}^{2}}{p}

Here, there is an issue similar to (4), because the denominator in (6) does not express the variation of the manifest variable vector

X = (X_{i})

. Indeed, it is the sum of the variances of manifest variables and does not include covariances between them. In the next section, the factor analysis model (1) is reconsidered in the framework of generalized linear models (GLMs), and the effects (contributions) of latent variables

ξ_{j}

on the manifest variable vector

X = (X_{i})

, that is, factor contributions, are discussed through entropy [8].

3. Factor Analysis Model and Entropy

It is assumed that factors

ε_{i}

and

ξ_{j}

are normally distributed, and the factor analysis model (1) is reconsidered in the GLM framework. Let

Λ = (λ_{i j})

be a

p \times m

factor loading matrix; let

Φ

be an

m \times m

correlation matrix of common factor vector

ξ = {(ξ_{1}, ξ_{2}, \dots, ξ_{m})}^{T}

; and let

Ω

be the

p \times p

variance-covariance matrix of unique factor vector

ε = {(ε_{1}, ε_{2}, \dots, ε_{p})}^{T}

. The conditional density function of

X

given

ξ

,

f (x | ξ)

, is normal with mean

Λ ξ

and variance matrix

Ω

, and is given as follows:

f (x | ξ) = \frac{1}{{(2 π)}^{\frac{p}{2}} {| Ω |}^{\frac{1}{2}}} \exp (\frac{X^{T} \tilde{Ω} Λ ξ - \frac{1}{2} ξ^{T} Λ^{T} {\tilde{Ω}}^{2} Λ ξ}{| Ω |} - \frac{\frac{1}{2} X^{T} \tilde{Ω} X}{| Ω |})

where

\tilde{Ω}

is the cofactor matrix of

Ω

. Let

f (x)

and

g (ξ)

be the marginal density functions of

X

and

ξ

, respectively. Then, a basic predictive power measure for GLMs [9] is based on the Kullback–Leibler information [6], and applying it to the above model, we have

KL (X, ξ) = \iint f (x | ξ) g (ξ) \log \frac{f (x | ξ)}{f (x)} d x d ξ + \iint f (x) g (ξ) \log \frac{f (x)}{f (x | ξ)} d x d ξ = \frac{tr \tilde{Ω} Λ Φ Λ^{T}}{| Ω |}

(7)

The above measure was derived from a discussion on log odds ratios in GLMs [9], and is scale-invariant with respect to manifest variables

X_{i}

. The numerator of (7) is the explained entropy of

X

by

ξ

, and the denominator is the dispersion of the unique factors in entropy, that is, the generalized variance of

ε = {(ε_{1}, ε_{2}, \dots, ε_{p})}^{T}

. Thus, (7) expresses the total effect (contribution) of factor vector

ξ = (ξ_{j})

on manifest variable vector

X = (X_{i})

in entropy, and is denoted by

C (ξ \to X)

in the present paper. The entropy coefficient of determination (ECD) is calculated as follows [9]:

ECD (X, ξ) = \frac{tr \tilde{Ω} Λ Φ Λ^{T}}{tr \tilde{Ω} Λ Φ Λ^{T} + | Ω |}

(8)

The denominator of the above measure is interpreted as the variation of manifest variable vector

X = (X_{i})

in entropy and the numerator is the explained variation of random vector

X

in entropy. In this sense, ECD (8) is the factor contribution ratio of

ξ = (ξ_{j})

for the whole entropy space of

X = (X_{i})

, and it expresses the standardized total effect of

ξ = {(ξ_{1}, ξ_{2}, \dots, ξ_{m})}^{T}

on the manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

, which is denoted by

e_{T} (ξ \to X)

[8,10]. As for (6), in the present paper, the ECD is denoted by

\tilde{RC} (ξ \to X)

, that is, the relative contribution of factor vector

ξ

for the whole space of manifest variable vector

X

in entropy.

Remark 1.

Let

Σ

be the

p \times p

variance-covariance matrix of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

and let

Φ

be the

m \times m

correlation matrix of

ξ

. Then, we have

Σ = Λ Φ Λ^{T} + Ω

(9)

For assessing the goodness-of-fit of the models, the following overall coefficient of determination (OCD) is suggested ([11], p. 60) on the basis of (9):

OCD (X, ξ) = 1 - \frac{| Ω |}{| Σ |} (= \frac{| Σ | - | Ω |}{| Σ |})

Determinant

| Ω |

is the generalized variance of unique factor vector

ε = {(ε_{1}, ε_{2}, \dots, ε_{p})}^{T}

and

| Σ |

is that of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

. Then, OCD is interpreted as the ratio of the explained generalized variance of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

by common factor vector

ξ = {(ξ_{1}, ξ_{2}, \dots, ξ_{m})}^{T}

in the p-dimensional Euclidian space. On the other hand, from (8), it follows that

ECD (X, ξ) = 1 - \frac{| Ω |}{t r \tilde{Ω} Λ Φ Λ^{T} + | Ω |}

Hence, ECD is viewed as the ratio of the explained variation of the manifest variable vector in entropy.

Cofactor matrix

\tilde{Ω}

is diagonal and the

(i, i)

elements are

\prod_{k \neq i} σ_{k}^{2}, i = 1, 2, \dots, p

. If common factors are statistically independent, it follows that

tr \tilde{Ω} Λ Φ Λ^{T} = \sum_{i = 1}^{p} \prod_{k \neq i} σ_{k}^{2} \sum_{j = 1}^{m} λ_{i j}^{2} = \sum_{i = 1}^{p} \sum_{j = 1}^{m} λ_{i j}^{2} \prod_{k \neq i} σ_{k}^{2}

Thus, (7) is decomposed as

KL (X, ξ) = \sum_{i = 1}^{p} \sum_{j = 1}^{m} \frac{λ_{i j}^{2}}{σ_{i}^{2}}

As detailed below, in the present paper, the contribution of factor

ξ_{j}

to

X

,

C (ξ_{j} \to X)

, is defined by

C (ξ_{j} \to X) = \sum_{i = 1}^{p} \frac{λ_{i j}^{2}}{σ_{i}^{2}}

(10)

Remark 2.

The above contribution is different from the conventional definition of factor contribution (2); unless

σ_{i}^{2} = 1, i = 1, 2, \dots, p

. In this sense, we may say that the standardization of manifest variables in entropy is obtained by setting all the unique factor variances to one.

In the next section, the contributions (effects) of factors

ξ_{j}

to manifest variable vector

X

are discussed in a general framework through an entropy-based path analysis [8].

4. Measurement of Factor Contribution Based on Entropy

A path diagram for the factor analysis model is given in Figure 1, in which the single-headed arrows imply the directions of effects of factors and the double-headed curved arrows indicate the associations between the related variables. In this section, common factors are assumed to be correlated, that is, we consider an oblique case, and an entropy-based path analysis [8] is applied to make a general discussion in the measurement of factor contributions.

Theorem 1.

In the factor analysis model (1),

KL (X, ξ) = \sum_{i = 1}^{p} KL (X_{i}, ξ)

Proof.

Let

f_{i} (x_{i} | ξ)

be the conditional density functions of manifest variables

X_{i}

, given factor vector

ξ

; let

f_{i} (x_{i})

be the marginal density functions of

X_{i}

; let

f (x)

be the marginal density function of

X

; and let

g (ξ)

be the marginal density function of common factor vector

ξ

. As the manifest variables are conditionally independent, given factor vector

ξ

, the conditional density function of

X

is

f (x | ξ) = \prod_{i = 1}^{p} f_{i} (x_{i} | ξ)

From (7), we have

KL (X, ξ) = \iint \prod_{i = 1}^{p} f_{i} (x_{i} | ξ) g (ξ) \log \frac{\prod_{k = 1}^{p} f_{k} (x_{k} | ξ)}{f (x)} d x d ξ + \iint f (x) g (ξ) \log \frac{f (x)}{\prod_{k = 1}^{p} f_{k} (x_{k} | ξ)} d x d ξ

\begin{array}{l} = \iint (\prod_{i = 1}^{p} f_{i} (x_{i} | ξ) g (ξ) - f (x) g (ξ)) \log \prod_{k = 1}^{p} f_{k} (x_{k} | ξ) d x d ξ \\ = \iint \prod_{i = 1}^{p} f_{i} (x_{i} | ξ) g (ξ) \log \frac{\prod_{k = 1}^{p} f_{k} (x_{k} | ξ)}{\prod_{k = 1}^{p} f_{k} (x_{k})} d x d ξ \\ + \iint f (x) g (ξ) \log \frac{\prod_{k = 1}^{p} f_{k} (x_{k})}{\prod_{k = 1}^{p} f_{k} (x_{k} | ξ)} d x d ξ \end{array}

= \sum_{k = 1}^{p} \iint \prod_{i = 1}^{p} f_{i} (x_{i} | ξ) g (ξ) \log \frac{f_{k} (x_{k} | ξ)}{f_{k} (x_{k})} d x d ξ + \sum_{k = 1}^{p} \iint f (x) g (ξ) \log \frac{f_{k} (x_{k})}{f_{k} (x_{k} | ξ)} d x d ξ = \sum_{k = 1}^{p} \iint f_{k} (x_{k} | ξ) g (ξ) \log \frac{f_{k} (x_{k} | ξ)}{f_{k} (x_{k})} d x_{k} d ξ + \sum_{k = 1}^{p} \iint f_{k} (x_{k}) g (ξ) \log \frac{f_{k} (x_{k})}{f_{k} (x_{k} | ξ)} d x_{k} d ξ

= \sum_{i = 1}^{p} (\iint f_{i} (x_{i} | ξ) g (ξ) \log \frac{f_{i} (x_{i} | ξ)}{f_{i} (x_{i})} d x_{i} d ξ + \iint f_{i} (x_{i}) g (ξ) \log \frac{f_{i} (x_{i})}{f_{i} (x_{i} | ξ)} d x_{i} d ξ) = \sum_{i = 1}^{p} K L (X_{i}, ξ)

☐

In model (1) with correlation matrix

Φ = (φ_{i j})

, we have

KL (X_{i}, ξ) = \frac{\sum_{k = 1}^{m} \sum_{l = 1}^{m} λ_{i k} φ_{k l} λ_{i l}}{σ_{i}^{2}}

The above quantity is referred to as the contribution of

ξ

to

X_{i}

, and is denoted as

C (ξ \to X_{i})

. Let

R_{i}

be the multiple correlation coefficient of

X_{i}

and

ξ = (ξ_{j})

. Then,

C (ξ \to X_{i}) = \frac{R_{i}^{2}}{1 - R_{i}^{2}} (= KL (X_{i}, ξ))

(11)

From Theorem 1, we then have

C (ξ \to X) = \sum_{i = 1}^{p} \frac{R_{i}^{2}}{1 - R_{i}^{2}} (= KL (X, ξ))

(12)

Hence, Theorem 1 gives the following decomposition of the contribution of

ξ

on

X

into those on the single manifest variables

X_{i}

(11):

C (ξ \to X) = \sum_{i = 1}^{p} C (ξ \to X_{i})

(13)

Remark 3.

Notice that in the denominator of (4), the total contribution of all factors

ξ_{i}

is simply defined as the total sum assessed:

\sum_{l = 1}^{m} C_{l} = \sum_{i = 1}^{p} R_{i}^{2}

On the other hand, in the present approach, the total effect (contribution) of factor vector

ξ

on manifest variable vector

X

is decomposed into those of manifest variables

X_{i}, (12) and (13)

.

Let

X_{s u b}

be any sub-vector of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

. Then, the contribution of factor vector

ξ

to

X_{s u b}

is defined by

C (ξ \to X_{s u b}) = KL (X_{s u b}, ξ)

From Theorem 1, we have the following corollary.

Corollary 1.

Let

X_{(1)} = {(X_{i_{1}}, X_{i_{2}}, \dots, X_{i_{q}})}^{T}

and

X_{(2)} = {(X_{j_{1}}, X_{j_{2}}, \dots, X_{j_{p - q}})}^{T}

be a decomposition of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

, where

q < p

. Then, for factor analysis model (1), it follows that

C (ξ \to X) = C (ξ \to X_{(1)}) + C (ξ \to X_{(2)}) C (ξ \to X_{(1)}) = \sum_{k = 1}^{q} C (ξ \to X_{i_{k}}), C (ξ \to X_{(2)}) = \sum_{k = 1}^{p - q} C (ξ \to X_{j_{k}})

Proof:

From a similar discussion to the proof of Theorem 1, we have

KL (X, ξ) = KL (X_{(1)}, ξ) + KL (X_{(2)}, ξ) KL (X_{(1)}, ξ) = \sum_{k = 1}^{q} KL (X_{i_{k}}, ξ), KL (X_{(2)}, ξ) = \sum_{k = 1}^{p - q} KL (X_{j_{k}}, ξ)

Hence, the corollary follows.

Next, the standardized total effects of single factors

ξ_{j}

on manifest variable vector

X

, that is,

e_{T} (ξ_{j} \to X)

, are calculated [8,10]. Let

ξ^{/ j} = {(ξ_{1}, ξ_{2}, \dots, ξ_{j - 1}, ξ_{j + 1}, \dots, ξ_{m})}^{T}

;

f (x, ξ^{/ j} | ξ_{j})

be the conditional density function of

X

and

ξ^{/ j}

given

ξ_{j}

;

f (x | ξ_{j})

be the conditional density function of

X

given

ξ_{j}

;

g (ξ^{/ j} | ξ_{j})

be the conditional density function of

ξ^{/ j}

given

ξ_{j}

; and

g_{j} (ξ_{j})

be the marginal density function of

ξ_{j}

. Then, we have

\begin{array}{l} KL (X, ξ^{/ j} | ξ_{j}) = \iint f (x, ξ) \log \frac{f (x, ξ^{/ j} | ξ_{j})}{f (x | ξ_{j}) g (ξ^{/ j} | ξ_{j})} d x d ξ^{/ j} d ξ_{j} \\ + \iint f (x | ξ_{j}) g (ξ^{/ j} | ξ_{j}) g_{j} (ξ_{j}) \log \frac{f (x | ξ_{j}) g (ξ^{/ j} | ξ_{j})}{f (x, ξ^{/ j} | ξ_{j})} d x d ξ^{/ j} d ξ_{j} = \frac{tr \tilde{Ω} Λ cov (ξ, X | ξ_{j})}{| Ω |} \end{array}

where

cov (ξ, X | ξ_{j})

is a

m \times p

covariance matrix given

ξ_{j}

, of which the

(k, i)

elements are

cov (ξ_{k}, X_{i} | ξ_{j})

. The standardized total effect

e_{T} (ξ_{j} \to X)

is given by

e_{T} (ξ_{j} \to X) = \frac{KL (X, ξ) - KL (X, ξ^{/ j} | ξ_{j})}{KL (X, ξ) + 1} = \frac{tr \tilde{Ω} Λ (cov (ξ, X) - cov (ξ, X | ξ_{j}))}{tr \tilde{Ω} Λ cov (ξ, X) + | Ω |}

The standardized total effect

e_{T} (ξ_{j} \to X)

[8] is interpreted as the contribution ratio of factor

ξ_{j}

in the whole entropy space of

X

, and in the present paper, it is denoted by

\tilde{R C} (ξ_{j} \to X)

. The contribution of factor

ξ_{j}

measured in entropy is defined by

C (ξ_{j} \to X) = KL (X, ξ) - KL (X, ξ^{/ j} | ξ_{j}) = \frac{tr \tilde{Ω} Λ cov (ξ, X)}{| Ω |} - \frac{tr \tilde{Ω} Λ cov (ξ, X | ξ_{j})}{| Ω |}

As for (6), the relative contribution of factor

ξ_{j}

on

X

is given by

RC (ξ_{j} \to X) = \frac{\tilde{RC} (ξ_{j} \to X)}{\tilde{RC} (ξ \to X)} = \frac{C (ξ_{j} \to X)}{C (ξ \to X)}

Concerning factor contributions of

ξ_{j}

on the single manifest variables

X_{i}

, that is,

C (ξ_{j} \to X_{i})

, the following theorem can be stated.

Theorem 2.

In the factor analysis model (1),

C (ξ_{j} \to X) = \sum_{i = 1}^{p} C (ξ_{j} \to X_{i})

Proof:

From Theorem 1, it follows that

KL (X, ξ^{/ j} | ξ_{j}) = \sum_{i = 1}^{p} KL (X_{i}, ξ^{/ j} | ξ_{j})

Then, we have

C (ξ_{j} \to X_{i}) = KL (X_{i}, ξ) - KL (X_{i}, ξ^{/ j} | ξ_{j})

and,

\begin{array}{l} C (ξ_{j} \to X) = KL (X, ξ) - KL (X, ξ^{/ j} | ξ_{j}) \\ = \sum_{i = 1}^{p} KL (X_{i}, ξ) - \sum_{i = 1}^{p} KL (X_{i}, ξ^{/ j} | ξ_{j}) \\ = \sum_{i = 1}^{p} (K L (X_{i}, ξ) - KL (X_{i}, ξ^{/ j} | ξ_{j})) = \sum_{i = 1}^{p} C (ξ_{j} \to X_{i}) \end{array}

From the above theorem, we have the following corollary.

Corollary 2.

Let

X_{(1)} = {(X_{i_{1}}, X_{i_{2}}, \dots, X_{i_{q}})}^{T}

and

X_{(2)} = {(X_{j_{1}}, X_{j_{2}}, \dots, X_{j_{p - q}})}^{T}

bedecomposition of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

, where

q < p

.

C (ξ_{j} \to X) = C (ξ_{j} \to X_{(1)}) + C (ξ_{j} \to X_{(2)}) C (ξ_{j} \to X_{(1)}) = \sum_{k = 1}^{q} C (ξ_{j} \to X_{i_{k}}), C (ξ_{j} \to X_{(2)}) = \sum_{k = 1}^{p - q} C (ξ_{j} \to X_{j_{k}})

Proof:

From a similar discussion in the proof of Theorem 2, the corollary follows. ☐

Remark 4.

Let

X_{s u b}

be any sub-vector of manifest variable vector

X = {(X_{1}, X_{2}, \dots, X_{p})}^{T}

. By substituting

X

for

X_{s u b}

in the above discussion,

C (ξ \to X_{s u b})

,

C (ξ_{j} \to X_{s u b})

,

\tilde{RC} (ξ_{j} \to X_{s u b})

, and

RC (ξ_{j} \to X_{s u b})

can be defined.

For orthogonal factor analysis models, the following theorem holds true.

Theorem 3.

In factor analysis model (1), if common factors

ξ_{j}

are statistically independent, then

C (ξ \to X) = \sum_{j = 1}^{m} \sum_{i = 1}^{p} C (ξ_{j} \to X_{i}) .

Proof:

From model (1), we have

C (ξ_{j} \to X_{i}) = KL (X_{i}, ξ) - KL (X_{i}, ξ^{/ j} | ξ_{j}) = \frac{λ_{i j}^{2}}{σ_{i}^{2}}

This completes the theorem. ☐

From the above discussion, if common factors

ξ_{j}

are statistically independent, (10) is derived. Moreover, we have

\tilde{RC} (ξ_{j} \to X) = \frac{KL (X, ξ) - KL (X, ξ^{/ j} | ξ_{j})}{KL (X, ξ) + 1} = \frac{\sum_{i = 1}^{p} \frac{λ_{i j}^{2}}{σ_{j}^{2}}}{KL (X, ξ) + 1}

This measure is the relative contribution ratio of

ξ_{j}

for the variation of

X

in entropy. The relative contributions of

ξ_{j}

on

X

in entropy are calculated as follows:

RC (ξ_{j} \to X) = \frac{C (ξ_{j} \to X)}{C (ξ \to X)} = \frac{\sum_{i = 1}^{p} \frac{λ_{i j}^{2}}{σ_{i}^{2}}}{\sum_{j = 1}^{m} \sum_{i = 1}^{p} \frac{λ_{i j}^{2}}{σ_{i}^{2}}}

Remark 5.

It is difficult to use OCD for assessing factor contributions, because

| Σ |

cannot be decomposed as in the above discussion.

5. Numerical Example

In order to illustrate the present method, we use the data shown in Table 1 [12]. In this table, manifest variables

X_{1}, X_{2}, and X_{3}

are subjects in liberal arts and variables

X_{4} and X_{5}

are those in sciences. First, orthogonal factor analysis (varimax method by S-PLUS ver. 8.2) is applied to the data and the results are illustrated in Table 2. From the estimated factor loadings, the first factor is interpreted as an ability relating to liberal arts, and the second factor as that for sciences. According to the factor contributions

C (ξ_{j} \to X)

shown in Table 3, the contribution of factor

ξ_{2}

is about twice as big than that of factor

ξ_{1}

from a view point of entropy, and from the relative contributions

\tilde{RC} (ξ_{j} \to X)

, about 30% of variation of manifest variable vector

X

in entropy is explained by factor

ξ_{1}

and about 60% by factor

ξ_{2}

. The relative contribution

\tilde{RC} (ξ \to X)

in Table 3 implies about 90% of the entropy of manifest variable vector

X

is explained by the two factors. On the other hand, in the conventional method, the measured factor contributions of

ξ_{1}

and

ξ_{2}

, that is,

C_{j}

, are almost equal (Table 4). As discussed in the present paper, the conventional method is intuitive and does not have any logical foundation for multidimensionally measuring contributions of factors to manifest variable vectors. Table 5 decomposes “the contribution of

ξ

to

X

” into components

C (ξ_{j} \to X_{i})

. The contribution of

ξ_{2}

to

X_{5}

is prominent compared with the other contributions.

From the discussion in the previous section, the contributions of factors are flexibly calculated. For example, it is reasonable to divide the manifest variable vector into

X_{(1)} = (X_{1}, X_{2}, X_{3})

and

X_{(2)} = (X_{4}, X_{5})

, because the first sub-vector is related to the liberal arts and the second one to the sciences. First, the contributions of

ξ_{1}

and

ξ_{2}

to

X_{(1)}

are calculated according to the present method, and the details are given as follows:

C (ξ_{1} \to X_{(1)}) = C (ξ_{1} \to X_{1}) + C (ξ_{1} \to X_{2}) + C (ξ_{1} \to X_{3}) = 0.72 + 1.49 + 0.72 = 2.93 C (ξ_{2} \to X_{(1)}) = 0.30 + 0.15 + 0.00 = 0.45

C (ξ \to X_{(1)}) = C (ξ_{1} \to X_{(1)}) + C (ξ_{2} \to X_{(1)}) = 2.93 + 0.45 = 3.38 \tilde{RC} (ξ \to X_{(1)}) = \frac{C (ξ \to X_{(1)})}{C (ξ \to X_{(1)}) + 1} = \frac{3.38}{3.38 + 1} = 0.77

(14)

\tilde{RC} (ξ_{1} \to X_{(1)}) = \frac{C (ξ_{1} \to X_{(1)})}{C (ξ \to X_{(1)}) + 1} = \frac{2.93}{3.38 + 1} = 0.67

(15)

\tilde{RC} (ξ_{2} \to X_{(1)}) = \frac{0.45}{3.38 + 1} = 0.10

(16)

RC (ξ_{1} \to X_{(1)}) = \frac{C (ξ_{1} \to X_{(1)})}{C (ξ \to X_{(1)})} = \frac{2.93}{2.93 + 0.45} = 0.87

(17)

RC (ξ_{2} \to X_{(1)}) = \frac{0.45}{2.93 + 0.45} = 0.13

(18)

From (14), 77% of the entropy of manifest variable sub-vector

X_{(1)}

are explained by the two factors, in which 67% of that are explained by factor

ξ_{1}

(15) and 10% by factor

ξ_{2} (16)

. From the relative contributions (17) and (18), 87% of the total contribution of the two factors are made by factor

ξ_{1}

and 13% by factor

ξ_{2}

.

On the other hand, the contributions of

ξ_{1}

and

ξ_{2}

on

X_{(2)} = (X_{4}, X_{5})

are calculated as follows:

C (ξ_{1} \to X_{(2)}) = C (ξ_{2} \to X_{4}) + C (ξ_{2} \to X_{5}) = 0.19 + 0.00 = 0.19 C (ξ_{2} \to X_{(2)}) = 0.63 + 5.14 = 5.77

C (ξ \to X_{(2)}) = C (ξ_{1} \to X_{(2)}) + C (ξ_{2} \to X_{(2)}) = 0.19 + 5.77 = 5.96 \tilde{RC} (ξ \to X_{(2)}) = \frac{C (ξ \to X_{(2)})}{C (ξ \to X_{(2)}) + 1} = \frac{5.96}{5.96 + 1} = 0.86

(19)

\tilde{RC} (ξ_{1} \to X_{(2)}) = \frac{C (ξ_{1} \to X_{(2)})}{C (ξ \to X_{(2)}) + 1} = \frac{0.19}{5.96 + 1} = 0.03

(20)

\tilde{RC} (ξ_{2} \to X_{(2)}) = \frac{5.77}{5.96 + 1} = 0.83

(21)

RC (ξ_{1} \to X_{(2)}) = \frac{C (ξ_{1} \to X_{(2)})}{C (ξ \to X_{(2)})} = \frac{0.19}{5.96} = 0.03

(22)

RC (ξ_{2} \to X_{(2)}) = \frac{5.77}{5.96} = 0.97

(23)

From (19), 86% of entropy of manifest variable sub-vector

X_{(2)}

is explained by the two factors, in which 3% of the entropy are explained by factor

ξ_{1}

(20) and 83% by factor

ξ_{2}

(21). The contribution ratios of the factors to sub-vector

X_{(2)}

are calculated in (22) and (23). Ninety-seven percent of the entropy was made by factor

ξ_{2}

.

Second, factor contributions in an oblique case are calculated. The estimated factor loadings and the correlation matrix of factors based on the covarimin method are shown in Table 6 and Table 7, respectively. Based on factor loadings in Table 6, factor

ξ_{1}

is interpreted as an ability for subjects in the liberal arts and factor

ξ_{2}

as an ability for subjects in sciences. The results are similar to those in the orthogonal case mentioned above, because the correlation between the factors is not strong. Table 8 shows the decomposition of

C (ξ \to X)

based on Theorems 1 and 2. In this case, it is noted that

C (ξ \to X) \neq C (ξ_{1} \to X) + C (ξ_{2} \to X)

; however,

C (ξ \to X) = \sum_{i = 1}^{5} C (ξ \to X_{i})

. According to the table, the contributions of

ξ_{1}

and

ξ_{2}

to sub-vectors of manifest variable vector

X

can also be calculated as in the above orthogonal factor analysis. Table 9 illustrates the contributions of factors on manifest variable vector

X

. Factor

ξ_{1}

explains 42% of the entropy of

X

and factor

ξ_{2}

explains 71%.

6. Discussion

For orthogonal factor analysis models, the conventional method measures factor contributions (effects) by the sums (totals) of squared factor loadings related to the factors (2); however, there is no logical foundation for how they can be interpreted. It is reasonable to measure factor contributions as the effects of factors on the manifest variable vector concerned. The present paper has proposed a method of measuring factor contributions through entropy, that is, applying an entropy-based path analysis approach. The method measures the contribution of factor vector

ξ

to manifest variable vector

X

and decomposes it into those of factors

ξ_{j}

to manifest variables

X_{i}

and/or those to sub-vectors of

X

. Comparing (2) and (10), for standardization of unique factor variances

σ_{i}^{2} = 1

, the present method equals to the conventional method. As discussed in this paper, the present method can be employed in oblique factor analysis as well, and it has been illustrated in a numerical example. The present method has a theoretical basis for measuring factor contributions in a framework of entropy, and it is a novel approach for factor analysis. The present paper confines itself to the usual factor analysis model. A more complicated model with a mixture of normal factor analysis models [13] is excluded, and a further study is needed to apply the entropy-based method to the model.

Author Contributions

N.E. conceived the study; N.E., M.T., and C.G.B. discussed the idea for measuring factor contribution; N.E. and M.T. proven the theorems in the paper; N.E. and C.G.B. computed the numerical example; N.E. wrote the paper, and the coauthors reviewed it.

Funding

Grant-in-aid for Scientific Research 18993038, Ministry of Education, Culture, Sports, Science, and Technology of Japan.

Acknowledgments

The authors would like to thank the referees for their useful comments and suggestions to improve the first version of the present paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Spearman, S. “General-intelligence”, objectively determined and measured. Am. J. Psychol. 1904, 15, 201–293. [Google Scholar] [CrossRef]
Thurstone, L.L. Vector of Mind: Multiple Factor Analysis for the Isolation of Primary Traits; The University of Chicago Press: Chicago, IL, USA, 1935. [Google Scholar]
Young, A.G.; Pearce, S. A beginner’s guide to factor analysis: Focusing on exploratory factor analysis. Quant. Methods Psychol. 2013, 9, 79–94. [Google Scholar] [CrossRef]
Bartholomew, D.J. Latent Variable Models and Factor Analysis; Oxford University Press: New York, NY, USA, 1987. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1946, 27, 379–423. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Harry, J. Relative entropy measures of multivariate dependence. J. Am. Stat. Assoc. 1989, 84, 157–164. [Google Scholar]
Eshima, N.; Tabata, M.; Borroni, G.C.; Kano, Y. An entropy-based approach to path analysis of structural generalized linear models: A basic idea. Entropy 2015, 17, 5117–5132. [Google Scholar] [CrossRef]
Eshima, N.; Tabata, M. Entropy coefficient of determination for generalized linear models. Comput. Stat. Data Anal. 2010, 54, 1381–1389. [Google Scholar] [CrossRef]
Eshima, N.; Borroni, C.G.; Tabata, M. Relative importance assessment of explanatory variables in generalized linear models: An entropy-based approach. Stat. Appl. 2016, 16, 107–122. [Google Scholar]
Everitt, B.S. An Introduction to Latent Variable Models; Chapman and Hall: London, UK, 1984. [Google Scholar]
Adachi, K.; Trendafilov, N.T. Some mathematical properties of the matrix decomposition solution in factor analysis. Psychometrika 2018, 83, 407–424. [Google Scholar] [CrossRef] [PubMed]
Attias, H. Independent factor analysis. Neural Comput. 1999, 11, 803–851. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Path diagram for factor analysis model (1)

(m = 2)

.

Figure 1. Path diagram for factor analysis model (1)

(m = 2)

.

Table 1. Data for illustrating factor analysis.

Subject	$Japanese X_{1}$	$English X_{2}$	$Social X_{3}$	$Mathematics X_{4}$	$Science X_{5}$
1	64	65	83	69	70
2	54	56	53	40	32
3	80	68	75	74	84
4	71	65	40	41	68
5	63	61	60	56	80
6	47	62	33	57	87
7	42	53	50	38	23
8	54	17	46	58	58
9	57	48	59	26	17
10	54	72	58	55	30
11	67	82	52	50	44
12	71	82	54	67	28
13	53	67	74	75	53
14	90	96	63	87	100
15	71	69	74	76	42
16	61	100	92	53	58
17	61	69	48	63	71
18	87	84	64	65	53
19	77	75	78	37	44
20	57	27	41	54	30

Table 2. Factor loadings of orthogonal factor analysis (

χ^{2} = 0.55, d f = 1, P = 0.45) .

Table 2. Factor loadings of orthogonal factor analysis (

χ^{2} = 0.55, d f = 1, P = 0.45) .

	$X_{1}$	$X_{2}$	$X_{3}$	$X_{4}$	$X_{5}$
$ξ_{1}$	0.60	0.75	0.65	0.32	0.00
$ξ_{2}$	0.39	0.24	0.00	0.59	0.92
uniqueness	0.50	0.38	0.58	0.55	0.16

Table 3. Factor contributions based on entropy (orthogonal case).

	$ξ_{1}$	$ξ_{2}$	Total
$C (ξ_{j} \to X)$	3.11	6.23	$9.34 = C (ξ \to X)$
$\tilde{RC} (ξ_{j} \to X)$	0.30	0.60	$0.90 = \tilde{R C} (ξ \to X)$
$RC (ξ_{j} \to X)$	0.33	0.67	$1$

Table 4. Factor contributions with the conventional method.

	$ξ_{1}$	$ξ_{2}$	Total
$C_{j}$	1.44	1.39	2.83
${\tilde{RC}}_{j}$	0.29	0.28	0.57
${RC}_{j}$	0.51	0.49	1

Table 5. Decomposition of factor contribution

C (ξ \to X)

into

C (ξ_{j} \to X_{i}) .

Table 5. Decomposition of factor contribution

C (ξ \to X)

into

C (ξ_{j} \to X_{i}) .

	$X_{1}$	$X_{2}$	$X_{3}$	$X_{4}$	$X_{5}$	$Total = C (ξ_{j} \to X)$
$ξ_{1}$	0.72	1.49	0.72	0.19	0.00	3.11
$ξ_{2}$	0.30	0.15	0	0.63	5.14	6.23
total $= C (ξ \to X_{i})$	1.01	1.64	0.72	0.82	5.14	9.34

Table 6. Factor loadings of oblique factor analysis (

χ^{2} = 0.55, d f = 1, P = 0.45) .

Table 6. Factor loadings of oblique factor analysis (

χ^{2} = 0.55, d f = 1, P = 0.45) .

	$X_{1}$	$X_{2}$	$X_{3}$	$X_{4}$	$X_{5}$
$ξ_{1}$	0.59	0.77	0.68	0.29	0
$ξ_{2}$	0.24	0.00	−0.12	0.52	0.92
uniqueness	0.50	0.41	0.58	0.55	0.16

Table 7. Correlation matrix of factors.

	$ξ_{1}$	$ξ_{2}$
$ξ_{1}$	1	0.315
$ξ_{2}$	0.315	1

Table 8. Decomposition of factor contribution

C (ξ \to X)

into

C (ξ_{j} \to X_{i})

(oblique case).

Table 8. Decomposition of factor contribution

C (ξ \to X)

into

C (ξ_{j} \to X_{i})

(oblique case).

	$X_{1}$	$X_{2}$	$X_{3}$	$X_{4}$	$X_{5}$	$Total = C (ξ_{j} \to X)$
$ξ_{1}$	0.90	1.44	0.70	0.37	0.54	3.95
$ξ_{2}$	0.37	0.14	0.01	0.68	5.43	6.65
$C (ξ \to X_{i})$	1.01	1.44	0.73	0.82	5.43	$C (ξ \to X) = 9.43$

Table 9. Factor contributions based on entropy (oblique case).

	$ξ_{1}$	$ξ_{2}$	Effect of $ξ$ on $X$
$C (ξ_{j} \to X)$	3.95	6.65	$C (ξ \to X) = 9.43$
$\tilde{RC} (ξ_{j} \to X)$	0.38	0.64	$\tilde{RC} (ξ \to X) = 0.90$
$RC (ξ_{j} \to X)$	0.42	0.71

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Eshima, N.; Tabata, M.; Giovanni Borroni, C. An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models. Entropy 2018, 20, 634. https://doi.org/10.3390/e20090634

AMA Style

Eshima N, Tabata M, Giovanni Borroni C. An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models. Entropy. 2018; 20(9):634. https://doi.org/10.3390/e20090634

Chicago/Turabian Style

Eshima, Nobuoki, Minoru Tabata, and Claudio Giovanni Borroni. 2018. "An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models" Entropy 20, no. 9: 634. https://doi.org/10.3390/e20090634

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Entropy-Based Approach for Measuring Factor Contributions in Factor Analysis Models

Abstract

1. Introduction

2. Relative Factor Contributions in the Conventional Method

3. Factor Analysis Model and Entropy

4. Measurement of Factor Contribution Based on Entropy

5. Numerical Example

6. Discussion

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Subject	$Japanese X_{1}$	$English X_{2}$	$Social X_{3}$	$Mathematics X_{4}$	$Science X_{5}$
1	64	65	83	69	70
2	54	56	53	40	32
3	80	68	75	74	84
4	71	65	40	41	68
5	63	61	60	56	80
6	47	62	33	57	87
7	42	53	50	38	23
8	54	17	46	58	58
9	57	48	59	26	17
10	54	72	58	55	30
11	67	82	52	50	44
12	71	82	54	67	28
13	53	67	74	75	53
14	90	96	63	87	100
15	71	69	74	76	42
16	61	100	92	53	58
17	61	69	48	63	71
18	87	84	64	65	53
19	77	75	78	37	44
20	57	27	41	54	30

Subject	$Japanese X_{1}$	$English X_{2}$	$Social X_{3}$	$Mathematics X_{4}$	$Science X_{5}$
1	64	65	83	69	70
2	54	56	53	40	32
3	80	68	75	74	84
4	71	65	40	41	68
5	63	61	60	56	80
6	47	62	33	57	87
7	42	53	50	38	23
8	54	17	46	58	58
9	57	48	59	26	17
10	54	72	58	55	30
11	67	82	52	50	44
12	71	82	54	67	28
13	53	67	74	75	53
14	90	96	63	87	100
15	71	69	74	76	42
16	61	100	92	53	58
17	61	69	48	63	71
18	87	84	64	65	53
19	77	75	78	37	44
20	57	27	41	54	30

Subject	$Japanese X_{1}$	$English X_{2}$	$Social X_{3}$	$Mathematics X_{4}$	$Science X_{5}$
1	64	65	83	69	70
2	54	56	53	40	32
3	80	68	75	74	84
4	71	65	40	41	68
5	63	61	60	56	80
6	47	62	33	57	87
7	42	53	50	38	23
8	54	17	46	58	58
9	57	48	59	26	17
10	54	72	58	55	30
11	67	82	52	50	44
12	71	82	54	67	28
13	53	67	74	75	53
14	90	96	63	87	100
15	71	69	74	76	42
16	61	100	92	53	58
17	61	69	48	63	71
18	87	84	64	65	53
19	77	75	78	37	44
20	57	27	41	54	30