+ - 0:00:00
Notes for current slide
Notes for next slide

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Internal Consistency of Multilevel Data

Cluster Means, Centering, and Construct Meanings

Mark Lai

University of Southern California

2020/11/09

1 / 57

Outline

Reliability in factor analysis

2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Outline

Reliability in factor analysis

Multilevel reliability

2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Outline

Reliability in factor analysis

Multilevel reliability

Issues of level-specific reliability coefficients

  1. Reliability of latent scores
  2. Cross-level invariance
  3. Construct meanings
2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Outline

Reliability in factor analysis

Multilevel reliability

Issues of level-specific reliability coefficients

  1. Reliability of latent scores
  2. Cross-level invariance
  3. Construct meanings

Reliability indices for observed composite scores

  • ω2l, ωb, ωw
2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Some alternative indices I proposed to solve these limitations

Looking forward to comments and suggestions; whether I'm doing something wrong or right

Outline

Reliability in factor analysis

Multilevel reliability

Issues of level-specific reliability coefficients

  1. Reliability of latent scores
  2. Cross-level invariance
  3. Construct meanings

Reliability indices for observed composite scores

  • ω2l, ωb, ωw

Longitudinal Data?

2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Some alternative indices I proposed to solve these limitations

Looking forward to comments and suggestions; whether I'm doing something wrong or right

Importance of Reliability

  • Psychological scales are not perfect
3 / 57

Importance of Reliability

  • Psychological scales are not perfect

  • Certain level of reliability needed

    • Statistical analyses are not trustworthy when the numbers are not consistent

Image credit: Reliability by Nick Youngson CC BY-SA 3.0 Alpha Stock Images

3 / 57

APA Journal Article Reporting Standards (JARS)

  • In the Psychometrics section (Appelbaum, Cooper, Kline, Mayo-Wilson, Nezu, and Rao, 2018), researchers were asked to

Estimate and report values of reliability coefficients for the scores analyzed (i.e., the research's sample) (p. 7)

4 / 57

Similar recommendations can be found in numerous journal and methodological guidelines

Reliability

5 / 57

Just a quick introduction on the foundational work on reliability that this research relies on.

Classical Test Theory

Lord & Novick (1968)

Observed score = True score + Error

Y=T+E

6 / 57

For example, we ask students report their attitutes toward math

Classical Test Theory

Lord & Novick (1968)

Observed score = True score + Error

Y=T+E

T and E independent, so

σY2=σT2+σE2

6 / 57

For example, we ask students report their attitutes toward math

Classical Test Theory

Lord & Novick (1968)

Observed score = True score + Error

Y=T+E

T and E independent, so

σY2=σT2+σE2

Reliability ρ=σT2σY2=σT2σT2+σE2=[Corr(Y,T)]2

6 / 57

For example, we ask students report their attitutes toward math

Latent Variable/Factor Analysis

(Essential) Tau-equivalence

p items: k=1,,p

Yk=νk+η+ϵk

7 / 57

When we have multiple items, we can estimate the error variance

For the true score proportion of Y, it's on the same metric/unit as the latent variable

Latent Variable/Factor Analysis

(Essential) Tau-equivalence

p items: k=1,,p

Yk=νk+η+ϵk Var(η)=ψ, Var(ϵk)=θk, ϵk and ϵk independent

Cov(Yk,Yk)=ψ

7 / 57

When we have multiple items, we can estimate the error variance

For the true score proportion of Y, it's on the same metric/unit as the latent variable

Latent Variable/Factor Analysis

(Essential) Tau-equivalence

p items: k=1,,p

Yk=νk+η+ϵk Var(η)=ψ, Var(ϵk)=θk, ϵk and ϵk independent

Cov(Yk,Yk)=ψ

Unweighted (unit-weight) composite: Z=kYj

Variance of unweighted composite: Var(Z)=p2ψ+kθk

7 / 57

When we have multiple items, we can estimate the error variance

For the true score proportion of Y, it's on the same metric/unit as the latent variable

Latent Variable/Factor Analysis

(Essential) Tau-equivalence

p items: k=1,,p

Yk=νk+η+ϵk Var(η)=ψ, Var(ϵk)=θk, ϵk and ϵk independent

Cov(Yk,Yk)=ψ

Unweighted (unit-weight) composite: Z=kYj

Variance of unweighted composite: Var(Z)=p2ψ+kθk Reliability = p2ψVar(Z), or Cronbach's α

7 / 57

When we have multiple items, we can estimate the error variance

For the true score proportion of Y, it's on the same metric/unit as the latent variable

There were different ways to justify the derivation of α

Latent Variable

Congeneric

Yk=νk+λkη+ϵk

8 / 57

Latent Variable

Congeneric

Yk=νk+λkη+ϵk

  • True Score Variance VTrue=k(λk)2ψ
  • Error Variance = VError=kθk

Composite reliability ω=VTrueVTrue+VError

8 / 57

Latent Variable

Congeneric

Yk=νk+λkη+ϵk

  • True Score Variance VTrue=k(λk)2ψ
  • Error Variance = VError=kθk

Composite reliability ω=VTrueVTrue+VError

More generally, with Cov([ϵ1,ϵ2,])=Θ, VError=1Θ1

8 / 57

Reliability is a property of observed test scores (Z), not the latent scores (η)

9 / 57

Multilevel Data

Lai, M. H. C. (2020). Composite reliability of multilevel data: It's about observed scores and construct meanings. Psychological Methods. Advance online publication. https://doi.org/10.1037/met0000287

10 / 57

Example

2007 Trends in International Mathematics and Science Study (TIMSS; Williams et al., 2009)

  • 7,896 students (4th grade) from 515 schools

Positive attitudes toward math (PATM)

Item Wording
AS4MAMOR Would like to do more math
AS4MAENJ I enjoy learning mathematics
AS4MALIK I like math
AS4MABOR Math is boring (reverse-coded)
11 / 57

Multilevel Reliability Not Consistently Reported

Kim et al. (2016): Only 54% reported reliability, among 39 articles using multilevel confirmatory factor analysis (MCFA)

  • Usually only one reliability reported for one scale
12 / 57

However, discussion on multilevel reliability is not new

Multilevel Reliability

  • Raykov and du Toit (2005); Raykov and Marcoulides (2006)

    • Two-level composite reliability
  • Cranford, Shrout, Iida, Rafaeli, Yip, and Bolger (2006)

    • Generalizability Theory framework
    • Reliability of change
  • Geldhof, Preacher, and Zyphur (2014)

    • Level-specific reliability (within and between)
    • Most popular with cross-sectional data
    • Only approach discussed in Kim et al. (2016)
13 / 57

Geldhof et al. (2014)

14 / 57

Geldhof et al. (2014)

"Unconstrained" Multilevel Factor Model

j indexes cluster

Yij=ν+λbηjb+λjwηijw+ϵij

ϵij=ϵjb+ϵijw Var(ηb)=ψb, Var(ηjw)=ψw

Var(ϵb)=θb, Var(ϵjw)=θw

Loading invariance across clusters: λjw=λw for all j

14 / 57

No cross-level invariance

ϵ is the uniqueness, separated into the within and the between level

Geldhof et al. (2014)

Fixed ψb=ψw=1 for identification

ω~b=(k=1pλkb)2(k=1pλkb)2+k=1pθkkbω~w=(k=1pλkw)2(k=1pλkw)2+k=1pθkkw.

For the TIMSS data

  • Est ω~w = .857, 95% CI [.849, .863]

  • Est ω~b = .977, 95% CI [.964, .987] !!

15 / 57

Use tilde to distinguish them with the indices I will discuss later

Why I got interested in this is that the reliability indices seem extremely large

ω~b is Usually High

Not uncommon in the literature . . .

16 / 57

ω~b is Usually High

Not uncommon in the literature . . .

Positive and negative affects: ω~b = .94 to .97 (Rush and Hofer, 2014)

Instructional Skills Questionnaire: α~b between .90 to .99 (Knol, Dolan, Mellenbergh, and van der Maas, 2016)

16 / 57
  1. Repeated measures within persons

  2. Multiple factors in ISQ, Team from Netherland

Are we that good at measuring between-level variables?

17 / 57

Three Issues

  1. Which "scores" are reliable?

    • Cluster means and centering
    • Latent vs. observed composites
  2. Cross-level invariance

  3. Construct meanings

18 / 57

Although it is a critique on the level-specific reliability, to be fair

Three Issues

  1. Which "scores" are reliable?

    • Cluster means and centering
    • Latent vs. observed composites
  2. Cross-level invariance

  3. Construct meanings

To be fair, most of these issues have only started getting attentions recently

18 / 57

Although it is a critique on the level-specific reliability, to be fair

Issue 1: "Scores" in Multilevel Studies

First compute a composite of the 4 PATM items

If we use composite PATM to predict student's math achievement, we can compute

19 / 57

Issue 1: "Scores" in Multilevel Studies

First compute a composite of the 4 PATM items

If we use composite PATM to predict student's math achievement, we can compute

IDSCHOOL AS4MAMOR AS4MAENJ AS4MALIK AS4MABORr Z Zb Zw
1 2 2 1 2 7 6.5000 0.5000
1 2 1 1 1 5 6.5000 -1.5000
1 2 1 1 1 5 6.5000 -1.5000
1 2 1 2 1 6 6.5000 -0.5000
1 1 1 1 1 4 6.5000 -2.5000
2 3 2 2 2 9 6.5625 2.4375
2 1 2 2 1 6 6.5625 -0.5625
2 1 1 1 1 4 6.5625 -2.5625
2 3 2 1 1 7 6.5625 0.4375
2 2 2 3 1 8 6.5625 1.4375
19 / 57

Three Sets of Scores

  • Raw/Overall composite PATM (Zij)

  • School means of composite PATM (cluster mean; Zjb)

  • Student deviations from school means (cluster-mean centered; Zijw=ZijZjb)

20 / 57

Three Sets of Scores

  • Raw/Overall composite PATM (Zij)

  • School means of composite PATM (cluster mean; Zjb)

  • Student deviations from school means (cluster-mean centered; Zijw=ZijZjb)

We should compute reliability for each of them

20 / 57

Which Score Is ω~b for?

Is ω~b the reliability of the school means?

21 / 57

Not clear in the original paper

Which Score Is ω~b for?

Is ω~b the reliability of the school means?

Var(Y1b)=(λ1b)2+θ11b

Var(kYkb)=(kλkb)2+kθkkb

ω~b=(kλkb)2(kλkb)2+kθkkb

21 / 57

Not clear in the original paper

But What is Ykb?

Yjkb (in circle) is the latent school mean of item k

  • True/Population mean of all students of school j
22 / 57

Let's say the school has 500 students. The one in circle is the mean of everyone from that school. But the sample may only contain 50 students

May be easier to think in terms of a population mean vs a sample mean

But What is Ykb?

Yjkb (in circle) is the latent school mean of item k

  • True/Population mean of all students of school j

Different from the observed school mean, Y¯.jk=i=1njYijk/nj

  • Mean of students in the sample from school j
22 / 57

Let's say the school has 500 students. The one in circle is the mean of everyone from that school. But the sample may only contain 50 students

May be easier to think in terms of a population mean vs a sample mean

But What is Ykb?

Yjkb (in circle) is the latent school mean of item k

  • True/Population mean of all students of school j

Different from the observed school mean, Y¯.jk=i=1njYijk/nj

  • Mean of students in the sample from school j

Raudenbush and Bryk (2002): Reliability of cluster means

Var(YijkYjkb)=σkkw/nj

22 / 57

Let's say the school has 500 students. The one in circle is the mean of everyone from that school. But the sample may only contain 50 students

May be easier to think in terms of a population mean vs a sample mean

  1. Raudenbush & Bryk also talks about the reliability of the cluster mean, which is not perfect with a finite sample
  2. The observed mean converges to the latent mean when nj

Therefore, ω~b is the internal consistency of a latent composite.

Is that a problem?

23 / 57

Therefore, ω~b is the internal consistency of a latent composite.

Is that a problem?

Let's go back to Y=T+E, where T is a latent variable. What is the reliability of T?

23 / 57

Therefore, ω~b is the internal consistency of a latent composite.

Is that a problem?

Let's go back to Y=T+E, where T is a latent variable. What is the reliability of T?

It should be 1 as T is the true score

23 / 57

Therefore, ω~b is the internal consistency of a latent composite.

Is that a problem?

Let's go back to Y=T+E, where T is a latent variable. What is the reliability of T?

It should be 1 as T is the true score

But if we know the true score, we don't need to worry about reliability

23 / 57

Illustration Using Simulated Data

ψb=ψw=1, nj=10 for all j

Five items

  • λb=0.25, θb=0.1
  • λw=0.5, θw=1
24 / 57

Just to make things more clear, I simulated a data set

Ten observations in each cluster

Illustration Using Simulated Data

ψb=ψw=1, nj=10 for all j

Five items

  • λb=0.25, θb=0.1
  • λw=0.5, θw=1

24 / 57

Just to make things more clear, I simulated a data set

Ten observations in each cluster

Illustration Using Simulated Data

ψb=ψw=1, nj=10 for all j

Five items

  • λb=0.25, θb=0.1
  • λw=0.5, θw=1

Sources of measurement error:

Latent Mean item uniqueness
Observed Mean item uniqueness + sampling error

25 / 57

26 / 57

ηb is the true score at the school level

Left: Correlation between latent score and latent composite

Right: Correlation between latent score and observed composite, which is smaller

ω~b=.76=[Corr(ηb,kYkb)]2

26 / 57

ηb is the true score at the school level

Left: Correlation between latent score and latent composite

Right: Correlation between latent score and observed composite, which is smaller

ω~b=.76=[Corr(ηb,kYkb)]2

However, [Corr(ηb,Zb)]2=.49, as

VError=k=1pθkkb+[(k=1pλkw)2+k=1pθkkw]/n

ωb=.49ω~b

26 / 57

ηb is the true score at the school level

Left: Correlation between latent score and latent composite

Right: Correlation between latent score and observed composite, which is smaller

Overly optimistic information

imagine in a single-level context, saying that the reliability of the instrument was .76, but when it was less than .5

ω~b=.76=[Corr(ηb,kYkb)]2

However, [Corr(ηb,Zb)]2=.49, as

VError=k=1pθkkb+[(k=1pλkw)2+k=1pθkkw]/n

ωb=.49ω~b

For the TIMSS items, ωb=.719, 95% CI [.668, .771]

  • as opposed to ω~b=.977
26 / 57

ηb is the true score at the school level

Left: Correlation between latent score and latent composite

Right: Correlation between latent score and observed composite, which is smaller

Overly optimistic information

imagine in a single-level context, saying that the reliability of the instrument was .76, but when it was less than .5

How About ω~w?

27 / 57

How About ω~w?

  • ω~w is composite reliability of latent-mean-centered scores

    • Also latent variables
    • But algebraically, ω~w=ωw

27 / 57

Issue 2: Cross-Level Loading Invariance

28 / 57

Without Constraints on Loadings

  • ηb: school-level construct, no connection to ηw

  • ηw: purely student-level construct (i.e., ICC = 0)

    • E.g., PATW relative to the school mean

(See e.g., Mehta & Neale, 2005)

29 / 57

Can only compare relative standing, not absolute value

Without Constraints on Loadings

  • ηb: school-level construct, no connection to ηw

  • ηw: purely student-level construct (i.e., ICC = 0)

    • E.g., PATW relative to the school mean

(See e.g., Mehta & Neale, 2005)

29 / 57

Can only compare relative standing, not absolute value

With Cross-Level Invariance

One construct η: ηij=ηjb+ηijw

ICC = ψbψb+ψw

30 / 57

This is more consistent with the way we use cluster means and do centering in MLM

Strong/Scalar Invariance Across Clusters

Implies that θkkb=0 for all ks

  • ω~b=1.0 (Jak, Oort, and Dolan, 2014)

For an individual construct, ω~b is roughly a measure of strong invariance

31 / 57

Construct Meanings

Based on Stapleton, Yang, and Hancock (2016); Stapleton and Johnson (2019)

32 / 57

What is the Target Construct?

  • What is your attitude toward math?

  • What is your attitude toward math, relative to the school norm?

  • What is your school's overall attitude toward math?

33 / 57

Individual/Configural Construct

34 / 57

Individual/Configural Construct

What is your attitude toward math?

Individual construct η

Partitioning: η=ηb+ηw

Configural construct ηb (i.e., true cluster mean)

  • Var(ηb)/Var(η) = ICC

Within-cluster component ηw

34 / 57

Matching Composites and Constructs

  • Individual construct--Raw composite Zij=kYijk
    • VTrue=(k=1pλk)2(ψw+ψb)
    • VError=1Θb1+1Θw1
    • Discussed in Raykov and du Toit (2005)
35 / 57

Matching Composites and Constructs

  • Individual construct--Raw composite Zij=kYijk
    • VTrue=(k=1pλk)2(ψw+ψb)
    • VError=1Θb1+1Θw1
    • Discussed in Raykov and du Toit (2005)
  • Configural construct--Composite cluster mean Zjb=kY¯jk
    • VTrue=(k=1pλk)2ψb
    • VError=1Θb1+[(k=1pλk)2ψw+1Θw1]/n
    • For unbalanced cluster sizes, use the harmonic mean n~
35 / 57

Matching Composites and Constructs

  • Individual construct--Raw composite Zij=kYijk
    • VTrue=(k=1pλk)2(ψw+ψb)
    • VError=1Θb1+1Θw1
    • Discussed in Raykov and du Toit (2005)
  • Configural construct--Composite cluster mean Zjb=kY¯jk
    • VTrue=(k=1pλk)2ψb
    • VError=1Θb1+[(k=1pλk)2ψw+1Θw1]/n
    • For unbalanced cluster sizes, use the harmonic mean n~
  • Within-cluster construct--Composite of deviation scores Zijw=k(YijkY¯jk)
    • VTrue=(k=1pλk)2ψw
    • VError=1Θw1
35 / 57

Replace n with the harmonic mean for unequal cluster sizes

Within-Cluster Construct

36 / 57

Within-Cluster Construct

What is your attitude toward math, relative to the school norm?

Within-cluster construct ηw

Expected ICC = 0

ωw reliability of Zijw

(k=1pλk)2ψw(k=1pλk)2ψw+1Θw1

36 / 57

Shared Construct

37 / 57

Shared Construct

What is your school's attitude toward math?

Shared construct ηb: Cluster-level attribute (aka climate)

ωb reliability of Zjb

  • VTrue=(k=1pλk)2ψb
  • VError=1Θb1+1Σw1/n~

37 / 57

Shared + Configural/Individual Constructs

38 / 57

Shared + Configural/Individual Constructs

What is your school's attitude toward math?

There may be rater acquiescence

Shared construct ηs: School climate

Individual construct ηw: Acquiescence

Configural construct ηb: School means of Acquiescence

38 / 57

Shared + Configural/Individual Constructs

The school-level composite, Zjb, measures both ηs and ηb

ωb(s): construct reliability of Zjb measuring ηs

  • VTrue=(k=1pλks)2ψs
  • VError=(k=1pλk)2(ψb+ψw/n~)
    +1Θb1+1Θw1/n~

39 / 57

Extensions of α

α2l=pp1(kk(σkkb+σkkw)1Σb1+1Σw1)αb=pp1(kkσkkb1Σb1+1Σw1/n~)αw=pp1(kkσkkw1Σw1)

40 / 57

41 / 57

Which One to Report?

  • If a variable is partitioned in a multilevel model (most likely an individual construct), all three (ω2l,ωb,ωw) should be reported
    • Cluster means and cluster-mean centered predictors
    • Outcome variable
42 / 57

Which One to Report?

  • If a variable is partitioned in a multilevel model (most likely an individual construct), all three (ω2l,ωb,ωw) should be reported
    • Cluster means and cluster-mean centered predictors
    • Outcome variable
  • Otherwise, reliability at the corresponding level (ωb or ωw)
42 / 57

Summary (Lai, 2020)

Computing and reporting reliability information is important for multilevel data

43 / 57

Summary (Lai, 2020)

Computing and reporting reliability information is important for multilevel data

Reliability information is needed for raw, cluster means, and cluster-mean centered scores

43 / 57

Summary (Lai, 2020)

Computing and reporting reliability information is important for multilevel data

Reliability information is needed for raw, cluster means, and cluster-mean centered scores

Previous approach to between-level reliability is an overestimate when cluster size is small

43 / 57

Summary (Lai, 2020)

Computing and reporting reliability information is important for multilevel data

Reliability information is needed for raw, cluster means, and cluster-mean centered scores

Previous approach to between-level reliability is an overestimate when cluster size is small

Nature of target construct should be considered, and it has implications on reliability computation

43 / 57
Construct ω2l, α2l ωb, αb ωw, αw ωb(s)
Individual X X X
Configural X
Within-Cluster X
Shared X X
44 / 57

Longitudinal Data

Preliminary ideas. Suggestions are greatly appreciated.

45 / 57

Midlife in the United States

Data from MIDUS 2: Daily Stress Project, 2004-2009 (Ryff and Almeida, 2009)

  • 2,022 participants, 8 days each

  • Target construct: Positive affect

Item Wording
b2dc24 Did you feel attentive?
b2dc25 Did you feel proud?
b2dc26 Did you feel active?
b2dc27 Did you feel confident?
  • Type of scores: raw composite, person means, person-mean centered
46 / 57

From MCFA

Est ICC(η)=.778

Composite Est ω 95% CI
Raw .812 [.801, .822]
Within .609 [.595, .623]
Between .852 [.839, .864]

47 / 57

Can We Incorporate Time?

Cross-Classified CFA (Jeon and Rabe-Hesketh, 2012; Asparouhov and Muthén, 2012)

Assuming cross-level invariance for an individual construct, with decomposition ηti=ηiP+ηtT+ηtiW

48 / 57

Relation to the Generalizability Theory

Most meaningful when participants are measured on the same days/times

Cranford, Shrout, Iida, et al. (2006): generalizability coefficients for diary studies

49 / 57

Not the case for the MIDUS data, as everyone starts on a different day

Relation to the Generalizability Theory

Most meaningful when participants are measured on the same days/times

Cranford, Shrout, Iida, et al. (2006): generalizability coefficients for diary studies

  • Fixed vs. Random item facet (in estimation)
  • Relax the essential parallel test assumption
    • Item-specific loadings and uniqueness
  • Flexible SEM modeling
49 / 57

Not the case for the MIDUS data, as everyone starts on a different day

Some Possible Reliability Coefficients

Reliability of raw scores

  • VTrue=(kλk)2(ψP+ψT+ψW)
  • VError=1(ΘP+ΘT+ΘW)1
50 / 57

Some Possible Reliability Coefficients

Reliability of raw scores

  • VTrue=(kλk)2(ψP+ψT+ψW)
  • VError=1(ΘP+ΘT+ΘW)1

Reliability of person means (across T time points)

  • VTrue=(kλk)2ψP
  • VError=1ΘP1+[(kλk)2ψW+1ΘW1]/T
50 / 57

Some Possible Reliability Coefficients

Reliability of raw scores

  • VTrue=(kλk)2(ψP+ψT+ψW)
  • VError=1(ΘP+ΘT+ΘW)1

Reliability of person means (across T time points)

  • VTrue=(kλk)2ψP
  • VError=1ΘP1+[(kλk)2ψW+1ΘW1]/T

Reliability of deviation from person mean

  • VTrue=(kλk)2(ψT+ψW)
  • VError=1(ΘT+ΘW)1
50 / 57

Person-level (trait-level) variance is not part of true score for the deviation score

In this example, there is essentially no time-level variance

  • E.g., no day of participation effect
Composite Est ω 95% CI
Raw .829 [.820, .837]
Within .646 [.635, .660]
Between .859 [.849, .868]
51 / 57

Many Questions Remain

  1. Linkage to generalizability coefficients by Cranford et al. (2006)

  2. Discrete indicators?

  3. Should constructs at the within-person level and the between-person level be on the same metric?

  4. Are there "shared" constructs at the person level?

    • E.g., Intensively measuring a stable trait?
  5. Reliability of change? (Rogosa, Brandt, and Zimowski, 1982)

    • Related to reliability of within-person deviation?
52 / 57

Thanks!

Slides created via the R package xaringan.

53 / 57

References

Appelbaum, M., H. Cooper, R. B. Kline, et al. (2018). "Journal article reporting standards for quantitative research in psychology". In: American Psychologist 73.1, pp. 3-25. ISSN: 0003066X. DOI: 10.1037/amp0000191.

Asparouhov, T. and B. Muthén "General random effect latent variable modeling: Random subjects, items, contexts, and parameter". In: Advances in multilevel modeling for educational research: Addressing practical issues found in real-world applications. Charlotte, NC: Information Age, pp. 163-192.

Cranford, J. A., P. E. Shrout, M. Iida, et al. (2006). "A procedure for evaluating sensitivity to within-person change: Can mood measures in diary studies detect change reliably?" En. In: Personality and Social Psychology Bulletin 32.7, pp. 917-929. ISSN: 0146-1672, 1552-7433. DOI: 10.1177/0146167206287721. URL: http://journals.sagepub.com/doi/10.1177/0146167206287721 (visited on Nov. 08, 2020).

Geldhof, G. J., K. J. Preacher, and M. J. Zyphur (2014). "Reliability estimation in a multilevel confirmatory factor analysis framework". In: Psychological Methods 19.1, pp. 72-91. ISSN: 1082989X. DOI: 10.1037/a0032138.

54 / 57

References (cont'd)

Jak, S., F. J. Oort, and C. V. Dolan (2014). "Measurement bias in multilevel data". In: Structural Equation Modeling: A Multidisciplinary Journal 21.1, pp. 31-39. ISSN: 1070-5511. DOI: 10.1080/10705511.2014.856694.

Jeon, M. and S. Rabe-Hesketh (2012). "Profile-likelihood approach for estimating generalized linear mixed models with factor structures". En. In: Journal of Educational and Behavioral Statistics 37.4, pp. 518-542. ISSN: 1076-9986, 1935-1054. DOI: 10.3102/1076998611417628. URL: http://journals.sagepub.com/doi/10.3102/1076998611417628 (visited on Nov. 08, 2020).

Knol, M. H., C. V. Dolan, G. J. Mellenbergh, et al. (2016). "Measuring the quality of university lectures: Development and validation of the Instructional Skills Questionnaire (ISQ)". In: PLOS ONE 11.2. Ed. by D. S. Courvoisier, p. e0149163. ISSN: 1932-6203. DOI: 10.1371/journal.pone.0149163.

Lai, M. H. C. (2020). "Composite reliability of multilevel data: It’s about observed scores and construct meanings." En. In: Psychological Methods. ISSN: 1939-1463, 1082-989X. DOI: 10.1037/met0000287. URL: http://doi.apa.org/getdoi.cfm?doi=10.1037/met0000287 (visited on Nov. 08, 2020).

55 / 57

References (cont'd)

Raudenbush, S. W. and A. S. Bryk (2002). Hierarchical linear models: Applications and data analysis methods. 2nd ed. Thousand Oaks, CA: Sage. ISBN: 076191904X.

Raykov, T. and G. A. Marcoulides (2006). "On multilevel model reliability estimation from the perspective of structural equation modeling". In: Structural Equation Modeling: A Multidisciplinary Journal 13.1, pp. 130-141. ISSN: 1070-5511. DOI: 10.1207/s15328007sem1301_7.

Raykov, T. and S. H. C. du Toit (2005). "Estimation of reliability for multiple-component measuring instruments in hierarchical designs". In: Structural Equation Modeling: A Multidisciplinary Journal 12.4, pp. 536-550. ISSN: 1070-5511. DOI: 10.1207/s15328007sem1204_2.

Rogosa, D., D. Brandt, and M. Zimowski (1982). "A growth curve approach to the measurement of change." En. In: Psychological Bulletin 92.3, pp. 726-748. ISSN: 0033-2909. DOI: 10.1037/0033-2909.92.3.726. URL: http://content.apa.org/journals/bul/92/3/726 (visited on Nov. 08, 2020).

56 / 57

References (cont'd)

Rush, J. and S. M. Hofer (2014). "Differences in within- and between-person factor structure of positive and negative affect: Analysis of two intensive measurement studies using multilevel structural equation modeling.". In: Psychological Assessment 26.2, pp. 462-473. ISSN: 1939-134X. DOI: 10.1037/a0035666.

Ryff, C. D. and D. M. Almeida (2009). Midlife in the United States (MIDUS 2): Daily Stress Project, 2004-2009: Version 2. En. type: dataset. DOI: 10.3886/ICPSR26841.V2. URL: http://www.icpsr.umich.edu/icpsrweb/NACDA/studies/26841/version/2 (visited on Nov. 08, 2020).

Stapleton, L. M. and T. L. Johnson (2019). "Models to examine the validity of cluster-level factor structure using individual-level data". In: Advances in Methods and Practices in Psychological Science, p. 251524591985503. ISSN: 2515-2459. DOI: 10.1177/2515245919855039.

Stapleton, L. M., J. S. Yang, and G. R. Hancock (2016). "Construct meaning in multilevel settings". In: Journal of Educational and Behavioral Statistics 41.5, pp. 481-520. ISSN: 1076-9986. DOI: 10.3102/1076998616646200.

57 / 57

Outline

Reliability in factor analysis

2 / 57

Such an honor to be here. Co-chairs of my dissertation are both graduates of ASU, and they would share with me how good the program at ASU is

Lineage of ASU Quant . . .

Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow