Study-design reference

The design families side by side — what each controls, claims, and leaves open

This one-page reference lays the course’s design families next to each other. The questions to ask of any study are always the same: what is the unit of analysis? was there random sampling? was there random assignment? what claim does that license? what threat is left open? All example numbers come from the synthetic recurring studies and are illustrative.

The design families at a glance

Design	Random assignment?	Random sampling?	Strongest claim it supports	Main threat it leaves open
Completely randomized experiment	yes	usually no	a causal effect for units like these	external validity (generalization)
Randomized block / paired design	yes (within blocks/pairs)	usually no	a causal effect, more precisely estimated	external validity; blocking on the wrong variable
Factorial experiment	yes	usually no	causal main effects and interactions	reading a main effect when an interaction is large
Observational study (with adjustment)	no	varies	an adjusted association, causal only under the measured-confounder assumption	unmeasured confounding; bad controls
Survey (SRS / stratified / cluster)	no	yes	a population estimate (with uncertainty)	coverage error; nonresponse bias

The throughline: assignment buys causation; sampling buys generalization. No single design buys both for free, which is why real evidence is usually assembled from several.

Experiments: sharpening and structuring the comparison

Design	What it does	Effect on the estimate	Worked anchor (synthetic)
Completely randomized (CRD)	assign at random; compare group means	the baseline	Focus experiment: \(d = 3.0\), \(\operatorname{SE} \approx 1.55\), randomization \(p \approx 0.06\)
Blocking (RCBD)	randomize within pre-treatment blocks	same effect, smaller SE	\(\operatorname{SE} \approx 1.03\), \(p \approx 0.005\)
Pairing / matching	compare within pairs / within unit	same effect, smaller SE	paired \(\operatorname{SE} \approx 0.91\), \(p \approx 0.003\)
Factorial (2×2)	two factors at once	main effects and interaction	cells \(5, 8, 7, 13\); \(A = +4.5\), \(B = +3.5\), \(AB = +1.5\)

Blocking and pairing change the standard error, not the effect: in the Focus experiment the effect stays \(d = 3.0\) while the SE falls from \(1.55\) (CRD) to \(1.03\) (blocked) to \(0.91\) (paired).

Observational evidence: adjustment and its limits

When there is no random assignment, groups differ at baseline. Adjustment (stratification or regression) compares within levels of a measured confounder to close a backdoor path. In the tutoring-center study the naive user-vs-nonuser difference is \(+8.0\); adjusting for prior ability shrinks it to \(+3.0\). Adjustment is only as good as the confounders you measured — adjust for pre-treatment confounders, never for a post- treatment mediator or collider (a bad control). See the causal-diagram guide.

Survey sampling: precision by design

Sampling design	Cost	Variance vs SRS	Worked anchor (synthetic)
Simple random (SRS)	baseline	baseline	\(\hat p = 0.45\), \(\operatorname{SE} \approx 0.026\)
Stratified	similar	lower (deff < 1) when strata differ	deff \(\approx 0.72\), \(\operatorname{SE} \approx 0.022\)
Cluster	cheaper	higher (deff > 1)	deff \(\approx 2.4\), \(\operatorname{SE} \approx 0.041\)

A cluster sample must not be analyzed as if it were an SRS — that ignores the design effect and understates the uncertainty. And no amount of sampling precision fixes nonresponse: in the study-habits survey the sampling CI is \((0.40, 0.50)\), but the nonresponse sensitivity bound is \([0.27, 0.67]\) — far wider.

The questions to ask of any study

What is the question, and the unit of analysis?
Was there random sampling? (If so, you can generalize to the frame’s population.)
Was there random assignment? (If so, you can make a causal claim for units like these.)
What are the threats? (Selection, measurement, confounding, attrition, nonresponse, bad controls.)
What can the design support — and what would strengthen it? (This is the design memo.)

This page is a study reference. For graded specifics — deadlines, submissions, and policies — Blackboard (the LMS) is authoritative.