Some common misunderstandings about randomization

28 Aug, 2018 at 14:22 | Posted in Statistics & Econometrics | Comments Off

Randomization is an alternative when we do not know enough to control, but is generally inferior to good control when we do. We suspect that at least some of the popular and professional enthusiasm for RCTs, as well as the belief that they are precise by construction, comes from misunderstandings about … random or realized confounding on the one hand and confounding in expectation on the other …

The RCT strategy is only successful if we are happy with estimates that are arbitrarily far from the truth, just so long as the errors cancel out over a series of imaginary experiments. In reality, the causality that is being attributed to the treatment might, in fact, be coming from an imbalance in some other cause in our particular trial; limiting this requires serious thought about possible covariates.

Angus Deaton & Nancy Cartwright

The point of making a randomized experiment is often said to be that it ‘ensures’ that any correlation between a supposed cause and effect indicates a causal relation. This is believed to hold since randomization (allegedly) ensures that a supposed causal variable does not correlate with other variables that may influence the effect.

The problem with that simplistic view on randomization is that the claims made are both exaggerated and false:

• Even if you manage to do the assignment to treatment and control groups ideally random, the sample selection certainly is — except in extremely rare cases — not random. Even if we make a proper randomized assignment, if we apply the results to a biased sample, there is always the risk that the experimental findings will not apply. What works ‘there,’ does not work ‘here.’ Randomization hence does not ‘guarantee ‘ or ‘ensure’ making the right causal claim. Although randomization may help us rule out certain possible causal claims, randomization per se does not guarantee anything!

• Even if both sampling and assignment are made in an ideal random way, performing standard randomized experiments only give you averages. The problem here is that although we may get an estimate of the ‘true’ average causal effect, this may ‘mask’ important heterogeneous effects of a causal nature. Although we get the right answer of the average causal effect being 0, those who are ‘treated’ may have causal effects equal to -100 and those ‘not treated’ may have causal effects equal to 100. Contemplating being treated or not, most people would probably be interested in knowing about this underlying heterogeneity and would not consider the average effect particularly enlightening.

• There is almost always a trade-off between bias and precision. In real-world settings, a little bias often does not overtrump greater precision. And — most importantly — in case we have a population with sizeable heterogeneity, the average treatment effect of the sample may differ substantially from the average treatment effect in the population. If so, the value of any extrapolating inferences made from trial samples to other populations is highly questionable.

• Since most real-world experiments and trials build on performing a single randomization, what would happen if you kept on randomizing forever, does not help you to ‘ensure’ or ‘guarantee’ that you do not make false causal conclusions in the one particular randomized experiment you actually do perform. It is indeed difficult to see why thinking about what you know you will never do, would make you happy about what you actually do.

Randomization is not a panacea. It is not the best method for all questions and circumstances. Proponents of randomization make claims about its ability to deliver causal knowledge that are simply wrong. There are good reasons to be sceptical of the now popular — and ill-informed — view that randomization is the only valid and best method on the market. It is not.

Blog at WordPress.com.
Entries and Comments feeds.

	fredtorssander on DSGE models — worse than…
	Jorge Morales Meoqui on DSGE models — worse than…
	Lars Syll on DSGE models — worse than…
	Jorge Morales Meoqui on DSGE models — worse than…
	Lars Syll on DSGE models — worse than…
	Jorge Morales Meoqui on DSGE models — worse than…
	rsm on The ‘Just One More…
	Jan Milch on Keynes — en ständigt akt…
	rsm on Brownian motion (student …
	Nanikore on The total incompetence of peop…
	Bruce Wilder on The total incompetence of peop…
	rsm on Ergodicity — a questiona…
	Edward Fullbrook on Susan Neiman on why left is no…
	rsm on The non-existence of economic…
	fredtorssander on The non-existence of economic…

LARS P. SYLL

Some common misunderstandings about randomization

Recent Posts

Comments Policy

Recent Comments

Reading List

Categories

Archives