empirical-methods

Homepage for 17-803 "Empirical Methods" at Carnegie Mellon University


Project maintained by bvasiles Hosted on GitHub Pages — Theme by mattgraham

L11: Causal Relationships (pdf, video)

Lecture11-Experiments2

This lecture is the second part of a series on designing experiments. We discussed what it means for something to be a “cause” or an “effect,” the three ingredients needed for establishing a causal relationship, and how experiments as a research method match the characteristics of causal relationships very well.

We also discussed and critiqued two example studies using experiments: a true (randomized) experiment in the Tomkins et al study of double blind reviewing at the Conference on Web Search and Data Mining, and a quasi-experiment in the Sobel & Clarkson study of teaching formal methods.

Lecture Readings

Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and quasi-experimental designs for generalized causal inference. Wadsworth Publishing.

The discussion of cause as an inus condition – “insufficient but nonredundant part of an unnecessary but sufficient condition” – follows Chapter 1 from the book (Experiments and generalized causal inference).


Sobel, A. E. K., & Clarkson, M. R. (2002). Formal methods application: An empirical tale of software development. IEEE Transactions on Software Engineering, 28(3), 308-320.

Berry, D. M., & Tichy, W. F. (2003). Comments on “Formal methods application: an empirical tale of software development”. IEEE Transactions on Software Engineering, 29(6), 567-571.

Sobel, A. E. K., & Clarkson, M. R. (2003). Response to “Comments on ‘Formal methods application: an empirical tale of software development’”. IEEE Transactions on Software Engineering, 29(6), 572-575.

A rare example of an academic feud about the validity of study design, that played out in public. Read the papers in the order they are listed above. The second paper is a critique of the original study design, with a response by the original authors in the third paper.


Tomkins, A., Zhang, M., & Heavlin, W. D. (2017). Single versus double blind reviewing at WSDM 2017. arXiv preprint arXiv:1702.00502

A nice example of a randomized experiment carried out to assess the impact of single vs double-blind reviewing of conference papers. The paper reports that: