empirical-methods

Homepage for 17-803 "Empirical Methods" at Carnegie Mellon University


Project maintained by bvasiles Hosted on GitHub Pages — Theme by mattgraham

Empirical Methods (thanks for the promo, @JoshQuicksall!)

This is the Spring 2026 offering of this course. For older versions, see here: Spring 2024Fall 2022Spring 2021Fall 2018.

Overview

Empirical methods play a key role in the design and evaluation of tools and technologies, and in testing the social and technical theories they embody. No matter what your research area is, chances are you will be conducting some empirical studies as part of your work. Are you looking to evaluate a new algorithm? New tool? Analyze (big) data? Understand what challenges practitioners face in some domain?

This course is a survey of empirical methods designed primarily for computer science PhD students, that teaches you how to go about each of these activities in a principled and rigorous way. You will learn about and get hands-on experience with a core of qualitative and quantitative empirical research methods, including interviews, qualitative coding, survey design, and many of the most useful statistical analyses of (large-scale) data, such as various forms of regression, time series analysis, and causal inference. And you will learn how to design valid studies applying and combining these methods.

There will be extensive reading with occasional student presentations about the reading in class, homework assignments, and a semester-long research project for which students must prepare in-class kickoff and final presentations as well as a final report.

After completing this course, you will:

As a side effect, this course helps you develop a healthy dose of skepticism towards scientific results in general. Does the study design really allow the authors to make certain claims? Does the analysis technique? Is the evidence provided as strong as it could be? Are there fundamental flaws and threats to validity?

Coordinates

Course Syllabus and Policies

The syllabus covers course overview and objectives, evaluation, time management, late work policy, and collaboration policy.

Learning Goals

The learning goals describe what I want students to know or be able to do by the end of the semester. I evaluate whether learning goals have been achieved through assignments, written project reports, and in-class presentations.

Schedule

Below is a preliminary schedule for Spring 2026. Each link points to a dedicated page with materials and more details. All videos are published on this YouTube channel.

Note: The schedule is subject to change and will be updated as the semester progresses.

Date Topic Notes
Tue, Jan 13 No class (Bogdan out)  
Thu, Jan 15 No class (Bogdan out)  
Tue, Jan 20 Introduction slidesvideo
Thu, Jan 22 Formulating research questions slidesvideo
Tue, Jan 27 Literature review slidesvideo
Thu, Jan 29 Role of Theory slidesvideo
Tue, Feb 3 Exemplar interview papers  
Thu, Feb 5 Conducting interviews  
Tue, Feb 10 Qualitative data analysis  
Thu, Feb 12 In-class activity: qualitative coding & thematic analysis  
Tue, Feb 17 Qualitative analysis in the age of LLMs  
Thu, Feb 19 Types of errors in probability survey research  
Tue, Feb 24 Questionnaire design and multi-item scales  
Thu, Feb 26 Causal relationships and experimental design  
Tue, Mar 3 Spring break, no class  
Thu, Mar 5 Spring break, no class  
Tue, Mar 10 Experimental design papers  
Thu, Mar 12 Interaction effects  
Tue, Mar 17 Regression modeling diagnostics  
Thu, Mar 19 In-class activity: Galton families regression  
Tue, Mar 24 In-class activity: Simpson’s paradox  
Thu, Mar 26 Interrupted time series design  
Tue, Mar 31 Interrupted time series analysis with control  
Thu, Apr 2 In-class activity: ITS with control  
Tue, Apr 7 Dealing with controversy  
Thu, Apr 9 Carnival, no class  
Tue, Apr 14 Guest lecture (Bogdan @ICSE)  
Thu, Apr 16 Guest lecture (Bogdan @ICSE)  
Tue, Apr 21 Final presentations (I)  
Thu, Apr 23 Final presentations (II)