Schedule
Please, check out this schedule frequently as it will likely change a bit throughout the quarter.
Legend:
- PA0 stands for Programming Assignment Primer
- PA stands for Programming Assignment
- DSP stands for Data Science Project
- SR stands for Issue Report
- R stands for reading assignment
- IP stands for individual quarter-long project
#Lecture | Date | Lecture | Keywords | Readings | Important Dates |
---|---|---|---|---|---|
1 | 03/21 | Course Overview and Introduction to the Data Science Process | Data science lifecycle. Ethics, fairness, responsibility, and privacy issues. | Reading 1.1: John P. A. Ioannidis Why Most Published Research Findings Are False PLOS Medicine. 2005 Reading 1.2: Michael Jordan Artificial Intelligence: The Revolution Hasn’t Happened Yet. HDSR 2019. | PA0 assigned SR assigned R1 assigned IP assigned |
2 | 03/23 | Pitfalls in Inferential Statistics | Multiple hypothesis, Bonferroni correction, false discovery rate, statistical vs practical significance | ||
3 | 03/28 | Data Context and Quality | collection, preparation, cleaning, missing data | Reading 2.1: Mark D. Wilkinson et al. The FAIR Guiding Principles for scientific data management and stewardship. Nature Scientific Data. 2016 Reading 2.2: Stephen Stigler. Data Have a Limited Shelf Life. HDSR 2019. | R1 due PA0 due R2 assigned PA assigned DSP assigned |
4 | 03/30 | Causality and Experiments 1/2 | causal models, experiments (RCT) | ||
5 | 04/04 | Causality and Experiments 2/2 | causal inference from observational data, human subjects, AB testing, experimental design | Reading 3.1: Department of Health, Education, and Welfare. The Belmont Report. April 18, 1979. Reading 3.2 Robert Bond, Christopher Fariss et al.A 61-million-person experiment in social influence and political mobilization. Nature 2012. | R2 due R3 assigned |
6 | 04/06 | No class – Individual Project Work | |||
7 | 04/11 | Introduction to Machine Learning 1/2 | optimization vs generalization, training and test data, models, learning | Reading 4.1: Nithya Sambasivan et al. Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI. CHI 2021. 4.2 Wendy Parker Model Evaluation: An Adequacy-for-Purpose View. 2022 (read the introduction and (optionally) the rest) | R3 due R4 assigned |
8 | 04/13 | Machine Learning in the Wild 2/2 | training data, feature engineering, information leakage, concept drift, algorithmic decision making | ||
9 | 04/18 | Fairness and Interpretability in Machine Learning | fairness definitions | Reading 5.1 Deirdre K. Mulligan, Joshua A. Kroll, Nitin Kohli, Richmond Y. Wong This Thing Called Fairness: Disciplinary Confusion Realizing a Value in Technology CSCW 2019 Reading 5.2 Julia Angwin, Jeff Larson, Surya Mattu, Lauren Kirchner. Machine Bias. ProPublica, May 23, 2016 | R4 due R5 assigned |
10 | 04/20 | Discussion 1/2 | |||
11 | 04/25 | Visualization and Communication | packaging data products, reproducibility, repeatibility, visualization, communication | R5 due | |
12 | 04/27 | Introduction to Privacy 1/2 | privacy definitions, law, technology | ||
13 | 05/02 | Introduction to Privacy 2/2 | data anonymization and deanonymization, k-anonimity, attacks, indigenous data sovereignty | Reading 6: Shoshana Zuboff. Big other: surveillance capitalism and the prospects of an information civilization. Journal of Information Technology 2015. | R6 assigned |
14 | 05/04 | Statistical Data Privacy | differential privacy, sensitivity | ||
15 | 05/09 | Data Flows, Lifecyles, Data Markets | provenance, right to be forgotten, data portability, data brokers, data ownership, value of data, data unions, cooperatives, strikes | Reading 7 . Edith Ramirez, Julie Brill, Maureen K. Ohlhausen, Joshua D. Wright, Terrell McSweeny Data Brokers: A call for transparency and accountability. Federal Trade Commission, May, 2014 (Read Executive Summary and then Section 4 “Types of Products”) | R6 due R7 assigned SR due |
16 | 05/11 | Course Summary | summary and discussion | PA due | |
17 | 05/16 | SR Presentation (Selection of students) | R7 due IP due DSP due | ||
18 | 05/18 | Poster Presentation |