Date | Lecture | Required Reading | Reading Assignments | Project |
Apr 8 | Lecture 1 Biases and Fairness slides, video | Barocas: Ch 2 Bishop: Ch 8.2 | - On Formalizing Fairness in Prediction with Machine Learning, ArXiv 2017
- k-NN as an implementation of situation testing for discrimination discovery and prevention, SIGKDD 2011
- Equality of Opportunity in Supervised Learning, NeurIPS 2016
- Discrimination-aware Data Mining, SIGKDD 2008
- Fairness Constraints: Mechanisms for Fair Classification, AISTATS 2017
| Project Guideline Out |
Apr 10 | Lecture 2 Fair Representation Learning slides, video | |
- Data decisions and theoretical implications when adversarially learning fair representations, FAT 2017
- Inherent trade-offs in the fair determination of risk scores, ArXiv 2016
- Fairgan: Fairness-aware generative adversarial networks. IEEE Big Data 2018
- Flexibly fair representation learning by disentanglement, ICML 2019
- Wasserstein Fair Classification. UAI 2019
| |
Apr 15 | Lecture 3 Interpretability and Transparency slides, video | Molnar: Ch 2, Ch 4 | - The mythos of model interpretability. Queue 2018
- Peeking inside the black-box: A survey on Explainable Artificial Intelligence (XAI), IEEE Access 2018
- Towards a rigorous science of interpretable machine learning. Arxiv 2017
- Falling rule lists. AISTATS 2015
- Discovering interpretable representations for both deep generative and discriminative models, ICML 2018
| |
Apr 17 | Lecture 4 Post Hoc Interpretability and Proxy Models slides, video | Molnar: Ch 5.7, Ch 5.8 | - "Why should i trust you?" Explaining the predictions of any classifier, SIGKDD 2016
- Interpretable decision sets: A joint framework for description and prediction, SIGKDD 2016
- Distilling knowledge from deep networks with applications to healthcare domain, Arxiv 2015
- Model agnostic supervised local explanations, NeurIPS 2018
- Explaining classifications for individual instances. IEEE Transactions on Knowledge and Data Engineering 2008
| |
Apr 22 | Lecture 5 Feature Interaction for interpretability slides, video | Molnar: Ch 5.9, Ch 5.10 | - Towards Efficient Data Valuation Based on the Shapley Value, ICML 2019
- A Game Theoretic Approach to Class-wise Selective Rationalization, NeurIPS 2019
- Granger-causal attentive mixtures of experts: Learning important features with neural networks, AAAI 2019
- GNNExplainer: Generating explanations for graph neural networks, NeurIPS 2019
- Ancona, Marco, Cengiz Öztireli, and Markus Gross. Explaining deep neural networks with a polynomial time algorithm for shapley values approximation, ICML 2019
| |
Apr 24 | Lecture 6 Example and Visualization Based Methods for Interpretability slides, video | Molnar: Ch 6 | - Counterfactual explanations without opening the black box: Automated decisions and the GDPR, Harv. JL & Tech, 2017
- INVASE: Instance-wise variable selection using neural networks, ICLR 2018
- On Concept-Based Explanations in Deep Neural Networks, arXiv 2019
- Efficient Data Representation by Selecting Prototypes with Importance Weights, ICDM 2019
- The bayesian case model: A generative approach for case-based reasoning and prototype classification. NeurIPS 2014
| Project Proposal Due |
Apr 29 | Lecture 7 Interpreting deep neural networks slides, video | |
- Learning how to explain neural networks: PatternNet and PatternAttribution, ICLR 2018
- Towards robust interpretability with self-explaining neural networks, NeurIPS 2018
- Visualizing and understanding convolutional networks, ECCV 2014
- Real time image saliency for black box classifiers, NeurIPS 2017
- Sanity checks for saliency maps, NeurIPS 2018
| |
May 1 | Lecture 8 Fairness Through Input Manipulation slides, video | |
- From parity to preference-based notions of fairness in classification, NeurIPS 2017
- A reductions approach to fair classification, ICML 2018
- On fairness and calibration, NeurIPS 2017
- Predict responsibly: improving fairness and accuracy by learning to defer, NeurIPS 2018
- A common framework to provide explanations and analyse the fairness and robustness of black-box models, AIES 2020
| |
May 6 | Lecture 9 Fair NLP slides, video | |
- Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them, NAACL 2019
- Gender Bias in Contextualized Word Embeddings, NAACL 2019
- Understanding the Origins of Bias in Word Embeddings, ICML 2019
- The Woman Worked as a Babysitter: On Biases in Language Generation, EMNLP 2019
- The risk of racial bias in hate speech detection, ACL 2019
| |
May 8 | Lecture 10 Fairness for Vision Representations slides, video | |
- Counterfactual fairness, NeurIPS 2017
- Men also like shopping: Reducing gender bias amplification using corpus-level constraints, EMNLP 2017
- Feature transfer learning for face recognition with under-represented data, CVPR 2019
- Fairness of exposure in rankings, KDD 2017
- Gender shades: Intersectional accuracy disparities in commercial gender classification, FAccT 2018
| |
May 13 | Lecture 11 Mid-way Presentation | | | Midterm Report Due |
May 15 | Lecture 12 Robustness and Adversarial Attacks slides, video1, video2 | |
- Darts: Deceiving autonomous cars with toxic signs. arXiv 2018
- Adversarial examples are not bugs, they are features, NeurIPS 2019
- Synthesizing robust adversarial examples, ICML 2018
- Universal adversarial perturbations, CVPR 2017
- Deepfool: a simple and accurate method to fool deep neural networks, CVPR 2016
| |
May 20 | Lecture 13 Adversarial Defense slides, video | | - On detecting adversarial perturbations, ICLR 2017
- Certified defenses against adversarial examples, ICLR 2018
- Certified adversarial robustness via randomized smoothing, ICML 2019
- Defense-gan: Protecting classifiers against adversarial attacks using generative models, ICLR 2018
- Ensemble adversarial training: Attacks and defenses, ICLR 2018
| |
May 22 | Lecture 14 Fair Causal Reasoning slides, video | | - PC-Fairness: A Unified Framework for Measuring Causality-based Fairness, NeurIPS 2019
- Path-specific counterfactual fairness, AAAI 2019
- Envy-free classification, NeurIPS 2019
- Causal inference for social discrimination reasoning, Journal of Intelligent Information Systems 2019
- Equality of opportunity in classification: A causal approach, NeurIPS 2018
| |
May 27 | Lecture 15 (cancaled) | | - Conditional learning of fair representations, ICLR 2020
- Inherent tradeoffs in learning fair representations, NeurIPS 2019
- A Geometric Solution to Fair Representations, AAAI/ACM AI, Ethics, and Society 2020
- Learning Certified Individually Fair Representations, arXiv 2020
- A general approach to fairness with optimal transport, AAAI 2020
| |
May 29 | Lecture 16 Disentangled Fair Representation and ML Auditing slides, video | | - Concrete problems in AI safety, arXiv 2016
- Inherent trade-offs in the fair determination of risk scores, arXiv 2016
- The concept of fairness in the GDPR: a linguistic and contextual interpretation, FAccT 2020
- European Union regulations on algorithmic decision-making and a “right to explanation”, AI magazine 2017
- AI Fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias, arXiv 2018
| |
Jun 3 | Lecture 17 Privacy slides, video | | - Practical secure aggregation for privacy-preserving machine learning, ACM SIGSAC Conference on Computer and Communications Security, 2017
- Federated learning: Strategies for improving communication efficiency, arXiv 2016
- Towards federated learning at scale: System design, SysML 2019
- Federated multi-task learning, NeurIPS 2017
- Calibrating noise to sensitivity in private data analysis, Theory of cryptography conference 2006
| |
Jun 5 | Final Project Presentation | | | |
Jun 10 | Final Project Presentation | | | |
Jun 12 | | | | 11:59PM PT: Final Report Due
|