User profiles for Pang Wei Koh

Pang Wei Koh

University of Washington
Verified email at cs.washington.edu
Cited by 17292

Wilds: A benchmark of in-the-wild distribution shifts

PW Koh, S Sagawa, H Marklund… - International …, 2021 - proceedings.mlr.press
Distribution shifts—where the training distribution differs from the test distribution—can
substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. …

Just train twice: Improving group robustness without training group information

…, AS Chen, A Raghunathan, PW Koh… - International …, 2021 - proceedings.mlr.press
Standard training via empirical risk minimization (ERM) can produce models that achieve low
error on average but high error on minority groups, especially in the presence of spurious …

Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization

S Sagawa, PW Koh, TB Hashimoto, P Liang - arXiv preprint arXiv …, 2019 - arxiv.org
Overparameterized neural networks can be highly accurate on average on an iid test set yet
consistently fail on atypical groups of the data (eg, by learning spurious correlations that …

Understanding black-box predictions via influence functions

PW Koh, P Liang - International conference on machine …, 2017 - proceedings.mlr.press
How can we explain the predictions of a black-box model? In this paper, we use influence
functions—a classic technique from robust statistics—to trace a model’s prediction through the …

On the opportunities and risks of foundation models

…, G Keeling, F Khani, O Khattab, PW Koh… - arXiv preprint arXiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Concept bottleneck models

PW Koh, T Nguyen, YS Tang… - International …, 2020 - proceedings.mlr.press
We seek to learn models that we can interact with using high-level concepts: if the model
did not think there was a bone spur in the x-ray, would it still predict severe arthritis? State-of-the…

[HTML][HTML] Mobility network models of COVID-19 explain inequities and inform reopening

S Chang, E Pierson, PW Koh, J Gerardin, B Redbird… - Nature, 2021 - nature.com
The coronavirus disease 2019 (COVID-19) pandemic markedly changed human mobility
patterns, necessitating epidemiological models that can capture the effects of these changes in …

[HTML][HTML] Stronger data poisoning attacks break data sanitization defenses

PW Koh, J Steinhardt, P Liang - Machine Learning, 2022 - Springer
Abstract Machine learning models trained on data from the outside world can be corrupted by
data poisoning attacks that inject malicious points into the models’ training sets. A common …

Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization

…, A Raghunathan, S Sagawa, PW Koh… - International …, 2021 - proceedings.mlr.press
For machine learning systems to be reliable, we must understand their performance in
unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution …

Peer and self assessment in massive online classes

C Kulkarni, KP Wei, H Le, D Chia… - ACM Transactions on …, 2013 - dl.acm.org
Peer and self-assessment offer an opportunity to scale both assessment and learning to global
classrooms. This article reports our experiences with two iterations of the first large online …