Slice-Based Learning: A Programming Model for Residual Learning in Critical Data Slices
Abstract
In real-world machine learning applications, data subsets correspond to especially critical outcomes: vulnerable cyclist detections are safety-critical in an autonomous driving task, and "question" sentences might be important to a dialogue agent's language understanding for product purposes. While machine learning models can achieve quality performance on coarse-grained metrics like F1-score and overall accuracy, they may underperform on these critical subsets---we define these as slices, the key abstraction in our approach. To address slice-level performance, practitioners often train separate "expert" models on slice subsets or use multi-task hard parameter sharing. We propose Slice-based Learning, a new programming model in which the slicing function (SF), a programmer abstraction, is used to specify additional model capacity for each slice. Any model can leverage SFs to learn slice-specific representations, which are combined with an attention mechanism to make slice-aware predictions. We show that our approach improves over baselines in terms of computational complexity and slice-specific performance by up to 19.0 points, and overall performance by up to 4.6 F1 points on applications spanning natural language understanding and computer vision benchmarks as well as production-scale industrial systems.
Cite
Text
Chen et al. "Slice-Based Learning: A Programming Model for Residual Learning in Critical Data Slices." Neural Information Processing Systems, 2019.Markdown
[Chen et al. "Slice-Based Learning: A Programming Model for Residual Learning in Critical Data Slices." Neural Information Processing Systems, 2019.](https://mlanthology.org/neurips/2019/chen2019neurips-slicebased/)BibTeX
@inproceedings{chen2019neurips-slicebased,
title = {{Slice-Based Learning: A Programming Model for Residual Learning in Critical Data Slices}},
author = {Chen, Vincent and Wu, Sen and Ratner, Alexander J and Weng, Jen and Ré, Christopher},
booktitle = {Neural Information Processing Systems},
year = {2019},
pages = {9397-9407},
url = {https://mlanthology.org/neurips/2019/chen2019neurips-slicebased/}
}