The Teaching Dimension of Kernel Perceptron

Abstract

Algorithmic machine teaching has been studied under the linear setting where exact teaching is possible. However, little is known for teaching nonlinear learners. Here, we establish the sample complexity of teaching, aka teaching dimension, for kernelized perceptrons for different families of feature maps. As a warm-up, we show that the teaching complexity is $\Theta(d)$ for the exact teaching of linear perceptrons in $\mathbb{R}^d$, and $\Theta(d^k)$ for kernel perceptron with a polynomial kernel of order $k$. Furthermore, under certain smooth assumptions on the data distribution, we establish a rigorous bound on the complexity for approximately teaching a Gaussian kernel perceptron. We provide numerical examples of the optimal (approximate) teaching set under several canonical settings for linear, polynomial and Gaussian kernel perceptions.

Cite

Text

Kumar et al. "The Teaching Dimension of Kernel Perceptron." Artificial Intelligence and Statistics, 2021.

Markdown

[Kumar et al. "The Teaching Dimension of Kernel Perceptron." Artificial Intelligence and Statistics, 2021.](https://mlanthology.org/aistats/2021/kumar2021aistats-teaching/)

BibTeX

@inproceedings{kumar2021aistats-teaching,
  title     = {{The Teaching Dimension of Kernel Perceptron}},
  author    = {Kumar, Akash and Zhang, Hanqi and Singla, Adish and Chen, Yuxin},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2021},
  pages     = {2071-2079},
  volume    = {130},
  url       = {https://mlanthology.org/aistats/2021/kumar2021aistats-teaching/}
}