Dataset Cataloging Metadata for Machine Learning Applications Research

Abstract

As the field of machine learning (ML) matures, two types of data archives are developing: collections of benchmark data sets used to test the performance of new algorithms, and data stores to which machine learning/data mining algorithms are applied to create scientific or commercial applications. At present, the catalogs of these archives are ad hoc and not tailored to machine learning analysis. This paper considers the cataloging metadata required to support these two types of repositories, and discusses the organizational support necessary for archive catalog maintenance.

Cite

Text

Cunningham. "Dataset Cataloging Metadata for Machine Learning Applications Research." Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics, 1997.

Markdown

[Cunningham. "Dataset Cataloging Metadata for Machine Learning Applications Research." Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics, 1997.](https://mlanthology.org/aistats/1997/cunningham1997aistats-dataset/)

BibTeX

@inproceedings{cunningham1997aistats-dataset,
  title     = {{Dataset Cataloging Metadata for Machine Learning Applications Research}},
  author    = {Cunningham, Sally Jo},
  booktitle = {Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics},
  year      = {1997},
  pages     = {139-146},
  volume    = {R1},
  url       = {https://mlanthology.org/aistats/1997/cunningham1997aistats-dataset/}
}