Certifai: A Toolkit for Building Trust in AI Systems

Abstract

As more companies and governments build and use machine learning models to automate decisions, there is an ever-growing need to monitor and evaluate these models' behavior once they are deployed. Our team at CognitiveScale has developed a toolkit called Cortex Certifai to answer this need. Cortex Certifai is a framework that assesses aspects of robustness, fairness, and interpretability of any classification or regression model trained on tabular data, without requiring access to its internal workings. Additionally, Cortex Certifai allows users to compare models along these different axes and only requires 1) query access to the model and 2) an “evaluation” dataset. At its foundation, Cortex Certifai generates counterfactual explanations, which are synthetic data points close to input data points but differing in terms of model prediction. The tool then harnesses characteristics of these counterfactual explanations to analyze different aspects of the supplied model and delivers evaluations relevant to a variety of different stakeholders (e.g., model developers, risk analysts, compliance officers). Cortex Certifai can be configured and executed using a command-line interface (CLI), within jupyter notebooks, or on the cloud, and the results are recorded in JSON files and can be visualized in an interactive console. Using these reports, stakeholders can understand, monitor, and build trust in their AI systems. In this paper, we provide a brief overview of a demonstration of Cortex Certifai's capabilities.

Cite

Text

Henderson et al. "Certifai: A Toolkit for Building Trust in AI Systems." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/759

Markdown

[Henderson et al. "Certifai: A Toolkit for Building Trust in AI Systems." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/henderson2020ijcai-certifai/) doi:10.24963/IJCAI.2020/759

BibTeX

@inproceedings{henderson2020ijcai-certifai,
  title     = {{Certifai: A Toolkit for Building Trust in AI Systems}},
  author    = {Henderson, Jette and Sharma, Shubham and Gee, Alan H. and Alexiev, Valeri and Draper, Steve and Marin, Carlos and Hinojosa, Yessel and Draper, Christine and Perng, Michael and Aguirre, Luis and Li, Michael and Rouhani, Sara and Consul, Shorya and Michalski, Susan and Prasad, Akarsh and Chutani, Mayank and Kumar, Aditya and Alam, Shahzad and Kandarpa, Prajna and Jesudasan, Binnu and Lee, Colton and Criscolo, Michael and Williamson, Sinead and Sanchez, Matt and Ghosh, Joydeep},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {5249-5251},
  doi       = {10.24963/IJCAI.2020/759},
  url       = {https://mlanthology.org/ijcai/2020/henderson2020ijcai-certifai/}
}