Linking Industry Sectors and Financial Statements: A Hybrid Approach for Company Classification
Abstract
The identification of the financial characteristics of industry sectors has a large importance in accounting audit, allowing auditors to prioritize the most important area during audit. Existing company classification standards such as the Standard Industry Classification (SIC) code allow to map a company to a category based on its activity and products. In this paper, we explore the potential of machine learning algorithms and language models to analyze the relationship between those categories and companies' financial statements. We propose a supervised company classification methodology and analyze several types of representations for financial statements. Existing works address this task using solely numerical information in financial records. Our findings show that beyond numbers, textual information occurring in financial records can be leveraged by language models to match the performance of dedicated decision tree-based classifiers, while providing better explainability and more generic accounting representations. We think this work can serve as a preliminary work towards semi-automatic auditing.
Cite
Text
Dzuyo et al. "Linking Industry Sectors and Financial Statements: A Hybrid Approach for Company Classification." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I16.33806Markdown
[Dzuyo et al. "Linking Industry Sectors and Financial Statements: A Hybrid Approach for Company Classification." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/dzuyo2025aaai-linking/) doi:10.1609/AAAI.V39I16.33806BibTeX
@inproceedings{dzuyo2025aaai-linking,
title = {{Linking Industry Sectors and Financial Statements: A Hybrid Approach for Company Classification}},
author = {Dzuyo, Guy Stephane Waffo and Guibon, Gaël and Cerisara, Christophe and Belmar-Letelier, Luis},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {16444-16452},
doi = {10.1609/AAAI.V39I16.33806},
url = {https://mlanthology.org/aaai/2025/dzuyo2025aaai-linking/}
}