Expertise-Centric Prompting Framework for Financial Tabular Data Generation Using Pre-Trained Large Language Models
Abstract
Access to financial tabular data is often restricted owing to strict regulations surrounding personal information. Despite the advanced generative capabilities of large language models (LLMs), methodologies for the effective creation or expansion of financial tabular datasets remains undeveloped. The complexity of attribute relationships and the diverse data ranges in financial services present significant challenges in processing and understanding these datasets. To address these issues, we propose an expertise-centric prompting framework for synthesizing realistic and accessible pseudo-financial data. This framework involves a collaboration between financial experts and LLMs, focusing on schema calibration and attribute constraints. Moreover, we introduce new metrics to evaluate the realism of these pseudo datasets. We validated the effectiveness of the proposed framework and metrics on both English and Korean datasets, encompassing card transactions, loan statements, and deposits and savings, utilizing pre-trained LLMs such as KoGPT, ClovaX, LLAMA 2-Chat, GPT-3.0, and ChatGPT-3.5/4.0.
Cite
Text
Kim et al. "Expertise-Centric Prompting Framework for Financial Tabular Data Generation Using Pre-Trained Large Language Models." NeurIPS 2024 Workshops: TRL, 2024.Markdown
[Kim et al. "Expertise-Centric Prompting Framework for Financial Tabular Data Generation Using Pre-Trained Large Language Models." NeurIPS 2024 Workshops: TRL, 2024.](https://mlanthology.org/neuripsw/2024/kim2024neuripsw-expertisecentric/)BibTeX
@inproceedings{kim2024neuripsw-expertisecentric,
title = {{Expertise-Centric Prompting Framework for Financial Tabular Data Generation Using Pre-Trained Large Language Models}},
author = {Kim, Subin and Son, Jungmin and Jung, Minyoung and Kwak, Youngjun},
booktitle = {NeurIPS 2024 Workshops: TRL},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/kim2024neuripsw-expertisecentric/}
}