AI-Enhanced Semantic Feature Norms for 786 Concepts
Abstract
Semantic feature norms have been foundational in the study of human conceptual knowledge, yet traditional methods face trade-offs between concept/feature coverage and verifiability of quality due to the labor-intensive nature of norming studies. Here, we introduce a novel approach that augments a dataset of human-generated feature norms with responses from large language models (LLMs) while verifying the quality of norms against reliable human judgments. We find that our AI-enhanced feature norm dataset shows much higher feature density and overlap among concepts while outperforming a comparable human-only norm dataset and word-embedding models in predicting people’s semantic similarity judgments. Taken together, we demonstrate that human conceptual knowledge is richer than captured in previous norm datasets and show that, with proper validation, LLMs can serve as powerful tools for cognitive science research.
Cite
Text
Suresh et al. "AI-Enhanced Semantic Feature Norms for 786 Concepts." ICLR 2025 Workshops: Bi-Align, 2025.Markdown
[Suresh et al. "AI-Enhanced Semantic Feature Norms for 786 Concepts." ICLR 2025 Workshops: Bi-Align, 2025.](https://mlanthology.org/iclrw/2025/suresh2025iclrw-aienhanced/)BibTeX
@inproceedings{suresh2025iclrw-aienhanced,
title = {{AI-Enhanced Semantic Feature Norms for 786 Concepts}},
author = {Suresh, Siddharth and Mukherjee, Kushin and Giallanza, Tyler and Yu, Xizheng and Patil, Mia and Cohen, Jonathan D. and Rogers, Timothy T.},
booktitle = {ICLR 2025 Workshops: Bi-Align},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/suresh2025iclrw-aienhanced/}
}