Design Principle Transfer in Neural Architecture Search via Large Language Models

Zhou, Xun; Wu, Xingyu; Feng, Liang; Lu, Zhichao; Tan, Kay Chen

doi:10.1609/AAAI.V39I21.34463

Design Principle Transfer in Neural Architecture Search via Large Language Models

Xun Zhou, Xingyu Wu, Liang Feng, Zhichao Lu, Kay Chen Tan

AAAI 2025 pp. 23000-23008

doi:10.1609/AAAI.V39I21.34463 /aaai/2025/zhou2025aaai-design/

Abstract

Transferable neural architecture search (TNAS) has been introduced to design efficient neural architectures for multiple tasks, to enhance the practical applicability of NAS in real-world scenarios. In TNAS, architectural knowledge accumulated in previous search processes is reused to warm up the architecture search for new tasks. However, existing TNAS methods still search in an extensive search space, necessitating the evaluation of numerous architectures. To overcome this challenge, this work proposes a novel transfer paradigm, i.e., design principle transfer. In this work, the linguistic description of various structural components' effects on architectural performance is termed design principles. They are learned from established architectures and then can be reused to reduce the search space tasks by discarding unpromising architectures. Searching in the refined search space can boost both the search performance and efficiency for new NAS tasks. To this end, a large language model (LLM)-assisted design principle transfer (LAPT) framework is devised. In LAPT, LLM is applied to automatically reason the design principles from a set of given architectures, and then a principle adaptation method is applied to refine these principles progressively based on the search results. Experimental results demonstrate that LAPT can beat the state-of-the-art TNAS methods on most tasks and achieve comparable performance on the remainder.

PDF AAAI Semantic Scholar

Cite

Text

Zhou et al. "Design Principle Transfer in Neural Architecture Search via Large Language Models." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I21.34463

Markdown

[Zhou et al. "Design Principle Transfer in Neural Architecture Search via Large Language Models." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhou2025aaai-design/) doi:10.1609/AAAI.V39I21.34463

BibTeX

@inproceedings{zhou2025aaai-design,
  title     = {{Design Principle Transfer in Neural Architecture Search via Large Language Models}},
  author    = {Zhou, Xun and Wu, Xingyu and Feng, Liang and Lu, Zhichao and Tan, Kay Chen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23000-23008},
  doi       = {10.1609/AAAI.V39I21.34463},
  url       = {https://mlanthology.org/aaai/2025/zhou2025aaai-design/}
}