A Teacher-Teacher Framework for Clinical Language Representation Learning

Abstract

In recent years, there has been a proliferation of ready-to-use large language models (LLMs) designed for various applications, both general-purpose and domain-specific. Instead of advocating for the development of a new model or continuous pretraining of an existing one, this paper introduces a pragmatic teacher-teacher framework to facilitate mutual learning between two pre-existing models.By leveraging two teacher models possessing complementary knowledge, we introduce a LIghtweight kNowledge alignmEnt (LINE) module aimed at harmonizing their knowledge within a unified representation space. This framework is particularly valuable in clinical settings, where stringent regulations and privacy considerations dictate the handling of detailed clinical notes. Our trained LINE module excels in capturing critical information from clinical notes, leveraging highly de-identified data. Validation and downstream tasks further demonstrate the effectiveness of the proposed framework.

Cite

Text

Huang et al. "A Teacher-Teacher Framework for Clinical Language Representation Learning." Neural Information Processing Systems, 2024. doi:10.52202/079017-3681

Markdown

[Huang et al. "A Teacher-Teacher Framework for Clinical Language Representation Learning." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/huang2024neurips-teacherteacher/) doi:10.52202/079017-3681

BibTeX

@inproceedings{huang2024neurips-teacherteacher,
  title     = {{A Teacher-Teacher Framework for Clinical Language Representation Learning}},
  author    = {Huang, Feiqing and Zhang, Shenghan and Sweet, Sara Morini and Cai, Tianxi},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3681},
  url       = {https://mlanthology.org/neurips/2024/huang2024neurips-teacherteacher/}
}