Self-Supervised Vision for Climate Downscaling

Abstract

Generative artificial intelligence in music has made significant strides, yet it still falls short of the substantial achievements seen in natural language processing, primarily due to the limited availability of music data. Knowledge-informed approaches have been shown to enhance the performance of music generation models, even when only a few pieces of musical knowledge are integrated. This paper seeks to leverage comprehensive music theory in AI-driven music generation tasks, such as algorithmic composition and style transfer, which traditionally require significant manual effort with existing techniques. We introduce a novel automatic music lexicon construction model that generates a lexicon, named CompLex, comprising 37,432 items derived from just 9 manually input category keywords and 5 sentence prompt templates. A new multi-agent algorithm is proposed to automatically detect and mitigate hallucinations. CompLex demonstrates impressive performance improvements across three state-of-the-art text-to-music generation models, encompassing both symbolic and audio-based methods. Furthermore, we evaluate CompLex in terms of completeness, accuracy, non-redundancy, and executability, confirming that it possesses the key characteristics of an effective lexicon.

Cite

Text

Singh et al. "Self-Supervised Vision for Climate Downscaling." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/825

Markdown

[Singh et al. "Self-Supervised Vision for Climate Downscaling." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/singh2024ijcai-self/) doi:10.24963/ijcai.2024/825

BibTeX

@inproceedings{singh2024ijcai-self,
  title     = {{Self-Supervised Vision for Climate Downscaling}},
  author    = {Singh, Karandeep and Jeong, Chaeyoon and Shidqi, Naufal and Park, Sungwon and Nellikkattil, Arjun and Zeller, Elke and Cha, Meeyoung},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {7456-7464},
  doi       = {10.24963/ijcai.2024/825},
  url       = {https://mlanthology.org/ijcai/2024/singh2024ijcai-self/}
}