Automating Geospatial Vision Tasks with a Large Language Model Agent

Chen, Yuxing; Wang, Weijie; Kurtz, Camille; Lobry, Sylvain

doi:10.1007/978-3-662-72243-5_13

Automating Geospatial Vision Tasks with a Large Language Model Agent

Yuxing Chen, Weijie Wang, Camille Kurtz, Sylvain Lobry

ECML-PKDD 2025 pp. 218-235

doi:10.1007/978-3-662-72243-5_13 /ecmlpkdd/2025/chen2025ecmlpkdd-automating/

Abstract

Large Language Models (LLMs) have shown promise in automating code generation for data science tasks, yet they struggle with complex task sequences, especially in geospatial vision tasks. These difficulties stem from challenges in managing stepwise dependencies, aligning diverse data sources with spatial constraints, and accurately applying various geospatial libraries—often resulting in logical errors or hallucinations. To address these limitations, we introduce GeoAgent, an interactive framework designed to enable LLMs to automate geospatial vision tasks effectively. GeoAgent integrates a code interpreter, static analysis, and retrieval generation within a Monte Carlo Tree Search framework, creating a robust solution tailored to the geospatial data processing workflow. We introduce a new benchmark to evaluate GeoAgent’s performances on single- and multi-turn tasks, including geospatial data acquisition, analysis, and visualization across multiple Python libraries. Our experiments reveal that GeoAgent significantly outperforms baseline LLMs in function call accuracy, task pass rate and task completion, marking a substantial advancement in automating geospatial vision tasks and setting a new standard for LLM-driven geospatial data analysis.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Chen et al. "Automating Geospatial Vision Tasks with a Large Language Model Agent." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-662-72243-5_13

Markdown

[Chen et al. "Automating Geospatial Vision Tasks with a Large Language Model Agent." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/chen2025ecmlpkdd-automating/) doi:10.1007/978-3-662-72243-5_13

BibTeX

@inproceedings{chen2025ecmlpkdd-automating,
  title     = {{Automating Geospatial Vision Tasks with a Large Language Model Agent}},
  author    = {Chen, Yuxing and Wang, Weijie and Kurtz, Camille and Lobry, Sylvain},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {218-235},
  doi       = {10.1007/978-3-662-72243-5_13},
  url       = {https://mlanthology.org/ecmlpkdd/2025/chen2025ecmlpkdd-automating/}
}