Evaluating Vision-Enabled LLMs a Comparative Study on Cloud Detection using Horizon Camera Imagery

Research Article

Authors

  • Allan Cerentini PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.
  • Juliana Marian Arrais Arrais PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.
  • Bruno Juncklaus Martins PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.
  • Sylvio Luiz Mantelli Neto Mantelli Neto FOTOVOLTAICA-UFSC, INPE Brazilian National Institute for Space Research, São José dos Campos, São Paulo, Brazil.
  • Tiago Oliveira da Luz PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.
  • Aldo von Wangenheim PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Keywords:

Remote Sensing; Large Language Models; Sky Clouds; Multimodal LLMs

Abstract

The rapid advancement of vision-enabled large language models (LLMs) presents transformative opportunities for specialized domains such as atmospheric science. This study evaluates the efficacy of multimodal LLMs in cloud identification tasks by leveraging a curated subset of the Clouds-1500 dataset, annotated with World Meteorological Organization (WMO) cloud classes. We introduce a novel pipeline that converts segmentation masks into text-based spatial, coverage, and class representations, enabling structured LLM analysis through custom prompts and the BAML library for response standardization. Benchmarking 18 state-of-the-art models revealed significant performance variations, with Anthropic’s Claude 3.5 Sonnet (71.67% class accuracy), OpenAI’s GPT-4o (68.89%), and xAI’s Grok Vision Beta (70.00%) emerging as top performers. However, challenges persist in low-coverage scenarios, where even leading models exhibited accuracy drops of 30–50%. The study demonstrates that while LLMs show promise in interpreting complex meteorological data, their effectiveness depends on task complexity, model architecture, and domain-specific adaptations. These findings provide a framework for integrating LLMs into remote sensing workflows, balancing automation with the precision required for operational meteorology.

 

Keywords Remote Sensing; Large Language Models; Sky Clouds; Multimodal LLMs

Author Biographies

Allan Cerentini, PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

1PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Juliana Marian Arrais Arrais, PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

1PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Bruno Juncklaus Martins, PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

1PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Sylvio Luiz Mantelli Neto Mantelli Neto, FOTOVOLTAICA-UFSC, INPE Brazilian National Institute for Space Research, São José dos Campos, São Paulo, Brazil.

2FOTOVOLTAICA-UFSC, INPE Brazilian National Institute for Space Research, São José dos Campos, São Paulo, Brazil. 

Tiago Oliveira da Luz, PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

1PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Aldo von Wangenheim, PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

1PPGCC - Federal University of Santa Catarina, Florianópolis, Santa Catarina, Brazil.

Downloads

Published

2025-04-09

How to Cite

Cerentini, A. ., Arrais, J. M. A., Martins, B. J. ., Mantelli Neto, S. L. M. N., da Luz, T. O. ., & Wangenheim, A. von . (2025). Evaluating Vision-Enabled LLMs a Comparative Study on Cloud Detection using Horizon Camera Imagery: Research Article. International Journal of Advanced Remote Sensing and GIS, 13(1), pp. 3557–3569. Retrieved from https://cloudjl.com/index.php/RemoteSensing/article/view/72

Issue

Section

Articles

Most read articles by the same author(s)