Enhancing photovoltaic power generation nowcasting with sky image analysis using multi-modal attention networks

  • Oluwatoyosi Bamisile (Lead / Corresponding author)
  • , She Kun (Lead / Corresponding author)
  • , Chiagoziem C. Ukwuoma
  • , Dongsheng Cai
  • , Chibueze D. Ukwuoma
  • , Chinedu I. Otuka
  • , Chidera O. Ukwuoma
  • , Olusola Bamisile

Research output: Contribution to journalArticlepeer-review

Abstract

The growing demand for renewable energy has heightened the importance of accurate photovoltaic (PV) power forecasting, especially under fluctuating weather conditions. Traditional and many deep learning-based models struggle to interpret the complex visual patterns in sky images, resulting in reduced prediction accuracy, particularly during periods of cloudiness or seasonal variability. In this study, we introduce a novel deep learning model that enhances solar nowcasting by dynamically identifying and emphasising the most informative features across multiple visual dimensions. The model introduces a compact multi-scale CNN backbone specifically tailored to the spectral–spatial characteristics of sky images, ensuring efficient feature extraction under real-time constraints. A custom lightweight attention mechanism is embedded to enhance cloud–irradiance saliency detection, delivering transformer-like selectivity while retaining low parameter and computational cost. These features are coupled with a regularised deep regression head that integrates high-level interactions, forming a unified architecture that advances photovoltaic nowcasting by balancing accuracy, interpretability, and efficiency. The study used the novel Sky Images and Photovoltaic Power Generation Dataset provided by Stanford University to evaluate the proposed model using evaluation metrics such as RMSE, MSE, MAE, MAPE, and R2. The proposed model achieves an overall RMSE of 2.259 and R2 of 0.913, with particularly strong performance on sunny days (RMSE of 0.461, R2 of 0.996) and consistent results under cloudy conditions (RMSE of 3.158, R2 of 0.824). Seasonal analysis reveals that the model maintains robust accuracy across different climatic conditions, with channel attention excelling during high-irradiance summer and autumn days, and spatial attention effectively capturing complex cloud structures in winter and spring. These outcomes depict the model’s ability to deliver more reliable short-term power forecasts by leveraging deeper visual understanding, ultimately contributing to more efficient solar energy management.

Original languageEnglish
Article number114117
JournalSolar Energy
Volume303
Early online date7 Nov 2025
DOIs
Publication statusPublished - Jan 2026

Keywords

  • Attention Mechanism
  • Deep Learning
  • Photovoltaic Output Prediction
  • Sky Images
  • Solar Forecasting

ASJC Scopus subject areas

  • Renewable Energy, Sustainability and the Environment
  • General Materials Science

Fingerprint

Dive into the research topics of 'Enhancing photovoltaic power generation nowcasting with sky image analysis using multi-modal attention networks'. Together they form a unique fingerprint.

Cite this