Fixation is a critical step in green tea processing, and the moisture content of the leaves after fixation is a key indicator of the fixation quality. Near-infrared spectroscopy (NIRS)-based moisture detection technology is often applied in the tea processing industry. However, temperature fluctuations during processing can cause changes in the NIRS curves, which in turn affect the accuracy of moisture prediction models based on the spectral data. To address this challenge, NIRS data were collected from samples at various stages of fixation and at different temperatures, and a novel deep learning network (DiSENet) was proposed, which integrates multi-scale feature fusion and attention mechanisms. Using a global modeling approach, the proposed method achieved a coefficient of determination (R