Enhancing food recognition accuracy using hybrid transformer models and image preprocessing techniques.

0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: B N Jagadesh, Kranthi Kumar Lella, Srihari Varma Mantena, Shyam Sunder Pabboju, T Prabhakara Rao, Asha P Sathe, Ramesh Vatambeti

Ngôn ngữ: eng

Ký hiệu phân loại: 794.147 King

Thông tin xuất bản: England : Scientific reports , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 142801

Thêm vào giỏ Liên kết toàn văn

This study presents a robust approach for continuous food recognition essential for nutritional research, leveraging advanced computer vision techniques. The proposed method integrates Mutually Guided Image Filtering (MuGIF) to enhance dataset quality and minimize noise, followed by feature extraction using the Visual Geometry Group (VGG) architecture for intricate visual analysis. A hybrid transformer model, combining Vision Transformer and Swin Transformer variants, is introduced to capitalize on their complementary strengths. Hyperparameter optimization is performed using the Improved Discrete Bat Algorithm (IDBA), resulting in a highly accurate and efficient classification system. Experimental results highlight the superior performance of the proposed model, achieving a classification accuracy of 99.83%, significantly outperforming existing methods. This study underscores the potential of hybrid transformer architectures and advanced preprocessing techniques in advancing food recognition systems, offering enhanced accuracy and efficiency for practical applications in dietary monitoring and personalized nutrition recommendations.

Tạo bộ sưu tập với mã QR