OBJECTIVES: Knee osteoarthritis (KOA), a prevalent degenerative joint disease, is primarily diagnosed through X-ray imaging. The Kellgren-Lawrence grading system (K-L) is the gold standard for evaluating KOA severity through X-ray analysis. However, this method is highly subjective and non-quantifiable, limiting its effectiveness in detecting subtle joint changes on X-rays. Recent researchers have been directed towards developing deep-learning (DL) techniques for a more accurate diagnosis of KOA using X-ray images. Despite advancements in these intelligent methods, the debate over their diagnostic sensitivity continues. Hence, we conducted the current meta-analysis. METHODS: A comprehensive search was conducted in PubMed, Cochrane, Embase, Web of Science, and IEEE up to July 11, 2023. The QUADAS-2 tool was employed to assess the risk of bias in the included studies. Given the multi-classification nature of DL tasks, the sensitivity of DL across different K-L grades was meta-analyzed. RESULTS: A total of 19 studies were included, encompassing 62,158 images. These images consisted of 22,388 for K-L CONCLUSIONS: The DL multi-classification methods based on X-ray imaging generally demonstrate a favorable sensitivity rate (over 50%) in distinguishing between K-L CLINICAL RELEVANCE STATEMENT: Deep-learning methods have been useful to some extent in assessing the effectiveness of X-rays for osteoarthritis of the knee. However, this requires further research and reliable data to provide specific recommendations for clinical practice. KEY POINTS: X-ray deep-learning (DL) methods are debatable for evaluating knee osteoarthritis (KOA) under The Kellgren-Lawrence system (K-L). Multi-classification deep-learning methods are more clinically relevant for assessing K-L grading than dichotomous results. For K-L3 and K-L4, X-ray-based DL has high diagnostic performance
early KOA needs to be further improved.