In cancer pathology diagnosis, analyzing Whole Slide Images (WSI) encounters challenges like invalid data, varying tissue features at different magnifications, and numerous hard samples. Multiple Instance Learning (MIL) is a powerful tool for addressing weakly supervised classification in WSI-based pathology diagnosis. However, existing MIL frameworks cannot simultaneously tackle these issues. To address these challenges, we propose an integrated recognition framework comprising three complementary components: a preprocessing selection method, an Efficient Feature Pyramid Network (EFPN) model for multi-instance learning, and a Similarity Focal Loss. The preprocessing selection method accurately identifies and selects representative image patches, effectively reducing invalid data interference and enhancing subsequent model training efficiency. The EFPN model, inspired by pathologists' diagnostic processes, captures different tissue features in WSI images by constructing a multi-scale feature pyramid, enhancing the model's ability to recognize tumor tissue features. Additionally, the Similarity Focal Loss further improves the model's discriminative power and generalization performance by focusing on hard samples and emphasizing classification boundary information. The test accuracy for binary tumor classification on the CAMELYON16 and two private datasets reached 93.58%, 84.74%, and 99.91%, respectively, all of which outperform existing techniques.