1 Introduction

Breast cancer remains one of the most prevalent malignancies among women globally. Accurate pathological evaluation of key biomarkers, such as estrogen receptor (ER), progesterone receptor (PR), HER2, and the Ki-67 proliferation index, is crucial in determining prognosis and informing therapeutic decisions. However, despite the existence of well-established guidelines such as those from the American Society of Clinical Oncology (ASCO) and College of American Pathologists (CAP), significant interobserver variability persists in the interpretation of these immunohistochemistry (IHC) assays. This variability is particularly pronounced in cases of equivocal or low HER2 expression, low-positive hormone receptors (1-10% positivity), and Ki-67 scoring, leading to inconsistencies in molecular subtyping that can potentially impact treatment decisions (Ivanova et al. 2024; Eren et al. 2026).

Recent updates to HER2 testing guidelines (2023 ASCO/CAP) and the emerging clinical significance of HER2-low breast cancer have further emphasized the need for precise and reproducible biomarker assessment (Ivanova et al. 2024). Studies demonstrate considerable concordance challenges, particularly in distinguishing HER2 IHC 0 from 1+ scores, which has become clinically relevant with the development of HER2-targeted antibody-drug conjugates for HER2-low disease (Eren et al. 2026; Hou, Nitta, and Li 2023). Similarly, Ki-67 assessment remains problematic despite its prognostic and predictive value, with Gown (2023) characterizing it as showing “promise, potential, and problems” due to persistent inter-observer variability and lack of standardization across laboratories (Gown 2023).

The implementation of digital pathology has opened new possibilities for artificial intelligence (AI) integration in routine histopathological practice (Niazi, Parwani, and Gurcan 2019; Soliman, Li, and Parwani 2024). Deep learning approaches have demonstrated remarkable capabilities in breast cancer histopathology, including prediction of biomarker status directly from H&E-stained sections and automated grading with improved reproducibility (Shamai et al. 2022; Wang et al. 2022; Chan et al. 2023). Computational pathology methods using weakly supervised deep learning on whole slide images have achieved clinical-grade performance, suggesting feasibility for routine diagnostic implementation (Campanella et al. 2019).

AI-based digital image analysis has shown significant potential in reducing observer-dependent variability and improving reproducibility of biomarker scoring (Baxi et al. 2022; Abele et al. 2023; Liu et al. 2023). In ER and PR evaluation, automated image analysis demonstrates strong correlation with pathologists’ manual assessments and improved inter-laboratory consistency (Shafi et al. 2022). For HER2 interpretation, AI assistance has been reported to enhance diagnostic concordance, especially in challenging HER2-low and equivocal categories (Jung et al. 2024; Wu et al. 2023).

Ki-67 presents particular challenges that make it an important target for AI-assisted quantification. A comprehensive evaluation by Dawe et al. (2024) compared five digital image analysis (DIA) methods for Ki-67 scoring in 278 breast cancer cases, finding that while deep learning approaches (piNET) achieved the highest agreement with manual counts (ICC: 0.850), none of the tested methods reached the clinically desired Cohen’s κ ≥ 0.8 for common diagnostic cutoffs (Dawe et al. 2024). The study identified tumor heterogeneity, algorithm implementation differences, and image segmentation as primary contributors to variability, emphasizing that fully automated pipelines are crucial for developing robust, reproducible approaches. Other studies have confirmed that AI-assisted Ki-67 quantification significantly improves reproducibility and reduces the time-consuming nature of manual assessment (Dy et al. 2024; Acs et al. 2021).

A critical gap in the current literature is the paucity of real-world studies examining how pathologists actually integrate AI recommendations into clinical practice. Most validation studies focus on algorithm performance metrics (sensitivity, specificity, concordance) without examining pathologist behavior or the impact on clinical decision-making (Li et al. 2025; Tan-Garcia, Chua, and Leow 2025). Large-scale studies comparing local versus central laboratory assessments, such as the WSG ADAPTcycle trial (n=5,292) by Hamann et al. (2025), demonstrate ongoing challenges in achieving consistent Ki-67 measurements even without AI, highlighting the need for standardization tools (Hamann et al. 2025).

Overall, these findings position AI as a promising tool for standardizing IHC biomarker assessment, minimizing diagnostic discrepancies, and improving clinical decision-making in breast cancer (Reis-Filho and Kather 2023; Ahn et al. 2023; Yan, Li, and Wu 2023). However, critical questions remain about real-world implementation, individual pathologist adoption patterns, cases where AI may increase rather than decrease disagreement, and potential systematic biases introduced by automated systems. The present study addresses these gaps by evaluating the impact of AI assistance (Aiforia platform) on interobserver variability, individual pathologist behavior, and clinical decision implications in ER, PR, HER2, and Ki-67 assessments. Using a two-phase design (pre-AI and post-AI evaluations by the same pathologists), we provide comprehensive analysis of AI’s real-world impact on diagnostic concordance, systematic biases, and treatment-relevant molecular classification in a representative clinical dataset.

Abele, Niklas, Katharina Tiemann, Till Krech, Axel Wellmann, Christian Schaaf, Florian Langer, Anja Peters, et al. 2023. “Noninferiority of Artificial Intelligence-Assisted Analysis of Ki-67 and Estrogen/Progesterone Receptor in Breast Cancer Routine Diagnostics.” Modern Pathology 36 (2): 100033. https://doi.org/10.1016/j.modpat.2022.100033.

Acs, Balazs, Irma Fredriksson, Caroline Rönnlund, Catharina Hagerling, Anna Ehinger, Anikó Kovács, Rasmus Røge, Jonas Bergh, and Johan Hartman. 2021. “Variability in Breast Cancer Biomarker Assessment and the Effect on Oncological Treatment Decisions: A Nationwide 5-Year Population-Based Study.” Cancers 13 (5): 1166. https://doi.org/10.3390/cancers13051166.

Ahn, Joon Seok, Sunho Shin, Sun Ah Yang, Eun Kyung Park, Kyung Hee Kim, Sun-Il Cho, Jun Won Lee, et al. 2023. “Artificial Intelligence in Breast Cancer Diagnosis and Personalized Medicine.” Journal of Breast Cancer 26 (5): 405–35. https://doi.org/10.4048/jbc.2023.26.e45.

Baxi, Vipul, Robin Edwards, Michael Montalto, and Saurabh Saha. 2022. “Digital Pathology and Artificial Intelligence in Translational Medicine and Clinical Practice.” Modern Pathology 35 (1): 23–32. https://doi.org/10.1038/s41379-021-00919-2.

Campanella, Gabriele, Matthew G Hanna, Luke Geneslaw, Allen Miraflor, Vitor Werneck Krauss Silva, Klaus J Busam, Edi Brogi, Victor E Reuter, David S Klimstra, and Thomas J Fuchs. 2019. “Clinical-Grade Computational Pathology Using Weakly Supervised Deep Learning on Whole Slide Images.” Nature Medicine 25 (8): 1301–9. https://doi.org/10.1038/s41591-019-0508-1.

Chan, Rachel C, Clare KC To, Kenneth CT Cheng, Taniyama Yoshikazu, Lisa LA Yan, Ivan KH Poon, Brandt KH Law, Josephine MN Chan, and Gary M Tse. 2023. “Artificial Intelligence in Breast Cancer Histopathology.” Histopathology 82 (1): 198–210. https://doi.org/10.1111/his.14820.

Dawe, Melanie, Wei Shi, Tzu-Yun Liu, Katarzyna Lajkosz, Yuki Shibahara, Dimitri Androutsos, Anthony Fyles, Fei-Fei Liu, Susan J Done, and April Khademi. 2024. “Reliability and Variability of Ki-67 Digital Image Analysis Methods for Clinical Diagnostics in Breast Cancer.” Laboratory Investigation 104 (5): 100341. https://doi.org/10.1016/j.labinv.2024.100341.

Dy, Amanda, Ngoc-Nhu Jennifer Nguyen, Julien Meyer, Melanie Dawe, Wei Shi, Dimitri Androutsos, Anthony Fyles, Fei-Fei Liu, Susan Done, and April Khademi. 2024. “AI Improves Accuracy, Agreement and Efficiency of Pathologists for Ki67 Assessments in Breast Cancer.” Scientific Reports 14 (1): 1283. https://doi.org/10.1038/s41598-024-51723-2.

Eren, Oğuzhan Can, Onur Can Taşkın, Murat Oktay, Ahmet Baygül, Fatma Aktepe, Fadime Gül Salman, and Cengiz Yarık. 2026. “Interobserver and Interassay Concordance in HER2 Immunohistochemistry: Implications on HER2-Low Breast Cancer.” Applied Immunohistochemistry & Molecular Morphology 34 (2): 75–81. https://doi.org/10.1097/PAI.0000000000001281.

Gown, Allen M. 2023. “The Biomarker Ki-67: Promise, Potential, and Problems in Breast Cancer.” Applied Immunohistochemistry & Molecular Morphology 31 (4): 221–29. https://doi.org/10.1097/PAI.0000000000001087.

Hamann, Marianne, Matthias Christgen, Oleg Gluz, Ulrike Nitz, Sherko Kümmel, Toralf Reimer, Bahriye Aktas, et al. 2025. “Endocrine Response Assessment in HR-Positive HER2-Negative Early Breast Cancer: Concordance of Local Versus Central Ki67 Measurements in the WSG ADAPTcycle Trial (n = 5292).” ESMO Open 10 (2): 105552. https://doi.org/10.1016/j.esmoop.2025.105552.

Hou, Yu, Hiroaki Nitta, and Zaibo Li. 2023. “HER2 Intratumoral Heterogeneity in Breast Cancer, an Evolving Concept.” Cancers 15 (10): 2664. https://doi.org/10.3390/cancers15102664.

Ivanova, Mariya, Francesca Maria Porta, Mariia D’Ercole, Carlo Pescia, Elham Sajjadi, Oriana Pala, Giovanni Mazzarol, et al. 2024. “Standardized Pathology Report for HER2 Testing in Compliance with 2023 ASCO/CAP Updates and 2023 ESMO Consensus Statements on HER2-Low Breast Cancer.” Virchows Archiv 483 (5): 885–95. https://doi.org/10.1007/s00428-023-03656-w.

Jung, Minsun, Seung Geun Song, Soo Ick Cho, Sangwon Shin, Taebum Lee, Wonkyung Jung, Hajin Lee, et al. 2024. “Augmented Interpretation of HER2, ER, and PR in Breast Cancer by Artificial Intelligence Analyzer: Enhancing Interobserver Agreement Through a Reader Study of 201 Cases.” Breast Cancer Research 26 (1): 31. https://doi.org/10.1186/s13058-024-01784-y.

Li, Wei, Shuang Ye, Zhongyi Jin, Lin Chen, Yue Chao, and Kun Qian. 2025. “Artificial Intelligence in Digital Pathology of Breast Cancer, New Era of Practice?” International Journal of Surgery. https://doi.org/10.1097/JS9.0000000000002953.

Liu, Yueping, Dandan Han, Anil V Parwani, and Zaibo Li. 2023. “Applications of Artificial Intelligence in Breast Pathology.” Archives of Pathology & Laboratory Medicine 147 (9): 1003–13. https://doi.org/10.5858/arpa.2022-0457-RA.

Niazi, Muhammad Khalid Khan, Anil V Parwani, and Metin N Gurcan. 2019. “Digital Pathology and Artificial Intelligence.” The Lancet Oncology 20 (5): e253–61. https://doi.org/10.1016/S1470-2045(19)30154-8.

Reis-Filho, Jorge S, and Jakob Nikolas Kather. 2023. “Overcoming the Challenges to Implementation of Artificial Intelligence in Pathology.” Journal of the National Cancer Institute 115 (6): 608–12. https://doi.org/10.1093/jnci/djad048.

Shafi, Saba, David A Kellough, Giovanni Lujan, Swati Satturwar, Anil V Parwani, and Zaibo Li. 2022. “Integrating and Validating Automated Digital Imaging Analysis of Estrogen Receptor Immunohistochemistry in a Fully Digital Workflow for Clinical Use.” Journal of Pathology Informatics 13: 100122. https://doi.org/10.1016/j.jpi.2022.100122.

Shamai, Gil, Avihai Livne, António Polónia, Edmond Sabo, Anca Cretu, Gil Bar-Sela, and Ron Kimmel. 2022. “Deep Learning-Based Image Analysis Predicts PD-L1 Status from h&e-Stained Histopathology Images in Breast Cancer.” Nature Communications 13 (1): 6753. https://doi.org/10.1038/s41467-022-34275-9.

Soliman, Amr, Zaibo Li, and Anil V Parwani. 2024. “Artificial Intelligence’s Impact on Breast Cancer Pathology: A Literature Review.” Diagnostic Pathology 19 (1): 38. https://doi.org/10.1186/s13000-024-01453-w.

Tan-Garcia, Alexis, Tien Huey Chua, and Wei Qiang Leow. 2025. “Computational Pathology in the Age of Artificial Intelligence - Embrace Not Fear.” The Journal of Pathology: Clinical Research. https://doi.org/10.1002/2056-4538.70049.

Wang, Yinxi, Balázs Acs, Stephen Robertson, Bo Liu, Lucı́a Solorzano, Carolina Wählby, Johan Hartman, and Mattias Rantalainen. 2022. “Improved Breast Cancer Histological Grading Using Deep Learning.” Annals of Oncology 33 (1): 89–98. https://doi.org/10.1016/j.annonc.2021.09.007.

Wu, Si, Meng Yue, Jun Zhang, Xiaoxian Li, Zaibo Li, Huina Zhang, Xinran Wang, et al. 2023. “The Role of Artificial Intelligence in Accurate Interpretation of HER2 Immunohistochemical Scores 0 and 1+ in Breast Cancer.” Modern Pathology 36 (3): 100054. https://doi.org/10.1016/j.modpat.2022.100054.

Yan, Shiyun, Jingjing Li, and Wenzhe Wu. 2023. “Artificial Intelligence in Breast Cancer: Application and Future Perspectives.” Journal of Cancer Research and Clinical Oncology 149 (18): 16879–96. https://doi.org/10.1007/s00432-023-05337-2.