Can AI Help Make Screening Mammography “Lean”?

Carta Acesso aberto Revisado por pares

Can AI Help Make Screening Mammography “Lean”?

2019; Radiological Society of North America; Volume: 293; Issue: 1 Linguagem: Inglês

10.1148/radiol.2019191542

ISSN

1527-1315

Autores

Despina Kontos, Emily F. Conant,

Tópico(s)

Digital Radiography and Breast Imaging

Resumo

HomeRadiologyVol. 293, No. 1 PreviousNext Reviews and CommentaryFree AccessEditorialCan AI Help Make Screening Mammography “Lean”?Despina Kontos , Emily F. ConantDespina Kontos , Emily F. ConantAuthor AffiliationsFrom the Perelman School of Medicine, University of Pennsylvania, 3400 Spruce St, Philadelphia Pa 19104.Address correspondence to D.K. (e-mail: [email protected]).Despina Kontos Emily F. ConantPublished Online:Aug 6 2019https://doi.org/10.1148/radiol.2019191542MoreSectionsPDF ToolsImage ViewerAdd to favoritesCiteTrack CitationsPermissionsReprints ShareShare onFacebookTwitterLinked In See also the article by Yala et al in this issue.Download as PowerPointOpen in Image Viewer Dr Despina Kontos is an associate professor of radiology in the Center for Biomedical Image Computing and Analytics at the University of Pennsylvania. She holds a C.Eng. diploma in computer engineering and informatics from the University of Patras, Greece, and a PhD in computer science from Temple University, Philadelphia. Dr Kontos’s National Institutes of Health–funded research focuses on machine learning and imaging biomarkers for cancer screening, prognostication, and therapy response assessment.Download as PowerPointOpen in Image Viewer Dr Emily F. Conant is a professor of radiology and the vice chair of faculty development in the Department of Radiology at the Perelman School of Medicine at the University of Pennsylvania. Her research focuses on the analysis of outcomes from various breast imaging modalities, the optimization of the delivery of breast imaging services, and the quantitation of breast imaging data for precision risk assessment, screening, and diagnosis.IntroductionAlthough it is well known that mammographic screening can reduce breast cancer mortality, there is substantial variability in mammography reader performance (1). This variability, coupled with the increasing volume of screening studies, is stressing an already limited workforce. Methods to improve the sensitivity and specificity of readers have included computer-aided detection and the double reading of cases. Computer-aided detection, which is generally focused on improving detection, has had mixed results with little impact on sensitivity in large population studies (2). Double reading, which is frequently performed in service screening programs, has also been shown to have variable impact on screening outcomes and has only further strained an already limited workforce, even when performed with technologist assistants (3). Therefore, finding ways to maximize screening efficiency by eliminating wasteful activity while also maintaining or improving outcomes, a so-called lean approach to screening mammography, is necessary (4).Deep learning applications are rapidly gaining momentum with advances in hardware technology and have been shown to outperform conventional computer-aided detection algorithms (5). Most deep learning applications in breast cancer screening have focused on classifying suspicious findings (6). In this issue of Radiology, Yala et al (7) present an artificial intelligence, or AI, application that focuses on efficiency. Their goal was to reliably identify cancer-free mammograms (ie, true-negative findings) for triaging screening examinations at high confidence of being cancer free and, therefore, reduce the radiologist’s caseload to improve the efficiency while not compromising detection.In this study, a deep convolutional neural network, or CNN (ResNet18), was implemented. The CNN was trained, validated, and tested on a retrospective sample of a total 223 109 screening mammograms from 66 661 women. These mammograms were originally read by one of 23 breast imaging radiologists as part of routine clinical screening between 2009 and 2016 at the authors’ institution. The CNN incorporated pretraining from the ImageNet competition with data augmentation further refined with the authors’ training set, where the entire image, resized for optimal graphics processing unit processing, was fed as the network input. This model was previously trained by the authors to predict diagnosis of breast cancer within 1 year based on all the routine mammographic screening views for each patient (8). In the new application, Yala et al specifically chose the model’s threshold on the validation set to be that of the minimum score of a true-positive case, therefore maximizing the likelihood of triaging out true-negative mammograms while, ideally, maintaining sensitivity for cancer detection. To evaluate a clinical scenario where the developed model would be used for triage of true-negative assessments, the authors compared the performance of the radiologists during their original screening assessment to that of a retrospectively simulated scenario where the radiologists did not read any of the mammogram scored below the model’s “cancer-free” threshold, while reading the remainder of the nontriaged mammograms as before.The model showed good discriminatory ability, with an overall area under the curve of 0.82. The model had a similar performance across all age groups and races. The model was also discriminative across women with different breast density categories, with reported areas under the curve of 0.82, 0.81, 0.85, and 0.71 for women with fatty, scattered, heterogeneously, and extremely dense breasts, respectively. The model triaged a total of 5120 mammograms as cancer free, representing a reduction in caseload of 19%. In this simulated scenario, radiologists achieved a sensitivity and specificity of 90% and 94% (24 814 of 26 349; 95% confidence interval: 94.0%, 94.6%), respectively. The increase in specificity was significant (P = .002), and the sensitivity was noninferior by a margin of 5% (P < .001) compared with the complete caseload read according to standard of care. Two cases of cancer were missed with the new AI-driven model, one that was originally deemed as a false-negative finding by a radiologist and one deemed a true-positive finding and detected by a radiologist.Overall, the results reported by Yala et al are in agreement with data published by Rodriguez-Ruiz et al (8,9), who also suggest a 17% reduction in caseload and a 1% drop in sensitivity in a simulated enriched data set when applying a computer-aided model. Kyono et al (10) also recently showed a CNN model was able to maintain a negative predictive value of 99% while identifying 34% and up to 91% of negative mammograms for test sets with cancer prevalence of 15% and 1%, respectively.The findings of the study by Yala et al, therefore, add to the body of evidence that AI, when incorporated into the clinical workflow of mammographic screening interpretations, has potential advantages, including reducing the overall number of cases that require interpretation by a breast imaging radiologist. However, it still must be shown clinically whether this reduction of caseload leads to a reduction in overall time to read one’s “workload of cases.” It could be that with fewer cases in a radiologist’s worklist, the radiologist may spend more time on complex cases when the simple cases no longer require review. This approach would, ideally, improve the sensitivity of screening without necessarily decreasing total time of interpretation. This, nevertheless, remains to be seen. Most likely, a decrease in screening caseload will allow the radiologist to spend more time on other tasks such as biopsy procedures, diagnostic imaging, and multimodality imaging correlations.Although Yala et al report promising results, limitations of the study must be noted. As acknowledged by the authors, their retrospective study represented a simulated scenario. It is therefore unclear how a radiologist would not only interpret but also pace the reading of the remaining cases in the presence of the proposed reduced caseload. In addition, no tomosynthesis studies were included in the training of the CNN model. It is estimated that tomosynthesis images, which are rapidly becoming the standard for screening, take approximately twice the time to interpret due to the large data set. Thus, it will be important to develop similar algorithms to triage such cases to determine whether similar or potentially greater gains in efficiency could be achieved. From a technical implementation perspective, the training of the CNN model was done on “For Presentation” images and it is difficult to appreciate how much the characteristics captured may have been impacted by vendor-specific postprocessing. A tomosynthesis implementation could further compound computational efficiency of the CNN model due to the larger size and three-dimensional format of the imaging data. Finally, although the sensitivity of reading dropped slightly with the theoretical CNN model, it is unclear what type of cancers these were (eg, advanced invasive cancers vs in-situ lesions) and how such a reduction in sensitivity would translate into clinically significant declines in patient outcomes when scaled up to a population level.Ultimately, larger and more ethnically diverse studies (potentially incorporating additional risk factors) are needed to reliably triage true-negative screening mammograms while not compromising cancer detection and most importantly, patient outcomes. Toward this end, Yala et al made their convolutional neural network model publicly available. This can accelerate independent validation by other studies. Ultimately, this innovative application of artificial intelligence may prove more effective and reliable than conventional computer-aided detection in advancing a so-called lean approach to mammographic screening.Disclosures of Conflicts of Interest: D.K. disclosed no relevant relationships. E.F.C. Activities related to the present article: disclosed no relevant relationships. Activities not related to the present article: is a board member of Hologic; has grants/grants pending with Hologic and iCAD; received payment for lectures including service on speakers bureaus from International Institute for Continuing Medical Education and International Postgraduate Medical Education. Other relationships: disclosed no relevant relationships.Supported by National Institutes of Health (5R01CA161749-07, 5R01CA207084-03).References1. Barlow WE, Chi C, Carney PA, et al. Accuracy of screening mammography interpretation by characteristics of radiologists. J Natl Cancer Inst 2004;96(24):1840–1850. Crossref, Medline, Google Scholar2. Fenton JJ, Taplin SH, Carney PA, et al. Influence of computer-aided detection on performance of screening mammography. N Engl J Med 2007;356(14):1399–1409. Crossref, Medline, Google Scholar3. Posso M, Puig T, Carles M, Rué M, Canelo-Aybar C, Bonfill X. Effectiveness and cost-effectiveness of double reading in digital mammography screening: a systematic review and meta-analysis. Eur J Radiol 2017;96:40–49. Crossref, Medline, Google Scholar4. Kooi T, Litjens G, van Ginneken B, et al. Large scale deep learning for computer aided detection of mammographic lesions. Med Image Anal 2017;35:303–312. Crossref, Medline, Google Scholar5. Aboutalib SS, Mohamed AA, Berg WA, Zuley ML, Sumkin JH, Wu S. Deep learning to distinguish recalled but benign mammography images in breast cancer screening. Clin Cancer Res 2018;24(23):5902–5909. Crossref, Medline, Google Scholar6. Yala A, Schuster T, Miles R, Barzilay R, Lehman C. A deep learning model to triage screening mammograms: a simulation study. Radiology 2019;293:38–46. Link, Google Scholar7. Yala A, Lehman C, Schuster T, Portnoi T, Barzilay R. A Deep learning mammography-based model for improved breast cancer risk prediction. Radiology 2019;292(1):60–66. Link, Google Scholar8. Rodríguez-Ruiz A, Krupinski E, Mordang JJ, et al. Detection of breast cancer with mammography: effect of an artificial intelligence support system. Radiology 2019;290(2):305–314. Link, Google Scholar9. Rodriguez-Ruiz A, Lång K, Gubern-Merida A, et al. Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study. Eur Radiol 2019 Apr 16 [Epub ahead of print] https://doi.org/10.1007/s00330-019-06186-9. Google Scholar10. Kyono T, Gilbert FJ, van der Schaar M. Improving workflow efficiency for mammography using machine learning. J Am Coll Radiol 2019 May 30 [Epub ahead of print]. Google ScholarArticle HistoryReceived: July 9 2019Revision requested: July 16 2019Revision received: July 17 2019Accepted: July 18 2019Published online: Aug 06 2019Published in print: Oct 2019 FiguresReferencesRelatedDetailsCited ByLearning multi-frequency features in convolutional network for mammography classificationYimingWang, YunliangQi, ChunboXu, MengLou, YideMa2022 | Medical & Biological Engineering & Computing, Vol. 60, No. 7Using Deep Neural Network Approach for Multiple-Class Assessment of Digital MammographyShih-YenHsu, Chi-YuanWang, Yi-KaiKao, Kuo-YingLiu, Ming-ChiaLin, Li-RenYeh, Yi-MingWang, Chih-IChen, Feng-ChenKao2022 | Healthcare, Vol. 10, No. 12Lecture Notes in Computer ScienceZhenjieCao, ZhichengYang, YuxingTang, YanboZhang, MeiHan, JingXiao, JieMa, PengChang2021 | , Vol. 12907DeepCAT: Deep Computer-Aided Triage of Screening MammographyPaul H.Yi, DhananjaySingh, Susan C.Harvey, Gregory D.Hager, Lisa A.Mullen2021 | Journal of Digital Imaging, Vol. 34, No. 1Artificial Intelligence: A Primer for Breast Imaging RadiologistsManishaBahl2020 | Journal of Breast Imaging, Vol. 2, No. 4The Power of Triage (CADt) in Breast ImagingLisaWatanabe2020 | Applied RadiologyAccompanying This ArticleA Deep Learning Model to Triage Screening Mammograms: A Simulation StudyAug 6 2019RadiologyRecommended Articles Clinical Performance of Synthesized Two-dimensional Mammography Combined with Tomosynthesis in a Large Screening PopulationRadiology2017Volume: 283Issue: 1pp. 70-76A Deep Learning Model to Triage Screening Mammograms: A Simulation StudyRadiology2019Volume: 293Issue: 1pp. 38-46Breast Cancer Screening with Digital Breast Tomosynthesis Improves Performance of Mammography ScreeningRadiology2023Volume: 307Issue: 3Mammographic Breast Density Assessment Using Deep Learning: Clinical ImplementationRadiology2018Volume: 290Issue: 1pp. 52-58Lessons Learned from the Randomized Controlled TOmosynthesis plus SYnthesized MAmmography (TOSYMA) TrialRadiology2022Volume: 306Issue: 2See More RSNA Education Exhibits Artificial Intelligence In Breast Imaging: Whatâs Going On In Real World?Digital Posters2021Contrast-Enhanced Mammography: Current Indications and Future Directions Â Digital Posters2019Artificial Intelligence in Breast Imaging: Past, Present, and FutureDigital Posters2020 RSNA Case Collection Deodorant ArtifactRSNA Case Collection2021Invasive Lobular CarcinomaRSNA Case Collection2021Breast edemaRSNA Case Collection2021 Vol. 293, No. 1 Metrics Altmetric Score PDF download

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Can AI Help Make Screening Mammography “Lean”?