AI Can Reduce Overdiagnosis in Ultrasound Screening for Breast Cancer

RESEARCH BRIEF RESEARCH BRIEF By Pawel Slabiak • September 7, 2021 Center for Advanced Imaging Innovation and Research

Breast cancer is the most common and deadliest malignant disease affecting women worldwide.

[{"selector":"#anim-3a60a996-6d93-4cbf-8cc1-c2ea638a0169","keyframes":{"opacity":[0,1]},"delay":1500,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}]

But "dense" breasts, which are at higher risk of cancer, appear opaque on mammograms, making interpretation more difficult.

[{"selector":"#anim-ad2aa563-4f02-405f-81c3-dc6a77d53551","keyframes":{"opacity":[0,1]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b4881784-7ed7-4234-8a4e-986f1e729cac","keyframes":{"transform":["rotate(-356deg) translate3d(-97.70115%, 0px, 0) rotate(356deg)","rotate(-356deg) translate3d(0px, 0px, 0) rotate(356deg)"]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-4d56529f-0d59-449f-a9df-a33f882e0bec","keyframes":{"opacity":[0,1]},"delay":6000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-11149628-4474-4f94-83ff-78fee2369410","keyframes":{"opacity":[0,1]},"delay":6000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-3ddd1c91-a729-45bb-b979-4fb2031e4e43","keyframes":{"opacity":[0,1]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-eeafeaac-87b4-4433-8b6a-5833a9e98eef","keyframes":{"transform":["translate3d(-115.14196%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-189ec668-ad2c-4662-9060-182e933e401c","keyframes":{"opacity":[0,1]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b5e65767-d2bd-4e70-b93d-7ea1beb79261","keyframes":{"transform":["translate3d(-98.58156%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":2000,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Dense and extremely dense breasts contain a high proportion of "radiopaque" parenchyma. Breasts that have low mamographic density comprise mostly "radiolucent" fat.

When mammography is hard to decipher, doctors often turn to ultrasound for more information.

Less than 10 percent of which confirm cancer.

[{"selector":"#anim-f0326ec6-cd63-4a63-b37e-4517089c4412","keyframes":{"opacity":[0,1]},"delay":1600,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-8fc6a42f-db21-472a-91fc-fddc4b973cbd","keyframes":{"opacity":[0,1]},"delay":2400,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-34fac49d-2f4a-40b7-a0c9-a160eb1b96ba","keyframes":{"opacity":[0,1]},"delay":2000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}]

To investigate whether AI can help doctors make fewer false-positive findings, a team of researchers from NYU Langone Health, New York University, and NYU Abu Dhabi created a deep learning model to detect breast cancer in ultrasounds.

[{"selector":"#anim-0a9b0339-8f30-43d2-9b00-9d54b9c6aa8a [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(1.6814370082872596e-15%, 2.3160158246516834e-7%) scale(0.7633587786259542)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":20000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}]

Scientists prepared a set of deidentified images and anonymized medical reports from more than 143,000 patients who underwent screening at NYU Langone Health.

[{"selector":"#anim-514d762c-9521-4c24-8bdf-351409ced3c1","keyframes":{"opacity":[0,1]},"delay":5000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-ab3c9c60-ce1a-4103-a17d-d18c139e40cc","keyframes":{"opacity":[0,1]},"delay":7000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1c7cf53e-039a-4cac-a5e4-2e45824e584d","keyframes":{"opacity":[0,1]},"delay":3000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-546b8988-6c4e-40dc-bab9-58975d2e3fe3","keyframes":{"opacity":[0,1]},"delay":3000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-d3add4d5-08c7-4bac-9255-ab6c07601809","keyframes":{"opacity":[0,1]},"delay":5000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-98f605fe-c36a-4130-bd47-6949bbe85843","keyframes":{"opacity":[0,1]},"delay":7000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] VALIDATION 10% TEST 30% TRAINING 60% 143K PATIENT CASES

To compare the model's performance with that of human experts, the researchers had 10 radiologists read a subset of the exams.

[{"selector":"#anim-b8c5ed06-29c4-4635-9099-6b3698d77347","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-db575575-1663-4305-9894-5ad5b5936798","keyframes":{"transform":["translate3d(-80.42836%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

In most cases, the AI model agreed with human readers.

[{"selector":"#anim-74229d5e-8b7c-4d40-ac3b-763a081a68f0","keyframes":{"opacity":[0,1]},"delay":4000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b7f2b610-956d-4dac-addb-93fd4e088043","keyframes":{"opacity":[0,1]},"delay":4000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9d5f72f5-788c-4349-b9af-021de0ae8f4a","keyframes":{"opacity":[0,1]},"delay":4000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1a50365b-613b-40e9-a60d-d52f2497dd48","keyframes":{"opacity":[0,1]},"delay":1000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] In this ultrasound exam, all 10 radiologists found cancer. And so did the AI.

But in many cases in which the radiologists erroneously suspected cancer, the AI correctly found none.

[{"selector":"#anim-c47036d5-ac5a-48b7-ac72-93c5fcceb722","keyframes":{"opacity":[0,1]},"delay":1000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-3d6b078d-ee39-468f-8cb0-d1d37c0f959e","keyframes":{"opacity":[0,1]},"delay":5000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9bfd4075-2d29-4d14-a49f-01e589ffd4ff","keyframes":{"opacity":[0,1]},"delay":5000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a49def68-86a8-43a8-b639-4c13c9e25c76","keyframes":{"opacity":[0,1]},"delay":5000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-34766986-e04d-4078-8e90-a8e938a02dad","keyframes":{"opacity":[0,1]},"delay":5000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-d9b2bd38-3d6b-4e5f-a408-60951ddd5251","keyframes":{"opacity":[0,1]},"delay":5000,"duration":10000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] In both exams below, all 10 radiologists suspected malignancy and ordered biopsies. But the AI correctly classified the lesions as benign.

Overall, the AI was as accurate as the experts in identifying malignant lesions (it matched radiologists' sensitivity).

[{"selector":"#anim-40f6a2ae-3a45-47d6-bf9e-e5b88e2f9aae","keyframes":{"opacity":[0,1]},"delay":1600,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7a9bc8e3-b4bb-4f65-9e94-1274ae41394c","keyframes":{"opacity":[0,1]},"delay":3200,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9e899bdc-58fd-4a83-89ed-369834e6456c [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":30000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}]

But with their powers combined, the human experts and the machine learning model achieved even higher specificity and even lower biopsy rates...

[{"selector":"#anim-8a5022fa-66fa-40cf-bbb2-bb7e8c6e5eb0","keyframes":{"opacity":[0,1]},"delay":4000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-337e3ba3-73f6-481b-a1eb-ca115ca8787c","keyframes":{"opacity":[0,1]},"delay":2500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9fbc2d88-afc3-422d-89f8-ca0a28d21cc4","keyframes":{"opacity":[0,1]},"delay":2500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-ce1dd5f9-5495-4405-a2db-c5259b64d719","keyframes":{"opacity":[0,1]},"delay":6500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-5011d53b-0661-459a-9cd7-10407ae9856a","keyframes":{"opacity":[0,1]},"delay":6500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-10d72426-d9fd-4714-907c-a16295fce7bc","keyframes":{"opacity":[0,1]},"delay":4000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-62c91e0b-edfa-4cab-bfbf-2adb9d5a82ce","keyframes":{"opacity":[0,1]},"delay":6500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a2eec993-407f-4aa7-a308-b736c3170908","keyframes":{"opacity":[0,1]},"delay":2500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] AI Hybrid Radiologists 0.0 0.1 0.2 0.3 Biopsy rate

...while also identifying cancer cases more accurately than either the AI or the radiologists did on their own.

[{"selector":"#anim-4760e235-902b-4bdd-979a-b99359746035","keyframes":{"opacity":[0,1]},"delay":3500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-fbef19b2-8dad-4b9b-bde6-90f3d02a8424","keyframes":{"opacity":[0,1]},"delay":6000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-2c291ada-1cf5-4074-ba7c-0e8d2fbf68a1","keyframes":{"opacity":[0,1]},"delay":6000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-e8873b14-dd95-44f9-8566-057e5c752875","keyframes":{"opacity":[0,1]},"delay":2000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-16c70171-4974-407a-85a3-545fa94614c3","keyframes":{"opacity":[0,1]},"delay":2000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-2e985d3f-0a00-43c7-a583-0be63c3e859e","keyframes":{"opacity":[0,1]},"delay":3500,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-aad2279f-713f-47be-8cb4-667e8f3f09b9","keyframes":{"opacity":[0,1]},"delay":2000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7b539692-24ba-4551-bcbc-5a7abc4bc6f5","keyframes":{"opacity":[0,1]},"delay":6000,"duration":3000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] AI Hybrid Radiologists Positive predictive value 0.0 0.1 0.3 0.5 0.2 0.4

To accept AI's assistance in the reading room, radiologists need better insight into how deep learning algorithms arrive at recommendations.

[{"selector":"#anim-1c55eef5-e35d-4c66-a01d-5a746c5e1b4e [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-68.35937480800591%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":80000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-dd9386c6-b7ab-401d-b14d-e8e0a22a47b8","keyframes":{"opacity":[0,1]},"delay":1600,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}]

"We're looking for more intuitive ways to show how AI is making predictions," said Yiqiu "Artie" Shen, who co-led the research.

[{"selector":"#anim-7f0a7fb1-330f-4d6a-a344-5b9af9e68c03 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-8.169852830971289%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":20000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

"Our suggestion is to use saliency maps," said Shen, referring to the red- and green-tinted heat maps that indicate where the model bases its findings of malignant or benign lesions.

[{"selector":"#anim-4b3c5871-8da7-433b-b5d1-3fa4fd86e373","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-8e0b59c2-576f-44dc-9789-c96eca646b83","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-050aaba5-f169-413e-a190-012085a1168d","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-908619c1-dd56-4530-93a6-d4e2315421db","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b7715443-8d00-41d7-87ec-a13bf4e923dc","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a56beafa-af8e-4cd6-8c16-c63754e8128d","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b6ea6fb2-e1e5-49b5-a542-90cf592f7e6c","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-04808fc1-fe9c-47df-bfd0-618b434ba284","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b78a7c05-a35e-49a9-801f-1528b4be8139","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-c2f32313-adcb-4b25-969e-db00eeae843b","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-2c3125b5-2056-4e13-8753-8b971cf25c9e","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9259047a-74e7-4f46-823d-04f0d4e290ab","keyframes":{"opacity":[0,1]},"delay":2000,"duration":7000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}]

"This is one of the first ultrasound models to do that with breast cancer," said Farah Shamout, DPhil, who co-led the study.

[{"selector":"#anim-e4c7af16-3e58-4f1f-a0a8-cfe9f40e6e82 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(13.617286530592274%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":20000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

"Maybe the next step would be to give reasons," he said. "Maybe we could develop an AI able to describe its reasoning."

[{"selector":"#anim-2f055a10-5536-4464-9914-64094c1ab721","keyframes":{"opacity":[0,1]},"delay":1600,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-86fd6536-e892-4394-9484-985ef1098916","keyframes":{"opacity":[0,1]},"delay":3200,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7d57a9e9-8d54-4f6e-981c-85bd58d57b9a [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":30000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}]

Related Preprint

[{"selector":"#anim-38d0d241-fcbc-4dcb-a538-2db401c382b5","keyframes":{"opacity":[0,1]},"delay":2000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Shen Y, Shamout FE, Oliver JR, et al. Artificial Intelligence System Reduces False-Positive Findings in the Interpretation of Breast Ultrasound Exams. medRxiv 2021.04.28.21256203 doi: 10.1101/2021.04.28.21256203 Research images, data, and photo of Yiqiu "Artie" Shen courtesy of Yiqiu "Artie" Shen. Photo of Farah Shamout by Sam Hollenshead/NYU Photo Bureau. Body photos by Ivan Stern/Unsplash, Annie Spratt/Unsplash. Abstract illustrations by Oleksii Lishchyshyn/Shutterstock. Text, media editing, and production by Pawel Slabiak. Credits cai2r.net

Related Publication

[{"selector":"#anim-ba171c72-d471-49ee-973e-3cc13ff3cf8c","keyframes":{"opacity":[0,1]},"delay":2000,"duration":5000,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Shen Y, Shamout FE, Oliver JR, et al. Artificial intelligence system reduces false-positive findings in the interpretation of breast ultrasound exams. Nat Commun. 2021 Sep 24;12(1):5645. doi: 10.1038/s41467-021-26023-2 Update September 24, 2021 Research featured in this story has been peer reviewed and published. cai2r.net

AI Can Reduce Overdiagnosis in Ultrasound Screening for Breast Cancer

Breast cancer is the most common and deadliest malignant disease affecting women worldwide.

Screening is fundamental to early detection, found to cut mortality from the disease by about half.

But "dense" breasts, which are at higher risk of cancer, appear opaque on mammograms, making interpretation more difficult.

Most routine screening in developed countries is performed with mammography.

When mammography is hard to decipher, doctors often turn to ultrasound for more information.

Less than 10 percent of which confirm cancer.

But secondary screening with ultrasound results in up to 8 percent more biopsies.

To investigate whether AI can help doctors make fewer false-positive findings, a team of researchers from NYU Langone Health, New York University, and NYU Abu Dhabi created a deep learning model to detect breast cancer in ultrasounds.

Scientists prepared a set of deidentified images and anonymized medical reports from more than 143,000 patients who underwent screening at NYU Langone Health.

To compare the model's performance with that of human experts, the researchers had 10 radiologists read a subset of the exams.

👩🏾‍⚕️👨🏻‍⚕️👩🏻‍⚕️👩🏽‍⚕️👨🏼‍⚕️👨🏾‍⚕️👨🏿‍⚕️👩🏼‍⚕️👨🏽‍⚕️👩‍⚕️

🤖

vs.

In most cases, the AI model agreed with human readers.

But in many cases in which the radiologists erroneously suspected cancer, the AI correctly found none.

Overall, the AI was as accurate as the experts in identifying malignant lesions (it matched radiologists' sensitivity).

The model had a lower tendency to overdiagnose (had higher specificity).

And the AI's positive findings more often correlated with actual cancer cases (had greater positive predictive value).

But with their powers combined, the human experts and the machine learning model achieved even higher specificity and even lower biopsy rates...

...while also identifying cancer cases more accurately than either the AI or the radiologists did on their own.

To accept AI's assistance in the reading room, radiologists need better insight into how deep learning algorithms arrive at recommendations.

For now, the "hybrid model" of human readers and the algorithm is a statistical construct.

"We're looking for more intuitive ways to show how AI is making predictions," said Yiqiu "Artie" Shen, who co-led the research.

Shen, PhD candidate at NYU Center for Data Science, developed weakly supervised machine learning methods for mammography before investigating ultrasound.

"Our suggestion is to use saliency maps," said Shen, referring to the red- and green-tinted heat maps that indicate where the model bases its findings of malignant or benign lesions.

"This is one of the first ultrasound models to do that with breast cancer," said Farah Shamout, DPhil, who co-led the study.

Shamout, assistant professor at NYU Abu Dhabi, investigates AI technolgies and their potential to support decision-making in healthcare.

"Maybe the next step would be to give reasons," he said. "Maybe we could develop an AI able to describe its reasoning."

"There's a lot of potential here," Shamout said. "We can show that a hybrid approach improves performance."

Shen stressed that machine learning research needs to focus more on collaboration and explainability.

Related Preprint

Related Publication