Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans?

Cetera, Giulia Emily; Tozzi, Alberto Eugenio; Chiappa, Valentina; Castiglioni, Isabella; Merli, Camilla Erminia Maria; Vercellini, Paolo

doi:10.3390/jcm13102950

Open AccessReview

Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans?

by

Giulia Emily Cetera

^1,2

,

Alberto Eugenio Tozzi

³

,

Valentina Chiappa

⁴,

Isabella Castiglioni

⁵

,

Camilla Erminia Maria Merli

¹ and

Paolo Vercellini

^1,2,*

¹

Gynecology Unit, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, 20122 Milan, Italy

²

Academic Center for Research on Adenomyosis and Endometriosis, Department of Clinical Sciences and Community Health, Università degli Studi di Milano, 20122 Milan, Italy

³

Predictive and Preventive Medicine Research Unit, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy

⁴

Gynaecologic Oncology, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133 Milan, Italy

⁵

Department of Physics, Università Milano Bicocca, 20126 Milan, Italy

^*

Author to whom correspondence should be addressed.

J. Clin. Med. 2024, 13(10), 2950; https://doi.org/10.3390/jcm13102950

Submission received: 13 March 2024 / Revised: 8 April 2024 / Accepted: 6 May 2024 / Published: 16 May 2024

(This article belongs to the Special Issue Endometriosis and Adenomyosis: Modern Concepts on Clinical Outcomes, Treatment and Management)

Download Review Reports Versions Notes

Abstract

:

Artificial intelligence (AI) is experiencing advances and integration in all medical specializations, and this creates excitement but also concerns. This narrative review aims to critically assess the state of the art of AI in the field of endometriosis and adenomyosis. By enabling automation, AI may speed up some routine tasks, decreasing gynecologists’ risk of burnout, as well as enabling them to spend more time interacting with their patients, increasing their efficiency and patients’ perception of being taken care of. Surgery may also benefit from AI, especially through its integration with robotic surgery systems. This may improve the detection of anatomical structures and enhance surgical outcomes by combining intra-operative findings with pre-operative imaging. Not only that, but AI promises to improve the quality of care by facilitating clinical research. Through the introduction of decision-support tools, it can enhance diagnostic assessment; it can also predict treatment effectiveness and side effects, as well as reproductive prognosis and cancer risk. However, concerns exist regarding the fact that good quality data used in tool development and compliance with data sharing guidelines are crucial. Also, professionals are worried AI may render certain specialists obsolete. This said, AI is more likely to become a well-liked team member rather than a usurper.

Keywords:

artificial intelligence; endometriosis; adenomyosis

1. Introduction

Back in 1954, Roald Dahl wrote The Great Automatic Grammatizator, a story of a young engineer who developed a machine to mass-produce literature. The machine was so successful that the engineer opened a literary agency and bought out real authors, paying them to never write again. At the end of the story, over half of all the novels published in the United Kingdom had been written by the machine and writers who had resisted the temptation of being bought out by the engineer dieds of hunger [1].

In those days, authors feared that the advent of the computer would sabotage artistic writing, in a similar way to how nowadays we are concerned that artificial intelligence (AI) will dehumanize medical care and usurp both physicians and researchers.

However, can AI really be worse than humans? As AI is rapidly being integrated in all medical specializations, time has come to abandon magical thoughts and catastrophic interpretations and to start critically appraising this novel form of technology.

As critical thought stems from knowledge, we will briefly describe what the term AI actually refers to.

A unique definition of AI does not exist. Today, we are witnessing an enormous effort in trying to resolve this issue, in particular due to the implications that AI is having in the availability of innovative solutions on the market. Europe is trying to provide a definition that has legal value with the EU AI ACT, classifying AI technologies according to the risk they pose to the user. For the purpose of this review, we can refer to AI as any machine that can carry out a complex task, which would typically involve biological brainpower (including physical tasks) [2]. Among the variety of different technologies that make up the family of AI, machine learning (ML) is the most basic form. ML consists of teaching computers to perform tasks with a specific goal by using examples instead of explicit programming [3,4,5]. Although we may not know it by its actual name, ML is already part of our everyday life. It is the system used by Netflix to recommend which film we may want to watch; by our email applications to filter spam; or even by our car to assist us in driving and parking [6]. Deep learning (DL) is a subset of ML, which does not require the direct intervention of humans as it learns how to make predictions or decisions through complex algorithms known as artificial neural networks. This makes it practically impossible to understand how the DL model arrives at the answers it provides [7]. DL is particularly useful in the recognition of images and of speech [8]. More complex forms of AI include natural language processing (NLP) models, which are able to comprehend human language and/or unstructured text and transform it into structured data [5], and large language models (LLMs) and NLP tools that can understand and generate human-like language [8]. Both NLP and LLM models learn from vast amounts of data in a self-supervised manner.

ChatGPT is an example of an LLM, which has been trained on data obtained from open sources on the internet [9,10,11]. Although ChatGPT was not developed to provide medical knowledge, other systems such as Google’s Med-PaLM 2 have been specifically trained on medical data for such purposes [12,13].

Considering how the body of medical knowledge required to treat a patient has exponentially grown in the last forty years (while, in 1980, it doubled every seven years, in 2010, the doubling time was less than 75 days), access to a comprehensive and readily available source of such knowledge may support clinical teams, ensuring all patients receive the best possible care [5].

The possible applications of AI to medicine are not limited to providing updated and accessible medical knowledge. ML and NLP technologies may, in fact, help in the screening of clinical and radiological diagnoses; in the choice of medical treatments and in tailoring patients’ management; in surgery; and in the reduction of the administrative burden [9,14,15]. Moreover, when applied to research settings, AI technologies may significantly alleviate the burden of recruitment, monitoring, and insertion of data in clinical trials [16,17].

At the present moment, most applications of AI to health care have been validated in prospective clinical studies with proven diagnostic performance and safety and are yet to be validated in randomized controlled trials [5]. As such, their use is still mostly hypothetical or limited to research settings. However, this form of technology is rapidly evolving and is expected to soon overflow in all medical fields, including gynecology, where some tools are already present [18]. Given this background, we aimed to summarize the available evidence regarding the possible application of AI to the management of women with endometriosis and adenomyosis (Table 1). Whether we like it or not, medicine, like many other aspects of our lives, has taken this direction and it is our responsibility as physicians and researchers to build upon our knowledge in this regard. AI could actually represent the missing element which may help us overcome the limitations in both endometriosis and adenomyosis clinical management and research that human brains have not yet been able to master.

2. Methods

We considered all original studies analyzing the possible role of AI in the management of endometriosis or adenomyosis. Articles were included if they were written in English and were published in peer-reviewed journals. No limit to the year of publication or to the patients’ age was applied. Abstracts, conference papers and articles not reporting original data were excluded.

Articles were identified by a PubMed search that was carried out on 11th December 2023 using the keywords “endometriosis” and “adenomyosis” in combination with “artificial intelligence”, “AI”, ”machine learning”, “deep learning”, “Chat GPT”, “Med-PaLM 2”, “large language models”, and “natural language processing”. References from relevant publications were also screened and further articles were searched using PubMed’s “Similar articles” and “Cited by” functions.

3. Applications of AI to the Clinical Management of Endometriosis and Adenomyosis

3.1. Role in the Formulation of Clinical Diagnoses

Artificial intelligence technologies may represent a novel opportunity to decrease the diagnostic delay that notoriously characterizes both endometriosis and adenomyosis [19]. In fact, by providing updated medical knowledge, accessible both to physicians and to the public, they may ensure clinicians have considered all differential diagnoses that explain patients’ symptoms, as well as increasing patients’ awareness of the disease, deconstructing the common belief that menses are painful by definition [20]. On the one hand, it is of uttermost importance that algorithms are accurate and characterized by high specificity in order to minimize the risk of generating inappropriate referrals and unjustified anxiety. On the other hand, AI technology should be further trained to sort through the myriad of symptoms and information a patient may report, deducing and selecting only the most relevant.

The minimization of pain is a major determinant of diagnostic delay in endometriosis and adenomyosis, and is especially common among teenagers and individuals with superficial endometriosis [21,22]. In the former case, minimization often arises from a low awareness of the disease in young women among clinicians and among patients and their families [23]. In the latter, both insufficient medical knowledge and radiological challenges (superficial endometriosis is not visible by the means of an ultrasonographic examination) play a role [22]. In both cases, AI may represent a game changer.

By encouraging women’s referral to specialists, and by increasing physicians’ medical knowledge, AI may help redefine the epidemiology of endometriosis and adenomyosis. This may be particularly true for adenomyosis, which is rarely diagnosed in adolescents, although it actually appears to be more frequent in this specific population than commonly believed [24].

The accuracy of chatbots in resolving medical cases and in providing accurate medical knowledge has been proven in various studies, although their performance is still far from that expected in clinical settings. When submitting a series of diagnostically challenging medical cases to ChatGPT4, Kanjee and colleagues found that the chatbot gave the right answer in 39% of cases, and included the correct answer among differential diagnoses in 64% of cases [25]. When questioned with the United States Medical Licensing Examination quiz, chatbots with no specialized training passed or nearly passed the test [26]. Astonishingly, in another study comparing the performance of ChatGPT4 with that of medical journal readers in resolving real-life medical cases, the chatbot correctly diagnosed 57% of cases, compared to 36% correct diagnoses given by the journal readers [27]. It must be pointed out that, in the latter study, the population of human journal readers was poorly characterized and that their level of medical skills, as well as the effort they put into answering questions, was unknown. As to what specifically regards gynecology, Ozgor and Simavi recently analyzed the accuracy of ChatGPT in answering questions about endometriosis. As many as 91% of questions were answered accurately, although among the questions based on the ESHRE endometriosis guidelines [28], accuracy was lower (67.5%) [29]. Considering the speed at which LLM technologies are expanding, and, consequently, how fast they may improve in preciseness and accuracy, these results are encouraging.

As well as representing a source of medical knowledge, AI may provide algorithms for the prediction of the likelihood of endometriosis in patients with chronic pelvic pain or with infertility. This will be made possible by the fact AI is able to detect patterns in large volumes of data. Such algorithms will have to be validated on large populations from multiple centers in order for their data to be applicable on a vast scale.

Heterogeneous approaches have been suggested by various study groups to build such algorithms, most of which achieved sensitivity and specificity above 85%. Data used so far to build prediction or diagnostic models include the following: clinical features (age, presence and severity of symptoms, comorbidities, infertility, previous surgery) [30,31,32,33,34]; serum and salivary biomarkers [35,36,37,38]; genomics, transcriptomics, metabolomics, proteomics and methylomics data [39,40,41,42,43,44]; lipidomic data from endometrial fluid [45]; gene, mRNA and proteomic and transcriptomic expression in the endometrium [46,47,48]; mixed data [49,50]; and radiologic images [51,52]. However, the majority of these studies, which have been comprehensively analyzed in Sivajohan and co-workers’ recent review [19], were retrospective, meaning that the models were trained and validated on patient datasets, rather than in vivo on humans. Moreover, the efficacy of AI in predicting and/or diagnosing endometriosis and adenomyosis was not compared with that of existing decision algorithms and of clinical diagnostic tools. Further research is needed in this regard.

3.2. Role in the Formulation of Radiological Diagnoses

As to what specifically concerns the radiological diagnosis of endometriosis and adenomyosis, AI has a great potential to improve its quality by learning to detect anomalies in ultrasonographic and MRI images. This is made possible by the fact AI is able to match imaging findings with previously registered data [2,53,54].

Computer-assisted interpretation of radiological data is already in use in other medical fields and, in some cases, appears to be as accurate as experienced radiologists [55,56]. Moreover, DL methods may improve diagnostic accuracy by eliminating subjectivity, and may provide diagnoses in a few seconds [53]. Various studies have been published on this topic so far; however, most algorithms are lacking adequate validation and generalizability and are currently limited to research purposes [57,58,59].

In the management of endometriosis and adenomyosis, this kind of technology may be of particular aid in distinguishing patients with and without the disease, especially in complex cases with atypical presentations or in settings in which expert radiologists or ultrasound examiners are not available. In their recent pilot study on 50 individuals with a surgical diagnosis of endometriosis, and an equal number of individuals with at least one symptom of endometriosis but without a diagnosis of endometriosis, Balica and co-workers used five different DL methods to aid the sonographic diagnosis of the disease. AI-assisted diagnosis was feasible and efficacious and was able to predict endometriosis with a 90% AUC and 80% accuracy [60].

AI may also prove useful in the differential diagnosis of endometriosis from other benign conditions. In Hu and colleagues’ retrospective study, DL was used to distinguish ovarian endometriosis from tubo-ovarian abscesses (TOAs) on ultrasonographic images [57]. Like endometriomas, TOAs may, in fact, present as hypoechoic avascular cystic masses within the context of a pelvis distorted by adhesions. Astonishingly, when comparing AI’s performance with that of three ultrasound examiners and of the plasma concentrations of carbohydrate antigen 125 (CA 125), DL’s performance was superior [57].

A particular aspect of the ultrasonographic diagnosis of endometriosis is represented by the evaluation of the Pouch of Douglas (POD), which may be obliterated by adhesions between the retrocervix, the anterior wall of the rectum and the uterosacral ligaments [61]. POD obliteration is particularly important to recognize pre-operatively as it increases surgical complexity and the risk of complications, as well as plays a role in the disease’s prognosis [62,63,64]. A sonographic marker of PD obliteration has been described (“sliding sign”); however, it relies on the operator’s expertise and largely depends on inter-observer variability [65,66]. To overcome such limitations, Maicas and colleagues analyzed the ability of DL in defining the state of the POD by classifying transvaginal ultrasound videos depicting positive and negative “sliding signs”. The accuracy, sensitivity and specificity of such a model were all just short of 90%, indicating high diagnostic performance [64].

However, not all studies on this topic have confirmed these encouraging results. When comparing DL’s ability to recognize adenomyosis on uterine ultrasonographic images with that of intermediate skilled trainees, Raimondo and co-workers found that the trainees’ accuracy was higher than DL’s (70% versus 51%). However, DL’s specificity, i.e., the ability to correctly identify healthy uteruses, was higher than the trainees’ (82% versus 69%). The authors concluded that the DL model could prove useful in limiting the over-diagnosis of adenomyosis, although the literature appears to suggest we are facing the opposite problem (under-diagnosis), especially in some categories of patients, first and foremost adolescents [67].

3.3. Role in the Choice of Medical Treatments and in the Customized Management of Patients

It has been estimated that the third leading cause of death in the United States is represented by medical errors [68]. Fortunately, human error in medical practice usually leads to less serious consequences than death; however, even mistakes with less catastrophic consequences require attention.

Errors are an inevitable limitation of human actions, which may arise from distraction, work overload, or lack of knowledge. Prescribing medical treatments for which an individual presents contraindications, overlooking the presence of the interactions between medications in patients with comorbidities, or simply not choosing the most adequate molecule for a given patient are extremely frequent events [68]. Integrating human activity with AI-driven control systems may represent an innovative solution to mitigate the frequency of such errors and limit their consequences in clinical practice.

Monophasic low-dose hormonal contraceptives and progestins are considered first-line options for the treatment of endometriosis, as they have the most favorable safety/efficacy/tolerability/cost profile [28,69]. As to what regards adenomyosis, no guidelines have been approved at the present time, although levonorgestrel-releasing intrauterine devices (LNG-IUDs) appear to be an effective first-line treatment [70,71,72]. Oral progestins, dienogest, in particular, and combined oral contraceptives (COCs) have also been proven to be effective in these patients [73,74,75]. Second-line treatment consists of GnRH analogues, both for endometriosis and for adenomyosis [73]. Surgery is the only therapeutic option in specific cases of endometriosis, including obstructive uropathy; bowel occlusion or subocclusion; ovarian cysts with a diameter greater than 5 cm or suspected of malignancy; and cases in which hormonal therapies are not tolerated or contraindicated [28]. Conversely, for women with adenomyosis, especially during childbearing age, surgery is rarely an option, and is usually limited to hysterectomy in patients in perimenopause [73].

However, no size fits all. Patients’ age, ongoing treatments for other conditions, comorbidities, response to treatment and life plans all strongly influence the choice of medical and surgical treatment. AI may aid in facilitating such choice, starting from the choice of prescribing treatment at all. In fact, not all patients with endometriosis or adenomyosis are promptly prescribed adequate therapy. This applies especially to adolescents, for whom hormonal treatment does not appear to be the standard of care [74], although it could prove particularly beneficial in improving painful symptoms and in reducing the risk of disease progression [70].

The World Health Organization has provided recommendations for contraceptive use in women with medical conditions or medically relevant characteristics, which should be routinely applied by gynecologists to avoid drug–disease interactions. Some of these recommendations are routinely addressed in medical practice, although not all are. Moreover, authors suggest that within the same class of molecules, some hormonal therapies are more adequate than others for specific populations.

AI algorithms based on such recommendations and on the most recent literature may guide physicians in an accurate manner towards the adoption of customized therapies, also providing alerts for drug–drug interactions. For example, AI algorithms may advise and remind clinicians to avoid molecules which have a greater effect on bone mass density (BMD) (dienogest monotherapies, GnRH analogues) in younger women; those with greater androgenic effects (NETA) in women with hyperlipidemia, hypercholesterolemia, or signs of hyperandrogenism; and those associated with a higher risk of thromboembolic events (COCs containing third- and fourth-generation progestins, COCs with ≥ 30 mcg ethinyl estradiol, transdermal patches, vaginal rings) in women with known risk factors. Conversely, they may suggest the adoption of therapies which have none or minor adverse effects on BMD (LNG-IUD, continuous use of COCs, estrogen–progestin transdermal patches, vaginal rings) in adolescents or in women with known risk factors for osteoporosis; those which are less likely to induce adverse serum lipid changes (COCs containing micronized 17β-estradiol (E2), or E2 valerate, or estetrol) in women with hyperlipidemia or hypercholesterolemia; those associated with a reduced risk of venous thromboembolisms (second-generation progestins, LNG-IUD, subdermal implant progestins, COCs containing micronized 17β-E2, or E2 valerate, or estetrol) in those with known risk factors for cardiovascular accidents; and those which are approved as contraceptives (COCs, LNG-IUDs, desogestrel monotherapies and etonogestrel subdermal implants) in women desiring contraception [70,75,76,77,78,79,80,81,82].

AI technologies may not only assist clinicians in the choice of the most adequate treatment for a given patient, but they may also guide clinical decisions by predicting outcomes such as reproductive prognosis and cancer risk. Knowledge regarding their reproductive prognosis may empower patients, enabling them to adjust their life projects around their condition and increasing their perception of being taken care of, ultimately improving their satisfaction and their adherence to treatment [22,83,84]. Risk models helping clinicians predict which patients are more likely to encounter a malignant transformation of endometriosis may help identify who requires a timely surgical treatment [85]. Chao and co-workers recently developed a risk model through ML that can predict the risk of endometriosis-associated ovarian cancer, with sensitivity and specificity both short of 90%. The model was built using clinical characteristics including, among others, age, age at menopause and size of the ovarian cysts. Although it was created within a pilot study, and certainly requires further validation, it is a promising example of how AI may facilitate the early identification of malignant transformation, helping clinicians recognize those patients in which risk-reducing medical or surgical interventions should be carried out [85].

Back in 2005, Awaysheh and co-workers reviewed 97 studies analyzing the effects of computer-based clinical decision support systems in various medical fields (cardiology, general surgery and psychiatrics) and found that such systems improved practitioners’ performance in 64% of studies, as well as improved patients’ outcomes in 13% of studies. Although dated, this study provides encouraging results which can only be improved by the advancing application of AI technology [86].

3.4. Role in Surgical Treatment

Surgery may also benefit from AI, especially through its integration with robotic surgery systems. In fact, DL may assist surgeons by providing real-time guidance in the interpretation of anatomy and in the individuation of endometrial foci (by matching robotic surgery images with pre-operatory imaging data). By combining such information with surgeons’ movements, it may warn operators of the risk of complications at an early stage, and, by comparing surgeons’ movements with the data acquired from expert surgeons, it may also be of aid in training. Moreover, by memorizing such movements, it may convert the entire process to an automated surgery. In fact, AI systems can be programmed to cut tissue or suture with a high degree of precision [2,87,88].

At the present moment, the literature regarding the application of DL to endometriosis and adenomyosis surgery is limited to Hernández and colleagues’ 2022 publication [88]. In their observational study, the authors used DL algorithms to quantify the level of indiocyanine green, and consequently of blood perfusion, in bowel anastomoses during laparoscopic bowel resection for endometriosis. The model obtained 92% accuracy and may represent a first step towards the adoption of AI in surgical settings to increase precision and reduce the frequency of complications [89].

3.5. Role in Reducing the Burden Linked to Administrative Work

The tight link between productivity pressure and burnout is clear. It has been estimated that physicians now spend more than 50% of their time updating electronic health records. Since the advent of COVID-19, they have also been spending an increasing amount of time in their out-of-office hours taking care of the exorbitant volume of electronic communication with patients [22,90,91]. This comes at the expense of efficacious communication, empathy, and clinicians’ psychological wellbeing [22], which all affect patients’ empowerment, satisfaction with treatment, adherence to treatment, symptom perception, and ability to remain integrated in society despite suffering from a chronic condition [92,93,94]. However, the solution may be round the corner. In fact, AI-powered technologies may not only be of aid in providing summaries of large medical records, filtering and drafting medical notes and e-mails, generating laboratory and prescription orders, cataloguing diseases according to their ICD, and scheduling appointments; they may do so at a greater speed than humans and with greater accuracy [10,58]. This would enable physicians to have more time actively interacting with their patients, while reducing working hours, and ultimately reducing their risk of burnout [95]. Machines can actually augment our humanity [12] and this has been proven in Ayers and colleagues’ study, in which ChatGPT was found to respond with higher quality and more empathetic answers to patients’ health care questions compared to physicians [13].

Further administrative work, which may be taken over by AI, includes the management of staff rotations and that of operating room slots. In the latter case, optimizing slots by using systems which are able to predict operating room use time may considerably decrease waiting lists, improving health quality at a national level [5]. Also, by creating large datasets including electronic health records from all medical institutions in a given country, AI may be of aid in the establishment of the transition from a fee-for-service reimbursement model to a value-based care model. In fact, by comparing the indication of medical and surgical treatments with available protocols and guidelines, AI technologies may help identify and reduce low-value health care, where value is considered as the relation between the potential benefits, harms and costs of a given medical intervention [96,97].

4. Applications of AI to Endometriosis and Adenomyosis Research

As soon as privacy issues are resolved, AI technologies may significantly alleviate the burden of patient recruitment, site monitoring and the insertion of data in clinical trials by engaging both with participants and with researchers in chat, audio or avatar modes. From a practical point of view, by being able to adapt to any language and communication style, AI models may interact with patients, answering their questions about recruitment and checking they respond to inclusion criteria, while scheduling appointments and collecting data. As to what regards researchers’ tasks, AI technologies may be of aid in analyzing data, producing reports, interpreting results and searching the literature. This may not only lessen researchers’ workloads, but also considerably reduce the current global expenditures on clinical trials, which are estimated to be more than $50 billion [17,98].

In endometriosis and adenomyosis research, much effort has been made over the past decades to better understand disease pathophysiology. Despite numerous genetics and public health studies being carried out, there have been no breakthroughs in the understanding of these conditions. However, these kinds of studies, which are typically based on a vast amount of data, may be facilitated in the future by the use of AI-based technologies, which are able to classify highly complex data and identify patterns and associations while adjusting for co-exposures [99]. Authors have already started adopting ML for such purposes. In their epidemiological study, Matta and co-workers analyzed the presence of pollutants in the abdominal tissue of 55 women with deep endometriosis and 144 healthy controls [99]. ML models showed a high classification performance and were able to consistently reveal a number of pollutants associated with the disease. Genetic studies analyzing the expression of long non-coding RNAs [100] and of genes [101,102] reported an up-regulation of the genes involved in vasculogenesis, cell proliferation, cell–matrix adhesion, cell differentiation, extra-cellular matrix, remodeling muscle contraction, apoptosis, immune response and chemotaxis.

Electronic medical records are an underutilized source of longitudinal data which could be extracted and analyzed using AI technologies to aid such a field of research [103], along with data obtained from symptom-tracking apps.

As the transition to LLM-augmented medical research advances, the World Association of Medical Editors (WAMEs) has recently alerted editors regarding the possible downfalls of ChatGPT-generated medical writing, including the risk of plagiarism and of false content [104]. WAME recommendations included the following points: chatbots should not be considered as authors, as they cannot take responsibility for their paper; authors should disclose their use of chatbots and provide detailed information on how AI technologies were used; authors should be held responsible for contributions given to their manuscripts by chatbots; and editors should have access to updated tools to recognize AI-generated content [104]. Accordingly, medical journals have started publishing their own guidelines regarding the use of chatbots in medical writing [105].

5. Limitations and Challenges of AI

So far, we have enumerated the numerous ways in which AI may overcome human limitations and, as such, facilitate the work of physicians and clinicians and improve patients’ experience. However, AI also entails limitations and, as such, needs to be strictly regulated in order for it to be used in a safe manner, with respect to ethics and human rights. In this regard, while appraising AI’s potential to improve healthcare worldwide, the World Health Organization has outlined six areas for an efficacious regulation of AI, which all governments should adhere to [106]. Such areas include the following: protecting human autonomy (humans should remain in control of medical decisions and patients should understand the role AI plays in their care); promoting human wellbeing and safety (AI should not result in mental or physical harm to individuals); ensuring transparency and intelligibility (AI technologies should be understandable to developers, users and regulators); fostering responsibility (regulatory principles should be applied to the algorithms); ensuring equity (AI should be accessible worldwide, not only in high-income settings, and AI should not encode biases which may marginalize minority groups); and promoting responsiveness and sustainability (AI should respond to requirements and should be consistent with global efforts to reduce humans’ impact on the environment) [106].

Although it is unlikely physicians’ autonomy in making decisions for their patients will be taken over by AI, patients using LLMs as medical resources may be misled by receiving incorrect information. In fact, LLMs’ human-like conversational style and their apparently authoritative and plausible answers to human prompts easily gain users’ trust [16,107]. This poses safety risks, as AI technologies are surely promising, yet still far from being infallible, especially when answering medical queries [9]. On some occasions, they even make information up, as has been found when asking a chatbot to provide references for its answers [107]. In Goodman and colleagues’ cross-sectional study on the accuracy of chatbot-generated answers to questions based on medical guidelines, the percentage of completely incorrect answers was as high as 8% [9]. This is due to the fact chatbots are not conscious; they simply rearrange existing data which has been previously input in the system, without being able to discriminate between reliable and unreliable sources [98]. LLMs specifically trained on medical guidelines and on the most recent literature are being developed to overcome such pitfalls [108,109]. Moreover, at the present moment, AI technology is still not able to account for certain real-world aspects of making a clinical diagnosis or solving a medical case such as sorting through the myriad of symptoms and information a patient may report and deducing the most salient, as well as understanding nuances linked to the patient’s emotional wellbeing [58]. Such drawbacks of AI may have more serious repercussions on patients’ health if clinicians who use these technologies in their clinical practice are not aware of their limitations, and, as such, do not verify their veracity [5]. Understanding how different AI methods work, and what their competencies and their limitations are, is fundamental to safeguard individuals’ safety. It is the health workers’ duty to keep up to date in this regard to protect both their patients and themselves [19].

As international medical organizations and journals have already started doing, it is essential that regulatory principles for AI application in healthcare are defined at a global level. Guidance is needed as to what regards patient privacy, the protection of sensitive data [14,110] and malpractice liability [111]. Considering that AI applications’ efficiency depends on the quality of the data they are fed, it is urgent to find a balance between protecting sensitive data and feeding AI systems with the most vast and complete amount of data possible. Also, currently, no LLM has been reviewed by the US Food and Drug Administration, nor by its equivalents worldwide. As such, if a patient experiences malpractice as a result of the use of AI-based technology, no one may be liable for the patient’s injury except the clinician, who should always verify AI’s outputs before relying on its use [111].

A further aspect of AI, which will need to be brought to the attention of users, is the risk of introducing, in AI applications, systemic biases which result in the marginalization of minority groups [112]. Training AI models with data that are not representative in toto of the population the algorithm is going to be used on is the basis of such biases [113]. In a recent study exploring the use of an AI model to aid in the interpretation of chest radiographs, the authors noted that, although trained on datasets of thousands of images, the AI model under-diagnosed diseases in underserved and minority groups [114]. This may be the case not only of ethnic minority groups, but of all of those individuals whose data were too scanty or difficult to retrieve for the AI models to be trained on. Access to the internet and to AI facilities is another aspect which should be considered when analyzing the risk of the marginalization of underserved communities.

6. Conclusions

Although human intellect has brought about an astonishing improvement of life expectancy and of the quality of life in the last few centuries, human interventions are intrinsically prone to error. Errors may limit the quality of care and the ability to further improve both clinical research and medical practice. This is where AI may fit in; while it may not be able to reduce human errors to zero, it can reduce them significantly.

Despite the fact most AI applications have not been subject to randomized controlled clinical trials, their potential in improving healthcare is evident. In the management of endometriosis and adenomyosis, AI technology may not only improve patients’ experience by contributing to a reduction of diagnostic delay, guaranteeing a more efficient patient–clinician interaction, enhancing surgical outcomes, and facilitating customized forms of care, but it may also improve physicians’ experience by decreasing the administrative workload, aiding diagnoses and the choice of medical treatments, increasing surgeons’ expertise and facilitating clinical research. At a global level, AI may improve health services’ efficiency and speed up the transition from a fee-for-service reimbursement model to a value-based care mode. Such improvements may occur through AI’s ability to carry out automation, doing so with greater precision and in a fraction of the time compared to humans.

More research is needed before such technologies can be implemented in clinical practice and strict regulation is necessary before these algorithms are introduced in everyday practice. In the meantime, it is our standing as clinicians and researchers to expand our knowledge in this regard, overcoming our skeptical view of machines as usurpers ad recognizing that this form of man-made technology may actually be the solution to overcome men’s intrinsic limitations.

Author Contributions

Conceptualization, P.V. and G.E.C.; writing—original draft preparation, G.E.C.; writing—review and editing, A.E.T., V.C., I.C., C.E.M.M. and P.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to acknowledge the precious contribution of Ernesto and Cameron Sticki to the editing of this manuscript.

Conflicts of Interest

P.V. serves as Associate Editor for Human Reproduction; is a member of the Editorial Board of the Journal of Obstetrics and Gynaecology Canada, of the Italian Journal of Obstetrics and Gynaecology, and of the International Editorial Board of Acta Obstetricia et Gynecologica Scandinavica; has received royalties from Wolters Kluwer for chapters on endometriosis management in the clinical decision support resource UpToDate and maintains both a public and private gynaecological practice. Valentina Chiappa is a Member of Medical Advisory Board in DeepTrace Tech. All other authors declare they have no conflict of interest.

References

Dahl, R. The Great Automatic Grammatizator. In Someone Like You; Penguin: Harmondsworth, UK, 1986; p. 209. [Google Scholar]
Weichert, J.; Welp, A.; Scharf, J.L.; Dracopoulos, C.; Becker, W.H.; Gembicki, M. The Use of Artificial Intelligence in Automation in the Fields of Gynaecology and Obstetrics—An Assessment of the State of Play. Geburtshilfe Frauenheilkd. 2021, 81, 1203–1216. [Google Scholar] [CrossRef]
Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Li, F.-F. ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 20–25. [Google Scholar]
Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, B.; et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef]
Sahni, N.R.; Carrus, B. Artificial Intelligence in U.S. Health Care Delivery. N. Engl. J. Med. 2023, 389, 348–358. [Google Scholar]
Lee, J.H.; Kwon, S.Y.; Chang, J.; Yuk, J.S. Machine Learning Approach to find the relation between Endometriosis, benign breast disease, cystitis and non-toxic goiter. Sci. Rep. 2019, 9, 5410. [Google Scholar] [CrossRef]
Babic, B.; Gerke, S.; Evgeniou, T.; Cohen, I.G. Beware explanations from AI in health care. Science 2021, 373, 284–286. [Google Scholar] [CrossRef]
Jeyaraman, M.; Balaji, S.; Jeyaraman, N.; Yadav, S. Unraveling the Ethical Enigma: Artificial Intelligence in Healthcare. Cureus 2023, 15, e43262. [Google Scholar] [CrossRef] [PubMed]
Goodman, R.S.; Patrinely, J.R.; Stone, C.A.; Zimmerman, E.; Donald, R.R.; Chang, S.S.; Berkowitz, S.T.; Finn, A.P.; Jahangir, E.; Scoville, E.A.; et al. Accuracy and Reliability of Chatbot Responses to Physician Questions. JAMA Netw. Open 2023, 6, e2336483. [Google Scholar] [CrossRef] [PubMed]
Lee, P.; Bubeck, S.; Petro, J. Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine. N. Engl. J. Med. 2023, 388, 1233–1239. [Google Scholar] [CrossRef] [PubMed]
Grünebaum, A.; Chervenak, J.; Pollet, S.L.; Katz, A.; Chervenak, F.A. The exciting potential for ChatGPT in obstetrics and gynecology. Am. J. Obstet. Gynecol. 2023, 228, 696–705. [Google Scholar] [CrossRef]
Voelker, R. The Promise and Pitfalls of AI in the Complex World of Diagnosis, Treatment, and Disease Management. JAMA 2023, 330, 1416–1419. [Google Scholar] [CrossRef]
Ayers, J.W.; Poliak, A.; Dredze, M.; Leas, E.C.; Zhu, Z.; Kelley, J.B.; Faix, D.J.; Goodman, A.M.; Longhurst, C.A.; Hogarth, M.; et al. Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum. JAMA Intern. Med. 2023, 183, 589–596. [Google Scholar] [CrossRef] [PubMed]
Minssen, T.; Vayena, E.; Cohen, I.G. The Challenges for Regulating Medical Use of ChatGPT and Other Large Language Models. JAMA 2023, 330, 315–316. [Google Scholar] [CrossRef] [PubMed]
Cutler, D.M. What Artificial Intelligence Means for Health Care. JAMA Health Forum 2023, 4, e232652. [Google Scholar] [CrossRef] [PubMed]
Menz, B.D.; Modi, N.D.; Sorich, M.J.; Hopkins, A.M. Health Disinformation Use Case Highlighting the Urgent Need for Artificial Intelligence Vigilance: Weapons of Mass Disinformation. JAMA Intern. Med. 2024, 184, 92–96. [Google Scholar] [CrossRef] [PubMed]
Hernandez, A.F.; Lindsell, C.J. The Future of Clinical Trials: Artificial to Augmented to Applied Intelligence. JAMA 2023, 330, 2061–2063. [Google Scholar] [CrossRef] [PubMed]
Chiappa, V.; Interlenghi, M.; Bogani, G.; Salvatore, C.; Bertolina, F.; Sarpietro, G.; Signorelli, M.; Ronzulli, D.; Castiglioni, I.; Raspagliesi, F. A decision support system based on radiomics and machine learning to predict the risk of malignancy of ovarian masses from transvaginal ultrasonography and serum CA-125. Eur. Radiol. Exp. 2021, 5, 28. [Google Scholar] [CrossRef] [PubMed]
Sivajohan, B.; Elgendi, M.; Menon, C.; Allaire, C.; Yong, P.; Bedaiwy, M.A. Clinical use of artificial intelligence in endometriosis: A scoping review. npj Digit. Med. 2022, 5, 109. [Google Scholar] [CrossRef]
Centini, G.; Lazzeri, L.; Dores, D.; Pianigiani, L.; Iannone, P.; Stefano, L.; Petraglia, F. Chronic pelvic pain and quality of life in women with and without endometriosis. J. Endometr. Pelvic Pain Disord. 2013, 5, 27–33. [Google Scholar] [CrossRef]
Racine, M.; Dion, D.; Dupuis, G.; Guerriere, D.N.; Zagorski, B.; Choinière, M.; Canadian STOP-PAIN Research Group. The Canadian STOP-PAIN project: The burden of chronic pain-does sex really matter? Clin. J. Pain 2014, 30, 443–452. [Google Scholar] [CrossRef]
Cetera, G.E.; Facchin, F.; Viganò, P.; Merli, C.E.M.M.; Frassineti, A.; Fiorini, J.; Somigliana, E.; Vercellini, P. “SO FAR AWAY” How doctors can contribute to making endometriosis hell on earth. A call for humanistic medicine and empathetic practice for genuine patient-centered care. Int. J. Womens Health 2024, 16, 273–287. [Google Scholar] [CrossRef]
Greene, R.; Stratton, P.; Cleary, S.D.; Ballweg, M.L.; Sinaii, N. Diagnostic experience among 4,334 women reporting surgically diagnosed endometriosis. Fertil. Steril. 2009, 91, 32–39. [Google Scholar] [CrossRef] [PubMed]
Exacoustos, C.; Lazzeri, L.; Martire, F.G.; Russo, C.; Martone, S.; Centini, G.; Piccione, E.; Zupi, E. Ultrasound Findings of Adenomyosis in Adolescents: Type and Grade of the Disease. J. Minim. Invasive Gynecol. 2022, 29, 291–299. [Google Scholar] [CrossRef] [PubMed]
Kanjee, Z.; Crowe, B.; Rodman, A. Accuracy of a Generative Artificial Intelligence Model in a Complex Diagnostic Challenge. JAMA 2023, 330, 78–80. [Google Scholar] [CrossRef] [PubMed]
Kung, T.H.; Cheatham, M.; Medenilla, A.; Sillos, C.; De Leon, L.; Elepaño, C.; Madriaga, M.; Aggabao, R.; Diaz-Candido, G.; Maningo, J.; et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLoS Digit. Health 2023, 2, e0000198. [Google Scholar] [CrossRef] [PubMed]
Eriksen, A.V.; Möller, S.; Ryg, J. Use of GPT-4 to Diagnose Complex Clinical Cases. NEJM AI 2023, 1, AIp2300031. [Google Scholar] [CrossRef]
Becker, C.M.; Bokor, A.; Heikinheimo, O.; Horne, A.; Jansen, F.; Kiesel, L.; King, K.; Kvaskoff, M.; Nap, A.; Petersen, K.; et al. ESHRE guideline: Endometriosis. Hum. Reprod. Open 2022, 2022, hoac009. [Google Scholar] [CrossRef]
Ozgor, B.Y.; Simavi, M.A. Accuracy and reproducibility of ChatGPT’s free version answers about endometriosis. Int. J. Gynaecol. Obstet. 2023, 165, 691–695. [Google Scholar] [CrossRef]
Bendifallah, S.; Puchar, A.; Suisse, S.; Delbos, L.; Poilblanc, M.; Descamps, P.; Golfier, F.; Touboul, C.; Dabi, Y.; Daraï, E. Machine learning algorithms as new screening approach for patients with endometriosis. Sci. Rep. 2022, 12, 639. [Google Scholar] [CrossRef] [PubMed]
Nnoaham, K.E.; Hummelshoj, L.; Kennedy, S.H.; Jenkinson, C.; Zondervan, K.T.; World Endometriosis Research Foundation Women’s Health Symptom Survey Consortium. Developing symptom-based predictive models of endometriosis as a clinical screening tool: Results from a multicenter study. Fertil. Steril. 2012, 98, 692–701. [Google Scholar] [CrossRef]
Tore, U.; Abilgazym, A.; Asunsolo-Del-Barco, A.; Terzic, M.; Yemenkhan, Y.; Zollanvari, A.; Sarria-Santamera, A. Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach. Biomedicines 2023, 11, 3015. [Google Scholar] [CrossRef]
Goldstein, A.; Cohen, S. Self-report symptom-based endometriosis prediction using machine learning. Sci. Rep. 2023, 13, 5499. [Google Scholar] [CrossRef] [PubMed]
Kiser, A.C.; Schliep, K.C.; Hernandez, E.J.; Peterson, C.M.; Yandell, M.; Eilbeck, K. An artificial intelligence approach for investigating multifactorial pain-related features of endometriosis. PLoS ONE 2024, 19, e0297998. [Google Scholar] [CrossRef] [PubMed]
Bendifallah, S.; Dabi, Y.; Suisse, S.; Jornea, L.; Bouteiller, D.; Touboul, C.; Puchar, A.; Daraï, E. MicroRNome analysis generates a blood-based signature for endometriosis. Sci. Rep. 2022, 12, 4051. [Google Scholar] [CrossRef] [PubMed]
Vodolazkaia, A.; El-Aalamat, Y.; Popovic, D.; Mihalyi, A.; Bossuyt, X.; Kyama, C.M.; Fassbender, A.; Bokor, A.; Schols, D.; Huskens, D.; et al. Evaluation of a panel of 28 biomarkers for the non-invasive diagnosis of endometriosis. Hum. Reprod. 2012, 27, 2698–2711. [Google Scholar] [CrossRef] [PubMed]
Zhang, H.; Zhang, H.; Yang, H.; Shuid, A.N.; Sandai, D.; Chen, X. Machine learning-based integrated identification of predictive combined diagnostic biomarkers for endometriosis. Front. Genet. 2023, 14, 1290036. [Google Scholar] [CrossRef] [PubMed]
Dabi, Y.; Suisse, S.; Marie, Y.; Delbos, L.; Poilblanc, M.; Descamps, P.; Golfier, F.; Jornea, L.; Forlani, S.; Bouteiller, D.; et al. New class of RNA biomarker for endometriosis diagnosis: The potential of salivary piRNA expression. Eur. J. Obstet. Gynecol. Reprod. Biol. 2023, 291, 88–95. [Google Scholar] [CrossRef] [PubMed]
Akter, S.; Xu, D.; Nagel, S.C.; Bromfield, J.J.; Pelch, K.E.; Wilshire, G.B.; Joshi, T. GenomeForest: An Ensemble Machine Learning Classifier for Endometriosis. AMIA Jt. Summits Transl. Sci. Proc. 2020, 2020, 33–42. [Google Scholar]
Ghazi, N.; Arjmand, M.; Akbari, Z.; Mellati, A.O.; Saheb-Kashaf, H.; Zamani, Z. [1]H NMR-based metabolomics approaches as non- invasive tools for diagnosis of endometriosis. Int. J. Reprod. Biomed. 2016, 14, 1–8. [Google Scholar] [CrossRef]
Dutta, M.; Joshi, M.; Srivastava, S.; Lodh, I.; Chakravarty, B.; Chaudhury, K. A metabolomics approach as a means for identification of potential biomarkers for early diagnosis of endometriosis. Mol. Biosyst. 2012, 8, 3281–3287. [Google Scholar] [CrossRef]
Braga, D.P.A.F.; Montani, D.A.; Setti, A.S.; Turco, E.G.L.; Oliveira-Silva, D.; Borges, E., Jr. Metabolomic profile as a noninvasive adjunct tool for the diagnosis of Grades III and IV endometriosis-related infertility. Mol. Reprod. Dev. 2019, 86, 1044–1052. [Google Scholar] [CrossRef]
Wang, L.; Zheng, W.; Mu, L.; Zhang, S.Z. Identifying biomarkers of endometriosis using serum protein fingerprinting and artificial neural networks. Int. J. Gynaecol. Obstet. 2008, 101, 253–258. [Google Scholar] [CrossRef] [PubMed]
Zou, L.; Meng, L.; Xu, Y.; Wang, K.; Zhang, J. Revealing the diagnostic value and immune infiltration of senescence-related genes in endometriosis: A combined single-cell and machine learning analysis. Front. Pharmacol. 2023, 14, 1259467. [Google Scholar] [CrossRef] [PubMed]
Domínguez, F.; Ferrando, M.; Díaz-Gimeno, P.; Quintana, F.; Fernández, G.; Castells, I.; Simón, C. Lipidomic profiling of endometrial fluid in women with ovarian endometriosis†. Biol. Reprod. 2017, 96, 772–779. [Google Scholar] [CrossRef]
Li, B.; Wang, S.; Duan, H.; Wang, Y.; Guo, Z. Discovery of gene module acting on ubiquitin-mediated proteolysis pathway by co-expression network analysis for endometriosis. Reprod. Biomed. Online 2021, 42, 429–441. [Google Scholar] [CrossRef]
Fassbender, A.; Verbeeck, N.; Börnigen, D.; Kyama, C.M.; Bokor, A.; Vodolazkaia, A.; Peeraer, K.; Tomassetti, C.; Meuleman, C.; Gevaert, O.; et al. Combined mRNA microarray and proteomic analysis of eutopic endometrium of women with and without endometriosis. Hum. Reprod. 2012, 27, 2020–2029. [Google Scholar] [CrossRef]
Akter, S.; Xu, D.; Nagel, S.C.; Bromfield, J.J.; Pelch, K.; Wilshire, G.B.; Joshi, T. Machine Learning Classifiers for Endometriosis Using Transcriptomics and Methylomics Data. Front. Genet. 2019, 10, 766. [Google Scholar] [CrossRef]
Su, D.; Guo, Y.; Yang, R.; Fang, Z.; Lu, X.; Xu, Q.; Teng, Y.; Sun, H.; Yang, C.; Dong, J.; et al. Identifying a panel of nine genes as novel specific model in endometriosis noninvasive diagnosis. Fertil. Steril. 2024, 121, 323–333. [Google Scholar] [CrossRef]
Blass, I.; Sahar, T.; Shraibman, A.; Ofer, D.; Rappoport, N.; Linial, M. Revisiting the Risk Factors for Endometriosis: A Machine Learning Approach. J. Pers. Med. 2022, 12, 1114. [Google Scholar] [CrossRef] [PubMed]
Guerriero, S.; Pascual, M.; Ajossa, S.; Neri, M.; Musa, E.; Graupera, B.; Rodriguez, I.; Alcazar, J.L. Artificial intelligence (AI) in the detection of rectosigmoid deep endometriosis. Eur. J. Obstet. Gynecol. Reprod. Biol. 2021, 261, 29–33. [Google Scholar] [CrossRef]
Guo, Z.; Feng, P.; Chen, X.; Tang, R.; Yu, Q. Developing Preoperative Nomograms to Predict Any-Stage and Stage III-IV Endometriosis in Infertile Women. Front. Med. 2020, 7, 570483. [Google Scholar] [CrossRef]
Rajpurkar, P.; Lungren, M.P. The Current and Future State of AI Interpretation of Medical Images. N. Engl. J. Med. 2023, 388, 1981–1990. [Google Scholar] [CrossRef] [PubMed]
Ferryman, K.; Mackintosh, M.; Ghassemi, M. Considering Biased Data as Informative Artifacts in AI-Assisted Health Care. N. Engl. J. Med. 2023, 389, 833–838. [Google Scholar] [CrossRef] [PubMed]
Rudolph, J.; Huemmer, C.; Ghesu, F.-C.; Mansoor, A.; Preuhs, A.; Fieselmann, A.; Fink, N.; Dinkel, J.; Koliogiannis, V.; Schwarze, V.; et al. Artificial intelligence in chest radiography reporting accuracy: Added clinical value in the emergency unit setting with- out 24/7 radiology coverage. Investig. Radiol. 2022, 57, 90–98. [Google Scholar] [CrossRef] [PubMed]
Christiansen, F.; Epstein, E.L.; Smedberg, E.; Åkerlund, M.; Smith, K.; Epstein, E. Ultrasound image analysis using deep neural networks for discriminating between benign and malignant ovarian tumors: Comparison with expert subjective assessment. Ultrasound Obstet. Gynecol. 2021, 57, 155–163. [Google Scholar] [CrossRef] [PubMed]
Hu, P.; Gao, Y.; Zhang, Y.; Sun, K. Ultrasound image-based deep learning to differentiate tubal-ovarian abscess from ovarian endometriosis cyst. Front. Physiol. 2023, 14, 1101810. [Google Scholar] [CrossRef] [PubMed]
Kulkarni, P.A.; Singh, H. Artificial Intelligence in Clinical Diagnosis: Opportunities, Challenges, and Hype. JAMA 2023, 330, 317–318. [Google Scholar] [CrossRef] [PubMed]
Littmann, M.; Selig, K.; Cohen-Lavi, L.; Frank, Y.; Hönigschmid, P.; Kataka, E.; Mösch, A.; Qian, K.; Ron, A.; Schmid, S.; et al. Validity of machine learning in biology and medicine increased through collaborations across fields of expertise. Nat. Mach. Intell. 2020, 2, 18–24. [Google Scholar] [CrossRef]
Balica, A.; Dai, J.; Piiwaa, K.; Qi, X.; Green, A.N.; Philips, N.; Egan, S.; Hachihaliloglu, I. Augmenting endometriosis analysis from ultrasound data with deep learning. In Proceedings of the Medical Imaging 2023: Ultrasonic Imaging and Tomography, San Diego, CA, USA, 10 April 2023. [Google Scholar]
Cullen, T.S. Adenomyoma of the rectovaginal septum. JAMA 1914, 62, 835–839. [Google Scholar] [CrossRef]
Purohit, R.; Sharma, J.G.; Meher, D.; Rakh, S.R.; Malik, S. Completion of vaginal hysterectomy by electro surgery using anteroposterior approach in benign cases faced with obliterated posterior cul-de-sac. Int. J. Women’s Health 2018, 10, 529–536. [Google Scholar] [CrossRef]
Leonardi, M.; Martin, E.; Reid, S.; Blanchette, G.; Condous, G. Deep endometriosis transvaginal ultrasound in the workup of patients with signs and symptoms of endometriosis: A cost analysis. BJOG 2019, 126, 1499–1506. [Google Scholar] [CrossRef]
Maicas, G.; Leonardi, M.; Avery, J.; Panuccio, C.; Carneiro, G.; Hull, M.L.; Condous, G. Deep learning to diagnose pouch of Douglas obliteration with ultrasound sliding sign. Reprod. Fertil. 2021, 2, 236–243. [Google Scholar] [CrossRef] [PubMed]
Menakaya, U.; Infante, F.; Lu, C.; Phua, C.; Model, A.; Messyne, F.; Brainwood, M.; Reid, S.; Condous, G. Interpreting the real-time dynamic ‘sliding sign’ and predicting pouch of douglas obliteration: An interobserver, intraobserver, diagnostic-accuracy and learning- curve study. Ultrasound Obstet. Gynecol. 2016, 48, 113–120. [Google Scholar] [CrossRef] [PubMed]
Leonardi, M.; Martins, W.P.; Espada, M.; Georgousopoulou, E.; Condous, G. Prevalence of negative sliding sign representing pouch of douglas obliteration during pelvic transvaginal ultrasound for any indication. Ultrasound Obstet. Gynecol. 2020, 56, 928–933. [Google Scholar] [CrossRef] [PubMed]
Raimondo, D.; Raffone, A.; Aru, A.C.; Giorgi, M.; Giaquinto, I.; Spagnolo, E.; Travaglino, A.; Galatolo, F.A.; Cimino, M.G.C.A.; Lenzi, J.; et al. Application of Deep Learning Model in the Sonographic Diagnosis of Uterine Adenomyosis. Int. J. Environ. Res. Public Health 2023, 20, 1724. [Google Scholar] [CrossRef] [PubMed]
Makary, M.A.; Daniel, M. Medical error-the third leading cause of death in the US. BMJ 2016, 353, i2139. [Google Scholar] [CrossRef] [PubMed]
Vercellini, P.; Bandini, V.; Viganò, P.; Ambruoso, D.; Cetera, G.E.; Somigliana, E. Proposal for targeted, neo-evolutionary-oriented secondary prevention of early-onset endometriosis and adenomyosis. Part II: Medical interventions. Hum. Reprod. 2024, 39, 18–34. [Google Scholar] [CrossRef] [PubMed]
Ozdegirmenci, O.; Kayikcioglu, F.; Akgul, M.A.; Kaplan, M.; Karcaaltincaba, M.; Haberal, A.; Akyol, M. Comparison of levonorgestrel intrauterine system versus hysterectomy on efficacy and quality of life in patients with adenomyosis. Fertil. Steril. 2011, 95, 497–502. [Google Scholar] [CrossRef] [PubMed]
Mansukhani, N.; Unni, J.; Dua, M.; Darbari, R.; Malik, S.; Verma, S.; Bathla, S. Are women satisfied when using levonorgestrel-releasing intrauterine system for treatment of abnormal uterine bleeding? J. Midlife Health 2013, 4, 31–35. [Google Scholar] [CrossRef] [PubMed]
Etrusco, A.; Barra, F.; Chiantera, V.; Ferrero, S.; Bogliolo, S.; Evangelisti, G.; Oral, E.; Pastore, M.; Izzotti, A.; Venezia, R.; et al. Current Medical Therapy for Adenomyosis: From Bench to Bedside. Drugs 2023, 83, 1595–1611. [Google Scholar] [CrossRef] [PubMed]
Pino, I.; Belloni, G.M.; Barbera, V.; Solima, E.; Radice, D.; Angioni, S.; Arena, S.; Bergamini, V.; Candiani, M.; Maiorana, A.; et al. “Better late than never but never late is better”, especially in young women. A multicenter Italian study on diagnostic delay for symptomatic endometriosis. Eur. J. Contracept. Reprod. Health Care 2023, 28, 10–16. [Google Scholar] [CrossRef]
Neriishi, K.; Hirata, T.; Fukuda, S.; Izumi, G.; Nakazawa, A.; Yamamoto, N.; Harada, M.; Hirota, Y.; Koga, K.; Wada-Hiraike, O.; et al. Long-term dienogest administration in patients with symptomatic adenomyosis. J. Obstet. Gynaecol. Res. 2018, 44, 1439–1444. [Google Scholar] [CrossRef]
Osuga, Y.; Watanabe, M.; Hagino, A. Long-term use of dienogest in the treatment of painful symptoms in adenomyosis. J. Obstet. Gynaecol. Res. 2017, 43, 1441–1448. [Google Scholar] [CrossRef] [PubMed]
Berlanda, N.; Somigliana, E.; Viganò, P.; Vercellini, P. Safety of medical treatments for endometriosis. Expert Opin. Drug Saf. 2016, 15, 21–30. [Google Scholar] [CrossRef] [PubMed]
Goshtasebi, A.; Subotic Brajic, T.; Scholes, D.; Beres Lederer Goldberg, T.; Berenson, A.; Prior, J.C. Adolescent use of combined hormonal contraception and peak bone mineral density accrual: A meta-analysis of international prospective controlled studies. Clin. Endocrinol. 2019, 90, 517–524. [Google Scholar] [CrossRef] [PubMed]
Gersten, J.; Hsieh, J.; Weiss, H.; Ricciotti, N.A. Effect of Extended 30 μg Ethinyl Estradiol with Continuous Low-Dose Ethinyl Estradiol and Cyclic 20 μg Ethinyl Estradiol Oral Contraception on Adolescent Bone Density: A Randomized Trial. J. Pediatr. Adolesc. Gynecol. 2016, 29, 635–642. [Google Scholar] [CrossRef] [PubMed]
Lidegaard, Ø.; Nielsen, L.H.; Skovlund, C.W.; Skjeldestad, F.E.; Løkkegaard, E. Risk of venous thromboembolism from use of oral contraceptives containing different progestogens and oestrogen doses: Danish cohort study, 2001-9. BMJ 2011, 343, d6423. [Google Scholar] [CrossRef] [PubMed]
Lidegaard, Ø.; Milsom, I.; Geirsson, R.T.; Skjeldestad, F.E. Hormonal contraception and venous thromboembolism. Acta Obstet. Gynecol. Scand. 2012, 91, 769–778. [Google Scholar] [CrossRef] [PubMed]
Haverinen, A.H.; Luiro, K.M.; Szanto, T.; Kangasniemi, M.H.; Hiltunen, L.; Sainio, S.; Piltonen, T.T.; Lassila, R.; Tapanainen, J.S.; Heikinheimo, O. Combined oral contraceptives containing estradiol valerate vs ethinylestradiol on coagulation: A randomized clinical trial. Acta Obstet. Gynecol. Scand. 2022, 101, 1102–1111. [Google Scholar] [CrossRef] [PubMed]
Champaloux, S.W.; Tepper, N.K.; Monsour, M.; Curtis, K.M.; Whiteman, M.K.; Marchbanks, P.A.; Jamieson, D.J. Use of combined hormonal contraceptives among women with migraines and risk of ischemic stroke. Am. J. Obstet. Gynecol. 2017, 216, 489.e1–489.e7. [Google Scholar] [CrossRef]
Ben-Sira, Z. The function of the professional’s affective behavior in client satisfaction: A revised approach to social interaction theory. J. Health Soc. Behav. 1976, 17, 3–11. [Google Scholar] [CrossRef]
DiMatteo, M.R.; Taranta, A.; Friedman, H.S.; Prince, L.M. Predicting patient satisfaction from physicians’ nonverbal communication skills. Med. Care 1980, 18, 376–387. [Google Scholar] [CrossRef] [PubMed]
Chao, X.; Wang, S.; Lang, J.; Leng, J.; Fan, Q. The application of risk models based on machine learning to predict endometriosis-associated ovarian cancer in patients with endometriosis. Acta Obstet. Gynecol. Scand. 2022, 101, 1440–1449. [Google Scholar] [CrossRef] [PubMed]
Awaysheh, A.; Wilcke, J.; Elvinger, F.; Rees, L.; Fan, W.; Zimmerman, K.L. Review of Medical Decision Support and Machine-Learning Methods. Vet. Pathol. 2019, 56, 512–525. [Google Scholar] [CrossRef] [PubMed]
Loukas, C.; Varytimidis, C.; Rapantzikos, K.; Kanakis, M.A. Keyframe extraction from laparoscopic videos based on visual saliency detection. Comput. Methods Programs Biomed. 2018, 165, 13–23. [Google Scholar] [CrossRef]
Teixeira, J. One Hundred Years of Evolution in Surgery: From Asepsis to Artificial Intelligence. Surg. Clin. N. Am. 2020, 100, XV–XVI. [Google Scholar] [CrossRef] [PubMed]
Hernández, A.; de Zulueta, P.R.; Spagnolo, E.; Soguero, C.; Cristobal, I.; Pascual, I.; López, A.; Ramiro-Cortijo, D. Deep Learning to Measure the Intensity of Indocyanine Green in Endometriosis Surgeries with Intestinal Resection. J. Pers. Med. 2022, 12, 982. [Google Scholar] [CrossRef] [PubMed]
Adler-Milstein, J.; Zhao, W.; Willard-Grace, R.; Knox, M.; Grumbach, K. Electronic health records and burnout: Time spent on the electronic health record after hours and message volume associated with exhaustion but not with cynicism among primary care clinicians. J. Am. Med. Inform. Assoc. 2020, 27, 531–538. [Google Scholar] [CrossRef] [PubMed]
Rotenstein, L.S.; Landman, A.; Bates, D.W. The Electronic Inbox-Benefits, Questions, and Solutions for the Road Ahead. JAMA 2023, 330, 1735–1736. [Google Scholar] [CrossRef]
O’Hara, R.; Rowe, H.; Fisher, J. Managing endometriosis: A cross-sectional survey of women in Australia. J. Psychosom. Obstet. Gynaecol. 2022, 43, 265–272. [Google Scholar] [CrossRef]
Patient empowerment–who empowers whom? Lancet 2012, 379, 1677, Erratum in: Lancet 2012, 380, 650. [CrossRef]
Ellingsen, D.M.; Isenburg, K.; Jung, C.; Lee, J.; Gerber, J.; Mawla, I.; Sclocco, R.; Grahl, A.; Anzolin, A.; Edwards, R.R.; et al. Brain-to-brain mechanisms underlying pain empathy and social modulation of pain in the patient-clinician interaction. Proc. Natl. Acad. Sci. USA 2023, 120, e2212910120. [Google Scholar] [CrossRef] [PubMed]
Hswen, Y.; Abbasi, J. AI Will-and Should-Change Medical School, Says Harvard’s Dean for Medical Education. JAMA 2023, 330, 1820–1823. [Google Scholar] [CrossRef] [PubMed]
Pandya, A. Adding Cost-effectiveness to Define Low-Value Care. JAMA 2018, 319, 1977–1978. [Google Scholar] [CrossRef] [PubMed]
Vercellini, P.; Giudice, L.C.; Evers, J.L.; Abrao, M.S. Reducing low-value care in endometriosis between limited evidence and unresolved issues: A proposal. Hum. Reprod. 2015, 30, 1996–2004. [Google Scholar] [CrossRef] [PubMed]
Jeyaraman, M.; Jeyaraman, N.; Nallakumarasamy, A.; Yadav, S.; Bondili, S.K. ChatGPT in Medical Education and Research: A Boon or a Bane? Cureus 2023, 15, e44316. [Google Scholar] [CrossRef] [PubMed]
Matta, K.; Vigneau, E.; Cariou, V.; Mouret, D.; Ploteau, S.; Le Bizec, B.; Antignac, J.P.; Cano-Sancho, G. Associations between persistent organic pollutants and endometriosis: A multipollutant assessment using machine learning algorithms. Environ. Pollut. 2020, 260, 114066. [Google Scholar] [CrossRef] [PubMed]
Zhou, C.; Zhang, T.; Liu, F.; Zhou, J.; Ni, X.; Huo, R.; Shi, Z. The differential expression of mRNAs and long noncoding RNAs between ectopic and eutopic endometria provides new insights into adenomyosis. Mol. Biosyst. 2016, 12, 362–370. [Google Scholar] [CrossRef] [PubMed]
Yu, L.; Shen, H.; Ren, X.; Wang, A.; Zhu, S.; Zheng, Y.; Wang, X. Multi-omics analysis reveals the interaction between the complement system and the coagulation cascade in the development of endometriosis. Sci. Rep. 2021, 11, 11926. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Lu, Y.; Sun, G.; Fang, Z.; Xing, Z.; Nong, W.; Wei, Y.; Wang, S.; Shi, G.; Dong, M.; et al. Machine learning algorithms for a novel cuproptosis-related gene signature of diagnostic and immune infiltration in endometriosis. Sci. Rep. 2023, 13, 21603. [Google Scholar] [CrossRef]
Giudice, L.C.; Oskotsky, T.T.; Falako, S.; Opoku-Anane, J.; Sirota, M. Endometriosis in the era of precision medicine and impact on sexual and reproductive health across the lifespan and in diverse populations. FASEB J. 2023, 37, e23130. [Google Scholar] [CrossRef]
Zielinski, C.; Winker, M.; Aggarwal, R.; Ferris, L.; Heinemann, M.; Lapeña, J.F.; Pai, S.; Ing, E.; Citrome, L. Chatbots, ChatGPT, and Scholarly Manuscripts: WAME Recommendations on ChatGPT and Chatbots in relation to scholarly publications. Natl. Med. J. India 2023, 36, 1–4. [Google Scholar] [CrossRef] [PubMed]
Flanagin, A.; Bibbins-Domingo, K.; Berkwits, M.; Christiansen, S.L. Nonhuman “Authors” and Implications for the Integrity of Scientific Publication and Medical Knowledge. JAMA 2023, 329, 637–639. [Google Scholar] [CrossRef] [PubMed]
World Health Organizaion. Ethics and Governance of Artificial Intelligence for Health: WHO Guidance; World Health Organization: Geneva, Switzerland, 2021. [Google Scholar]
Haupt, C.E.; Marks, M. AI-Generated Medical Advice-GPT and Beyond. JAMA 2023, 329, 1349–1350. [Google Scholar] [CrossRef] [PubMed]
Singhal, K.; Azizi, S.; Tu, T.; Mahdavi, S.S.; Wei, J.; Chung, H.W.; Scales, N.; Tanwani, A.; Cole-Lewis, H.; Pfohl, S.; et al. Large language models encode clinical knowledge. Nature 2023, 620, 172–180. [Google Scholar] [CrossRef] [PubMed]
Voelker, R.; Hswen, Y. Clinical AI Tools Must Be Fed the Right Data, Stanford Health Care’s Chief Data Scientist Says. JAMA 2023, 330, 2137–2139. [Google Scholar] [CrossRef] [PubMed]
Kanter, G.P.; Packel, E.A. Health Care Privacy Risks of AI Chatbots. JAMA 2023, 330, 311–312. [Google Scholar] [CrossRef] [PubMed]
Duffourc, M.; Gerke, S. Generative AI in Health Care and Liability Risks for Physicians and Safety Concerns for Patients. JAMA 2023, 330, 313–314. [Google Scholar] [CrossRef] [PubMed]
Hswen, Y.; Voelker, R. New AI Tools Must Have Health Equity in Their DNA. JAMA 2023, 330, 1604–1607. [Google Scholar] [CrossRef]
Ueda, D.; Kakinuma, T.; Fujita, S.; Kamagata, K.; Fushimi, Y.; Ito, R.; Matsui, Y.; Nozaki, T.; Nakaura, T.; Fujima, N.; et al. Fairness of artificial intelligence in healthcare: Review and recommendations. Jpn. J. Radiol. 2024, 42, 3–15. [Google Scholar] [CrossRef]
Seyyed-Kalantari, L.; Zhang, H.; McDermott, M.B.A.; Chen, I.Y.; Ghassemi, M. Under- diagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nat. Med. 2021, 27, 2176–2182. [Google Scholar] [CrossRef]

Table 1. Possible applications of AI to the management of endometriosis and adenomyosis.

	Possible Role of AI	Consequences on Disease Management
Formulation of clinical diagnoses	Providing updated medical knowledge accessible to physicians and to the public	It may aid differential diagnosis It may increase patients’ awareness of the disease It may redefine epidemiology
Formulation of clinical diagnoses	Providing algorithms for the prediction of the likelihood of endometriosis in women with CPP or infertility	It may decrease diagnostic delay It may improve clinical outcomes
Formulation of radiological diagnoses	Detection of anomalies in US and MRI images and performance of diagnoses in a few seconds	It may improve efficiency It may assist diagnosis in complex cases or in settings in which specialists are not available
Choice of medical treatments and customized patient management	Recognizing drug-disease and drug–drug interactions	It may reduce the risk of prescription errors
	Prediction of reproductive prognosis and cancer risk	It may improve patient empowerment It may help identify high-risk patients
Surgical treatment	Overlapping robotic surgical evaluations with pre-operatory imaging data	It may assist in the interpretation of anatomy and in lesion detection
	Comparing surgeons’ movements with those of experts	It may be of aid in training
	Warning of the risk of complications	It may improve surgical outcomes
Administrative work	Providing summaries of medical records, filtering and drafting notes and e-mails, generating prescription orders, cataloguing diseases according to their ICD at a greater speed than humans and with greater accuracy	It may reduce physicians’ risk of burnout It may increase patients’ satisfaction
	Managing operation room slots and scheduling appointments	It may decrease waiting lists
	Creating large datasets of electronic health records from all medical institutions	It may help in the transition from a fee-to-service reimbursement model to a value-based care model
Endometriosis and adenomyosis research	Engaging with participants and with researchers in chat, audio or avatar modes.	It may alleviate the burden of patient recruitment, site monitoring and insertion of data
Endometriosis and adenomyosis research	Analyzing vast amounts of data, classifying highly complex data, identifying patterns, producing reports and searching the literature	It may reduce the expenditure on clinical trials It may help improve the understanding of disease pathophysiology

Table legend: CPP: chronic pelvic pain; US: ultrasonographic; MRI: magnetic resonance imaging; ICD: International Classification of Diseases.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cetera, G.E.; Tozzi, A.E.; Chiappa, V.; Castiglioni, I.; Merli, C.E.M.; Vercellini, P. Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans? J. Clin. Med. 2024, 13, 2950. https://doi.org/10.3390/jcm13102950

AMA Style

Cetera GE, Tozzi AE, Chiappa V, Castiglioni I, Merli CEM, Vercellini P. Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans? Journal of Clinical Medicine. 2024; 13(10):2950. https://doi.org/10.3390/jcm13102950

Chicago/Turabian Style

Cetera, Giulia Emily, Alberto Eugenio Tozzi, Valentina Chiappa, Isabella Castiglioni, Camilla Erminia Maria Merli, and Paolo Vercellini. 2024. "Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans?" Journal of Clinical Medicine 13, no. 10: 2950. https://doi.org/10.3390/jcm13102950

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans?

Abstract

1. Introduction

2. Methods

3. Applications of AI to the Clinical Management of Endometriosis and Adenomyosis

3.1. Role in the Formulation of Clinical Diagnoses

3.2. Role in the Formulation of Radiological Diagnoses

3.3. Role in the Choice of Medical Treatments and in the Customized Management of Patients

3.4. Role in Surgical Treatment

3.5. Role in Reducing the Burden Linked to Administrative Work

4. Applications of AI to Endometriosis and Adenomyosis Research

5. Limitations and Challenges of AI

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI