A novel high accuracy model for automatic surgical workflow recognition using artificial intelligence in laparoscopic totally extraperitoneal inguinal hernia repair (TEP)

Ortenzi, Monica; Rapoport Ferman, Judith; Antolin, Alenka; Bar, Omri; Zohar, Maya; Perry, Ori; Asselmann, Dotan; Wolf, Tamir

doi:10.1007/s00464-023-10375-5

A novel high accuracy model for automatic surgical workflow recognition using artificial intelligence in laparoscopic totally extraperitoneal inguinal hernia repair (TEP)

2023 SAGES Oral
Open access
Published: 25 August 2023

Volume 37, pages 8818–8828, (2023)
Cite this article

Download PDF

You have full access to this open access article

Surgical Endoscopy Aims and scope Submit manuscript

A novel high accuracy model for automatic surgical workflow recognition using artificial intelligence in laparoscopic totally extraperitoneal inguinal hernia repair (TEP)

Download PDF

Monica Ortenzi ORCID: orcid.org/0000-0002-6508-6488^1,2,
Judith Rapoport Ferman¹,
Alenka Antolin¹,
Omri Bar¹,
Maya Zohar¹,
Ori Perry¹,
Dotan Asselmann¹ &
…
Tamir Wolf¹

1265 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Introduction

Artificial intelligence and computer vision are revolutionizing the way we perceive video analysis in minimally invasive surgery. This emerging technology has increasingly been leveraged successfully for video segmentation, documentation, education, and formative assessment. New, sophisticated platforms allow pre-determined segments chosen by surgeons to be automatically presented without the need to review entire videos. This study aimed to validate and demonstrate the accuracy of the first reported AI-based computer vision algorithm that automatically recognizes surgical steps in videos of totally extraperitoneal (TEP) inguinal hernia repair.

Methods

Videos of TEP procedures were manually labeled by a team of annotators trained to identify and label surgical workflow according to six major steps. For bilateral hernias, an additional change of focus step was also included. The videos were then used to train a computer vision AI algorithm. Performance accuracy was assessed in comparison to the manual annotations.

Results

A total of 619 full-length TEP videos were analyzed: 371 were used to train the model, 93 for internal validation, and the remaining 155 as a test set to evaluate algorithm accuracy. The overall accuracy for the complete procedure was 88.8%. Per-step accuracy reached the highest value for the hernia sac reduction step (94.3%) and the lowest for the preperitoneal dissection step (72.2%).

Conclusions

These results indicate that the novel AI model was able to provide fully automated video analysis with a high accuracy level. High-accuracy models leveraging AI to enable automation of surgical video analysis allow us to identify and monitor surgical performance, providing mathematical metrics that can be stored, evaluated, and compared. As such, the proposed model is capable of enabling data-driven insights to improve surgical quality and demonstrate best practices in TEP procedures.

Graphical abstract

Implementation of Artificial Intelligence–Based Computer Vision Model for Sleeve Gastrectomy: Experience in One Tertiary Center

Article 05 January 2024

Validation of an artificial intelligence platform for the guidance of safe laparoscopic cholecystectomy

Article 02 August 2022

Towards automatic verification of the critical view of the myopectineal orifice with artificial intelligence

Article 24 February 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Artificial intelligence (AI) is increasingly being adopted in several medical specialties. It has proven to be an effective supporting tool in various fields and at different levels of healthcare management, from diagnosis to treatment and follow up [1, 2]. Since its first introduction, surgery mediated by a video interface has evolved from a hardly conceivable concept to an efficient tool that can be harnessed to achieve various goals [3]. Surgical videos document intraoperative events in a more objective and detailed manner than do operative notes alone, enabling accurate review of surgical performance and, eventually, correlation to patient outcomes [4,5,6]. As such, groups in both industry and academia are exploring ways to efficiently record, collect, store, and analyze surgical videos, with the ultimate goal of providing intraoperative decision support to improve surgical quality and, consequently, outcomes [7, 8].

Video analysis per se is an important tool that allows surgeons to gain valuable insights into their surgical practice. However, full video review is incredibly time-consuming, and surgeons rarely have the time and technological tools to perform it routinely. As a result, significant amounts of actionable data are lost [9]. The introduction of AI presents an opportunity to overcome these gaps in video utilization and enables the surgical field to maintain continuous technological advancement.

Surgical Intelligence platforms available today integrate AI and computer vision tools, allowing automation of surgical video capture and analysis. This transforms long, unedited surgical videos into segmented, user-friendly videos that present data regarding key surgical steps, important intraoperative events, and efficiency metrics. Eventually, the goal will be for real-time information regarding ongoing operations to enable surgical workflow optimization [10]. These technological capabilities hold great promise for a variety of clinical and quality assurance applications, from education and training to individual analytics and insights, alongside departmental identification of blind spots, efficiency metrics, and more [11].

High-accuracy detection of surgical steps provides an important foundation for these capabilities and has been studied recently across several surgical specialties [12,13,14] and procedures [15,16,17]. Segmenting procedural video into key steps allows for easier navigation of the video, enables the presentation of additional data like intraoperative adverse events and safety achievements, and provides valuable information for workflow optimization. Despite ongoing advancements in this field [17], prior step recognition models have been based on limited datasets. Even when they demonstrate good accuracy levels, these models lack generalizability [18] and are less likely to capture the variability and complexity of step detection in procedures performed by different surgeons, using varied techniques, at facilities of varying professional levels.

Inguinal hernia repair can be considered one of the most relevant fields to test and employ an AI video analysis model, for two main reasons. First, groin hernia repair is one of the most commonly performed operations in general surgery [19, 20]. Second, minimally invasive techniques have been successfully incorporated into inguinal hernia surgery and are used more commonly due to their proven advantage in comparison to open techniques, with fewer postoperative complications and faster recovery times [21, 22]. Still, the choice of surgical technique for groin hernia repair remains controversial due to the technical skills required to perform minimally invasive, as compared to conventional, techniques [19, 23]. Previous studies on automated step detection in hernia surgery focused mainly on transabdominal preperitoneal repair (TAPP) [24, 25]. The aim of this study was to develop, test, and demonstrate the accuracy of a novel computer vision algorithm for automated surgical step detection of laparoscopic totally extraperitoneal (TEP) inguinal hernia repair.

Materials and methods

Dataset

The dataset for this study comprised 619 TEP videos collected between October 25, 2019, and December 13, 2022. A Surgical Intelligence platform (Theator Inc., Palo Alto, CA) was used to automatically capture the videos during surgery and to de-identify and upload procedures to a secure cloud infrastructure. Data was then analyzed retrospectively. Of the videos, 602 were curated from 3 medical centers and 17 from other sources, with a total of 85 expert surgeons participating. The final set of 619 was reached after excluding videos documenting TAPP procedures as well as videos in which the recording quality and/or the number of steps documented were insufficient for analysis.

Surgical video data were organized and annotated (see “Annotation process” section) using a dedicated platform. Videos were divided into two groups based on the type of hernia repair they documented (unilateral or bilateral). Total duration of the procedure was defined from the first camera insertion to the end of the final inspection and final camera extraction. The operative procedure was standard and comprised a succession of predefined steps, from access to the preperitoneal space to its closure after mesh placement [26, 27]. Type of access to the preperitoneal space, either with the aid of a balloon dissection or with direct access, without the use of a space maker, was documented. Mesh was systematically used in this series and fixed over the myopectineal orifice (MPO) [3, 28, 29]. All videos were obtained following IRB approval.

Annotation process

In this context, the term annotation refers to the process of manually marking the start and end points of each procedural step. The general annotation process was based on the process developed and validated in a previous study [30].

A team of annotators previously trained with respect to the TEP procedure manually annotated all the videos. To maintain high standards, the process included three levels of validation. First, each video was randomly assigned to a trained annotator who reviewed it and manually assigned each second of the procedure to one of the predefined surgical steps. This was followed by review by a second trained annotator (i.e., validator), who validated the labels with the aim of reducing manual errors and enforcing an aligned annotation process. In cases with unclear workflow or non-typical events, a second validation was conducted by a general surgeon with experience in laparoscopic hernia repair procedures. The manually annotated videos were subsequently used to train and test the computer vision AI algorithm.

Step definitions

The annotation system was based on the recognition of 6 major procedural steps: (1) balloon dissection, (2) access to the preperitoneal space, (3) preperitoneal dissection, (4) hernia sac reduction, (5) mesh placement, and (6) mesh fixation. For bilateral hernias, an additional step, change of focus, designated the switching of sides during the procedure. The defined steps represent a complete framework for TEP, but not all documented procedures include them all, and they are independent of one another and of chronological order. As such, when a step did not occur, it did not interfere with recognition of the other steps.

The term ‘out of body’ indicates video sequences during which the camera was extracted from the optical trocar, for example, to clean the camera or to introduce the mesh through wider access to the abdomen [27]. The start and end points of each step were defined based on technical surgical details (Table 1). A guiding rule for the entire annotation process was, ‘One step ends when the next step starts,’ such that every second of video was labeled as one and only one of the steps. Rules for step recognition were set ahead of the annotation process. Standardized definitions were produced based on pre-existing recommendations, international guidelines, and joint consultation with expert surgeons and AI engineers, taking into account clinically relevant and algorithmically meaningful considerations [26, 27].

Table 1 Procedural steps with visual triggers and associated surgical actions

Full size table

Algorithm architecture

Videos in this study were all processed with the same AI model, which comprised several parts, as follows:

Video preprocessing

Raw video files were processed using FFmpeg. First, audio was removed and videos were encoded with libx264, using 25 frames per second. Then, video width was scaled to 480, and height was set to maintain the aspect ratio of the original video. We only considered video segments from initial insertion to final extraction of the scope, so video footage before or after these timepoints was discarded. In addition, non-relevant frames, such as out-of-body frames recognized during the surgery itself, were de-identified by an image-blurring algorithm to maintain patient privacy and surgical team confidentiality [31].

Step recognition model

The method utilized in this study, as previously described in Bar et al. [18], consisted of two key modules: (1) a feature extraction model that produced a mathematical representation of each second in the procedural video and (2) a temporal model that learned to predict surgical steps based on the sequence of features.

Generally, videos could be processed either in an end-to-end manner (i.e., frame-by-frame) or as short clips. Often, video-based methods use short clips, which are typically suitable when analyzing short videos. However, surgical videos can be hours long, and in cases like these, end-to-end processing achieves better performance and accuracy while also surveying the entire surgical procedure. In this study, we employed a Video Transformer Network (VTN) [32] as a feature extraction model. VTN processes videos as a sequence of images (video frames) and uses attention modules [33] to focus on specific important cues in the video. These temporal-visual features enable differentiation of important actions that take place in the video (such as the entry of a surgical tool into the visual field) from idle, prolonged segments (such as minutes-long adhesiolysis). When provided with surgical step annotations during training, our model learned to focus on the specific transitions necessary for accurate step recognition.

Previous studies have demonstrated that transfer learning can be utilized for step recognition in several general surgery procedures and that it improves algorithm performance compared to traditional approaches [32]. In this study, we applied transfer learning by using a model that had been pre-trained on multiple general surgery procedures (e.g., cholecystectomy, appendectomy, sleeve gastrectomy) and was then adjusted using the TEP videos. Model adaptation to TEP steps was accomplished with a temporal model based on a Long Short-Term Memory (LSTM) network [34]. The LSTM network processes long sequences, taking into account the current second representation as well as maintaining a “memory” of previous relevant information that contributes to the model's final predictions.

Statistical analysis

Performance accuracy, defined as the accuracy of the AI algorithm in automatically recognizing the different steps, was assessed by comparison to the human annotations. Per-second accuracy was used to evaluate the model’s performance, which was calculated by comparing the manual annotation (ground truth) with the model’s step prediction for each second. Accuracy was defined as the ratio between the sum of correct predictions and the overall number of seconds. Mean-unweighted accuracy was defined as the average of the individual class accuracies. An error analysis was also performed by identifying the videos with the lowest accuracy in the test set and reviewing them individually. Continuous data were analyzed using the Mann–Whitney U test to examine the differences between unilateral and bilateral repairs when appropriate. A p value of < 0.05 was considered to be indicative of statistical significance.

Results

A total of 619 full-length TEP videos were analyzed: 371 were used to train the model, 93 for internal validation, and the remaining 155 as a test set, to evaluate the algorithm accuracy. Among the included cases, 270 were unilateral inguinal repairs (166 training, 46 validation, 58 test) and 349 were bilateral (205 training, 47 validation, 97 test). The mean procedure duration was 37.5 min. The average duration of bilateral procedures was 43 min (std = 21 min, n = 349). The average duration of unilateral procedures was 29 min (std = 17 min, n = 270). There was a statistically significant difference in duration between unilateral and bilateral procedures (p < 0.001). A dissecting balloon was used in 352 cases (57%) [23].

For unilateral repairs, the mean-unweighted accuracy was 89.16% and the overall accuracy was 89.59%, while for bilateral repairs the mean-unweighted accuracy was 83.17% and the overall accuracy was 88.45% (p = 0.0394; see Fig. 1).

Mesh placement was not performed in eight procedures (1.3%). When mesh placement was performed, it was not fixated in 36 procedures (5.81%) and was fixated in 575 procedures (92.9%). In 566 procedures (98%) fixation was accomplished with tackers, in 5 (0.87%) with glue, and in 4 (0.7%) with both tackers and glue. Per-step accuracy was highest for the hernia sac reduction step (94.3%) and lowest for the preperitoneal dissection step (73%), which was mostly confused with the hernia sac reduction step (25%).

Figure 2A shows the confusion matrix (CM) and Fig. 2B the normalized confusion matrix (NCM) for the model, to demonstrate the model’s performance. Rows correspond to annotated steps (manual annotations, i.e., ground truth) and columns correspond to steps recognized by the model. Values on the diagonal represent the proportion of time points (seconds in CM) for which the model detection was correct (also called the true positive rate [3] in NCM) for each step. Values not on the diagonal represent the proportion of time points for which the model detection was incorrect. The model showed true positive rates (diagonal) ranging from 94.3% (for sac reduction) to 72.2% (for preperitoneal dissection).

Figure 3 shows the NCM for unilateral (A) and bilateral repairs (B). The model shows true positive rates (diagonal) ranging from 94.9% (hernia sac reduction) to 70.7% (preperitoneal dissection) in unilateral repairs and from 94.1% (hernia and sac reduction) to 45.8% (change of focus) in bilateral repairs. Change of focus was mostly confused with preperitoneal dissection (25.1%).

Error analysis

Of the 155 videos in the test set, those with the lowest accuracy were identified and reviewed. In the 97 bilateral repair videos, accuracy was below 70% in two videos (n = 2, 2%), 70–75% in 3 videos (n = 3, 3%), and 75–80% in nine videos (n = 9, 9.2%).

In the 58 unilateral repair videos, accuracy was below 70% in 1 video (n = 1, 1.7%), 70–75% in one video (n = 1, 1.7%), and 75–80% in 2 videos (n = 2, 3.4%).

Two main common features were identified in the procedures with the lowest accuracy levels. First, many had an unusual or rare feature/occurrence (different type of access, large sac tear making the dissection intraperitoneal, etc.). For example, a case with 68% accuracy was a single port procedure requiring an unconventional set up. The second common feature associated with low accuracy was the presence of a very subtle transition between hernia reduction and preperitoneal dissection. Interchanging of these two steps during the course of the surgery might cause initial inaccuracies in step starting times.

Discussion

This study reports on the first AI-based automated recognition of surgical steps in videos of TEP inguinal hernia repair, reaching accuracy as high as 94.3% in recognition of key surgical steps. Our results show that per-step accuracy was highest for the hernia sac reduction step (94.3%). The lowest accuracy was observed for preperitoneal dissection (72.2%), which was mostly confused with hernia sac reduction (25%). This finding is not unexpected, since in the proposed TEP model, the hernia sac reduction step has a very clear trigger point corresponding to the first pulling of the sac. The preperitoneal dissection, however, usually starts in a more subtle way, as blunt dissection, which is less clearly detectable than sharp dissection. The temporal aspect may also contribute to the model’s confusion in this step. Namely, the hernia sac reduction step always takes place after a preperitoneal dissection is complete. The latter step is usually located after the access step, although an extra preperitoneal dissection might be necessary after the sac reduction, possibly disrupting the expected temporal sequence and creating another source of confusion.

An additional source of potential confusion was the transition between these steps, which can be unclear. As some dissection is always done together with the hernia reduction and some is done during trocar insertion, it is possible to mistakenly identify these steps, especially since dissection can occur more than once during the course of surgery.

Finally, the annotation process is “annotator-dependent” and inter-rater variability can be reflected in the annotator-model relationship. Small details are sometimes difficult to discern, even for an expert surgeon, and just as this could generate disagreement between two surgeons, it could be a source of disagreement between the model and the human annotator, hence lowering the accuracy of the model.

For unilateral repairs in this study, the mean-unweighted accuracy was 89.16% and the overall accuracy was 89.59%, while for bilateral repairs, the mean-unweighted accuracy was 83.17% and the overall accuracy was 88.45%. The task of surgical step recognition benefits from the fact that within the same procedure type, different procedures performed by different surgeons are still similarly structured. The procedural flow, the tools used, and the surrounding anatomical context remain similar no matter where the surgery is performed and who is performing it. Having said that, accurate step recognition is challenging due to the need to detect relatively small differences across time—a change in tools, a change in anatomic target, or a difference in how a tool is used.

By definition, AI is a mathematical approach that can be used to effectively reproduce tasks that normally require human intelligence [35]. To date, several approaches to step recognition using different types of machine learning have been performed by various researchers with an accuracy ranging from 69 to 86% [17, 18, 36,37,38,39]. Laparoscopic cholecystectomy was the first procedure on which surgical step recognition was studied, given its easy reproducibility, the existence of well-defined safety practices, and its ubiquitous presence in general surgery departments [39,40,41]. Later, Takekuchi et al. first proposed an AI model for step detection in TAPP, with an overall accuracy of 88.81% and 85.82% for unilateral and bilateral cases, respectively [22, 24].

The rationale for testing AI models on common procedures is twofold. The first is the ability to access large volumes of data in a relatively short period of time. The second is the need to give quick and useful feedback for the most commonly performed procedures, with the highest achievable accuracy, in order to justify the use of AI in clinical settings. Additionally, from a technological standpoint, this strategy allows us to explore and refine the models before applying them to more complex procedures [34].

Generalizability is one of the key characteristics contributing to the successful application of AI in the surgical field. It refers to the ability to accurately transfer the model’s knowledge to similar procedures, disregarding the many biases present when a variety of operating surgeons and venues are involved. Essentially, the ability to gather as much information as possible allows the model to adapt to new medical centers and surgeons without compromising accuracy [18]. To make this possible in the current study, transfer learning [35] was applied during the model’s training process, meaning that a model that had previously been trained on multiple general surgery procedures (e.g., cholecystectomy, appendectomy, sleeve gastrectomy) was adapted to be used for the TEP videos.

Another challenge in building AI models is the need for correctly labeled, representative data to drive the learning process. In the surgical field, the capture, labeling, and sharing of data are inherently more challenging than in other specialties [9]. As a result, most studies performed to date have been based on limited datasets [18] derived from single institutions [22, 24], and may therefore lack generalizability. The dataset used in the present study was composed of videos from a variety of venues and operating surgeons, providing significantly greater variability for training the model. In addition, the large-scale dataset and independent test set support the reliability and reproducibility of the results obtained.

The introduction of AI into surgery is an answer to the unmet need for standardization of surgical approaches. This need is well represented by quality improvement programs, which have been flourishing in recent years. One of these is the American College of Surgeons (ACS) National Surgical Quality Improvement Program (NSQIP) Quality Verification Program (QVP), a standards-based verification program designed to help sites improve quality across surgical departments [42, 43]. It operates under the motto, “you can’t improve quality if you can’t measure it,” which justifies effective implementation and integration of AI-supported surgical data science to analyze operative metrics in national quality programs.

The ultimate goal for AI, as for every other new technology that has been introduced into medical care, is to improve patient outcomes. Automated step detection facilitates surgical video navigation and review [39, 44]. Structuring the data (e.g., procedure steps and more) from surgical videos and connecting that information to preoperative and postoperative patient data and outcomes can offer novel insights when performed at scale. The step detection method presented in this study was conceived, amongst other use cases, to give surgeons the option of choosing whether to watch an entire procedure or to focus on a particular step. It also allows us to obtain analytics comparing performance across various procedures between a single surgeon and other surgeons from the same department, for example based on level of expertise and previous training. This type of analysis could be routinely used as an integrated tool, for example, in planned department meetings to address critical aspects of operative management that could affect both the efficiency and safety profile of the procedure.

The AI step detection model presented in this study can serve as a foundation for many other potential applications. In 2019, Bodenstedt et al. combined both visual data from endoscopic video and surgical device data and were able to train an algorithm to predict the remaining operative time in a range of laparoscopic surgical procedures [38]. Regarding surgical workflow, benefits could apply to hospital schedules, as the ability to predict surgical step duration in real-time would allow surgical staff to more accurately assess remaining procedural time [46, 47]. Other aspects of the context-aware operating room are also being investigated. Surgical tool recognition, for example, was successfully applied to videos of laparoscopic cholecystectomy, with an average classification precision of 93.75% [45].

Surgical video analysis has been used for educational purposes, resulting in efficacy that is equivalent and even superior to conventional methods [46,47,48,49]. It can improve surgical perception and awareness and shorten the learning curve [50]. This is expressed in the approved definition of digital surgery: “the use of technology for the enhancement of preoperative planning, surgical performance, therapeutic support, or training, to improve outcomes and reduce harm” [9].

Lastly but perhaps most importantly, one of the most promising potential use cases of AI-based surgical intelligence platforms is the ability to provide real-time decision support to operating surgeons. This capability requires further work, as future models will need to recognize additional procedural components, such as intraoperative adverse events, and to provide predictions and guidance with very high accuracy.

This study does have several limitations. While the model performs well when faced with outliers, as described above, the surgical steps defined are rather generalized. Future efforts should focus on the production of additional standardized step detection rules to further minimize the variability and improve the reproducibility of the model. Additionally, the change of focus step might have added confusion specific to TEP, as compared to other procedures. The fact that the dataset was composed of both bilateral and unilateral procedures, which differ in workflow, may have reduced the model’s accuracy. Moreover, the videos included in our dataset were derived from different venues and operating surgeons in unequal proportions. Namely, 74.5% of videos were collected from a single medical center, possibly contributing to data bias in terms of surgical techniques.

Conclusions

This study demonstrates that an AI model can provide fully automated TEP video step detection with high accuracy. Accurate step detection constitutes the first step for gaining video-based insights into the TEP procedure and may eventually enable its standardization and facilitate the prediction and prevention of negative outcomes. Accurate step detection is an important capability in itself, allowing videos to be navigated easily and operative ongoings to be presented in a simpler manner. It also enables extraction of efficiency and quality metrics based on step duration, and more. This capability paves the way for further important use cases in surgical training, surgeon self-assessment, performance review, and quality improvement. Future applications include predicting remaining operative time in any given procedure as well as automating surgical reporting [39]. As machine learning continues to advance rapidly, capabilities will evolve from retrospective analyses to online or real-time analyses, with the holy grail being data-driven intraoperative decision support.

References

Alaimo L, Moazzam Z, Woldesenbet S, Lima HA, Endo Y, Munir MM, Azap L, Ruzzenente A, Guglielmi A, Pawlik TM (2023) Artificial intelligence to investigate predictors and prognostic impact of time to surgery in colon cancer. J Surg Oncol. https://doi.org/10.1002/jso.27224
Article Google Scholar
Liu X, Faes L, Kale AU, Wagner SK, Fu DJ, Bruyseels A, Mahendiran T, Moraes G, Shamdas M, Kern C, Ledsam JR, Schmid MK, Balaskas K, Topol EJ, Bachmann LM, Keane PA, Denniston AK (2019) A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis. Lancet Digit Health 1(6):e271–e297. https://doi.org/10.1016/S2589-7500(19)30123-2
Article Google Scholar
Kelley WE Jr (2008) The evolution of laparoscopy and the revolution in surgery in the decade of the 1990s. JSLS 12(4):351–7
PubMed PubMed Central Google Scholar
Birkmeyer JD, Finks JF, O’Reilly A, Oerline M, Carlin AM, Nunn AR, Dimick J, Banerjee M, Birkmeyer NJ (2013) Surgical skill and complication rates after bariatric surgery. N Engl J Med 369:1434. https://doi.org/10.1056/nejmsa1300625
Article CAS PubMed Google Scholar
Stulberg JJ, Huang R, Kreutzer L, Ban K, Champagne BJ, Steele SR, Johnson JK, Holl JL, Greenberg CC, Bilimoria KY (2020) Association between surgeon technical skills and patient outcomes. JAMA Surg 155:960. https://doi.org/10.1001/jamasurg.2020.3007
Article PubMed Central Google Scholar
Varban OA, Thumma JR, Finks JF, Carlin AM, Ghaferi AA, Dimick JB (2021) Evaluating the effect of surgical skill on outcomes for laparoscopic sleeve gastrectomy: a video-based study. Ann Surg 273:766. https://doi.org/10.1097/SLA.0000000000003385
Article PubMed Google Scholar
Mascagni P, Fiorillo C, Urade T, Emre T, Yu T, Wakabayashi T, Felli E, Perretta S, Swanstrom L, Mutter D, Marescaux J, Pessaux P, Costamagna G, Padoy N, Dallemagne B (2019) Formalizing video documentation of the Critical View of Safety in laparoscopic cholecystectomy: a step towards artificial intelligence assistance to improve surgical safety. Surg Endosc 34:2709–2714
Article PubMed Google Scholar
Samareh A, Chang X, Lober WB, Evans HL, Wang Z, Qian X, Huang S (2019) Artificial intelligence methods for surgical site infection: impacts on detection, monitoring, and decision making. Surg Infect (Larchmt) 20:546–554
Article Google Scholar
Cheikh Youssef S, Haram K, Noël J, Patel V, Porter J, Dasgupta P, Hachach-Haram N (2023) Evolution of the digital operating room: the place of video technology in surgery. Langenbecks Arch Surg 408(1):95. https://doi.org/10.1007/s00423-023-02830-7
Article PubMed PubMed Central Google Scholar
Filicori F, Bitner DP, Fuchs HF, Anvari M, Sankaranaraynan G, Bloom MB, Hashimoto DA, Madani A, Mascagni P, Schlachta CM, Talamini M, Meireles OR (2023) SAGES video acquisition framework—analysis of available OR recording technologies by the SAGES AI task force. Surg Endosc. https://doi.org/10.1007/s00464-022-09825-3
Article PubMed Central Google Scholar
Mascagni P, Alapatt D, Sestini L, Altieri MS, Madani A, Watanabe Y, Alseidi A, Redan JA, Alfieri S, Costamagna G, Boškoski I, Padoy N, Hashimoto DA (2022) Computer vision in surgery: from potential to clinical value. NPJ Digit Med 5(1):163. https://doi.org/10.1038/s41746-022-00707-5
Article PubMed PubMed Central Google Scholar
Ahmidi N, Tao L, Sefati S, Gao Y, Lea C, Haro BB, Zappella L, Khudanpur S, Vidal R, Hager GD (2017) A dataset and benchmarks for segmentation and recognition of gestures in robotic surgery. IEEE Trans Biomed Eng 64(9):2025–2041. https://doi.org/10.1109/TBME.2016.2647680
Article PubMed PubMed Central Google Scholar
Katić D, Wekerle AL, Gärtner F, Kenngott H, Müller-Stich BP, Dillmann R, Speidel S (2014) Knowledge-driven formalization of laparoscopic surgeries for rule-based intraoperative context-aware assistance. In: Proceedings of information processing in computer-assisted interventions: 5th international conference (IPCAI 2014), Fukuoka, Japan, 28 June 2014. Springer, Cham, pp 158–167
Yu F, Croso GS, Kim TS, Song Z, Parker F, Hager GD, Reiter A, Vedula SS, Ali H, Sikder S (2019) Assessment of automated identification of phases in videos of cataract surgery using machine learning and deep learning techniques. JAMA Netw Open 2(4):e191860. https://doi.org/10.1001/jamanetworkopen
Article PubMed PubMed Central Google Scholar
Zisimopoulos O, Flouty E, Luengo I, Giataganas P, Nehme J, Chow A, Stoyanov D (2018) DeepPhase: surgical phase recognition in CATARACTS videos. In: Proceedings of 21st international conference on medical image computing and computer assisted intervention (MICCAI 2018), Granada, Spain, 16–20 September 2018, Part IV. Springer, Cham, pp 265–272
Hashimoto DA, Rosman G, Witkowski ER, Stafford C, Navarette-Welton AJ, Rattner DW, Lillemoe KD, Rus DL, Meireles OR (2019) Computer vision analysis of intraoperative video: automated recognition of operative steps in laparoscopic sleeve gastrectomy. Ann Surg 270(3):414–421. https://doi.org/10.1097/SLA.0000000000003460
Article PubMed Google Scholar
Garrow CR, Kowalewski KF, Li L, Wagner M, Schmidt MW, Engelhardt S, Hashimoto DA, Kenngott HG, Bodenstedt S, Speidel S, Müller-Stich BP, Nickel F (2021) Machine learning for surgical phase recognition: a systematic review. Ann Surg 273(4):684–693. https://doi.org/10.1097/SLA.0000000000004425
Article PubMed Google Scholar
Bar O, Neimark D, Zohar M, Hager GD, Girshick R, Fried GM, Wolf T, Asselmann D (2020) Impact of data on generalization of AI for surgical intelligence applications. Sci Rep 10(1):22208. https://doi.org/10.1038/s41598-020-79173-6
Article CAS PubMed PubMed Central Google Scholar
HerniaSurge Group (2018) International guidelines for groin hernia management. Hernia 22(1):1–165. https://doi.org/10.1007/s10029-017-1668-x
Article Google Scholar
Kingsnorth A, LeBlanc K (2003) Hernias: inguinal and incisional. Lancet 362:1561–1571. https://doi.org/10.1016/S0140-6736(03)14746-0
Article PubMed Google Scholar
Ortenzi M, Botteri E, Balla A, Podda M, Guerrieri M, Sartori A (2023) Nationwide analysis of laparoscopic groin hernia repair in Italy from 2015 to 2020. Updates Surg 75(1):77–84. https://doi.org/10.1007/s13304-022-01374-7
Article PubMed Google Scholar
Takeuchi Y, Etoh T, Suzuki K, Ohyama T, Hiratsuka T, Ishio T, Kugimiya M, Matsumoto T, Kai S, Bandoh T, Shibata K, Iwaki K, Tahara K, Shigemitsu Y, Inomata M (2021) Surgical outcomes of totally extraperitoneal repair for inguinal hernia: a retrospective multicenter propensity score-matched study. Ann Gastroenterol Surg 5(4):502–509. https://doi.org/10.1002/ags3.12443
Article PubMed PubMed Central Google Scholar
Chu HC, Hu SW, Wu WL, Tam KW (2023) Comparison of balloon dissection and telescopic dissection of the preperitoneal space in laparoscopic totally extraperitoneal hernia repair: a systematic review and meta-analysis. Langenbecks Arch Surg 408(1):15. https://doi.org/10.1007/s00423-023-02756-0
Article PubMed Google Scholar
Takeuchi M, Collins T, Ndagijimana A, Kawakubo H, Kitagawa Y, Marescaux J, Mutter D, Perretta S, Hostettler A, Dallemagne B (2022) Automatic surgical phase recognition in laparoscopic inguinal hernia repair with artificial intelligence. Hernia 26(6):1669–1678. https://doi.org/10.1007/s10029-022-02621-x
Article CAS PubMed Google Scholar
Takeuchi M, Collins T, Lipps C, Haller M, Uwineza J, Okamoto N, Nkusi R, Marescaux J, Kawakubo H, Kitagawa Y, Gonzalez C, Mutter D, Perretta S, Hostettler A, Dallemagne B (2023) Towards automatic verification of the critical view of the myopectineal orifice with artificial intelligence. Surg Endosc. https://doi.org/10.1007/s00464-023-09934-7
Article PubMed PubMed Central Google Scholar
Ferzli G, Mazen I (2019) Laparoscopic totally extra-peritoneal (TEP) inguinal hernia repair. Ann Laparosc Endosc Surg 4:355. https://doi.org/10.21037/ales.2019.03.03
Article Google Scholar
Shah MY, Raut P, Wilkinson TRV, Agrawal V (2022) Surgical outcomes of laparoscopic total extraperitoneal (TEP) inguinal hernia repair compared with Lichtenstein tension-free open mesh inguinal hernia repair: a prospective randomized study. Medicine (Baltimore) 101(26):e29746. https://doi.org/10.1097/MD.0000000000029746
Article PubMed Google Scholar
Nagata S, Orita H, Korenaga D (2022) Nonfixation of mesh in laparoscopic totally extraperitoneal inguinal hernia repair: a propensity score matched analysis. Asian J Surg 46(7):2662–2667. https://doi.org/10.1016/j.asjsur.2022.09.131
Article Google Scholar
Techapongsatorn S, Tansawet A, Kasetsermwiriya W, McEvoy M, Attia J, Wilasrusmee C, Thakkinstian A (2018) Mesh fixation technique in totally extraperitoneal inguinal hernia repair—a network meta-analysis. Surgeon 17(4):215–224. https://doi.org/10.1016/j.surge.2018.09.002
Article Google Scholar
Korndorffer JR Jr, Hawn MT, Spain DA, Knowlton LM, Azagury DE, Nassar AK, Lau JN, Arnow KD, Trickey AW, Pugh CM (2020) Situating artificial intelligence in surgery: a focus on disease severity. Ann Surg 272(3):523–528. https://doi.org/10.1097/SLA.0000000000004207
Article PubMed Google Scholar
Zohar M, Bar O, Neimark D, Hager GD, Asselmann D (2020) Accurate detection of out of body segments in surgical video using semi-supervised learning. In: Proceedings of the 3rd conference on medical imaging with deep learning, 21 September 2020, pp 923–936
Neimark D, Bar O, Zohar M, Asselmann D (2021) Video transformer network. In: Proceedings of the IEEE/CVF international conference on computer vision 2021, pp 3163–3172
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J, Houlsby N (2021) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint. arXiv:2010.11929
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT, Cambridge
Google Scholar
McCarthy J, Hayes PJ (1981) Some philosophical problems from the standpoint of artificial intelligence. In: Webber BL, Nilsson NJ (eds) Readings in artificial intelligence. Elsevier, Amsterdam, pp 431–450
Chapter Google Scholar
Bodenstedt S, Wagner M, Katić D, Mietkowski P, Mayer B, Kenngott H, Müller-Stich B, Dillmann R, Speidel S (2017) Unsupervised temporal context learning using convolutional neural networks for laparoscopic workflow analysis. arXiv preprint. arXiv:1702.03684
Padoy N, Blum T, Ahmadi SA, Feussner H, Berger MO, Navab N (2012) Statistical modeling and recognition of surgical workflow. Med Image Anal 16(3):632–641. https://doi.org/10.1016/j.media.2010.10.001
Article PubMed Google Scholar
Anteby R, Horesh N, Soffer S, Zager Y, Barash Y, Amiel I, Rosin D, Gutman M, Klang E (2021) Deep learning visual analysis in laparoscopic surgery: a systematic review and diagnostic test accuracy meta-analysis. Surg Endosc 35(4):1521–1533. https://doi.org/10.1007/s00464-020-08168-1
Article Google Scholar
Ko CY, Shah T, Nelson H, Nathens AB (2022) Developing the American College of Surgeons quality improvement framework to evaluate local surgical improvement efforts. JAMA Surg 157(8):737–739. https://doi.org/10.1001/jamasurg.2022.1826
Article PubMed PubMed Central Google Scholar
Strasberg SM, Hertl M, Soper NJ (1995) An analysis of the problem of biliary injury during laparoscopic cholecystectomy. J Am Coll Surg 180:101–125
CAS PubMed Google Scholar
Strasberg SM (2005) Biliary injury in laparoscopic surgery: part 1. Processes used in determination of standard of care in misidentification injuries. J Am Coll Surg 201:598–603
Article PubMed Google Scholar
Strasberg SM (2005) Biliary injury in laparoscopic surgery: part 2. Changing the culture of cholecystectomy. J Am Coll Surg 201:604–611
Article PubMed Google Scholar
Mazer L, Varban O, Montgomery JR, Awad MM, Schulman A (2022) Video is better: why aren’t we using it? A mixed-methods study of the barriers to routine procedural video recording and case review. Surg Endosc 36(2):1090–1097. https://doi.org/10.1007/s00464-021-08375-4
Article PubMed Google Scholar
Tran DT, Sakurai R, Yamazoe H, Lee JH (2017) Phase segmentation methods for an automatic surgical workflow analysis. Int J Biomed Imaging. https://doi.org/10.1155/2017/1985796
Article PubMed PubMed Central Google Scholar
Strömblad CT, Baxter-King RG, Meisami A, Yee SJ, Levine MR, Ostrovsky A, Stein D, Iasonos A, Weiser MR, Garcia-Aguilar J, Abu-Rustum NR (2021) Effect of a predictive model on planned surgical duration accuracy, patient wait time, and use of presurgical resources: a randomized clinical trial. JAMA Surg 156(4):315–321. https://doi.org/10.1001/jamasurg.2020.6361
Article PubMed PubMed Central Google Scholar
Lam K, Abràmoff MD, Balibrea JM, Bishop SM, Brady RR, Callcut RA, Chand M, Collins JW, Diener MK, Eisenmann M, Fermont K, Neto MG, Hager GD, Hinchliffe RJ, Horgan A, Jannin P, Langerman A, Logishetty K, Mahadik A, Maier-Hein L, Antona EM, Mascagni P, Mathew RK, Müller-Stich BP, Neumuth T, Nickel F, Park A, Pellino G, Rudzicz F, Shah S, Slack M, Smith MJ, Soomro N, Speidel S, Stoyanov D, Tilney HS, Wagner M, Darzi A, Kinross JM, Purkayastha S (2022) A Delphi consensus statement for digital surgery. NPJ Digit Med 5(1):100. https://doi.org/10.1038/s41746-022-00641-6
Article PubMed PubMed Central Google Scholar
Ahmet A, Gamze K, Rustem M, Sezen KA (2018) Is video-based education an effective method in surgical education? A systematic review. J Surg Educ 75(5):1150–1158. https://doi.org/10.1016/j.jsurg.2018.01.014
Article PubMed Google Scholar
Green JL, Suresh V, Bittar P, Ledbetter L, Mithani SK, Allori A (2019) The utilization of video technology in surgical education: a systematic review. J Surg Res 2019(235):171–180. https://doi.org/10.1016/j.jss.2018.09.015
Article Google Scholar
Youssef SC, Aydin A, Canning A, Khan N, Ahmed K, Dasgupta P (2022) Learning surgical skills through video-based education: a systematic review. Surg Innov 14:15533506221120146. https://doi.org/10.1177/15533506221120146
Article Google Scholar
Jaafari J, Douzi S, Douzi K, Hssina B (2021) Towards more efficient CNN-based surgical tools classification using transfer learning. J Big Data 8:1–5. https://doi.org/10.1186/s40537-021-00509-8
Article Google Scholar

Download references

Funding

Open access funding provided by Università Politecnica delle Marche within the CRUI-CARE Agreement. This study was supported by Theator.

Author information

Authors and Affiliations

Theator Inc., Palo Alto, CA, USA
Monica Ortenzi, Judith Rapoport Ferman, Alenka Antolin, Omri Bar, Maya Zohar, Ori Perry, Dotan Asselmann & Tamir Wolf
Department of General and Emergency Surgery, Polytechnic University of Marche, Ancona, Italy
Monica Ortenzi

Authors

Monica Ortenzi
View author publications
You can also search for this author in PubMed Google Scholar
Judith Rapoport Ferman
View author publications
You can also search for this author in PubMed Google Scholar
Alenka Antolin
View author publications
You can also search for this author in PubMed Google Scholar
Omri Bar
View author publications
You can also search for this author in PubMed Google Scholar
Maya Zohar
View author publications
You can also search for this author in PubMed Google Scholar
Ori Perry
View author publications
You can also search for this author in PubMed Google Scholar
Dotan Asselmann
View author publications
You can also search for this author in PubMed Google Scholar
Tamir Wolf
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MO, JRF, AA, OB, MZ, OP, DA and TW conceived the study; MO, JRF, AA, OB, MZ and OP designed and performed the research; MO, JRF, AA, OB, MZ and OP analyzed the data; MO, JRF, AA, OB and MZ wrote the paper; DA and TW supervised the paper; all authors read and approved the final manuscript.

Corresponding author

Correspondence to Monica Ortenzi.

Ethics declarations

Disclosures

Monica Ortenzi is a paid consultant. Judith Rapoport Ferman, Alenka Antolin, Omri Bar, Mata Zohar, Ori Perry, Dotan Asselman, and Tamir Wolf have an equity interest in Theator.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ortenzi, M., Rapoport Ferman, J., Antolin, A. et al. A novel high accuracy model for automatic surgical workflow recognition using artificial intelligence in laparoscopic totally extraperitoneal inguinal hernia repair (TEP). Surg Endosc 37, 8818–8828 (2023). https://doi.org/10.1007/s00464-023-10375-5

Download citation

Received: 03 April 2023
Accepted: 30 July 2023
Published: 25 August 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00464-023-10375-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A novel high accuracy model for automatic surgical workflow recognition using artificial intelligence in laparoscopic totally extraperitoneal inguinal hernia repair (TEP)