A systematic review on application of deep learning in digestive system image processing

Zhuang, Huangming; Zhang, Jixiang; Liao, Fei

doi:10.1007/s00371-021-02322-z

A systematic review on application of deep learning in digestive system image processing

Survey
Published: 31 October 2021

Volume 39, pages 2207–2222, (2023)
Cite this article

Download PDF

The Visual Computer Aims and scope Submit manuscript

A systematic review on application of deep learning in digestive system image processing

Download PDF

5669 Accesses
8 Citations
1 Altmetric
Explore all metrics

Abstract

With the advent of the big data era, the application of artificial intelligence represented by deep learning in medicine has become a hot topic. In gastroenterology, deep learning has accomplished remarkable accomplishments in endoscopy, imageology, and pathology. Artificial intelligence has been applied to benign gastrointestinal tract lesions, early cancer, tumors, inflammatory bowel diseases, livers, pancreas, and other diseases. Computer-aided diagnosis significantly improve diagnostic accuracy and reduce physicians’ workload and provide a shred of evidence for clinical diagnosis and treatment. In the near future, artificial intelligence will have high application value in the field of medicine. This paper mainly summarizes the latest research on artificial intelligence in diagnosing and treating digestive system diseases and discussing artificial intelligence's future in digestive system diseases. We sincerely hope that our work can become a stepping stone for gastroenterologists and computer experts in artificial intelligence research and facilitate the application and development of computer-aided image processing technology in gastroenterology.

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Article 24 December 2022

Convolutional neural networks: an overview and application in radiology

Article Open access 22 June 2018

Medical image data augmentation: techniques, comparisons and interpretations

Article 20 March 2023

1 Introduction

The diagnosis of digestive tract diseases depends on gastrointestinal endoscopy, imaging, and pathology. Deep learning (DL) has been widely applied in these fields. It can automatically establish an image recognition system without manipulating image features and achieve high diagnostic efficiency. In recent years, various advanced algorithms and models of computer-aided diagnosis (CAD) have been proposed, which is expected to reduce doctors’ workload and misdiagnosis rates (Fig. 1).

Artificial intelligence (AI) can be defined as the intelligence displayed by machines that mimic human cognitive functions [1, 2]. Machine learning (ML), a subdomain of AI, is an algorithm trained from data to perform a task rather than directly executing an explicit program. Representation Learning (RL) is a sub-category of ML, which can master core features and implement algorithms through the autonomous classification of data [3]. DL is a kind of RL. DL acquires feature combinations that reflect the hierarchical structure of data structures to provide detailed image classification output. At present, DL represented by convolutional neural networks (CNN) is the most widely used AI in medicine [4]. DL technology can extract pathological features through active learning of massive clinical data without providing features in advance and make a CAD through these pathological features. CAD can significantly reduce clinicians’ workload and assist doctors in making more accurate and rapid diagnoses. Besides, advanced diagnosis and treatment technologies can be shared across a wider region, and medical resources can be rebalanced through CAD.

2 Application of DL in gastrointestinal endoscopy

Digestive endoscopy is an essential method for diagnosing and treating digestive tract diseases and plays a vital role in screening precancerous lesions and early cancers. The detection rate of early precancerous lesions under endoscopy is relatively low, so it is of great significance to improve the endoscopic detection rate of early tumors for improving the prognosis of patients with digestive tract tumors. AI-assisted endoscopic diagnosis is expected to strengthen gastrointestinal lesions’ detection rate by endoscopic physicians and reduce misdiagnosis or missed diagnosis [5]. With the continuous iteration of computer technology and the arrival of the big data era, the research on the diagnosis of endoscopic diseases assisted by AI technology is flourishing.

DL has been applied in the endoscope-assisted diagnosis of tumors and precancerous lesions of the esophagus [6, 7], stomach [8], small intestine [9], and colorectum [10, 11]. The vast majority of scholars use endoscopic photographs or videos to carry out DL. The number and size of training sets adopted by different studies vary greatly, but most CAD systems’ accuracy in diagnosing tumors or precancerous lesions can exceed 80%.

Due to the lack of large-scale public authoritative data sets, studies often used single-center endoscopic data. The number of patients is usually less than 100, limiting DL’s accuracy and universality, leading to selection bias. Therefore, a study enhances data utilization and improves Barrett’s esophagus diagnostic accuracy by establishing an adversarial network [12]. Multi-center randomized controlled trials are the most compelling studies. However, there have been few multi-center prospective studies of AI in gastrointestinal endoscopy so far. Wu and Xu et al. Conducted two randomized controlled trials to verify the effectiveness of ENDOANGEL, a CAD system, in white-light imaging (WLI) and image-enhanced endoscopy (IEE) examination of early gastric cancer [13, 14].

CNN in fully supervised is challenging for endoscopes because it is challenging to obtain depth maps directly corresponding to authentic endoscope images. Weakly annotated images may be a cost-effective approach in future. Weakly supervised convolutional neural network (WCNN) can identify abnormal video frames and detect specific pathological points from video frames [15]. In this way, images can be marked only by image-level annotations instead of detailed pixel-level annotations. The system can automatically analyze detailed lesion areas by roughly dividing, thus achieving favorable detection and localization performance. Mahmood et al. put forward an unsupervised reverse domain adaptation framework to avoid excessive comments [16]. Their system worked by using confrontational training to remove patient-specific details from real endoscopic images while preserving diagnostic details. It is a pity that their research was limited to static image recognition, unable to adapt to endoscope videoed in poor light or unknown depth scenes. Ozyoruk et al. proposed an unsupervised monocular visual odometry and estimated depth to solve the problem of frequently changing lighting conditions and scale inconsistency between consecutive frames [17]. The algorithm was optimized by mixed loss functions, using spatial attention modules to instruct the network to focus on tissue areas. Besides, the system detected photometric loss to improve the robustness of fast inter-frame illumination changes in endoscope videos. Itoh et al. performed unsupervised DL by introducing the lambert reflection model as an auxiliary task for domain conversion between real and virtual colonoscopy images. The system can accurately extract 3D information, reducing the impact of specular reflection and colon wall texture on depth estimation [18]. Hwang et al. proposed a self-supervised monocular depth estimation method to assess Spatio-temporal consistency in the colonic environment by detecting depth differences between adjacent frames [19]. They used loss function and depth feedback network to estimate depth information in the next frame from previous frames’ data.

The diagnostic accuracy of esophageal disease by narrow-band imaging (NBI) is higher than WLI, but there are few DL studies on NBI at present. Compared with traditional WLI, NBI images have no significant difference in AI diagnostic efficiency because NBI improves lesion detection sensitivity and increases the possibility of overdiagnosis, leading to reduced diagnostic specificity. However, NBI is beneficial to enhance histological diagnostic grading accuracy [6]. Moreover, NBI can enhance the ability to differentiate squamous cell carcinoma microvessels [20]. A multi-center study shows that magnifying endoscopy narrow-band imaging (ME-NBI) reached senior endoscopic physicians’ predictive performance in early gastric cancer. Nevertheless, the system, which used images rather than videos for the study, requires an endoscopic magnification of the suspected lesion site before the CAD system can be used. Moreover, the system cannot distinguish the depth of tumor invasion [21].

Colorectal cancer is the third most common cancer in the world [22]. Colorectal adenomas have a 50% chance of malignant transformation, so early detection plays a crucial role in reducing mortality. About a quarter of adenomas are missed during standard colonoscopy [23]. DL’s study identifies and classifies colorectal polyps with excellent application value. Bora et al. collected WLI and NBI images of the colorectum to settle the complex problem of systematic visualization [24]. He used Generic Fourier Descriptors (GFD) to quantify shapes, Nonsubsampled Contourlet Transform (NSCT) to extract texture and color features and performed variance analysis to confirm that the GFD and NSCT features of tumors and non-neoplastic polyps were significantly different. After constructing the CNN model, Lai et al. found that both full-color NBI and red-green dual-channel NBI had better sensitivity than WLI in detecting polyps under colonoscopy [25].

Endoscopic ultrasonography (EUS) can improve imaging function and provide various methods for treating biliary tract diseases. Its steep learning curve and over-reliance on operators limit its clinical application in remote areas. Seven et al. predicted the mitotic index of gastrointestinal stromal tumors (GISTs) in EUS by DL. The system was able to automatically determine the prognosis of patients by EUS images [26]. The DL model designed by Yao’s research team can accurately identify the bile duct in EUS and automatically calibrate the anatomical position to measure the bile duct’s diameter, thus significantly improving the accuracy of the operator. The ability to identify lesions needs to be further developed in future [27].

The practical application of AI in gastrointestinal endoscopy is strongly time-sensitive, so it is necessary to integrate CAD into the working process of gastrointestinal endoscopy. The uneven light, gas, liquid, and surgical scars are the critical factors affecting the real-time application of AI in the endoscope. Manually filtered or standardized images for DL may reduce the system’s robustness. Gutierrez et al. collected clinical endoscopic videos of patients with ulcerative colitis from hundreds of different sites using different equipments, significantly increasing the area under the receiver operating characteristic curve (AUROC). Besides, these videos do not need to be marked by professional endoscopic physicians. The system automatically preprocesses and screens the original endoscopic videos and automatically carries out CNN system training, significantly reducing clinicians’ workload and reducing the deviation caused by artificial selection [28].

Confocal laser endoscopy (CLE) can detect various focal lesions with accuracy even close to pathological detection. CLE can also dynamically observe lesions under a microscope, so it has great application value in diagnosing and treating inflammatory bowel disease(IBD). However, CLE requires accurate image interpretation, which only experienced endoscopic physicians can do. Udristoiu’s team designed the DL system can distinguish between ulcerated and healed Crohn’s disease patients in CLE pictures [29]. Still, the algorithm was unable to determine active ulcers from inactive ulcers.

Wireless capsule endoscopy (WCE) can move along the entire digestive tract to identify gastrointestinal polyps and other lesions and allow patients to avoid the discomfort of traditional endoscopes. At present, there are two research hotspots: one is how to use the DL models to accurately find the lesion site from thousands of pictures taken by WCE; the other is how to control the active recognition of capsule endoscopy, the arrival of the lesion, and the administration of drugs or biopsy. Up to now, DL has been able to identify intestinal vascular dilatation [30], hemorrhage [31], polyps [32], colorectal tumors [33], and ulcers caused by Crohn’s disease [34] from tens of thousands of photographs taken during each WCE. However, current researches are mainly retrospective studies, and most of the data sets are composed of still images. Therefore, multi-center prospective studies with large samples are required to verify CNN’s effectiveness in WCE image recognition. IncetanK et at. introduced VR technology into WCE. His group used computed tomography (CT) images to create a 3D organ model and then remove interference such as bone, fat, and skin [35]. The system can precisely generate the same organ as the patient’s real organ, with mucosal texture and vascular network images, to locate the capsule accurately through DL technology. Furthermore, the capsule containing magnets is controlled by an external robotic arm, making it possible for physicians to observe and perform relevant tasks with the help of WCE (Table 1).

Table 1 Application of DL in gastrointestinal endoscopy

Full size table

3 Application of DL in digestive system imaging

3.1 Computed tomography

Patients with cirrhosis were proposed for screening esophageal and gastric varices by gastroscopy. The invasive procedure may bring bleeding and other risks. Therefore, some studies suggested that platelet count, spleen length, and platelet count ratio to spleen length should be used to determine the shunt degree of esophageal varices to evaluate the risk of varices in patients as a non-invasive examination [36]. Ma’s team used DL to assess the CT volume of the liver and spleen in patients with hepatitis B virus-related cirrhosis, combined with patients’ platelet ratio, to perform a computer-intelligent assessment of patients’ varicose veins risk [37].

Zhang et al. established a 3D learning network to evaluate models from a data set of CT images collected from three medical centers, achieving promising performance in gastric tumor edge segmentation and lymph node classification [38]. Another study established a dual-energy computed tomography (DECT) radiology DL model [39]. The predictive value of its response to chemotherapy was analyzed to predict patients’ treatment response during chemotherapy, which may help adjust treatment strategies in time through semi-automatic segmentation of advanced gastric cancer. Due to the small sample size, performing a performance analysis for each chemotherapy regimen was impossible. The DL system developed by Jiang’s research team can predict occult peritoneal metastasis of gastric cancer preoperatively by analyzing CT images, thus reducing the risk of blindly performing extensive total gastrectomy [40]. The next research direction may be the judgment of peritoneal metastasis after neoadjuvant chemotherapy and the DL of 3D images of other tumors.

DL also plays a role in the interpretation of CT in patients with cystic pancreatic lesions [41], pancreatic neuroendocrine tumors [42], and pancreatic cancer [43]. It can also achieve automatic localization and boundary segmentation of the pancreas in CT [44]. Due to the high degree of malignancy, patients with pancreatic cancer present irregular contours and unclear periphery in CT, leading to difficulties in demarcation with surrounding tissues. Besides, it is challenging to label CT images manually because of the complex anatomy around the pancreas. Liu et al. artificially labeled CT images of pancreatic cancer enhanced the data exploitation degree by moving and flip images and reduced the number of convolutional layers to reduce the model’s complexity [43]. Besides, he limited the pixel size to 50 × 50 to avoid too small plaques to contain sufficient information about the relationship between the tumor and adjacent tissue or too large to increase the unrelated image interference. As a result, the diagnosis of pancreatic cancer in patients of different races has a high AUROC.

3.2 Magnetic resonance imaging

Compared with CT, there are few DL studies on magnetic resonance imaging (MRI). Most current research has focused on the diagnosis of liver, pancreatic, and rectal diseases, such as liver cancer [45], liver fibrosis [46], liver fat segmentation [47], pancreatic tumors [48], rectal cancer [49], etc. Abdominal organ segmentation and fat segmentation are the advantages of MRI. Automatic segmentation of high-risk organs has important application value in MRI-guided radiotherapy. The robotic abdominal multi-organ segmentation technology developed by Chen’s team can accurately segment the nine abdominal organs with fewer parameters, and the duodenum segmentation should be further improved [50].

The quantification of human adipose tissue depots can help doctors understand a patient’s health. Belly fat has been linked to high blood pressure, inflammation, and type 2 diabetes [51]. Langner’s multi-center study demonstrated the robustness of their DL model in fat quantification [52]. In recent years, studies have found interactions and pathological similarities between IBD and metabolic disorders, including metabolic tissue disorders, inadequate immune response, and inflammatory response [53]. Patients with non-alcoholic fatty liver diseases (NAFLD) or a high body fat percentage are at higher risk for IBD [54, 55]. Combined with the patient’s clinical symptoms, MRI fat quantification could be applied to CAD of IBD and diabetes in future.

3.3 Positron emission tomography

Positron emission tomography (PET) imaging is commonly used in clinical oncology for diagnosis, staging, restaging, and monitoring of treatment response [56]. Image quality is crucial for visual interpretation and quantitative analysis [57]. Outside the receiving energy window, Scattered photons can be ignored and cause attenuation. In the receiving energy window, the path variation of photon scattering needs to be corrected. Attenuation or scattering events result in local decrease or increase of detection count, which leads to underestimation or overestimation of tracer uptake, respectively. Resulting in decreased image contrast and quantization error. Thus, image contrast is reduced, and quantization error is caused. In PET imaging analysis, CNN has been applied to image reconstruction [58] and image denoising [59]. These technologies will help radiologists produce more accurate PET images without obtaining CT images. While earlier studies were limited to the brain, current studies tend to look at whole-body scans. Shiri and Mostafapour's DL model can automatically correct attenuation and scatter in PET images [60, 61]. The most conspicuous advantage of their systems is that they did not require pre-entry of anatomical information. Nevertheless, they were susceptible to artifacts, leading to misjudgment of organ boundaries, especially between lungs and livers, abdomen, and pelvis. To avoid misjudgment in whole-body dynamic PET, appropriate function and kinetic models are required, along with whole-body motion correction.

In digestive system, PET images are used to help detect lesions in liver CT scans. Using a combination of the generative adversarial network (GAN) and whole convolutional network (FCN) to generate PET images from CT scans, a research team reduced the false positive rate by 28% [62]. Wang et al. introduced a Gan-based method for generating high-quality PET images in low-dose tracers, thereby reducing the risk of radioactive isotopes [63]. They introduced a progressive refinement scheme based on 3D to improve the quality of image display.

3.4 Ultrasound

Although MRI is accurate and non-invasive, the cost of using MRI to assess liver fat is high, so some research teams want to quantify liver fat by ultrasound. For example, Byra et al. used MRI to obtain the proton density fat fraction (PDFF) of patients and then matched the ultrasound images for the training model, achieving qualified diagnostic accuracy [64].

Ultrasound is the front line of screening for abdominal diseases. At present, the research on the application of DL in ultrasound is gradually increasing. Yang’s team created a mouse model of intestinal inflammation to collect micro-ultrasound (μ US) images of the cecum, small intestine, and colon. Three DL networks were trained to distinguish between healthy tissue and early inflammation tissue [65]. A prospective five-center study using DL from ultrasound videos of biliary atresia achieved higher diagnostic accuracy than human experts. The research team has also developed a mobile APP by DL of ultrasound pictures, enabling rural doctors in remote areas to perform CAD by taking and uploading photographs of suspected biliary atresia [66].

Hepatic cystic echinococcosis is still endemic in some areas. Hepatic cystic echinococcosis has five subtypes [67]. The ultrasonic appearance may change naturally over time or in response to treatment, making diagnosis difficult [68]. Although microscopic examination after surgical treatment is the gold standard for diagnosing subtypes and stages of hepatic cystic echinococcosis, accurate ultrasound diagnosis is of great value for patients who can be cured with medical treatment [69]. Wu et al. used three types of CNNs for DL, and because the architecture and features extraction was different, the final result was not wholly consistent. The three systems complement each other further to improve the accuracy of the model’s accuracy and ultimately enable the exact classification of hepatic cystic echinococcosis under ultrasound [70].

Ultrasonography(US) has crucial diagnostic value for benign and malignant lesions of the liver. Due to the low contrast between lesions and normal liver tissue, the diagnosis of solid lesions is a challenge. Ryu et al. used 4309 US images with focal liver disease, including liver cysts, hemangioma, metastasis, and hepatocellular carcinomas, for DL and precise segmentation and classification of focal liver lesions [71]. Contrast-Enhanced Ultrasound (CEUS) can allow real-time scanning and provide dynamic perfusion information, so it has the potential to surpass CT and MRI in liver and gallbladder diseases [72, 73]. Hu’s CEUS system can assist young ultrasound physicians to achieve higher diagnostic sensitivity for liver tumors diagnosis [74].

Imaging is an essential method for the diagnosis of liver diseases. With CT, MRI, and ultrasound, clinicians can accurately determine whether a patient has liver fibrosis, cirrhosis, non-alcoholic fatty liver disease (NAFLD), benign tumors, or hepatocellular carcinoma (HCC). With the development of next-generation sequencing and multi-omics tools, precision medicine can help doctors more comprehensively understand the health status of patients [75, 76]. In future, omics information can be integrated into imaging data to facilitate the development of precision medicine, provide professional health care strategies for patients with sub-health, and design the best diagnosis and treatment plan for patients [77] (Table 2).

Table 2 Application of AI in digestive system imaging

Full size table

4 Application of AI in digestive pathology

Pathology biopsy is the golden standard for diagnosing benign or malignant diseases of the digestive tract, but the number of pathologists is relatively small, so DL can effectively reduce pathologists’ workload. In recent studies, images at different amplification ratios were extracted from standardized HE staining specimens, and affine transformations were used to make up for deficiencies in data sets. Then, whole slide image (WSI) learning could be done by using these pictures. Standardized images have the advantage of removing stained samples, but retrospective studies can also lead to selective bias, and different staining conditions can affect CAD diagnoses. There have been retrospective studies on DL in the pathological diagnosis and prognosis analysis of Helicobacter pylori gastritis [78], rectal cancer [79], pancreatic tumors [80], gastrointestinal, and endocrine tumors [81]. Prospective, multi-center, and large-scale trials have also begun to verify these algorithms’ usability [82]. However, these studies generally have the problem of low interpretative ability for the results of CAD. The DL system developed by Ma et al. can distinguish between normal gastric mucosa, chronic gastritis, and gastric cancer. They used visualization techniques to display the DL model’s content and revealed how the AI program extracted gastric mucosal lesions’ morphological characteristics at different stages. Eventually, gastric cancer progression was revealed, and the effects of the CAD black box were attenuated [83].

The number of metastatic lymph nodes is an essential determinant of the TNM staging of gastrointestinal malignant tumors and is also one of the most critical factors in evaluating gastric cancer prognosis. The clinical-pathological diagnosis of lymph nodes is influenced by subjective factors and requires much time and effort [84]. Pan’s and Ding’s DL system can quickly detect the number of esophageal and rectal lymph node metastases in a large field of vision. However, as their system only supports rectangular annotation, its robustness is deficient in small detection objects or complex contours [85]. Hu’s model was improved on this basis, achieved excellent contour segmentation of a single lymph node, thus effectively improved the lymph node’s quantification accuracy [86]. Wang et al. have come up with a DL framework to analyze patients’ gastric carcinoma lymph node WSI. The system can accurately identify and divide the area of lymph nodes, then reveal the tumor area’s ratio and mesenteric lymph nodes area to predict patients’ prognosis. The system even found several poorly differentiated tumor cells missed by pathologists [87]. Kwak’s team also used WSI for the DL of lymph node metastasis in patients with colorectal cancer. They found that the peri-tumoral stroma (PTS) score was a reference for predicting the number of lymph node metastases [88].

A large sample of prospective studies recently investigated the DL system’s application in the pathological diagnosis of gastric cancer. The algorithm achieved 100% sensitivity and 97% specificity for gastric epithelial-derived tumors, and the vast majority of false positives were due to ulcers or inflammation [89]. However, the system often mistakenly identified GIST as atypical hyperplasia because there was no targeted dataset for non-epithelial tumors. Therefore, it is still worthwhile to establish a new learning model for mesenchymal tumors.

The evaluation of surgical margins is inseparable from the prognostic analysis. However, due to the excessively large resolution of WSI images, prognostic analysis based on WSI is often costly [90]. It is promising to divide the whole WSI into small pieces and then automatically analyze the prognosis of patients through DL. While the edge between tumors and normal tissues can be delineated artificially, labeled tumor areas can also contain normal tissues. Pixel-level annotation can alleviate this problem, but it is a drain on the pathologist's energy. Saillard et al. extracted regions randomly for manual marking to patch the DL system. They found that vascular spaces, the macrotrabecular architectural pattern, and a lack of immune infiltration suggested a poor prognosis of HCC. Although highlighting these areas can increase the accuracy of their system, it still does not utilize all pathological information [91]. Weakly supervised learning (WSL) utilizes easily available image-level annotations to infer pixel-level information automatically. Pathologists label WSI as cancer as long as a small portion of the image contains the cancerous area without specifying its exact location, greatly reducing pathologists' annotation burden and particularly applicable to the field of histopathology. Pathologists only need to mark WSI lesion types, but do not need to specify the exact location of cancer cells [92]. Shao et al. divided WSI into about 1000 patches with the size of 512*512 pixels and then used WSL to conduct DL recognition on all images. So as to fully obtain background information of pathological images. The effectiveness of the WSI level inpatient prognosis assessment was validated in three cancer datasets from the Cancer Genome Atlas (TCGA) [93]. Due to the large size of WSI images and the small proportion of lesions in some cases, image-level labels make automatic diagnosis difficult. Recalibrated multi-instance deep learning method (RMDL) can automatically find the key instances. The high-precision positioning network and recalibrated multi-instance learning were optimized, and the accuracy reached 86.5% [94].

Hyperspectral imaging (HSI) is a non-contact, non-contrast, non-invasive optical imaging technique that provides the analyzed region's pixel spectral and spatial information. It has been applied to both gastric and colorectal cancer. Jansen-Winkeln combined HSI with AI technology to intelligently distinguish colorectal cancer or adenomas from healthy mucosa on specimens. Besides, they used visualization techniques to help clinicians understand the mind of computers [95]. In future, with increased time efficiency, the technology may be used in the operating room on freshly removed specimens or even integrated into the laparoscope to help surgeons determine the extent of lymph node dissection in real-time.

In one study, 12 specimens of GIST were irradiated with near-infrared (NIR). NIR irradiation transparency distinguished the specific HSI information of GIST, and the lesion range of GIST was predicted by ML [96]. This technique may be utilized in the prediction of all submucosal tumors in future. However, light is often affected by the specimen’s thickness, and the training set is sometimes too small. (Table 3).

Table 3 Application of AI in digestive pathology

Full size table

5 Major techniques and Issues

DL is a kind of ML technique that can recognize highly complex patterns in large data sets. As mentioned above, DL can be broadly divided into supervised learning and unsupervised learning. The most popular architectures in supervised learning are CNN and recurrent neural network(RNN) (Fig. 2). In addition, there are also spatial convolutional network (SCN), temporal convolutional network (TCN), and Spatio-temporal attention convolutional network (STACN), which are, respectively, used to extract the appearance information of RGB images, capture the motion information of flow fields and learn the appearance information of areas with significant attention to motion [97]. The latter three methods are used relatively infrequently in medicine.

CNN is mainly composed of alternating convolutional layer and pooling layer, and each layer contains trainable filter banks [98]. CNN can continuously learn abstract features and integrate them into the full connection layer to calculate local weights and generate output values, thus completing tasks [99]. In this paper, many studies described above designed and optimized systems by modifying the number of cores, channels, or filter sizes.

A typical chief underlying mathematical implementation expressions of CNN [98]:

$$ y\left( n \right) = x\left( n \right)*\omega \left( n \right) = \mathop \sum \limits_{m = - \infty }^{\infty } x\left( m \right)\omega \left( {n - m} \right), $$

(1)

$$ Y\left( {i,j} \right) = X\left( {i,j} \right)*\omega \left( {i,j} \right) = \mathop \sum \limits_{m} \mathop \sum \limits_{n} X\left( {m,n} \right)\omega \left( {i - m,j - n} \right). $$

(2)

RNN are designed for discrete sequence analysis. Each point in the sequence generates an internal signal fed through the neural network to the next layer. Hidden layers preserve information in the observed sequence and updates it in real-time [100]. Medical reports are typically processed by RNN. To integrate information from medical reports, it is often necessary to include a hybrid network combining.

A typical chief underlying mathematical implementation expressions of RNN [100]:

$$ a^{l} \left( t \right) = f^{l} \left( {n^{l} \left( t \right)} \right); $$

(3)

$$ \begin{aligned} n^{1} \left( t \right) = & {\text{IW}}^{1,1} \left[ {p\left( t \right);p\left( {t - 1} \right); \ldots p\left( {t - {\text{TDL}}_{in} } \right)} \right] \\ & + {\text{LW}}^{1,1} \left[ {a^{1} \left( {t - 1} \right); \ldots a^{1} \left( {t - {\text{TDL}}_{{\text{int}}} } \right)} \right] \\ & + {\text{LW}}^{1,2} \left[ {a^{2} \left( {t - 1} \right); \ldots a^{2} \left( {t - {\text{TDL}}_{{\text{int}}} } \right)} \right] \\ & + LW^{1,3} \left[ {a^{3} \left( {t - 1} \right); \ldots a^{3} \left( {t - TDL_{out} } \right)} \right] + \underline {b}^{1} ; \\ \end{aligned} $$

(4)

$$ \begin{aligned} n^{2} \left( t \right) = & {\text{LW}}^{2,1} a^{1} \left( t \right) + {\text{LW}}^{2,2} \left[ {a^{2} \left( {t - 1} \right); \ldots a^{2} \left( {t - {\text{TDL}}_{{\text{int}}} } \right)} \right] \\ & + {\text{LW}}^{2,3} \left[ {a^{3} \left( {t - 1} \right); \ldots a^{3} \left( {t - {\text{TDL}}_{{\text{int}}} } \right)} \right] + \underline {b}^{2} ; \\ \end{aligned} $$

(5)

$$ n^{3} \left( t \right) = {\text{LW}}^{2,2} a^{2} \left( t \right) + {\text{LW}}^{3,3} \left[ {a^{3} \left( {t - 1} \right); \ldots a^{3} \left( {t - {\text{TDL}}_{{\text{int}}} } \right)} \right] + \underline {c} . $$

(6)

Although most studies are based on supervised learning with per-pixel annotation, WSL with image-level labels and even unsupervised learning has high application value. WSL uses labeled data to train the entire network and unlabeled data to train encoders and decoders [101]. Original data for unsupervised learning come in the form of images without any expert-annotated labels. A common technique in unsupervised learning is converting input data into low-dimensional subspaces and then grouping. The most common method of unsupervised learning is GAN. GAN has been widely used in medical imaging, such as denoising, modal transfer, anomaly detection, and image synthesis [102]. In addition, unsupervised learning also includes Auto-Encoders (AEs), stacked auto-encoders (SAEs), restricted Boltzmann machines (RBMs), deep belief networks (DBNs), and variational auto-encoders (VAE) [103]. AEs can reduce nonlinear dimensionality reduction, find compressed raw information in the network and reenter the low-dimensional space [104]. These techniques have rarely been used in medicine, but because unsupervised learning allows for network training using large amounts of unlabeled data and the best use of information, it may have broad applications in future.

A typical chief underlying mathematical implementation expressions of GAN [12]:

$$ \mathop {\min }\limits_{G} \mathop {\max }\limits_{D} \varphi \left( {D,G} \right) = E_{x} \left[ {\log D\left( x \right)} \right] + E_{z} \left[ {\log \left( {1 - D\left( {G\left( z \right)} \right)} \right)} \right], $$

(7)

$$ F_{{{\rm B}_{i} }} \left( {\varphi_{j} ,\phi } \right) = \arg_{k} \min G\left( {\varphi_{j} ,\phi_{k} } \right), $$

(8)

$$ S\left( {{\rm B}_{i} } \right) = \frac{1}{{|{\rm B}_{i} |}}\mathop \sum \limits_{j = 1}^{{\left| {{\rm B}_{i} } \right|}} F_{{{\rm B}_{i} }} \left( {\varphi_{j} ,\phi } \right). $$

(9)

A typical chief underlying mathematical implementation expressions of AEs [103]:

$$ h = \sigma \left( {w_{x,h} x + b_{x,h} } \right). $$

(10)

A typical chief underlying mathematical implementation expressions of RBM [103]:

$$ E\left( {x,h} \right) = h^{T} Wx - c^{T} x - b^{T} h, $$

(11)

$$ p\left( {x,h} \right) = \frac{1}{Z}\exp \left\{ { - E\left( {x,h} \right)} \right\}. $$

(12)

$$ P\left( {h_{j} |x} \right) = \frac{1}{{1 + \exp \left\{ { - b_{j} - W_{j} x} \right\}}}. $$

(13)

Transfer learning(TL) can fine-tune or retrain the original DL model by using new annotations. Tajbakhsh demonstrated that pre-trained CNN with fine-tuning is superior to CNN trained from scratch CNN [105]. Fine-tuning can significantly reduce costs than retraining. When ideal training sets are small, TL can bring greater performance improvement. So far, most approaches have started pre-training with natural image data. It may be possible to design cross-domain data sets in future, for example, using TL between CT, MRI, ultrasound, and PET.

Active learning (AL) can be learned in an interactive environment by selecting learning strategies through trial and error. The system tries to achieve its goals based on feedback from its own behavior and experience. At present, no application of AL in the digestive system has been found, which may be due to the high inherent coupling between AL selection strategies and the model being trained. These results in later data sets that may not be conducive to model training [106].

6 Overview

6.1 Applications of DL in medicine

AI is becoming increasingly valuable in the early diagnosis of digestive tract diseases. DL systems can significantly reduce the workload of clinicians and maintain high diagnostic accuracy and systematic robustness. As the public dataset expands, more and more high-quality algorithms will be discovered. However, Large sample prospective studies are needed to verify the effectiveness of the algorithm. Although DL has been extensively studied in the image processing of endoscopy, imaging, and pathology of gastrointestinal tract diseases, each auxiliary diagnostic method has its limitations. At present, there is still a lack of a CAD system that can comprehensively recognize the image data of different auxiliary examinations. This review lists progresses of DL in different auxiliary examinations, hoping that the data of different auxiliary examinations can be integrated to improve diagnosis accuracy one day.

What AI can bring us better identifying endoscopic images or pathological data from a single angle, but perhaps its most outstanding value is that it can help us break through the traditional thinking patterns, transcend the fixed diagnosis ideas, and give us a broader explorative space. Shortly, AI may help us implement diagnosis and treatment methods more flexibly, achieve disciplines integration more thoroughly, and evaluate conditions more comprehensively. AI can create infinite possibilities for our future.

Aslam studied the characteristics of exhaled gas compounds in patients with gastric cancer through CNN analysis, and the diagnostic accuracy of early and advanced gastric cancer reached 97.3% and 98.7, respectively [107]. With the development of computer technology and the iteration of CNN, we will use computers to find more non-invasive examination items in future.

Xiao led a prospective multi-center study using a slit lamp to conduct DL on the fundus and iris of patients with several common liver diseases and finally achieved excellent results in identifying liver cancer and chronic cirrhosis. In future, ophthalmology imaging may be used as a tool for the early screening of liver and biliary diseases [108]. This project is innovative because linking two seemingly distant organs together, allows this kind of interdisciplinary computer-aided research to discover biological phenomena that have not been discovered before.

COVID-19 has become a serious public problem, and companies are racing to develop drugs. Recently, Li's research team used DL models to predict drug-induced liver injury, thereby reducing the cost of clinical drug development and testing [109]. In future, when facing unacquainted sudden diseases, we can also adopt DL technology to input disease information and let the computer judge the patient's condition, treatment, and prognosis.

AI has shown its superiority in the early diagnosis of gastrointestinal diseases. However, if clinicians rely too much on AI, the images under a specific condition may be repeatedly missed due to the algorithm’s limitations or data set. AI will help clinicians discover the potential links between diseases and comprehensively assess the patient's condition and prognosis, but it also requires clinicians to continuously accept, learn, and improve this new technology.

Besides, Wong comprehensively evaluated non-alcoholic fatty liver disease severity based on clinical information, including electronic health records, liver biopsies, and liver images [110]. In future, AI health assessment of patients may not be limited to cross-sectional studies. Still, it can collect patients’ dynamic data in more detail to conduct intelligent analysis to obtain professional suggestions with solid persuasion.

6.2 Characteristics and Challenges of DL in medicine

Medical image analysis has three main tasks: disease diagnosis, lesions detection, and lesions and organs segmentation. It also includes other related tasks, such as image reconstruction, image retrieval, and report generation. The digestive field emphasizes the ability to recognize abnormalities, such as polyps and early cancer. Since medicine is a human-facing science, DL has its own characteristics and challenges in medical image processing compared with other CV scenarios.

Characteristics

1.
Physiological structures are often irregular and disordered, making it difficult to conceptualize them as matrices. There are multiple stages, such as precancerous lesions, between the tumor and normal tissue. Medical judgment is subjective and may vary from doctor to doctor, requiring extensive expert annotations to reach a consensus.
2.
Image recognition is obviously interfered with by viewpoint, noise, background motion, and illumination changes [111]. In medicine, diagnosis often needs reference background information to achieve higher accuracy. The use of implicit information in biological systems has attracted great attention.
3.
The compatibility of the DL system between hospitals needs special attention. Different light sources, resolutions, doctors' skills, and examination habits may affect judgment accuracy.
4.
Medicine values the prediction of causal effects in order to evaluate the curative effects in time. Genomics will be more widely used in future, and DL will become a daily tool for analysis [112].
5.
Medical images require high resolution, which makes image analysis costly and time-consuming. DL models can be trained using cloud computing in future, with instances deployed on different sites and trained on local data while sharing standard parameters, enabling the use of multiple GPUs at a reasonable cost and promoting respect for medical data privacy.

Challenges:

1.
Most DL applications are considered to be a "black box". Users are tough in explaining, understanding, or correcting how the model makes predictions. The system needs to explain prediction conclusions further to gain the trust of doctors and patients.
2.
Where is the application boundary of AI? The abuse of DL may infringe personal privacy, disturb natural law, and violate ethic. For example, what should a doctor do when the AI decides that abandoning treatment is the best option?
3.
In health security, minor errors can lead to catastrophic results. How to further improve the accuracy is always the challenge and pursuit of engineers.
4.
In the classification training of rare diseases, overfitting will occur if the sample number of one class is much larger than another class. Computer vision techniques can solve the overfitting problem. However, model complexity reduction and data enhancement techniques focus only on the target task on a given data set without introducing new information into the DL model. Today, introducing more information beyond a given medical data set has become a promising approach to solving the problem of small medical data sets. In addition to broader collaboration, enhanced data extraction using unsupervised learning and integration using different DL techniques are likely to mitigate this problem.

6.3 Restriction on AI’s clinical application

Currently, restriction on AI’s clinical application has three key factors: first, the compatibility of each system; second, daily maintenance and fault handling of the DL system; third, the legal liability. When the test is false, these errors involving computer knowledge are often difficult to be explained by doctors’ experience alone, so who should be responsible for this during the clinical process? Therefore, more multi-center prospective studies should be conducted in future. Relevant laws and regulations should be improved to translate scientific and technological achievements into practical applications.

The current limitation of AI in the digestive system image has its particularity:

1. Digestive endoscopy plays a significant role in clinical diagnosis and treatment. Inadequate intestinal preparation will affect image recognition and misdiagnose debris as the tumor.

2. While AI can serve as a second set of eyes for endoscopic physicians. There are still misdiagnosis rates to overcome.

3. The determination of tumor invasion depth depends on EUS, but the accuracy is limited. It is difficult to accurately distinguish the origin of tumors, which challenges the selection of surgical methods.

4. Blind spot is a vital factor leading to a missed diagnosis. It is necessary to develop further DL systems that can automatically prompt blind spots.

5. Digestive system covers a large number of organs and has high requirements for lesion localization in imaging. Currently, some systems can accurately achieve organ segmentation, but the localization inside organs is not accurate enough.

6. In the endoscopy process, rapid determination of polyp or tumor properties has high application value, but there is still a long way to go.

7 Conclusion

DL will be widely used in the medical field in the near future, especially for image recognition. CAD can significantly narrow the technical gap between physicians, reduce work pressure, and improve patients’ experience. However, there are many technical, ethical, and legal hurdles to overcome before AI is finally used in clinical practice.

References

Min, J.K., Kwak, M.S., Cha, J.M.: Overview of deep learning in gastrointestinal endoscopy. Gut Liver. 13(4), 388–393 (2019). https://doi.org/10.5009/gnl18384
Article Google Scholar
Tizhoosh, H.R., Pantanowitz, L.: Artificial intelligence and digital pathology: challenges and opportunities. J. Pathol. Inform. (2018). https://doi.org/10.4103/jpi.jpi_53_18
Article Google Scholar
Pannala, R., Krishnan, K., Melson, J., Parsi, M.A., Schulman, A.R., Sullivan, S., et al.: Artificial intelligence in gastrointestinal endoscopy. VideoGIE. 5(12), 598–613 (2020). https://doi.org/10.1016/j.vgie.2020.08.013
Article Google Scholar
Jisu, H., Bo-Yong, P., Hyunjin, P.: Convolutional neural network classifier for distinguishing Barrett's esophagus and neoplasia endomicroscopy images. Annu Int Conf IEEE Eng Med Biol Soc. 2892–2895 (2017) https://doi.org/10.1109/EMBC.2017.8037461
Le Berre, C., Sandborn, W.J., Aridhi, S., Devignes, M., Fournier, L., Smaïl-Tabbone, M., et al.: Application of artificial intelligence to gastroenterology and hepatology. Gastroenterology 158(1), 76–94 (2020). https://doi.org/10.1053/j.gastro.2019.08.058
Article Google Scholar
Wang, Y.K., Syu, H.Y., Chen, Y.H., Chung, C.S., Tseng, Y.S., Ho, S.Y., et al.: Endoscopic images by a Single-Shot multibox detector for the identification of early cancerous lesions in the esophagus: a pilot study. Cancers (Basel) (2021). https://doi.org/10.3390/cancers13020321
Article Google Scholar
Struyvenberg, M.R., de Groof, A.J., van der Putten, J., van der Sommen, F., Baldaque-Silva, F., Omae, M., et al.: A computer-assisted algorithm for narrow-band imaging-based tissue characterization in Barrett’s esophagus. Gastrointest. Endosc. 93(1), 89–98 (2021). https://doi.org/10.1016/j.gie.2020.05.050
Article Google Scholar
Zhang, Y., Li, F., Yuan, F., Zhang, K., Huo, L., Dong, Z., et al.: Diagnosing chronic atrophic gastritis by gastroscopy using artificial intelligence. Dig. Liver Dis. 52(5), 566–572 (2020). https://doi.org/10.1016/j.dld.2019.12.146
Article Google Scholar
Saito, H., Aoki, T., Aoyama, K., Kato, Y., Tsuboi, A., Yamada, A., et al.: Automatic detection and classification of protruding lesions in wireless capsule endoscopy images based on a deep convolutional neural network. Gastrointest. Endosc. 92(1), 144–151 (2020). https://doi.org/10.1016/j.gie.2020.01.054
Article Google Scholar
Zhou, G., Xiao, X., Tu, M., Liu, P., Yang, D., Liu, X., et al.: Computer aided detection for laterally spreading tumors and sessile serrated adenomas during colonoscopy. PLoS ONE 15(4), e231880 (2020). https://doi.org/10.1371/journal.pone.0231880
Article Google Scholar
Lee, J.Y., Jeong, J., Song, E.M., Ha, C., Lee, H.J., Koo, J.E., et al.: Real-time detection of colon polyps during colonoscopy using deep learning: systematic validation with four independent datasets. Sci. Rep. 10(1), 8379 (2020). https://doi.org/10.1038/s41598-020-65387-1
Article Google Scholar
de Souza, L.J., Passos, L.A., Mendel, R., Ebigbo, A., Probst, A., Messmann, H., et al.: Assisting barrett’s esophagus identification using endoscopic data augmentation based on generative adversarial networks. Comput. Biol. Med. (2020). https://doi.org/10.1016/j.compbiomed.2020.104029
Article Google Scholar
Wu, L., He, X., Liu, M., Xie, H., An, P., Zhang, J., et al.: Evaluation of the effects of an artificial intelligence system on endoscopy quality and preliminary testing of its performance in detecting early gastric cancer: a randomized controlled trial. Endoscopy (2021). https://doi.org/10.1055/a-1350-5583
Article Google Scholar
Xu, M., Zhou, W., Wu, L., Zhang, J., Wang, J., Mu, G., et al.: Artificial intelligence in diagnosis of gastric precancerous conditions by image-enhanced endoscopy: a multi-center, diagnostic study (with video). Gastrointest. Endosc. (2021). https://doi.org/10.1016/j.gie.2021.03.013
Article Google Scholar
Iakovidis, D.K., Georgakopoulos, S.V., Vasilakakis, M., Koulaouzidis, A., Plagianakos, V.P.: Detecting and locating gastrointestinal anomalies using deep learning and iterative cluster unification. IEEE Trans. Med. Imaging. 37(10), 2196–2210 (2018). https://doi.org/10.1109/TMI.2018.2837002
Article Google Scholar
Mahmood, F., Chen, R., Durr, N.J.: Unsupervised reverse domain adaptation for synthetic medical images via adversarial training. IEEE Trans. Med. Imaging. 37(12), 2572–2581 (2018). https://doi.org/10.1109/TMI.2018.2842767
Article Google Scholar
Ozyoruk, K.B., Gokceler, G.I., Bobrow, T.L., et al.: EndoSLAM dataset and an unsupervised monocular visual odometry and depth estimation approach for endoscopic videos. Med. Image Anal. 71, 102058 (2021). https://doi.org/10.1016/j.media.2021.102058
Article Google Scholar
Itoh, H., Oda, M., Mori, Y., et al.: Unsupervised colonoscopic depth estimation by domain translations with a Lambertian-reflection keeping auxiliary task. Int. J. Comput. Assist. Radiol. Surg. 16(6), 989–1001 (2021). https://doi.org/10.1007/s11548-021-02398-x
Article Google Scholar
Hwang, S.J., Park, S.J., Kim, G.M., Baek, J.H.: Unsupervised monocular depth estimation for colonoscope system using feedback network. Sensors (Basel). 21(8), 2691 (2021). https://doi.org/10.3390/s21082691
Article Google Scholar
Uema, R., Hayashi, Y., Tashiro, T., Saiki, H., Kato, M., Amano, T., et al.: Use of a convolutional neural network for classifying microvessels of superficial esophageal squamous cell carcinomas. J. Gastroenterol. Hepatol. (2021). https://doi.org/10.1111/jgh.15479
Article Google Scholar
Hu, H., Gong, L., Dong, D., Zhu, L., Wang, M., He, J., et al.: Identifying early gastric cancer under magnifying narrow-band images with deep learning: a multi-center study. Gastrointest. Endosc. 93(6), 1333-1341.e3 (2021). https://doi.org/10.1016/j.gie.2020.11.014
Article Google Scholar
Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R.L., Torre, L.A., Jemal, A.: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68(6), 394–424 (2018). https://doi.org/10.3322/caac.21492
Article Google Scholar
le Clercq, C.M., Bouwens, M.W., Rondagh, E.J., Bakker, C.M., Keulen, E.T., de Ridder, R.J., et al.: Postcolonoscopy colorectal cancers are preventable: a population-based study. Gut 63(6), 957–963 (2014). https://doi.org/10.1136/gutjnl-2013-304880
Article Google Scholar
Bora, K., Bhuyan, M.K., Kasugai, K., Mallik, S., Zhao, Z.: Computational learning of features for automated colonic polyp classification. Sci. Rep. 11(1), 4347 (2021). https://doi.org/10.1038/s41598-021-83788-8
Article Google Scholar
Lai, L.L., Blakely, A., Invernizzi, M., Lin, J., Kidambi, T., Melstrom, K.A., et al.: Separation of color channels from conventional colonoscopy images improves deep neural network detection of polyps. J. Biomed. Opt. (2021). https://doi.org/10.1117/1.JBO.26.1.015001
Article Google Scholar
Seven, G., Silahtaroglu, G., Kochan, K., Ince, A.T., Arici, D.S., Senturk, H.: Use of artificial intelligence in the prediction of malignant potential of gastric gastrointestinal stromal tumors. Dig. Dis. Sci. (2021). https://doi.org/10.1007/s10620-021-06830-9
Article Google Scholar
Yao, L., Zhang, J., Liu, J., Zhu, L., Ding, X., Chen, D., et al.: A deep learning-based system for bile duct annotation and station recognition in linear endoscopic ultrasound. EBioMedicine (2021). https://doi.org/10.1016/j.ebiom.2021.103238
Article Google Scholar
Gutierrez, B.B., Arcadu, F., Thalhammer, A., Gamez, S.C., Feehan, O., Drawnel, F., et al.: Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data. Ther. Adv. Gastrointest. Endosc. (2021). https://doi.org/10.1177/2631774521990623
Article Google Scholar
Udristoiu, A.L., Stefanescu, D., Gruionu, G., Gruionu, L.G., Iacob, A.V., Karstensen, J.G., et al.: Deep learning algorithm for the confirmation of mucosal healing in crohn’s disease, based on confocal laser endomicroscopy images. J. Gastrointestin. Liver Dis. 30(1), 59–65 (2021). https://doi.org/10.15403/jgld-3212
Article Google Scholar
Tsuboi, A., Oka, S., Aoyama, K., Saito, H., Aoki, T., Yamada, A., et al.: Artificial intelligence using a convolutional neural network for automatic detection of small-bowel angioectasia in capsule endoscopy images. Dig. Endosc. 32(3), 382–390 (2020). https://doi.org/10.1111/den.13507
Article Google Scholar
Caroppo, A., Leone, A., Siciliano, P.: Deep transfer learning approaches for bleeding detection in endoscopy images. Comput. Med. Imaging Graph (2021). https://doi.org/10.1016/j.compmedimag.2020.101852
Article Google Scholar
Laiz, P., Vitria, J., Wenzek, H., Malagelada, C., Azpiroz, F., Segui, S.: WCE polyp detection with triplet based embeddings. Comput. Med. Imaging Graph (2020). https://doi.org/10.1016/j.compmedimag.2020.101794
Article Google Scholar
Yamada, A., Niikura, R., Otani, K., Aoki, T., Koike, K.: Automatic detection of colorectal neoplasia in wireless colon capsule endoscopic images using a deep convolutional neural network. Endoscopy (2020). https://doi.org/10.1055/a-1266-1066
Article Google Scholar
Klang, E., Barash, Y., Margalit, R.Y., Soffer, S., Shimon, O., Albshesh, A., et al.: Deep learning algorithms for automated detection of Crohn’s disease ulcers by video capsule endoscopy. Gastrointest. Endosc. 91(3), 606–613 (2020). https://doi.org/10.1016/j.gie.2019.11.012
Article Google Scholar
Incetan, K., Celik, I.O., Obeid, A., Gokceler, G.I., Ozyoruk, K.B., Almalioglu, Y., et al.: VR-Caps: a virtual environment for capsule endoscopy. Med. Image Anal. (2021). https://doi.org/10.1016/j.media.2021.101990
Article Google Scholar
Colli, A., Gana, J.C., Yap, J., Adams-Webber, T., Rashkovan, N., Ling, S.C., et al.: Platelet count, spleen length, and platelet count-to-spleen length ratio for the diagnosis of oesophageal varices in people with chronic liver disease or portal vein thrombosis. Cochrane Database Syst. Rev. (2017). https://doi.org/10.1002/14651858.CD008759.pub2
Article Google Scholar
Lee, C.M., Lee, S.S., Choi, W.M., Kim, K.M., Sung, Y.S., Lee, S., et al.: An index based on deep learning-measured spleen volume on CT for the assessment of high-risk varix in B-viral compensated cirrhosis. Eur. Radiol. 31(5), 3355–3365 (2021). https://doi.org/10.1007/s00330-020-07430-3
Article Google Scholar
Zhang, Y., Li, H., Du, J., Qin, J., Wang, T., Chen, Y., et al.: 3D multi-attention guided multi-task learning network for automatic gastric tumor segmentation and lymph node classification. IEEE Trans. Med. Imaging. 40(6), 1618–1631 (2021). https://doi.org/10.1109/TMI.2021.3062902
Article Google Scholar
Tan, J.W., Wang, L., Chen, Y., Xi, W., Ji, J., Wang, L., et al.: Predicting chemotherapeutic response for far-advanced gastric cancer by radiomics with deep learning semi-automatic segmentation. J Cancer. 11(24), 7224–7236 (2020). https://doi.org/10.7150/jca.46704
Article Google Scholar
Jiang, Y., Liang, X., Wang, W., Chen, C., Yuan, Q., Zhang, X., et al.: Non-invasive prediction of occult peritoneal metastasis in gastric cancer using deep learning. JAMA Netw. Open. 4(1), e2032269 (2021). https://doi.org/10.1001/jamanetworkopen.2020.32269
Article Google Scholar
Watson, M.D., Lyman, W.B., Passeri, M.J., Murphy, K.J., Sarantou, J.P., Iannitti, D.A., et al.: Use of artificial intelligence deep learning to determine the malignant potential of pancreatic cystic neoplasms with preoperative computed tomography imaging. Am. Surg. 87(4), 602–607 (2021). https://doi.org/10.1177/0003134820953779
Article Google Scholar
Huang, B., Lin, X., Shen, J., Chen, X., Chen, J., Li, Z., et al.: Accurate and feasible deep learning based semi-automatic segmentation in CT for radiomics analysis in pancreatic neuroendocrine neoplasms. IEEE J. Biomed. Health Inform. (2021). https://doi.org/10.1109/JBHI.2021.3070708
Article Google Scholar
Liu, K.L., Wu, T., Chen, P.T., Tsai, Y.M., Roth, H., Wu, M.S., et al.: Deep learning to distinguish pancreatic cancer tissue from non-cancerous pancreatic tissue: a retrospective study with cross-racial external validation. Lancet Digit. Health. 2(6), e303–e313 (2020). https://doi.org/10.1016/S2589-7500(20)30078-9
Article Google Scholar
Roth, H.R., Lu, L., Lay, N., Harrison, A.P., Farag, A., Sohn, A., et al.: Spatial aggregation of holistically-nested convolutional neural networks for automated pancreas localization and segmentation. Med. Image Anal. (2018). https://doi.org/10.1016/j.media.2018.01.006
Article Google Scholar
Zhang, Y., Lv, X., Qiu, J., Zhang, B., Zhang, L., Fang, J., et al.: Deep learning with 3D convolutional neural network for non-invasive prediction of microvascular invasion in hepatocellular carcinoma. J. Magn. Reson. Imaging. 54(1), 134–143 (2021). https://doi.org/10.1002/jmri.27538
Article Google Scholar
Hectors, S.J., Kennedy, P., Huang, K.H., Stocker, D., Carbonell, G., Greenspan, H., et al.: Fully automated prediction of liver fibrosis using deep learning analysis of gadoxetic acid-enhanced MRI. Eur. Radiol. 31(6), 3805–3814 (2021). https://doi.org/10.1007/s00330-020-07475-4
Article Google Scholar
Jimenez-Pastor, A., Alberich-Bayarri, A., Lopez-Gonzalez, R., Marti-Aguado, D., Franca, M., Bachmann, R., et al.: Precise whole liver automatic segmentation and quantification of PDFF and R2* on MR images. Eur. Radiol. (2021). https://doi.org/10.1007/s00330-021-07838-5
Article Google Scholar
Corral, J.E., Hussein, S., Kandel, P., Bolan, C.W., Bagci, U., Wallace, M.B.: Deep learning to classify intraductal papillary mucinous neoplasms using magnetic resonance imaging. Pancreas 48(6), 805–810 (2019). https://doi.org/10.1097/MPA.0000000000001327
Article Google Scholar
Zhang, X.Y., Wang, L., Zhu, H.T., Li, Z.W., Ye, M., Li, X.T., et al.: Predicting rectal cancer response to neoadjuvant chemoradiotherapy using deep learning of diffusion kurtosis MRI. Radiol. 296(1), 56–64 (2020). https://doi.org/10.1148/radiol.2020190936
Article Google Scholar
Chen, Y., Ruan, D., Xiao, J., Wang, L., Sun, B., Saouaf, R., et al.: Fully automated multiorgan segmentation in abdominal magnetic resonance imaging with deep neural networks. Med. Phys. 47(10), 4971–4982 (2020). https://doi.org/10.1002/mp.14429
Article Google Scholar
Despres, J.P., Lemieux, I.: Abdominal obesity and metabolic syndrome. Nature 444(7121), 881–887 (2006). https://doi.org/10.1038/nature05488
Article Google Scholar
Langner, T., Hedstrom, A., Morwald, K., Weghuber, D., Forslund, A., Bergsten, P., et al.: Fully convolutional networks for automated segmentation of abdominal adipose tissue depots in multi-center water-fat MRI. Magn. Reson. Med. 81(4), 2736–2745 (2019). https://doi.org/10.1002/mrm.27550
Article Google Scholar
Michalak, A., Mosinska, P., Fichna, J.: Common links between metabolic syndrome and inflammatory bowel disease: current overview and future perspectives. Pharmacol. Rep. 68(4), 837–846 (2016). https://doi.org/10.1016/j.pharep.2016.04.016
Article Google Scholar
Kwon, J., Lee, C., Heo, S., Kim, B., Hyun, C.K.: DSS-induced colitis is associated with adipose tissue dysfunction and disrupted hepatic lipid metabolism leading to hepatosteatosis and dyslipidemia in mice. Sci. Rep. 11(1), 5283 (2021). https://doi.org/10.1038/s41598-021-84761-1
Article Google Scholar
Carreras-Torres, R., Ibanez-Sanz, G., Obon-Santacana, M., Duell, E.J., Moreno, V.: Identifying environmental risk factors for inflammatory bowel diseases: a Mendelian randomization study. Sci. Rep. 10(1), 19273 (2020). https://doi.org/10.1038/s41598-020-76361-2
Article Google Scholar
Ben-Haim, S., Ell, P.: 18F-FDG PET and PET/CT in the evaluation of cancer treatment response. J. Nucl. Med. 50(1), 88–99 (2009). https://doi.org/10.2967/jnumed.108.054205
Article Google Scholar
Boellaard, R.: Standards for PET image acquisition and quantitative data analysis. J. Nucl. Med. 50(Suppl 1), 11S-20S (2009). https://doi.org/10.2967/jnumed.108.057182
Article Google Scholar
Häggström, I., Schmidtlein, C.R., Campanella, G., Fuchs, T.J.: DeepPET: a deep encoder-decoder network for directly solving the PET image reconstruction inverse problem. Med. Image Anal. 54, 253–262 (2019). https://doi.org/10.1016/j.media.2019.03.013
Article Google Scholar
Sanaat, A., Arabi, H., Mainta, I., Garibotto, V., Zaidi, H.: Projection space implementation of deep learning-guided low-dose brain PET imaging improves performance over implementation in image space. J. Nucl. Med. 61(9), 1388–1396 (2020). https://doi.org/10.2967/jnumed.119.239327
Article Google Scholar
Shiri, I., Arabi, H., Geramifar, P., et al.: Deep-JASC: joint attenuation and scatter correction in whole-body 18F-FDG PET using a deep residual network. Eur. J. Nucl. Med. Mol. Imaging. 47(11), 2533–2548 (2020). https://doi.org/10.1007/s00259-020-04852-5
Article Google Scholar
Mostafapour, S., Gholamiankhah, F., Dadgar, H., Arabi, H., Zaidi, H.: Feasibility of deep learning-guided attenuation and scatter correction of whole-body 68Ga-PSMA PET studies in the image domain. Clin. Nucl. Med. 46(8), 609–615 (2021). https://doi.org/10.1097/RLU.0000000000003585
Article Google Scholar
Ben-Cohen, A., Klang, E., Raskin, S.P., Soffer, S., Ben-Haim, S., Konen, E., Amitai, M.M., Greenspan, H.: Cross-modality synthesis from ct to pet using fcn and gan networks for improved automated lesion detection. Eng. Appl. Artif. Intell. 78(186–94), 186–194 (2019). https://doi.org/10.1016/j.engappai.2018.11.013
Article Google Scholar
Wang, Y., Yu, B., Wang, L., et al.: 3D conditional generative adversarial networks for high-quality PET image estimation at low dose. Neuroimage 174, 550–562 (2018). https://doi.org/10.1016/j.neuroimage.2018.03.045
Article Google Scholar
Byra, M., Han, A., Boehringer, A.S., Zhang, Y.N., O’Brien, W.J., Erdman, J.J., et al.: Liver fat assessment in multiview sonography using transfer learning with convolutional neural networks. J. Ultrasound Med. (2021). https://doi.org/10.1002/jum.15693
Article Google Scholar
Yang, S., Lemke, C., Cox, B.F., Newton, I.P., Nathke, I., Cochran, S.: A learning-based microultrasound system for the detection of inflammation of the gastrointestinal tract. IEEE Trans. Med. Imaging. 40(1), 38–47 (2021). https://doi.org/10.1109/TMI.2020.3021560
Article Google Scholar
Zhou, W., Yang, Y., Yu, C., Liu, J., Duan, X., Weng, Z., et al.: Ensembled deep learning model outperforms human experts in diagnosing biliary atresia from sonographic gallbladder images. Nat. Commun. 12(1), 1259 (2021). https://doi.org/10.1038/s41467-021-21466-z
Article Google Scholar
Feng, X., Qi, X., Yang, L., Duan, X., Fang, B., Gongsang, Q., et al.: Human cystic and alveolar echinococcosis in the Tibet autonomous region (TAR), China. J. Helminthol. 89(6), 671–679 (2015). https://doi.org/10.1017/S0022149X15000656
Article Google Scholar
Golemanov, B., Grigorov, N., Mitova, R., Genov, J., Vuchev, D., Tamarozzi, F., et al.: Efficacy and safety of PAIR for cystic echinococcosis: experience on a large series of patients from Bulgaria. Am. J. Trop. Med. Hyg. 84(1), 48–51 (2011). https://doi.org/10.4269/ajtmh.2011.10-0312
Article Google Scholar
Dehkordi, A.B., Sanei, B., Yousefi, M., Sharafi, S.M., Safarnezhad, F., Jafari, R., et al.: albendazole and treatment of hydatid cyst: review of the literature. Infect. Disord. Drug. Targets. 19(2), 101–104 (2019). https://doi.org/10.2174/1871526518666180629134511
Article Google Scholar
Wu, M., Yan, C., Wang, X., Liu, Q., Liu, Z., Song, T.: Automatic classification of hepatic cystic echinococcosis using ultrasound images and deep learning. J. Ultrasound. Med. (2021). https://doi.org/10.1002/jum.15691
Article Google Scholar
Ryu, H., Shin, S.Y., Lee, J.Y., Lee, K.M., Kang, H.J., Yi, J.: Joint segmentation and classification of hepatic lesions in ultrasound images using deep learning. Eur. Radiol. (2021). https://doi.org/10.1007/s00330-021-07850-9
Article Google Scholar
Wang, W., Chen, L.D., Lu, M.D., Liu, G.J., Shen, S.L., Xu, Z.F., et al.: Contrast-enhanced ultrasound features of histologically proven focal nodular hyperplasia: diagnostic performance compared with contrast-enhanced CT. Eur. Radiol. 23(9), 2546–2554 (2013). https://doi.org/10.1007/s00330-013-2849-3
Article Google Scholar
Xie, X.H., Xu, H.X., Xie, X.Y., Lu, M.D., Kuang, M., Xu, Z.F., et al.: Differential diagnosis between benign and malignant gallbladder diseases with real-time contrast-enhanced ultrasound. Eur. Radiol. 20(1), 239–248 (2010). https://doi.org/10.1007/s00330-009-1538-8
Article Google Scholar
Hu, H.T., Wang, W., Chen, L.D., Ruan, S.M., Chen, S.L., Li, X., et al.: Artificial intelligence assists identifying malignant versus benign liver lesions using contrast-enhanced ultrasound. J. Gastroenterol. Hepatol. (2021). https://doi.org/10.1111/jgh.15522
Article Google Scholar
Wu, X., Li, J., Gassa, A., Buchner, D., Alakus, H., Dong, Q., et al.: Circulating tumor DNA as an emerging liquid biopsy biomarker for early diagnosis and therapeutic monitoring in hepatocellular carcinoma. Int. J. Biol. Sci. 16(9), 1551–1562 (2020). https://doi.org/10.7150/ijbs.44024
Article Google Scholar
Hakim, A., Zhang, X., DeLisle, A., Oral, E.A., Dykas, D., Drzewiecki, K., et al.: Clinical utility of genomic analysis in adults with idiopathic liver disease. J. Hepatol. 70(6), 1214–1221 (2019). https://doi.org/10.1016/j.jhep.2019.01.036
Article Google Scholar
Su, T.H., Wu, C.H., Kao, J.H.: Artificial intelligence in precision medicine in hepatology. J. Gastroenterol. Hepatol. 36(3), 569–580 (2021). https://doi.org/10.1111/jgh.15415
Article Google Scholar
Martin, D.R., Hanson, J.A., Gullapalli, R.R., Schultz, F.A., Sethi, A., Clark, D.P.: A deep learning convolutional neural network can recognize common patterns of injury in gastric pathology. Arch. Pathol. Lab. Med. 144(3), 370–378 (2020). https://doi.org/10.5858/arpa.2019-0004-OA
Article Google Scholar
Liu, S., Zhang, Y., Ju, Y., Li, Y., Kang, X., Yang, X., et al.: Establishment and clinical application of an artificial intelligence diagnostic platform for identifying rectal cancer tumor budding. Front Oncol. 11, 626626 (2021). https://doi.org/10.3389/fonc.2021.626626
Article Google Scholar
Klimov, S., Xue, Y., Gertych, A., Graham, R.P., Jiang, Y., Bhattarai, S., et al.: Predicting metastasis risk in pancreatic neuroendocrine tumors using deep learning image analysis. Front Oncol. (2020). https://doi.org/10.3389/fonc.2020.593211
Article Google Scholar
Govind, D., Jen, K.Y., Matsukuma, K., Gao, G., Olson, K.A., Gui, D., et al.: Improving the accuracy of gastrointestinal neuroendocrine tumor grading with deep learning. Sci. Rep. 10(1), 11064 (2020). https://doi.org/10.1038/s41598-020-67880-z
Article Google Scholar
Wang, K.S., Yu, G., Xu, C., Meng, X.H., Zhou, J., Zheng, C., et al.: Accurate diagnosis of colorectal cancer based on histopathology images using artificial intelligence. BMC Med. 19(1), 76 (2021). https://doi.org/10.1186/s12916-021-01942-5
Article Google Scholar
Ma, B., Guo, Y., Hu, W., Yuan, F., Zhu, Z., Yu, Y., et al.: Artificial intelligence-based multiclass classification of benign or malignant mucosal lesions of the stomach. Front Pharmacol. (2020). https://doi.org/10.3389/fphar.2020.572372
Article Google Scholar
Zhao, B., Huang, R., Lu, H., Mei, D., Bao, S., Xu, H., et al.: Risk of lymph node metastasis and prognostic outcome in early gastric cancer patients with mixed histologic type. Curr. Probl. Cancer. 44(6), 100579 (2020). https://doi.org/10.1016/j.currproblcancer.2020.100579
Article Google Scholar
Pan, Y., Sun, Z., Wang, W., Yang, Z., Jia, J., Feng, X., et al.: Automatic detection of squamous cell carcinoma metastasis in esophageal lymph nodes using semantic segmentation. Clin. Transl. Med. 10(3), e129 (2020). https://doi.org/10.1002/ctm2.129
Article Google Scholar
Hu, Y., Su, F., Dong, K., Wang, X., Zhao, X., Jiang, Y., et al.: Deep learning system for lymph node quantification and metastatic cancer identification from whole-slide pathology images. Gastric Cancer (2021). https://doi.org/10.1007/s10120-021-01158-9
Article Google Scholar
Wang, X., Chen, Y., Gao, Y., Zhang, H., Guan, Z., Dong, Z., et al.: Predicting gastric cancer outcome from resected lymph node histopathology images using deep learning. Nat. Commun. 12(1), 1637 (2021). https://doi.org/10.1038/s41467-021-21674-7
Article Google Scholar
Kwak, M.S., Lee, H.H., Yang, J.M., Cha, J.M., Jeon, J.W., Yoon, J.Y., et al.: Deep convolutional neural Network-Based lymph node metastasis prediction for colon cancer using histopathological images. Front. Oncol. (2020). https://doi.org/10.3389/fonc.2020.619803
Article Google Scholar
Park, J., Jang, B.G., Kim, Y.W., Park, H., Kim, B.H., Kim, M.J., et al.: A prospective validation and observer performance study of a deep learning algorithm for pathologic diagnosis of gastric tumors in endoscopic biopsies. Clin. Cancer Res. 27(3), 719–728 (2021). https://doi.org/10.1158/1078-0432.CCR-20-3159
Article Google Scholar
Shao, W., Han, Z., Cheng, J., et al.: Integrative analysis of pathological images and multi-dimensional genomic data for early-stage cancer prognosis. IEEE Trans. Med. Imaging. 39(1), 99–110 (2020). https://doi.org/10.1109/TMI.2019.2920608
Article Google Scholar
Saillard, C., Schmauch, B., Laifa, O., et al.: Predicting survival after hepatocellular carcinoma resection using deep learning on histological slides. Hepatology 72(6), 2000–2013 (2020). https://doi.org/10.1002/hep.31207
Article Google Scholar
Srinidhi, C.L., Ciga, O., Martel, A.L.: Deep neural network models for computational histopathology: a survey. Med. Image Anal. 67, 101813 (2021). https://doi.org/10.1016/j.media.2020.101813
Article Google Scholar
Shao, W., Wang, T., Huang, Z., Han, Z., Zhang, J., Huang, K.: Weakly supervised deep ordinal cox model for survival prediction from whole-slide pathological images. IEEE Trans. Med. Imaging (2021). https://doi.org/10.1109/TMI.2021.3097319
Article Google Scholar
Wang, S., Zhu, Y., Yu, L., et al.: RMDL: recalibrated multi-instance deep learning for whole slide gastric image classification. Med. Image Anal. 58, 101549 (2019). https://doi.org/10.1016/j.media.2019.101549
Article Google Scholar
Jansen-Winkeln, B., Barberio, M., Chalopin, C., Schierle, K., Diana, M., Kohler, H., et al.: Feedforward artificial neural network-based colorectal cancer detection using hyperspectral imaging: a step towards automatic optical biopsy. Cancers (Basel) (2021). https://doi.org/10.3390/cancers13050967
Article Google Scholar
Sato, D., Takamatsu, T., Umezawa, M., Kitagawa, Y., Maeda, K., Hosokawa, N., et al.: Distinction of surgically resected gastrointestinal stromal tumor by near-infrared hyperspectral imaging. Sci. Rep. 10(1), 21852 (2020). https://doi.org/10.1038/s41598-020-79021-7
Article Google Scholar
Tu, Z., Li, H., Zhang, D., Dauwels, J., Li, B., Yuan, J.: Action-stage emphasized spatiotemporal VLAD for video action recognition. IEEE Trans. Image Process. 28(6), 2799–2812 (2019). https://doi.org/10.1109/TIP.2018.2890749
Article MathSciNet MATH Google Scholar
Chen, W., Sun, Q., Chen, X., Xie, G., Wu, H., Xu, C.: Deep learning methods for heart sounds classification: a systematic review. Entropy (Basel). 23(6), 667 (2021). https://doi.org/10.3390/e23060667
Article Google Scholar
Xie, X., Niu, J., Liu, X., Chen, Z., Tang, S., Yu, S.: A survey on incorporating domain knowledge into deep learning for medical image analysis. Med. Image Anal. 69, 101985 (2021). https://doi.org/10.1016/j.media.2021.101985
Article Google Scholar
Curreri, F., Patanè, L., Xibilia, M.G.: RNN- and LSTM-based soft sensors transferability for an industrial process. Sensors (Basel) 21(3), 823 (2021). https://doi.org/10.3390/s21030823
Article Google Scholar
Chen, Y., Tu, Z., Ge, L., Zhang, D., Chen, R., and Yuan, J.: SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning. In Proc. ICCV, pp.6961–6970, (2019)
Yi, X., Walia, E., Babyn, P.: Generative adversarial network in medical imaging: a review. Med. Image Anal. 58, 101552 (2019). https://doi.org/10.1016/j.media.2019.101552
Article Google Scholar
Litjens, G., Kooi, T., Bejnordi, B.E., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017). https://doi.org/10.1016/j.media.2017.07.005
Article Google Scholar
Zou, J., Huss, M., Abid, A., Mohammadi, P., Torkamani, A., Telenti, A.: A primer on deep learning in genomics. Nat. Genet. 51(1), 12–18 (2019). https://doi.org/10.1038/s41588-018-0295-5
Article Google Scholar
Tajbakhsh, N., Shin, J.Y., Gurudu, S.R., et al.: Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans. Med. Imaging. 35(5), 1299–1312 (2016). https://doi.org/10.1109/TMI.2016.2535302
Article Google Scholar
Budd, S., Robinson, E.C., Kainz, B.: A survey on active learning and human-in-the-loop deep learning for medical image analysis. Med. Image Anal. 71, 102062 (2021). https://doi.org/10.1016/j.media.2021.102062
Article Google Scholar
Aslam, M.A., Xue, C., Chen, Y., Zhang, A., Liu, M., Wang, K., et al.: Breath analysis based early gastric cancer classification from deep stacked sparse autoencoder neural network. Sci. Rep. 11(1), 4014 (2021). https://doi.org/10.1038/s41598-021-83184-2
Article Google Scholar
Xiao, W., Huang, X., Wang, J.H., Lin, D.R., Zhu, Y., Chen, C., et al.: Screening and identifying hepatobiliary diseases through deep learning using ocular images: a prospective, multicentre study. Lancet Digit. Health. 3(2), e88-97 (2021). https://doi.org/10.1016/S2589-7500(20)30288-0
Article Google Scholar
Li, T., Tong, W., Roberts, R., Liu, Z., Thakkar, S.: DeepDILI: deep learning-powered drug-induced liver injury prediction using model-level representation. Chem. Res. Toxicol. 34(2), 550–565 (2021). https://doi.org/10.1021/acs.chemrestox.0c00374
Article Google Scholar
Wong, G.L., Yuen, P.C., Ma, A.J., Chan, A.W., Leung, H.H., Wong, V.W.: Artificial intelligence in prediction of non-alcoholic fatty liver disease and fibrosis. J. Gastroenterol Hepatol. 36(3), 543–550 (2021). https://doi.org/10.1111/jgh.15385
Article Google Scholar
Tu, Z., Xie, W., Qin, Q., Poppe, R., Veltkamp, R.C., Li, B., et al.: Multi-stream CNN: Learning representations based on human-related regions for action recognition. Pattern Recognit. (2018). https://doi.org/10.1016/j.patcog.2018.01.020
Article Google Scholar
Eraslan, G., Avsec, Ž, Gagneur, J., Theis, F.J.: Deep learning: new computational modelling techniques for genomics. Nat. Rev. Genet. 20(7), 389–403 (2019). https://doi.org/10.1038/s41576-019-0122-6
Article Google Scholar

Download references

Funding

No.

Author information

Huangming Zhuang and Jixiang Zhang authors contributed equally to this work.

Authors and Affiliations

Gastroenterology Department, Renmin Hospital of Wuhan University, Wuhan, 430060, Hubei, China
Huangming Zhuang, Jixiang Zhang & Fei Liao

Authors

Huangming Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Jixiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fei Liao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JXZ and HMZ analyzed the data, prepared the tables, and drafted the manuscript. HMZ and FL designed the project and finalized the manuscript. All authors assisted with reference collection and the reorganization and partial data analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Fei Liao.

Ethics declarations

Conflict of interest

The authors declare no competing interest.

Availability of data and material

Available.

Consent for publication

Yes.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhuang, H., Zhang, J. & Liao, F. A systematic review on application of deep learning in digestive system image processing. Vis Comput 39, 2207–2222 (2023). https://doi.org/10.1007/s00371-021-02322-z

Download citation

Accepted: 30 September 2021
Published: 31 October 2021
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00371-021-02322-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A systematic review on application of deep learning in digestive system image processing

Abstract

Similar content being viewed by others

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Convolutional neural networks: an overview and application in radiology

Medical image data augmentation: techniques, comparisons and interpretations

1 Introduction

2 Application of DL in gastrointestinal endoscopy

3 Application of DL in digestive system imaging

3.1 Computed tomography

3.2 Magnetic resonance imaging

3.3 Positron emission tomography

3.4 Ultrasound

4 Application of AI in digestive pathology

5 Major techniques and Issues

6 Overview

6.1 Applications of DL in medicine

6.2 Characteristics and Challenges of DL in medicine

6.3 Restriction on AI’s clinical application

7 Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Availability of data and material

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A systematic review on application of deep learning in digestive system image processing

Abstract

Similar content being viewed by others

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Convolutional neural networks: an overview and application in radiology

Medical image data augmentation: techniques, comparisons and interpretations

1 Introduction

2 Application of DL in gastrointestinal endoscopy

3 Application of DL in digestive system imaging

3.1 Computed tomography

3.2 Magnetic resonance imaging

3.3 Positron emission tomography

3.4 Ultrasound

4 Application of AI in digestive pathology

5 Major techniques and Issues

6 Overview

6.1 Applications of DL in medicine

6.2 Characteristics and Challenges of DL in medicine

6.3 Restriction on AI’s clinical application

7 Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Availability of data and material

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation