Introduction

Esophageal cancer (EC) is the seventh most common cancer worldwide with estimated 604,100 new cases in 2020 (3.1% of all sites), ranking sixth in overall mortality with 544,076 new deaths (5.5% of all sites), the latter signifying that in 2020 is estimated responsible for 1:18 cancer deaths as per GLOBOCAN Global Cancer Statistics 2021 [1].

EC global incidence as well as mortality rates are estimated two- to three-fold higher in men compared with women. This gender variation is followed by a striking geographic variation in both sexes. More than 15-fold differences between world regions exist, with rates ranging from 1.5:100,000 in Western Africa to 18.2:100,000 in Eastern Asia in men, and 0.4:100,000 in Central America to 6.8:100,000 in Eastern Asia in women [1]. This geographic incidence also differs substantially between the two most common histologic EC subtypes: esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC) which vary significantly in terms of etiology too. For ESCC, the so-far established main risk-factors are tobacco and alcohol consumption while additional suspected risk-factors are low socioeconomic status with poor diet quality of low fruit/vegetable intake, severe vitamins and micronutrients deficiencies, high polycyclic aromatic hydrocarbons (PAHs) consumption [2], betel quid chewing on the Indian subcontinent and hot mate drinking in Southern America [3]. Gastroesophageal reflux disease (GERD), Barrett’s esophagus, excess body mass index (BMI) and tobacco have been verified as key risk-factors for EAC [4].

While ESCC is broadly in decline, EAC incidence rates are rising rapidly [5]. This concerning EAC epidemic rise warrants further investigation as this cannot be exclusively attributed to the prevalence of the well-known EAC risk-factors, as obesity, GERD and tobacco. Although these environmental factors are deemed strongly implicated in EAC aetiopathogenesis, only a proportion of exposed individuals develop EAC, suggesting that genetic footprint may also contribute to malignant transformation of esophageal epithelial cells, so its role needs therefore to be reevaluated [6, 7].

Additional challenges health organizations encounter is that despite the milestones achieved in diagnosis and multimodal treatment, EC has still poor overall prognosis, with 5-year survival rates between 15–25% [8]. Delayed diagnosis to advanced stages and disease resistance to chemotherapy may contribute to these poor survival outcomes [9, 10]. Hence, an imperative need urges to identify novel molecular agents that could facilitate the promotion of EC cellular sensitivity to chemotherapy or immunotherapy regimens, as well as biomarkers that would expedite earlier cancer detection [11].

Aberrant expression of long non-coding RNAs (lncRNAs) is now well-established as associated with cancer development [12]. Emerging epidemiological evidence suggests that single-nucleotide-polymorphisms (SNPs) in certain genes influence the pathophysiology of human carcinogenesis including EC [13, 14]. Particularly, HOTAIR SNPs variations (as rs920778, rs4759314, rs1899663, rs12826786, rs874945, rs7958904 and rs10783618) have demonstrated a close connection to the development and progression of malignancies including EC and gastric cancer (GC), by acting as potential cancer susceptibility loci [15, 16]. However, although cumulative research is indicating possible relationship between HOTAIR SNPs and cancer risk, the so-far obtained results have been controversial, inconclusive, restricted to a specific ethnicity or limited by small sample-size. Others such as POLR2E rs3787016 and HULC rs7763881 polymorphisms have been associated with decreased risk of EC [17] in Asian populations.

The association of various SNPs in lncRNAs in EC tissues has only scarcely been investigated in the west. Based on previous research, we aimed to explore the potential contribution of four lncRNAs’ polymorphisms: HOTAIR rs920778, LINC00951 rs11752942, POLR2E rs3787016 and HULC rs7763881 in esophageal carcinoma susceptibility in a western population.

Materials-methods

Study design

This is a tertiary referral hospital-based case-control study designed according to the ‘Strengthening the Reporting of Observational Studies in Epidemiology’ guidelines for reporting observational studies (Table 1) [18]. A research protocol was developed and strictly followed by all participating authors/researchers. This was submitted to the Institutional Review Board of Laiko General Hospital and Ethics Committee of School of Medicine-National and Kapodistrian University of Athens (NKUA), Greece and approval was obtained prior to study start (IRB no: 18.01.2018/24). All procedures performed were in accordance with the ethical standards of the Helsinki Declaration 1964 and later versions. Informed consent was obtained from all patients prior to enrollment.

Table 1 STROBE Statement—Checklist of items that should be included in reports of case-control studies

The study was conducted over a nine-year period with the recruitment phase set between 25/03/2014-25/09/2018 and follow-up phase with end-date set at 30/06/2023. Rationale for determining the recruitment start-date was that, as of March 2014 the dedicated esophagogastric surgical team began operating at Laiko Hospital. Rationale for setting the end-date was that from October 2018 and onwards, we changed our practice from open to minimally invasive approach, so this was an effort to minimize bias that this shift in operating technique could potentially introduce to our oncological outcomes or subsequent survival analysis.

Two independent authors (EB, MB) extracted the data from our prospectively-collected Upper GastroIntestinal (UGI) Cancer database encompassing data from theatres, surgical/medical records, electronic/paper notes from inpatient and outpatient visits, investigations performed in public and private sectors. Discrepancies in data extraction were resolved after consensus with a third independent author (AM). All six surgeons participating in the study (EB, MB, AM, AC, TL and AA) were responsible to prospectively recruit, maintain accurate log and update patients’ records on our UGI Cancer database, as well as to conduct patients’ follow-up as per protocol.

Most senior molecular biologist (MG) designed and supervised the genotyping experiments. Most senior pathologist (ACL) supervised the histopathology examination and reports as well as the appropriate FFPEs tissue samples’ selection for the genotyping. Most senior surgeon (TL) supervised all surgical operations performed during the recruitment period.

Patient selection: inclusion-exclusion Criteria

All consecutive adult patients who underwent surgery for histologically-confirmed malignancy involving the middle-third, lower-third part of the esophagus or the esophagogastric junction (EGJ) (Siewert I-III) [19] at the Department of UGI Surgery, Laiko General Hospital, School of Medicine-NKUA, Greece were deemed eligible for inclusion. Clinical and pathological staging, as well as all the definitions described in the present study follow the guidelines of the TNM staging system of the American Joint Committee on Cancer (AJCC), 8th edition [20]. All patients were risk-assessed and clinically staged with physical examination, computed tomography and gastroscopy. At the time of diagnosis, all were evaluated by the dedicated cancer Multidisciplinary Team at Laiko Hospital which formulated the appropriate multimodal treatment strategy as per international National Comprehensive Cancer Network (NCCN) [21] and European Society for Medical Oncology (ESMO) [22] guidelines.

Exclusion criteria were as follows: (a) non-adult subjects, (b) patients with cancer of the upper-third/cervical esophagus, (c) those submitted to emergency surgery, (d) esophagogastric malignancy family history.

Regarding our control group, community subjects were recruited from the Department of Molecular Biology, School of Medicine, NKUA, Greece. Cases and controls were unmatched; controls had no self-reported history of cancer at any site. Both were from European/Greek ancestry and resided in the geographical region of Greece.

Data extraction: primary-secondary variables of interest

Primary study endpoint was to ascertain the presence of four lncRNAs’ polymorphisms: HOTAIR rs920778, LINC00951 rs11752942, POLR2E rs3787016 and HULC rs7763881 in primary esophageal and EGJ tumors in a western population, as Greece. We additionally sought to investigate the correlation of the aforementioned genetic values with the clinical, pathological, and oncological outcomes to identify further associations of these genetic footprints with recurrence patterns as well as metastatic potential in esophageal carcinogenesis process in a western population. Secondary endpoints were to assess the incidence of these SNPs in EAC and ESCC patients (subgroup analysis by histological subtype) and furthermore in EGJ Siewert I/II compared with EGJ Siewert III (subgroup analysis by tumor location).

To this end, variables of interest extracted from our UGI Cancer Database were: (1) demographics comprising age, gender, preoperative health of surgical candidates based on the American Society of Anesthesiologists’ classification of physical health grading system (ASA I-V) [23]; nutritional status and BMI according to the European Society for Clinical Nutrition and Metabolism (ESPEN) guidelines [24], (2) histology, size, location of primary tumor, neoadjuvant chemo-radiotherapy if offered; (3) date/type of surgical operation, lymphadenectomy extent, surgical time (minutes), estimated blood loss (EBL, ml), length of hospital stay (LOS, days); (4) final histopathological characteristics as tumor size, location and extent, lymph node (LN) harvest and infiltration, histological type, grade and stage, lymphovascular invasion, neoadjuvant treatment effect, resection (R1-3) and circumferential resection margins (CRM, mm), as per World Health Organization (WHO) and College of American Pathologists (CAP) recommendations [25]; (5) (%) minor/major complications (90-days), in-hospital mortality (90-days), follow-up length (months), adjuvant treatment where applicable, date/type of recurrence, disease-free survival (DFS, months) and overall survival (OS, months).

Minor complications were defined as Grade < II and major as Grade > IIIa based on the Clavien-Dindo severity classification system [26]. Recurrence date was set as the date of first investigation documenting the recurrence/metastasis. DFS was defined as the period from operation date and first recurrence date. OS was defined as the period between operation date and patient’s death.

Sample collection and preparation for genetic analysis

Surgical tissue specimens for all enrolled patients were transferred after completion of surgical operation to the First Department of Pathology, School of Medicine, NKUA, Greece. After gross pathologic examination and marking of the margins, each specimen was formalin fixed and underwent tissue processing for paraffin embedding. Standard fixation methods to preserve nucleic acid integrity were used including 10% neutral-buffered formalin fixed for 24–72 h. Once paraffin embedded, the tissue samples were then sectioned with a microtome and placed on a glass slide to formulate microscopic slides ready to be viewed under the microscope by the pathologists. When necessary, the embedding process was reversed to get the paraffin wax out and allow for staining of the sections, as Hematoxylin and Eosin (H&E) staining. All tissue samples were reviewed by two independent pathologists. Most senior pathologist (ACL) examined all the microscopic slides for each specimen and selected the slide and its corresponding FFPE tissue block with the higher tumor burden in preparation for nucleic acid extraction.

Genotyping of HOTAIR rs920778, LINC00951 rs11752942, POLR2E rs3787016 and HULC rs7763881

The nominated FFPE tissue blocks with the highest tumor burden from all recruited patients were transferred to Molecular Biology Laboratory, School of Medicine, NKUA, Greece. The percentage of tumor cells in each sample was minimum 50%. One to two − 1 mm diameter punches were sampled from the FFPE blocks. The punches were deparaffinized, homogenized and proteinase K digested. Then, the genomic DNA/RNA extraction was performed using a commercial RNA Extraction Kit from FFPE Samples (NucleoZOL, Macherey-Nagel, Germany). LncRNAs genotypes were identified through the “polymerase chain reaction-restriction fragment length polymorphism” (PCR-RFLP) or allele specific PCR depending on the SNP. Most senior molecular biologist (MG) supervised the experiments as per published methodology [27].

Statistical analysis

Descriptive statistical analysis was conducted for all the encountered parameters, measuring the accumulated values. All variables are reported as means and medians with their corresponding standard deviations, ranges and proportions. We assessed the relationship between lncRNAs’ gene polymorphisms and EC, EAC-EGJ and ESCC cancer susceptibility by determining the genotype and allele frequencies of all cases and controls. Genotype frequencies were compared using the Fisher’s exact test with Yate’s continuity correction. Odds ratios (ORs) and 95% confidence intervals (95% CI) were calculated, using the approximation of Woolf. To summarize the ORs of the four polymorphisms we applied five genetic models: allele contrast, homozygous, heterogeneous, dominant and recessive models (AA, homozygotes for the common allele; AB, heterozygotes; BB, homozygotes for the rare allele). Correlations between SNPs and clinicopathological parameters were also statistically analyzed. Survival analysis was performed by using Kaplan-Meier curves and multivariate comparisons by using cox proportional hazards models. The probability p-values were two tailed and p < 0.05 was adopted as the statistically significant level. Censoring date was 30/06/2023. Statistical analysis was performed with R, version 4.0.4 (R Project for Statistical Computing).

Results

Study Population, Clinicopathological, Surgical and Oncological outcomes

All enrolled study subjects were adults of European/Greek ancestry divided into two groups incorporating FFPE tissue samples from N = 95 consecutive esophageal/EGJ cancer patients subjected to surgical treatment as a case-group and blood samples from n = 121 cancer-free community subjects as a control-group. The characteristics of the surgical-group are listed in Table 2.

Table 2 Demographics-Surgical/Oncological outcomes for the whole Esophageal Cancer (EC) cohort (N = 95 patients)

Mean age of the patients at time of surgery was 62.9 years with median ASA Class II (range I-IV), most were male (Ν = 86/95, 90.5%). Primary tumor location was at middle-thoracic esophagus, lower-thoracic esophagus, EGJ-Siewert I, EGJ-Siewert II and EGJ-Siewert III in 2/95, 8/95, 20/95, 44/95 and 21/95 patients respectively. As per AJCC-8th Edition, cancers crossing the EGJ with their epicenter in the proximal 2 cm of the stomach (EGJ-Siewert II) were staged and treated as EC, whereas cancers crossing the EGJ with their epicenter in the proximal 2 to 5 cm of the stomach (EGJ-Siewert III) were staged and treated as GC. As such, surgical operations performed were: Ivor Lewis esophagectomy with 2-field standard lymph-node dissection, McKeown esophagectomy with 3-field standard lymph-node dissection, for either ESCC (N = 6) or middle/lower EAC or EAC-EGJ Siewert I/II (N = 61). N = 21 patients with EAC-EGJ Siewert III adenocarcinoma were submitted to total extended gastrectomy. N = 7 EAC patients with small tumors extending borderline between EGJ Siewert II and III (with epicenter at 2 cm) were also submitted to total extended gastrectomy.

Within 90 days after surgery, post-operative recovery was uneventful in Ν = 48 (50.5%) patients. N = 23/95 (24.2%) patients developed Minor-Class II, whereas N = 24/95 (25.3%) developed Major-Class III-V complications including anastomotic leak in 7/95, conduit necrosis in 1/95 and tracheoesophageal fistula in 1/95 patients. Overall, in- and out-of-hospital 90-day mortality was 7.4% (N = 7/95).

In final histopathological examination, High Grade Dysplasia was noted in N = 2 (2.1%) patients with underlying Barret’s Esophagus, whereas invasive carcinomas encompassed Adenocarcinoma (N = 84, 88.4%), Adeno-squamous (N = 02, 2.1%), Squamous Cell Carcinoma (N = 06, 6.3%) and Mixed adeno-neuroendocrine carcinoma-MANEC (N = 01, 1.1%). Invasive tumors were well-differentiated in 2/95, moderately-differentiated in 36/95 and poorly-differentiated in 52/95 patients, while in N = 5 differentiation could not be assessed (Gx) or was not applicable (N/a).

In terms of residual disease, adequate resection margins were achieved in 86/95 (90.5%) and LN harvest > 15 nodes in 87/95 (91.6%) patients.

Long-term follow-up was completed in N = 92/95 patients (97%). Follow-up period ranged between 4 and 97 months with median 75 months for living and 20 months for deceased cases. During follow-up, two patients suffered myocardial infarction and one major UGI hemorrhage and passed away at 4th postoperative month. Additionally, one patient died because of myocardial infarction at 56th postoperative month and one died because of metachronous lung cancer disease progression at 74th postoperative month. N = 49/95 patients (51.6%) developed disease recurrence with N = 46/95 having passed away 4–87 months post-operatively. The remainder three patients developed lung metastasis (N = 2) and regional LN metastasis (N = 1) at 3, 17 and 36 months respectively. After treated with chemoradiotherapy and immunotherapy, all three are alive with stable disease (survival range 59–79 months). Thirty-one patients (32.6%) are alive and cancer-free with median survival of 77 months (range 58–97). In total, estimated median OS was 32.5 months (range: 4–97 months), while median DFS was 18.4 months (range: 2–97).

Allele frequencies and genotype distributions reflecting the association between HOTAIR, LINC00951, POLR2E and HULC polymorphisms and cancer risk in EC, EAC, EAC-EGJ and ESCC Populations

For HOTAIR (Table 3), the detection of rs920778, C > T (T/C) polymorphism was initially performed in 95 surgically treated EC patients and 121 healthy controls with an overall distribution not significantly different between the two groups. According to our statistical analysis, the CT genotype was found to be equally present in both EC and control groups whereas the TT was overrepresented in the EC group with OR = 2.960, yet without statistical significance (P = 0.1241). When correlated EC patients’ TNM Stage and LN Involvement (N Status), we detected no significant association. When conducted a subset analysis by histological type, we identified a borderline non-significant increased frequency of the TT (OR: 3.442, P = 0.0620) and T variants (OR: 1.542, P = 0.0627) in N = 84 EAC patients. We then performed a subset analysis based on tumor location which yielded a significant over-presentation of the TT genotype (OR: 4.177, P = 0.0382) in N = 61 EAC-EGJ Siewert I/II patients followed by a marginally not significant over-presentation of the T allele (OR: 1.630, P = 0.0690). In N = 21 EAC-EGJ Siewert III and N = 6 ESCC patients, no correlation was found between TT and T gene variants and cancer susceptibility.

Table 3 Allele frequencies and genotype distributions demonstrating the association between HOTAIR polymorphism and cancer risk in Esophageal Cancer (EC), Esophageal AdenoCarcinoma (EAC), EsophagoGastric Junction (EGJ) Adenocarcinoma and Esophageal Squamous Cell Carcinoma (ESCC) Populations-Bold values denote statistically significant associations

Regarding LINC00951 (Table 4), we explored the LINC00951 rs11752942, A > G (G/A) polymorphism prevalence in both cancer and healthy control groups. Overall, the distribution of LINC00951 rs11752942, A > G (G/A) genotype frequencies between EC patients and controls did not differ significantly. Based on our analysis, the AG genotype was equally distributed between the EC patients and community controls while the GG was not significantly less frequent in the surgical case group (OR: 0.5222, P = 0.2498). When assessing the relationship with patients’ TNM Stage and LN Involvement our analysis resulted in no association, although the GG genotype were detected more often in individuals in more advanced stages III-IV compared with the I-II subgroup (OR: 4.250, P = 0.2402) as observed also in the recurrence positive group compared with the disease free group (OR: 3.161, P = 0.4153). By performing subgroup analysis, we demonstrated that GG genotype may act as a protective factor in EAC (OR: 0.6026), EAC-EGJ Siewert I/II (OR: 0.3917) and ESCC (OR: 0.5758), whereas on the contrary it may pose a risk-factor in EAC-EGJ Siewert III patients (OR: 1.306). Nevertheless, none of the above associations were found statistically significant.

Table 4 Allele frequencies and genotype distributions demonstrating the association between LINC00951 polymorphism and cancer risk in Esophageal Cancer (EC), Esophageal AdenoCarcinoma (EAC), EsophagoGastric Junction (EGJ) Adenocarcinoma and Esophageal Squamous Cell Carcinoma (ESCC) Populations-Bold values denote statistically significant associations

We also evaluated the distribution of POLR2E rs3787016, T > C (C/T) polymorphism in surgical cancer and healthy control groups (Table 5). We detected both the C allele and the CC genotype less frequently in EC patients compared with the healthy controls yet marginally not significantly different (P = 0.0561, P = 0.0582 respectively). However, when evaluated gene frequencies based on the TNM stage, the CC variant was significantly underrepresented in individuals at more advanced stages of the disease: 2/30 patients in the III-IV Stage Subgroup compared with 7/14 patients in the I-II Stage Subgroup, OR = 0.1333 (95% CI: 0.02448–0.7263), P = 0.0209 indicating a possible protective role in disease burden genetic footprint. When assessing POLR2E rs3787016’s correlation with the LN Involvement or with the disease progression/recurrence risk, our univariate analysis demonstrated no statistically significant association, nonetheless the CC/TT genotypes were under-represented in the more advanced N > 1 stage compared with the N0 subgroup (OR = 0.2333, P = 0.0666) as well as in the recurrence positive group compared with the disease free group (OR = 0.7083, P = 0.7102). Subsequently our subgroup analysis assessing pure EAC population, detected the C allele significantly more frequent in the healthy controls compared with the EAC (OR: 0.5778, P = 0.0119) or with the EAC-EGJ Siewert I/II populations (OR: 0.6047, P = 0.0386). The CC genotype was also significantly more present in the healthy control population compared with either the EAC (OR: 0.2497, P = 0.0114) or the EAC-EGJ Siewert I/II populations (OR: 0.2194, P = 0.0220). While the C and the CC variants may hence represent a potential protective factor in esophageal adenocarcinoma/EGJ carcinogenesis pathway, no association was identified with the EAC Siewert III subgroup. By contrast, both C and CC variants were observed more frequently in the ESCC group indicating that may pose a risk-factor in ESCC aetiopathogenesis, yet the sample-size is too small (N = 6 patients) to extract safe results.

Table 5 Allele frequencies and genotype distributions demonstrating the association between POL2RE polymorphism and cancer risk in Esophageal Cancer (EC), Esophageal AdenoCarcinoma (EAC), EsophagoGastric Junction (EGJ) Adenocarcinoma and Esophageal Squamous Cell Carcinoma (ESCC) Populations-Bold values denote statistically significant associations

Finally, we investigated HULC rs7763881 incidence as per previous methodology principles (Table 6). Overall, the distribution of HULC rs7763881, A > C (C/A) among EC and control populations was not found different, with both AC and CC genotypes being equally present not only in EC patients but also in the healthy controls, same as the C allele. Concurrently, we identified no association of the HULC rs7763881 with any of the clinical variables in EC patients such as TNM stage and LN Involvement. The subset analysis by tumor histology and location yielded also no association with cancer susceptibility in none of the AC/AA, CC/AA and C/A genetic models with no significant variations among ESCC, EAC, EAC-EGJ Siewert I/II and EGJ Siewert III subpopulations.

Table 6 Allele frequencies and genotype distributions demonstrating the association between HULC polymorphism and cancer risk in Esophageal Cancer (EC), Esophageal AdenoCarcinoma (EAC), EsophagoGastric Junction (EGJ) Adenocarcinoma and Esophageal Squamous Cell Carcinoma (ESCC) Populations-Bold values denote statistically significant associations

Univariate and Multivariate Survival Analysis reflecting the association of HOTAIR, LINC00951, POLR2E and HULC polymorphisms with overall survival (OS) and Disease-Free Survival (DFS) in EC, EAC, EAC-EGJ and ESCC Populations

Univariate analysis of the independent variables did not reveal any significant predictors for death or recurrence in both whole EC population and subpopulations for none of the four SNPs of interest. Subsequent multivariate analysis included univariate predictors of age, stage, operation type, histological subtype, neo- and adjuvant chemotherapy, radiotherapy and resection status. When performed for the whole EC cohort (N = 95), multivariate analysis revealed significant worse OS associated with age (HR: 1.0356, P = 0.0299), stage III-IV (OR: 3.9017, P = 0.0017), ESCC (HR: 4.1507, P = 0.04951), whereas only stage III-IV was associated with worse DFS (HR: 4.0091, P = 0.0047). When performed for either full EC cohort or subpopulations, no risk association was demonstrated between OS or DFS and the occurrence of any of the studied SNPs gene variants (Online Resource 1: Tables 1, 2, 3, 4 and 5; Figs. 1, 2 and 3).

Discussion

Human genome sequencing has established that protein-coding genes account for 3% of DNA whilst over 80% of our genome is actively transcribed into a group of a noncoding RNA molecules (ncRNAs) without potentiality for protein-coding, including lncRNAs [28]. These transcripts play a pivotal role in a series of biological processes as regulating chromatin dynamics, genome packaging and neighboring protein gene expression, growth and differentiation and hence implicated in carcinogenesis [29]. SNPs are the most common genetic variation and these occurring in lncRNAs’ functional region largely affect their expression, structure, and function and thereby affect cancer prognosis [30] and susceptibility by promoting oncogenesis and influencing disease-recurrence risk [31]. As such, lncRNAs SNPs hold great potential not only as future therapeutic targets/agents but also as novel markers for predictive analysis of cancer risk, clinical outcome, prognosis, survival and drug resistance [32] or toxicity [33].

The development of EC is a multifactorial process and associations with genetic variants have already been identified in Asian studies [34]. Considering that the majority of published research investigating this association of lncRNAs SNPs in esophageal carcinogenesis has been performed in Eastern ethnicities where ESCC histological subtype predominates, we hypothesized that likewise these SNPs may also manifest differently in EC carriers of western ethnicity with dominant EAC prevalence instead. To test this hypothesis, we investigated the effects of four lncRNAs polymorphisms on EC, EAC, EAC-EGJ and ESCC cancer susceptibility in a western population as Greece. We further sought to compare underlying correlations between Siewert I/II and Siewert III EGJ adenocarcinoma to identify possible variations in their oncogenetic mechanisms.

HOX transcript antisense RNA (HOTAIR) is a 2158-nucleotide lncRNA transcribed from the homeobox C (HOXC) antisense strand genes cluster located in chromosome 12q13.12 [35]. A growing number of investigations are drawing attention to the relationship between HOTAIR’s SNPs and the risk of various cancer types but the results obtained so far have been equivocal. In 2014, Zhang et al [36] examined the relationship between HOTAIR’s SNPs and ESCC predisposition and concluded that, compared with the rs920778 CC genotype, the TT played a positive role in the ESCC risk within a Chinese population. In 2021, Xu et al. also suggested that the T allele was nominally significant related to GC susceptibility among Chinese population when compared with the rs920778 C allele [37]. In 2017, Ge’s meta-analysis [38] comprising 37,900 samples from 26 case-control studies identified significant statistical evidence between the rs920778 and cancer susceptibility. When stratified by cancer type, a significantly increased susceptibility to ESCC and GC was also uncovered. By contrast, Bayram et al [39] concluded that HOTAIR rs920778 did not contribute to GC incidence in Turkish population. Taking into account previous evidence, we performed a case-control study analyzing the distribution of HOTAIR rs920778 genotype frequencies in both EC and healthy controls which yielded not significant over-presentation of the TT genotype in our EC population (OR: 2.960) encompassing various histological subtypes. However, in the subsequent subgroup analysis by histological type a significantly increased susceptibility to EAC-EGJ Siewert I/II cancer in the Greek population was uncovered in the homozygous TT models (OR: 4.177, P = 0.0382), suggesting that HOTAIR rs920778, C > T (T/C) polymorphism may pose a risk-factor in the aetiopathogenesis of the EAC in the West in a similar pattern as shown by previous studies for the ESCC carcinogenesis mechanism in the East [40, 41].

LINC00951 is a lncRNA located in chromosome 6p21.2, informally studied as lincRNA-uc003opf.1. Among 52 SNPs, Wu et al’s [42] genotyping results demonstrated that LINC00951 rs11752942 A > G (G/A) was significantly associated with ESCC risk. Pan et al [43] meta-analysis also identified that HOTAIR rs920778 and LINC00951 rs11752942 were related to head and neck cancers’ incidence in Asia. Taking these into consideration, we conducted a case-control study to determine possible association between this polymorphism and EC/EAC/ESCC risk in Greek population. As opposed to previous studies of Asian background, the distribution of LINC00951 rs11752942, A > G (G/A) genotype frequencies between EC patients and controls did not differ significantly in neither our primary analysis including all histological subtypes and tumor locations nor in our subgroup analysis investigating EAC, ESCC and EGJ Siewert I/II and III subsets. This outcome may plausibly be explained by the fact that the EAC subtype manifests predominantly in our EC patients of Greek ethnicity compared to ESCC subtype with different genetic footprint which predominantly manifests in Eastern studies of Asian ancestry instead.

Although the literature presents conflicting notions, SNP rs3787016 (A > G or complementary T > C, C/T), localized in the fourth intron of RNA polymerase II subunit E (POLR2E) lncRNA gene may serve as a genetic risk-factor increasing predisposition to various cancer types [41] including prostate cancer in Chinese [44] or Iranian [45], gastric cancer [46] in Chinese, breast and cervical cancer in Chinese populations [47]. Conversely, POLR2E rs3787016 may serve as a genetic protective factor against esophageal cancer ESCC subtype as investigated by Kang et al. in Chinese/Han population [17] in 2015. Since no study has yet evaluated the role of POLR2E rs3787016 in the diagnosis, incidence and prognosis of EC/EAC in the west, we conducted a case-control study in a Greek/European ancestry population. Our primary analysis yielded that CC genotype (OR: 0.4188) and C allele (OR: 0.6667) carriers are observed more frequently in healthy community controls and when present in EC patients are associated with lower LN infiltration risk (OR: 0.2333) and lesser metastatic potential (OR: 0.7083), yet without statistical significance. However, when TNM stage was assessed, CC genotype was significantly reduced in more advanced stages (OR: 0.1333, P = 0.0209). Furthermore in our subgroup analysis, compared with the TT and CT genotypes, CC significantly reduced the risk of EAC (OR: 0.2497, P = 0.0114) as well as the risk of EAC-EGJ Siewert I/II (OR: 0.2194, P = 0.0220), whereas no association was detected with the EAC-EGJ Siewert III risk. C allele’s carriers were also significantly associated with lower EAC (P = 0.0119) and EAC-EGJ Siewert I/II (P = 0.0386) risk. In line with Kang’s study, we identified that POLR2E rs3787016 may also serve as a genetic protective factor against esophageal cancer EAC subtype predisposition similarly to the ESCC subtype.

Apart from investigating POLR2E rs3787016’s potential correlation with ESCC susceptibility, Kang also assessed the HULC rs7763881, A > C (C/A) incidence in the same Chinese/Han population [17] concluding that HULC rs7763881 was a protective factor against ESCC among male, younger patients. Hepatocellular carcinoma up-regulated lncRNA (HULC) gene is located in chromosome 6p24.3 with two exons and 1638 bp length. In a 2022 meta-analysis the authors resulted that rs7763881 was associated with a decreased hepatocellular, colorectal and esophageal cancer risk [48]. Given literature’s contradictory results with Hong et al [49] suggesting that the HULC rs7763881 is associated with increased GC susceptibility and Elhelaly et al. [50]suggesting association with breast cancer in Egyptians, we conducted our case-control study to explore its role in EC/EAC/ESCC genetic footprint in western ethnicity. Compared with the previous studies, both our primary and subgroup analysis by tumor histology and location demonstrated no cancer susceptibility association in any of the genetic models AC/AA, CC/AA and C/A alleles with no significant variations among ESCC, EAC, EAC-EGJ Siewert I/II or EGJ-Siewert III subpopulations. This could be explained by small sample-size or EAC subtype’s predominant prevalence in our cohort or may represent a true different genetic footprint needs to be confirmed by additional future studies in the west.

Certain limitations apply to this report. As a hospital-based case-control study with large majority of EC cases and healthy controls from Attica Region, inherent choice bias may have occurred. A variability in sample-size was present related to histological subtypes and tumor location included in the primary analysis, which we sought to overcome with the subsequent subgroup analyses. The statistical strength of this case-control study may also be somewhat limited by the sample-size, particularly concerning the statistical analyses of EGJ Siewert I/II, Siewert III and ESCC subsets where the smaller sizes may have impacted the credibility of the data. Therefore, throughout the present analysis we intentionally focused on interpreting data involving larger sample-size groups such as EC or EAC rather than EGJ Siewert III or ESCC subgroups. Finally despite this case-control study is retrospective by definition, all data were extracted from our prospectively-collected UGI cancer database following predetermined research protocol to ensure appropriate methodology. Additional study-strengths were the follow-up length with high case-ascertainment enabling us to perform our correlations between SNPs and clinicopathological data such as tumor progression, metastasis and overall survival.

Conclusion

Neither LINC00951 rs11752942 nor HULC rs7763881 polymorphisms were detected more frequently in surgically treated EC patients compared with healthy community subjects in our study. Regarding HOTAIR rs920778, our subgroup analysis findings indicate that TT genotype may serve as a potential therapeutic suppression target against EAC-EGJ Siewert I-II. We also identified that the presence of C allele, as well as CC genotype of POLR2E rs3787016 were detected more often in the control population, and when found in esophageal cancerous tissues were associated with earlier stages of the disease, as well as with minor lymph node involvement and lesser metastatic potential. Thus, our findings regarding POLR2E rs3787016 suggest that it may become a valuable biomarker to evaluate EC predisposition and predict response to treatment and prognosis. These results demonstrate that HOTAIR and POLR2E genetic variants may influence lncRNA regulation and as such, may explain a fraction of EC and EAC genetic basis. Clinical implications of these findings need to be verified with further prospective studies with larger sample-size.