Introduction

Cucurbitacins are steroids derived from the triterpene skeleton “cucurbitane”. They are well distributed in plants of the Cucurbitaceae family. Although cucurbitacins were originally isolated as bitter principles from plants of the Cucurbitaceae family, they are now known to occur in other plant families, such as Brassicaceae, Scrophulariaceae, Begoniaceae, Elaeocarpaceae, Datiscaceae, Desfontainiaceae, Polemoniaceae, Primulaceae, Rubiaceae, Sterculiaceae, Rosaceae, and Thymelaeaceae1. Occurring as either glycosylated or non-glycosylated molecules, these compounds are well known for their toxicity and biological activities, such as cytotoxicity2,3,4, anti-inflammatory activity5, anti-malarial activity6, hepatoprotective potential7, and other activities.

The family Cucurbitaceae, also known as the gourd family, consists of 965 plant species in 95 genera8, which mostly occur as annual vines. Several of these are consumed as vegetables and fruits. Most commonly consumed plants in this family are various pumpkins, gourds, calabash, cucumber, melon, and watermelon varieties. The literature in this field is expanding and various research groups have reported biologically active cucurbitacins from several plants such as Citrullus colocynthis, Momordica charantia and others9,10,11,12,13. Since cucurbitacins have various interesting biological activities and are widely distributed among the plants of Cucurbitaceae family, there is a need to establish metabolite distributions of important genera and species in this family.

Metabolic profile development requires prior identification of natural products for which HPLC-MS/MS is a fast and reliable method. It is often done without the use of chemically pure standards as the availability of a compound in question through synthesis, isolation or commercial sources is not always possible. However, the level of certainty in natural product identification through mass spectrometry varies. This depends upon whether the data was obtained in reference to purified standards or only an untargeted study was performed. Authentic identification through mass spectrometry is only possible when a purified compound is available. However, in case of a metabolomics study it is neither economical nor practically possible to have a large number of purified standards available. However, in cases where purified standards are not available, it is still possible to identify natural products through a sample based on MS/MS fragmentation data14.

Keeping in view the important bioactivities of cucurbitacins, quantitation of these compounds in various plants of the Cucurbitaceae family was carried out. Various reports on the quantitation of important cucurbitacins in plants such as the zucchini15, bottle gourd16 and bitter melon17 have appeared in the literature. A notable report is the quantitation and pharmacokinetics study of cucurbitacin IIa and cucurbitacin IIb from Hemsleya amabilis in rat plasma. These two compounds are considered major bioactive constituents in this plant and have promising antiproliferative activities18.

We present herein a comprehensive study focusing on the generation of metabolic profile of various plants of the Cucurbitaceae family, along with quantitation of four biologically active cucurbitacins in ten species belonging to six different genera of the Cucurbitaceae family, along with Cissampelos pareira belonging to the Menispermaceae family. C. pareira is also reported to possess antioxidant19, anti-inflammatory20, antiviral21, antidiabetic22, anticancer23,24 and other activities25. There are studies in literature that report of the profiling of Cucurbitaceae family plants, such as profiling of phenolics and other polar components in watermelon26 and zucchini27 through LC-MS/MS. The study presented herein should serve as a stepping stone for more detailed metabolomics studies on important plants of the Cucurbitaceae family.

Experimental

Chemicals and reagents

Analytes 1, 2 and 4 namely cucurbitacin E 2-O-β-D-glucopyranoside (1), cucurbitacin I 2-O-β-D-glucopyranoside (2) and 22-deoxocucurbitoside B (4) were isolated from methanolic extract of Citrullus colocynthis fruits. The crude methanolic extract was fractionated using dichloromethane (DCM) and ethyl acetate. Compound 1 was isolated from the DCM fraction, while compounds 24 were isolated from the ethyl acetate fraction. Cucurbitacin (3) was isolated from DCM extract of C. pareira. The DCM extract was fractioned using hexanes and ethyl acetate. The structures of analytes are shown in Fig. 1. The compounds were characterized based on comparison of their 1H- and 13C-NMR spectral data with the data reported in literature28,29. Details of isolation are provided with supplementary information along with necessary spectroscopic data.

Figure 1
figure 1

Chemical structures of analytes 14 quantitated in various plants.

Formic acid, purchased from Daejung (Daejung Chemicals & Metals Co. Ltd., Korea), was used as an additive for the mobile phase. Methanol for mobile phase was purchased from Merck (Merck KGaA, Darmstadt, Germany). Type I water (ISO 3696) for the mobile phase was obtained from a Barnstead™ GenPure™ ultrapure water system (Thermo Fisher Scientific Inc., USA).

Instrumentation and experimental conditions

HPLC-ESI-MS/MS analysis for natural product identification was performed on a Bruker maXis II™ HR-QTOF mass spectrometer (Bremen, Germany), coupled to a Dionex UltiMate™ 3000 series HPLC system (Thermo Fisher Scientific, Waltham, MA, USA) fitted with a binary RS pump, column thermostat, and auto-sampler. Sample chromatography was performed on a Macherey-Nagel Nucleodur® C18 Gravity column (3.0 × 100 mm, 1.8 µm), kept at 40 °C. 4-µL samples were injected while the mobile phase consisted of A (0.1% formic acid in H2O) and B (0.1% formic acid in MeOH). The mobile phase flow rate was set at 0.7 mL/min using a linear gradient of A and B starting at 10% B, increased to 90% B in 5.5 min, maintained at 90% for 1.5 min, and returned to 10% B in 1 min. The total run time was 10 min, including a 1 min holding time at the start and 1 min equilibration time at the end of the gradient.

Mass spectra were recorded using electrospray ionization employing the Bruker CaptiveSpray™ ion source. MS and MS/MS spectra were recorded separately both in positive and negative modes. Ion source parameters were set as follows (parameters for negative mode next to positive mode parameters): capillary voltage at 4500 V (−3500 V), end plate offset at 500 V, nebulizer gas 45.0 psi, drying gas at 12.0 L/min and drying gas temperature at 270 °C. All spectra were recorded in the mass range of m/z 100 to 2000, while the scan speed was set at 5 Hz for MS and 12 Hz for MS/MS spectra. Active exclusion feature of the instrument was used which enables the instrument to remove a precursor ion from further consideration after a set number of MS/MS spectra have been recorded for that particular precursor ion. The active exclusion number was set at 3, and the precursor reconsideration time was set at 30 s.

HPLC-MS/MS analysis for quantitation was performed on a Bruker amaZon speed ion trap mass spectrometer (Bremen, Germany), coupled to a Dionex UltiMate™ 3000 series HPLC system (Thermo Fisher Scientific, Waltham, MA, USA) fitted with a binary pump, column thermostat and auto-sampler. Chromatographic separation of analytes was achieved on a reverse-phase C18 column (Agilent Poroshell 120 EC-C18 3.0 × 50 mm, 2.7 µm), kept at 40 °C. 2-µL samples were injected while the flow rate was set at 0.5 mL/min. A linear gradient was used for analyte elution starting at 10% B, increased to 95% B in 3.5 min, maintained at 95% for 1.5 min, and returned to 10% B in 1 min. The column was equilibrated for 1 min at the end of the gradient. Total run time for analysis was 8 min.

Mass spectra were recorded using electrospray ionization under positive mode employing the Bruker CaptiveSpray™ ion source. Ion source parameters were set as follows: capillary voltage at 4500 V, end plate offset at 500 V, nebulizer gas 35.0 psi, drying gas at 8.0 L/min, and drying gas temperature at 250 °C. Mass spectra scan range was set at m/z 100 to 1000, while the number of spectral averages was set at 3. Ion charge control (ICC) was used for transferring a certain number of ions to the ion trap and set at 60,000, while accumulation time was set at 100 ms. Fragmentation time under collision-induced dissociation (CID) mode was set at 20 ms while fragmentation amplitude was optimized for each analyte to obtain the maximum abundance of fragment ions.

Method performance

All MS and MS/MS data were saved using both profile and line spectra to minimize the chance of instrumental noise being taken as a precursor ion. Mass spectra for all samples were recorded under both ionization modes (positive and negative) to counter check the authenticity of a molecular ion peak while active exclusion was used to minimize the chances of common contaminant peaks being placed under MS/MS fragmentation. Each sample was injected in triplicate.

The developed quantitation method was assessed for accuracy and precision. Accuracy (% bias) and precision (% RSD) were assessed by analyzing three different QC samples with six replicates for intra-day, and 12 replicates on two different days (6 replicates/day) for inter-day analysis. Excellent accuracy and precision (<5%) were found for the developed method. The accuracy of analysis was calculated using the expected concentration (CE) and the mean value of measured concentration (CM) by using the following relation: Accuracy (bias, %) = [(CE-CM)/CE]x100. Similarly, the relative standard deviation (% RSD) was used as an indicator of the analytical precision, and calculated from the standard deviation and mean value of measured concentrations by the following equation: Precision (RSD, %) = (Standard Deviation (SD)/CM)x100. Method performance was further evaluated through the analysis of fortified samples prepared by spiking additional amounts of analytes 1–4 at three levels of 50, 100, and 150 ng/mL, respectively, in the original sample solutions used for analysis. Details about method precision and validation along with calibration equations, LOD and LOQ values are provided with supplementary information (Supplementary Tables 1, 2 and 3).

Sample preparation

Shade-dried plant material (whole plants) were crushed in a blender. 1 g of each plant was weighed and extracted with 10 mL methanol through sonication for 20 min. Each sample was centrifuged for 15 min at 6000 rpm to settle large particles, and the supernatant was filtered through a 0.22 µm PTFE syringe-driven filter. 50 µL of the filtered extract was diluted to 1000 µL with methanol for LC-MS, and LC-MS/MS analysis.

For quantitation, 1 mg of each standard compound was weighed and dissolved into 1 mL methanol to prepare standard stock solutions. These solutions were diluted with 50:50 water: methanol in a serial manner to prepare ten calibrant solutions ranging between 50–2000 ng/mL. The analysis of plant samples was performed using diluted plant extract. 50 µL of filtered plant extract was diluted to 1500 µL with 50:50 water: methanol for LC-MS/MS analysis.

Spiked samples for method validation were prepared in a similar manner as the plant samples. 50 µL of filtered plant extract plus an amount of standard solution equivalent to spike concentrations of 50, 100, and 150 ng/mL was diluted to a final volume of 1500 µL with 50:50 water: methanol for three samples, and labelled as S1, S2 and S3, respectively.

Results and Discussion

LC-MS/MS optimization

The method for LC-MS/MS in the profiling study was optimized using a RP-C18 column in a way that the various sample components eluted in a 10 min runtime. No carryover was detected in the next blank sample run after the plant sample. Analysis were performed in both positive and negative ionization modes, and the mobile phase composition for both polarities was kept identical (0.1% formic acid in both solvents). However, to obtain a reasonable cycle time for the MS/MS analysis, the scan frequency of the instrument was kept at maximum (12 Hz), and active exclusion was used to avoid solvent contaminant peaks being placed under MS/MS fragmentation. Precursor reconsideration time was set at 0.5 min after careful examination of the peak widths. This ensured that no precursor ions are excluded from the analysis.

The mobile phase gradient for quantitative analysis was adjusted in order to elute all analytes in the shortest possible runtime and to have large enough differences in runtimes to avoid overlapping MRM transitions. Figure 2 shows that all analytes eluted from the column between 5.0–6.6 min of analysis with small peak widths, and without any observable peak tailing or fronting. The analysis was performed under positive ionization mode, and all of the analytes were observed as sodium adducts ([M + Na]+). Due to the large structures and presence of various oxygen atoms (as hydroxyls) in the structures, all analytes showed a good affinity towards the formation of sodium adducts. Ammonium adducts were also observed along with sodium adducts when the mobile phase composition was changed from 0.1% formic acid to 20 mM ammonium acetate. However, the use of ammonium acetate decreased the instrument sensitivity. 0.5% acetic acid was also used as a mobile phase, but this resulted in a lower sensitivity as compared to 0.1% formic acid in positive mode, whereas chloride adducts were obtained in negative mode. 0.1% formic acid in negative mode also resulted into the formation of formate adducts. However, it was observed that all analytes in negative mode with different mobile phase compositions showed smaller instrumental response as compared to sodium adducts in the positive mode. Therefore, it was concluded that 0.1% formic acid in positive mode was the best mobile phase for analysis.

Figure 2
figure 2

Extracted-ion chromatogram of standard analytes 14 analyzed by MRM.

The observed sodium adducts were subjected to MS/MS fragmentation analysis in the ion trap and the fragmentation amplitudes were tuned for each analyte. All analytes showed good fragment yields in the fragmentation amplitude range of 0.90–1.55 V. Table 1 summarizes optimized MRM parameters for analytes 14. A standard mixture of analytes was prepared at a concentration of 50 ng/mL and analyzed under optimized chromatographic and MRM conditions. Excellent chromatographic peak shapes with good intensities were observed (Fig. 2). Figure 3 shows extracted ion-chromatograms, and product ion spectra of analytes 14 in the standard mixture at a concentration of 50 ng/mL.

Table 1 Optimized MS/MS parameters for analytes 14.
Figure 3
figure 3

Representative MRM extracted ion chromatograms and product ion spectra of standard analytes 14 at 50 ng/mL.

Identification and quantitation of natural products

A total of 51 compounds were putatively identified based on their high-resolution masses and fragmentation data in positive and negative ionization modes. Their identification was performed using a library of natural products previously reported from the plant species included in this study. The library was custom-built as follows. Plant names (with all synonyms) were queried in the Dictionary of Natural Products (DNP) Ver. 26.2 (Dec 2017), and all resulting hits were used to build a library of natural products. All the samples were screened against the prepared library using Bruker Compass TargetAnalysis Ver. 1.3 software, which compares the mass errors (ppm) and isotopic patterns of the compounds in the library with the observed mass spectra and ranks the probable compounds based on match score. The samples were then analyzed again for MS/MS spectra of compounds which were found using TargetAnalysis. Entries with higher ppm errors (>10 ppm) were discarded, and no MS/MS data analysis was performed. It was observed that mass errors were below 2 ppm in most cases. The fragment ions in the MS/MS data were analyzed using in-silico fragmentation. Fragments were generated by manually dissecting the molecules at various possible sites and comparing the theoretical fragments with those obtained from the data. Details about the compounds identified in positive and negative ionization modes are presented in Table 2.

Table 2 Table of compounds detected in positive and negative ionization modes.

The identification of natural products was performed through the acquisition of full range mass spectra, and it was observed that most analytes, under the positive ionization source conditions, were observed as sodium adducts while a few were observed as protonated adducts. The formed sodium adducts were observed to be stable as they did not exhibit extensive fragmentation under CID conditions. Fragments with high abundances were generated through the loss of H2O or acetate group (if present), while other fragments were only seen in low abundances. In the negative ionization mode, mass spectra contained formate adducts, while deprotonated molecules ([M-H]) were also seen. It was observed that the formate adducts, upon fragmentation, resulted in the loss of formic acid and generated deprotonated molecules which exhibited further fragmentation behaviour, such as the loss of water and acetate group.

Based on the ion intensities observed, heat maps were generated for both ionization modes (positive and negative) to show a distribution of various plant metabolites across 6 genera and 10 species of the Cucurbitaceae family (Figs 4, 5). Heat maps were generated using GraphPad Prism 7 on a PC running Windows 7 SP1.

Figure 4
figure 4

Heat map of compounds identified in positive ionization mode.

Figure 5
figure 5

Heat map of compounds identified in negative ionization mode.

The developed quantitation method was applied for the detection and determination of analytes 14 in 10 different plants of the Cucurbitaceae family, along with Cissampelos pareira which belongs to the family Menispermaceae. This plant is rich in alkaloids and finds some uses in the Indian and Chinese medicine. Many alkaloids from this plant exhibit cytotoxic23,30, anti-inflammatory20, antiplasmodic activities and this is why it has gained some attention as a natural remedy for malaria in Kenya due to its antimalarial properties31. Although this plant is well-known for its alkaloidal content, our research group recently isolated cucurbitacins F and Q from this plant. The isolation of cucurbitacin F and Q is quite surprising from C. pareira. The structures of compounds were confirmed through 1H-NMR and 13C-NMR spectroscopy. The details of isolation for analytes 13, along with 1H- and 13C-NMR data provided with supplementary information. Due to the isolation of analyte 3 from C. pareira, it was important to include this plant in the list of profiling of cucurbitacins. The results of quantitation showed that analytes 14 in various plants (in this study) occur in a very large range of concentrations between 0.12–5153.6 mg/Kg. The results of quantitation study are summarized in Table 3.

Table 3 Quantitation of analytes 1–4 in Cucurbitaceae plants.

Analytes 14 were found to be present in significant quantities in Citrullus colocynthis which is well known for its cucurbitacin content. This plant exhibits various important bioactivities such as antidiabetic, anticancer, anti-inflammatory, etc.32,33. The fruits of C. colocynthis have been used traditionally in the Indo-Pak region for its antidiabetic properties34. The results of the current quantitation study concur with the traditional use C. colocynthis fruits as it contains high concentrations of cucurbitacins. Cucurbitacins E, I and Q have been shown to possess antitumor and antidiabetic activities35,36,37,38,39,40. C. pareira contains a substantial amount of cucurbitacin Q as found in our study and this concurs with the antiproliferative potential of the plant30,41. This does not signify that the anticancer potential of this plant is only due to the presence of a large amount of cucurbitacin Q as it requires further studies. Another interesting concurrence is the presence of analytes 14 in moderate amounts in other plants of the Cucurbitaceae family. These plants exhibit antidiabetic, anticancer, antibacterial, and other activities42,43,44.

Conclusion

The present study has putatively identified fifty-one compounds in ten species in six different genera of the Cucurbitaceae family and C. pareira of the Menispermaceae family using high-resolution masses and fragmentation data. Mass spectrometric data of the identified compounds was used to produce a distribution profile these compounds in the analyzed Cucurbitaceae plants. A quantitation method for four bioactive cucurbitacins in the Cucurbitaceae plants was also developed in the current study. The developed quantitation method is simple, rapid, and sensitive. The results of this study are useful for natural product chemistry, food quality control, herbal products standardization, and drug discovery and development.