Introduction

Polymer science generally involves uniform subunits that are polymerized from petroleum-derived building blocks, such as nylon and polyethylene, and does not usually focus on or produce polymers with complex chemical structures or mixtures with numerous and diverse macromolecular complexities. However, a vast majority of biomacromolecules in nature possess biochemically complex structures and thus present a significant challenge in the evaluation and characterization of the components of complexed biomacromolecules, such as land plant biomass and seaweed polysaccharides. Seaweed biomass, primarily composed of polysaccharides, has particularly low hydrophobic and crystalline polymers that make them distinct from cellulosic polymers that are widely put to practical uses.1, 2 Therefore, a new field in polymer science will develop as the challenges in characterizing and understanding the biochemical complexities of various seaweed natural products are solved.

The ocean area surrounding Japan comprises the sixth largest exclusive economic zone in the world and possesses a great variety of marine resources, including fisheries and mineral resources.3 Of these resources, the biodiversity in marine ecosystems based on seaweeds in the coastal regions of Japan provides valuable biomass resources and has tremendous benefits to serve human needs.4 The biomass from seaweeds inhabiting the coastal regions are composed of various beneficial polysaccharides, such as alginic acid, fucoidan and agarose.1, 5 Although a small portion of these beneficial polysaccharides have been used in both manufacturing and for pharmaceutical products, the vast majority remains unexploited for their potential industrial applications because of the paucity of evaluation and characterization techniques for the utility, applicability and availability of these natural products. Therefore, the development of a biopolymer analytical strategy designed to describe and assess the diverse characteristics of seaweed biomass is desirable.

The evaluation and characterization of compositional variations in seaweed biomass is important for developing a method to determine the biopolymer application and composition of seaweeds on the basis of their biomass characteristics. This evaluation and characterization is required to extract useful information from the seaweed biomass complexes obtained by comprehensively measuring these complicated biomacromolecules in untapped natural resources. From this viewpoint, a nuclear magnetic resonance (NMR)-based metabolomics approach is a powerful tool for comprehensively evaluating the metabolic profiles of biochemical complexes in natural ecosystems and has been used extensively to study metabolites from a wide range of biological systems in various environments.6, 7, 8, 9, 10, 11, 12, 13 In addition, an NMR-based metabolomics approach has been applied to biomass profiling for land plant cell wall components.14, 15 However, biomass profiling of water-soluble and water-insoluble (cell wall) components in diverse seaweeds has been largely unexplored in terms of using the comprehensive characterization afforded by an NMR-based metabolomics approach.

Here, the biomass profiling of biochemical complexes from a variety of seaweeds, including brown, green and red algae (16 species total), is described through the comprehensive characterization of water-soluble, water-insoluble and solid-state components. For the NMR analysis of water-insoluble components, an extraction process based on a DMSO-d6/Pyridine-d5 (4/1, v/v) solvent system was modified and optimized.15 In addition, the compositional variations in and diversity of the water-soluble, water-insoluble and solid-state components were evaluated and compared by both NMR and attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy, and subsequently subjected to multivariate statistical analysis.

Materials and methods

Sample collection

The seaweeds used in this study were collected on 11 August and 9 September 2010 from an intertidal area at Tenjin Island in Yokosuka City, Kanagawa, Japan (N35.222, E139.602). The collected samples were classified into brown, green and red algae (9, 4 and 3 species, respectively; Supplementary Table 1).

NMR spectroscopy

To evaluate the water-soluble components of seaweeds, lyophilized samples were crushed to a powder using an Automill machine (Tokken, Inc., Chiba, Japan) and suspended in a phosphate buffer solution (0.1 M K2HPO4/KH2PO4, pH 7.0) containing 90% deuterium oxide and 1 mM sodium 2,2-dimethyl-2-silapentane-5-sulfonate (DSS) as an internal standard for NMR spectroscopy. One-dimensional (1D) Watergate (WG)16 and Carr-Purcell-Meiboom-Gill (CPMG) spectra of these samples were acquired at 298 K on a 700 MHz Bruker Biospin NMR instrument (AV700) equipped with an inverse (proton coils closest to the sample) gradient 5 mm Cryo 1H/13C/15N probe (Bruker Biospin, Rheinstetten, Germany). The WG pulse sequence is an effective technique for suppressing only the water signal, while retaining signals from both macromolecules and small molecules in the sample and is widely used in many complicated pulse sequences.16 On the other hand, CPMG spectra were also acquired to attenuate the broad signals from the macromolecules (for example, polysaccharide, protein and lipoproteins) and enhance sharper signals, such as those from small molecules.17, 18, 19 This enables the identification and relative quantification of small molecules in both the aliphatic and aromatic regions of the spectra. Therefore, WG and CPMG pulse sequences were used to characterize, compare and evaluate the macromolecules and small molecules in the water-soluble components. The two-dimensional (2D) 1H–13C heteronuclear single quantum coherence (HSQC) method for NMR measurements has been described previously.20, 21, 22 Briefly, a total of 128 complex f1 (13C) and 2048 complex f2 (1H) points were recorded from 24 scans per f1 increment. The obtained spectral widths were 150 and 14 p.p.m. for f1 and f2, respectively, and the NMR spectra was processed using NMRPipe software (http://spin.niddk.nih.gov/NMRPipe/).23, 24 Peak assignments were made using the SpinAssign program on the PRIMe website (http://prime.psc.riken.jp/).25, 26, 27

To evaluate the water-insoluble seaweed components, lyophilized samples were powdered and processed by various procedures to identify the best extraction method (Supplementary Table 2), and the optimal-modified process was used for all further analyses. In the optimal process, the lyophilized samples were roughly crushed in a food blender and then powdered using an Automill machine. The powdered samples were pretreated twice with hexafluoroacetone (HFA) to eliminate some of the overlapping protein signals and evaporated. To remove any small molecules, the HFA-pretreated samples were cycled thrice through a 5min incubation with ultrapure water at 50 °C, followed by supernatant removal before incubating three more times in methanol at 50 °C for 5 min with subsequent supernatant removal. At this point, the samples were milled 36 times in a ball mill machine (Fritsch Japan Co., Ltd., Kanagawa, Japan) for 10 min each with 10 min intervals to effectively dissolve the cell wall components in the DMSO-d6/Pyridine-d5 solvent (4/1, v/v). The 1D and 2D NMR spectra of the extracted DMSO-d6/Pyridine-d5 samples were acquired at 298 K using a 700MHz Bruker Biospin instrument equipped as described above (Bruker Biospin); the central DMSO solvent peak was used as the internal reference (13C 40.03, 1H 2.582 p.p.m.). For the 1H–13C HSQC NMR measurements, a total of 256 complex f1 (13C) and 1024 complex f2 (1H) points were recorded using 32 scans per f1 increment. The obtained spectral widths were 150 and 14 p.p.m. for f1 and f2, respectively. Alginic acid, fucoidan, galactan, laminaran (Sigma-Aldrich Japan Co., Ltd., Tokyo, Japan) and agar (Wako Pure Chemical Industries, Ltd., Osaka, Japan) were analyzed as standard compounds for assigning the detected signals.

ATR-FTIR spectroscopy

The ATR-FTIR spectra (400–4000 cm−1) of seaweed samples lyophilized and powdered as described above were obtained using a Nicolet 6700 FT-IR spectrometer (Thermo Fisher Scientific Inc., Waltham, MA, USA) with KBr disks. A smart iTR accessory with a high-pressure clamp (Thermo Fisher Scientific Inc.) was used for the ATR-FTIR measurements and collected 10 scans each of the background and sample spectra. Both baseline and ATR corrections for the penetration depth and frequency variations were applied using the OMNIC software (Thermo Fisher Scientific Inc., Waltham, MA, USA) supplied with the equipment.

Multivariate statistical analysis

The 1H-NMR data were reduced by subdividing the spectra into sequential designated 0.04 p.p.m. regions between the 1H chemical shifts of 0.0–9.0. Each region was integrated and normalized to the DSS integral regions after excluding any water resonances. The ATR-FTIR spectral data were integrated to 1869 data points throughout the 400–4000 cm−1 spectral range using the OMNIC software supplied with the equipment. The 1H-NMR and ATR-FTIR spectral data were statistically evaluated using both principal component analysis (PCA) and hierarchical cluster analysis (HCA) on the ‘R’ software (http://www.R-project.org/). Both PCA and HCA were used to explore any biomass component clustering based on intrinsic biochemical similarities. PCA and HCA facilitate the simultaneous comparison of a large number of complex objects, such as those in NMR and ATR-FTIR spectra, and can be used to statistically summarize the signature of intersample variations and relationships within the multivariate data.6, 8, 10, 11, 12, 13, 18, 19 In PCA, the PCs are calculated so that each component is a linear combination of the original variables orthogonal to the other PCs and with each successive component describing progressively less data variation. The data were visualized in the form of PC score and loading plots. The PC score plots can yield the sample distribution based on the biochemical variation in the data set and highlight groupings within a data set. Each coordinate in a score plot represents an individual sample, and each coordinate in a loading plot represents one NMR or ATR-FTIR spectral data point related to the biomass components. Therefore, a loading plot provides information on the spectral position responsible for the coordinated sample clusters in the corresponding score plot. In HCA, the data were represented using a hierarchical or tree-like structure calculated via the agglomerative method. The resulting hierarchical dendrogram can provide information regarding the relationships between the biomass samples by grouping them on the basis of their similarities, which allows for a visual understanding of both the fine and coarse inclusive relations between clusters.

Results and Discussion

Evaluation of water-soluble components in seaweed biomass by NMR spectroscopy

To evaluate the water-soluble components in seaweed biomass, samples dissolved in D2O were analyzed by NMR using the WG and CPMG pulse sequence programs (Supplementary Figure 1). A comparison between the CPMG and WG spectra of red algae is likely to reveal a signal enhancement derived from the monomeric or oligomeric sugar components and amino acids in the spectra (Supplementary Figures 1i–k). In contrast, a comparison between the brown algae spectra is likely to highlight the signals at 3.87, 3.79, 3.75 and 3.67 p.p.m., possibly derived from mannitol (Supplementary Figures 1l–t). All spectral data were then processed by PCA to determine the individual characteristics of the samples because PCA can facilitate the simultaneous comparison of a large amount of NMR spectral data and can statistically identify both similarities and dissimilarities in the intersample variations within multivariate data. Through the PCA score plots of the 1H-NMR data matrices, the water-soluble component profiles of these seaweeds tended to cluster according to their taxonomical grouping (that is, green, red and brown algae; Figure 1a). Thus, each NMR spectrum of water-soluble components from the same taxonomical group had a relatively high similarity to other samples of that group. In other words, PCA was capable of revealing the type and proportions of the chemical constituents within the taxonomical groupings. To identify the largest differences in the NMR chemical shifts and chemical components that contribute to the intergroup variations, loading plots were analyzed to provide information on the spectral positions responsible for the clustered positions in the corresponding sample score plots. Some water-soluble components, such as mannitol, D-galactose and L-glutamate, contributed to the clustering and were labeled by the SpinAssign program according to the loading plots data (Figures 1b and 2a and Table 1). Mannitol was characteristic of brown algae, whereas certain sugars, such as D-galactose and D-fructose, and amino acids, such as L-glutamate and L-aspartate, were characteristic of red algae. As mannitol is known to be abundant in brown algae1 and D-galactose is an abundant constituent sugar in carrageenan and agar from red algae,1 the biomass profiling of the water-soluble components was able to identify individual characteristics among the various seaweeds. Surprisingly, the PCA results were indicative of clustering according to the differences between the taxonomic groups, whereas the general data seemed to define these separations according to the differences in pulse sequence data in the WG and CPMG spectra. Therefore, these results revealed that the NMR profiles of the water-soluble seaweed components possessed great diversity based on their taxonomic grouping and were unaffected by differences in the pulse sequence data.

Figure 1
figure 1

Profiles of the water-soluble seaweed components evaluated by PCA based on their 1H-NMR spectra. (a) PCA score plot computed from green algae CPMG (♦), green algae WG (◊), red algae CPMG (▪), red algae WG (□), brown algae CPMG (▴) and brown algae WG (▵) spectra; (b) PCA loading plot that provided information regarding the spectral regions responsible for the coordinate and cluster positions for the samples in the corresponding score plots; numbers (1–21) contributed chemical compounds determined by the SpinAssign program and listed in Table 1.

Table 1 List of the chemical compounds attributed to the separation based on the PCA and determined by the SpinAssign program

Evaluation of water-insoluble components in seaweed biomass by NMR spectroscopy

The method for extracting water-insoluble components from seaweed biomass was first developed by examining, modifying and optimizing a physical pretreatment and extraction process using Eisenia bicyclis (Supplementary Table 2). Although the number and intensity of the NMR signals for the test material pretreated using a ball mill machine was better than that for other pretreatment systems, such as using a food blender and Automill machine, the target biomass signals overlapped with the protein signals (Supplementary Figure 2). Therefore, the extraction process was further modified by removing the small molecules via water and methanol extractions and eliminating the protein signals via HFA extractions as described above (Supplementary Figure 3) before being deemed the optimal extraction process for the water-insoluble biomass components (Supplementary Figure 4).

Next, the biomass profiles of the extracted water-insoluble components prepared as described above and extracted with a DMSO-d6/Pyridine-d5 solvent (4/1, v/v) were evaluated using PCA score plots (Supplementary Figure 5). The water-insoluble component profiles for these various seaweeds tended to cluster according to their taxonomical differences, which indicated that the NMR spectra of components in the same taxonomic groups were similar relative to those from other taxonomic groups. Moreover, to obtain more detailed information regarding the chemical components in seaweed biomass, both the water-soluble and water-insoluble component profiles were evaluated using 2D HSQC NMR spectra (Figure 2 and Supplementary Tables 3 and 4). Standards of the detected compounds, such as laminaran and agar, were measured to assign the peaks detected in the water-insoluble HSQC NMR spectra. The distribution patterns of the detected peaks in the HSQC NMR spectra were characteristically different in regard to their taxonomical grouping (Figure 2), which was consistent with the PCA results. Moreover, the peaks detected in HSQC NMR spectra by the NMR-based metabolomics approach can be annotated as candidate chemical compounds; therefore, the detected peaks were further analyzed to determine any candidate compounds and are listed in Supplementary Tables 3 and 4. The NMR signals derived from laminaran (a major brown algae storage glucan) were detected as characteristic components for brown algae, whereas the signals derived from agar (a gelatinous substance derived from a polysaccharide that accumulates in the agarophyte red algae cell walls) were detected as characteristic components for red algae.1 Moreover, some signals derived from polysaccharides corresponding to the cell wall components in land plants, namely, xylopyranose and glucopyranose, were detected as characteristic components for green algae, which suggested that green algae had plant cell wall-like structures in their water-insoluble components. Taken together, this strategy for profiling both the water-soluble and water-insoluble components of seaweed biomass was capable of evaluating individual characteristics among various seaweeds.

Figure 2
figure 2

1H–13C HSQC spectra of both the water-soluble (a) and water-insoluble (b) component profiles for a variety of seaweeds. Superimposed 1H–13C HSQC spectra of green (green), red (red) and brown algae (black); numbers (1–109) represent the determined chemical shifts assigned by the SpinAssign program and are listed in Supplementary Table 3.

Evaluation of solid-state constituents in seaweed biomass by ATR-FTIR spectroscopy

ATR-FTIR measurements were performed on freeze-dried and powdered seaweed samples to evaluate their solid-state biomass constituents, and the resulting PCA score plots of the data matrices showed that the solid-state constituent profiles of these seaweeds were also likely to cluster according to their taxonomical grouping (Supplementary Figure 6). Therefore, this approach to profiling the constituents of solid and water-soluble and water-insoluble seaweed biomass was capable of identifying individual characteristics among various seaweeds.

Chemical profiling of water-soluble, water-insoluble and solid-state biomass from various seaweeds

The water-soluble, water-insoluble and solid-state biomass profiles of these seaweeds were compared via reanalysis using the HCA of the obtained NMR and ATR-FTIR spectral data (Figure 3 and Supplementary Figure 7). An HCA dendrogram allows for the visualization and understanding of both the fine and coarse inclusive relationships between clusters. For example, the HCA dendrogram of water-soluble biomass profiles was roughly clustered into four groups (Figure 3a) with two groups, which consisted of brown algae, clustered to the left and the other two groups clustered, which consisting of green and red algae, to the right. In this way, the HCA dendrograms of the profile components distinctively visualized their intergroup relations. Each type of profile clustered according to the differences between their green, red and brown algae characteristics. In particular, the profiles of the brown algae biomass were more uniquely characteristic than those of the green and red algae. Furthermore, the seaweed biomass profiles were largely different from those of land plants.14, 15 Thus, the chemical components of seaweed observed here might reflect the evolution of plants because the evolutionary distance of brown algae from land plants is greater than that of the other algae.28, 29 Taken together, these profiling strategies for describing the water-soluble, water-insoluble and solid-state components of seaweed were capable of evaluating the individual characteristics of different types of seaweed. This comprehensive approach will provide a strategy and technological platform for evaluating the compositional variations and diversity of complicated macromolecular mixtures of various seaweeds, and opens new avenues for understanding biomacromolecules in natural ecosystems.

Figure 3
figure 3

Water-soluble (a) and solid-state (b) biomass profiles for the seaweeds evaluated by HCA based on the 1H-NMR and ATR-FTIR spectra, respectively.

The biomass profiling of seaweeds has been largely unexplored in terms of the comprehensive characterization afforded by the NMR-based metabolomics approach. Therefore, this study was the first attempt to understand seaweed biomass profiles via comprehensive metabolomics characterizations with multivariate statistical analyses. The water-soluble, water-insoluble and solid-state components of seaweed biomass were selected to develop these strategies as an initial step in the chemical profiling of complex biochemical mixtures. The characterization of the water-soluble components via an NMR-based metabolomics approach has been used extensively to study metabolites in a wide range of biological systems in various environments,6, 7, 8, 9, 10, 11, 12, 13 and is capable of creating assignments using our database (that is, SpinAssign program at the PRIMe website).25, 26, 27 The water-insoluble components (including the cell wall components and polysaccharide storage) were important constituents in seaweeds; therefore, an extraction method before the NMR measurements was developed via the modification and optimization of physical pretreatments and extraction processes. As both the water-soluble and water-insoluble components were extractable, the solid-state constituents were also analyzed to obtain an intact biomass profile.

In this study, the ATR-FTIR measurements were used to evaluate the chemical profile of the solid-state biomass constituents. Such measurements enable samples to be examined directly in the solid state without further preparation; therefore, ATR-FTIR has the advantages of only requiring a small amount of the sample; rapid measurement times, which allow for a high throughput; and both easy and simple operation. High-throughput ATR-FTIR measurements combine with multivariate statistical analysis to become a powerful tool for evaluating and identifying individual characteristics of solid-state biomass constituents on the basis of their chemical profile patterns. However, the disadvantage of this approach is the difficulty of linking the representative signals or peaks to the individual characteristics and variations in any chemical compounds and structures. Therefore, further analysis, such as solid-state NMR measurements, would be useful to specify certain factors that contribute to these intersample variations and relationships by identifying and assigning the chemical and/or structural characteristics of the solid-state biomass constituents. Actually, the description of a chemical profiling strategy in this report is the first step in developing an analytical method and comprehensive approach for the characterization, evaluation and elucidation of the chemical and structural interactions and relationships between complex biomacromolecular mixtures in natural biomass. In the next step, the use of solid-state NMR measurements to evaluate the solid-state constituents will be considered and combined with existing methods to assess physicochemical properties, such as thermodynamic characterization, for the development of new methodologies in future works.

This study focused on the development of profiling techniques for evaluating the water-soluble, water-insoluble and solid-state components of seaweed biomass to address the challenges in determining, comparing and understanding their biochemical complexities. NMR measurements of the water-insoluble seaweed components were accomplished via the modification and optimization of a physical pretreatment and extraction process based on a DMSO-d6/Pyridine-d5 (4/1, v/v) solvent system. Evaluating all of the compositional profiles obtained via NMR and ATR-FTIR spectroscopy using multivariate statistical analysis, such as PCA and HCA, provided a procedural plan capable of distinguishing the individual characteristics of various seaweeds. The development of new methodologies incorporating this approach will help clarify the relationship between complex biomacromolecules in natural products, elucidate their supramolecular structures and functions in natural ecosystems, and provide a promising new field in polymer science.