Large scale study of anti-sense regulation by differential network analysis

Legeay, Marc; Aubourg, Sébastien; Renou, Jean-Pierre; Duval, Béatrice

doi:10.1186/s12918-018-0613-7

Volume 12 Supplement 5

Selected articles from the 5th International Work-Conference on Bioinformatics and Biomedical Engineering: systems biology

Research
Open access
Published: 20 November 2018

Large scale study of anti-sense regulation by differential network analysis

Marc Legeay^1,2,
Sébastien Aubourg²,
Jean-Pierre Renou² &
…
Béatrice Duval¹

BMC Systems Biology volume 12, Article number: 95 (2018) Cite this article

2313 Accesses
1 Citations
1 Altmetric
Metrics details

Abstract

Background

Systems biology aims to analyse regulation mechanisms into the cell. By mapping interactions observed in different situations, differential network analysis has shown its power to reveal specific cellular responses or specific dysfunctional regulations. In this work, we propose to explore on a large scale the role of natural anti-sense transcription on gene regulation mechanisms, and we focus our study on apple (Malus domestica) in the context of fruit ripening in cold storage.

Results

We present a differential functional analysis of the sense and anti-sense transcriptomic data that reveals functional terms linked to the ripening process. To develop our differential network analysis, we introduce our inference method of an Extended Core Network; this method is inspired by C3NET, but extends the notion of significant interactions. By comparing two extended core networks, one inferred with sense data and the other one inferred with sense and anti-sense data, our differential analysis is first performed on a local view and reveals AS-impacted genes, genes that have important interactions impacted by anti-sense transcription. The motifs surrounding AS-impacted genes gather transcripts with functions mostly consistent with the biological context of the data used and the method allows us to identify new actors involved in ripening and cold acclimation pathways and to decipher their interactions. Then from a more global view, we compute minimal sub-networks that connect the AS-impacted genes using Steiner trees. Those Steiner trees allow us to study the rewiring of the AS-impacted genes in the network with anti-sense actors.

Conclusion

Anti-sense transcription is usually ignored in transcriptomic studies. The large-scale differential analysis of apple data that we propose reveals that anti-sense regulation may have an important impact in several cellular stress response mechanisms. Our data mining process enables to highlight specific interactions that deserve further experimental investigations.

Background

Understanding cell regulation mechanism is a key issue in bioinformatics. In the last two decades, large-scale gene expression profiling has led to considerable advances on this topics. Many discoveries have been obtained by differential expression analyses. In medicine, molecular classification of diseases may be achieved by looking for differentially expressed genes when comparing microarray datasets from healthy and disease samples [1]. Even if useful as prognostic tools, these approaches do not provide explanations about the dysfunctional mechanisms that cause the disease. Therefore, as reviewed in [2], recent works move to differential networking in order to identify cell regulations that are altered in disease samples.

Differential network biology [3, 4] refers to a set of works that rely on differential network mapping to analyse interactions between components of a biological system. These works study the changes that can be observed in the interaction networks representing biological systems when different environmental conditions, different tissue types, different disease states or different species are considered.

One type of approach in network biology is to integrate static interaction knowledge with dynamic changes in gene expression or metabolic fluxes. In [5], the authors propose to screen a known molecular interaction network to identify active sub-networks; an active sub-network is a connected region of the network with an unexpected high level of differential expression over particular sets of conditions. The top-scoring active sub-networks help to uncover regulatory mechanisms that control the expression changes in these experiments. In [6], the integration of a static protein-protein interaction network with ageing-related gene expression data is used to identify cellular changes related to age. An interaction from the reference network is put in the network associated to age A if the two related proteins are expressed in the datasets relative to A. The topologies of 37 different age-specific networks are analysed. While the global network topology does not exhibit significant changes with age, the measures of local topology (node degree, clustering coefficient, graphlet degree, …) reveal a small set of proteins whose centrality values are correlated with age.

Another possible approach is to compare networks inferred from data. A lot of methods have been proposed in the literature for the task of building an interaction network from a collection of expression data [7–10]. In these reverse engineering methods, the network modelises gene interactions as static processes inferred from a set of similar samples. To decipher the cellular response to different situations, more recent studies propose to identify differential co-expression patterns by comparing several networks [11, 12]. The comparison may be carried on pair-wise gene co-expressions by quantifying the significant differences [13, 14]. The comparison may also identify sub-networks containing significant changes of regulation; in [15], the authors measure the preservation of network modules across a set of condition-specific networks. Thus differential network analysis proposes to identify significant interaction modifications when comparing networks involving the same set of actors but related to different conditions.

The present work aims to study the impact of anti-sense transcription on gene networks by a method inspired by differential network analysis. Anti-sense RNAs are endogenous RNA molecules whose partial or entire sequences exhibit complementarity to other transcripts. Their different functions are not completely known but several studies suggest that they play an important role in the regulation of gene expression [16]. Mechanisms of anti-sense regulation can affect positively or negatively the protein production through their impact on transcription [17], mRNA degradation/stability [18] or final translation [19]. More recent works have also shown that anti-sense RNA are involved in the chromatin architecture [20]. Among their different functions, anti-sense genes can trigger the post-transcriptional gene silencing: it is a RNA-degradation mechanism creating small interfering RNAs (siRNA). The anti-sense transcript and the sense transcript hybridize themselves to form a double strand RNA (dsRNA). For instance, in Arabidopsis thaliana, [21] found that the RPP5 defence gene was affected by this phenomena: its sense and anti-sense dsRNA are degraded in siRNA which presumably contributes to the degradation of the sense transcript in the absence of pathogen infection. A similar mechanism regarding the salt tolerance in Arabidopsis thaliana has been described [22]. The description of these regulations and the results of genome-wide approaches [23, 24] suggest that anti-sense genes and RNA are major actors of biological pathways and must be integrated in the methods inferring gene networks.

The present work deals with apple data because a previous study [24] reveals different interesting points about anti-sense transcription in this organism. The authors used a dedicated microarray chip and RNA-Seq of small RNAs to analyse anti-sense transcription in eight different organs of apple Malus domestica. In their analysis, the three following points are important results about anti-sense transcription. Firstly, their measures of sense and anti-sense transcription show that, when considering the sense genes expressed in at least one of the eight organs, a significant level of anti-sense expression is found for 65% of them. This observation is higher than the previous studies performed on Arabidopsis thaliana [25, 26] that identified only 30% of anti-sense expression. Secondly, the presence of short interfering RNAs is correlated with the anti-sense transcript expression. Thirdly, the levels of expression of anti-sense transcripts vary depending on both organs and Gene Ontology (GO) categories: genes related to fruits and seeds, and belonging to the "defense" GO class have higher levels.

The work described in this paper proposes a genome-wide analysis of apple transcriptomic data, with measures of anti-sense transcripts, in the context of fruit ripening and cold storage. To identify the impact of integrating anti-sense transcription in gene network inference, we propose to achieve an original differential network analysis where we compare two context-specific gene networks inferred from two sets of actors: the first set is only composed by sense transcripts, it represents a usual transcriptomic study; the second set is composed by both sense and anti-sense transcripts. By computing the major differences between these two networks, we aim to highlight interactions that are greatly impacted by anti-sense transcripts and that would be neglected in a classical gene expression analysis.

The gene network inference methods based on statistical measures lead to infer many interactions in the network. Some of them are noise or redundant, we call them false positive interactions. A lot of works have been done to minimize such false positive interactions [27, 28]. Altay and Emmert-Streib [8] proposes to study what they call the core part of the gene network. The core network represents the most trustworthy part of the network: it is computed by selecting only the most significant interaction for each gene. However, the constraint of selecting only one interaction per gene seems too restrictive. We propose to extend the core network by considering a small number of significant interactions for each gene. Therefore, our gene network inference method computes what we call an Extended Core Network (ECN). We use ECN in our differential analysis to discover the interactions of the core network that are impacted by the integration of the anti-sense transcripts. Our proposal is summarized by the workflow presented in Fig. 1. We then define the notion of AS-impacted genes to identify the genes whose network neighbourhood is drastically different when the anti-sense transcripts are considered. We also study the relationships between the AS-impacted genes and we try to explain how their neighbourhood is rearranged by computing Steiner trees.

The rest of the paper is organized as follows. In “Methods” section, we first present the biological samples and the differential functional analysis that we propose to study the impact of anti-sense data on functional enrichment tests. Then we introduce the Extended Core Network Inference algorithm and we evaluate it on artificial datasets. We also present our differential analysis of two core gene networks built on different sets of actors, with the associated definition of change motifs and AS-impacted genes. We finally explain the computation of Steiner trees to study the relationships between AS-impacted genes. In “Results and discussion” section, we first present the results of the differential functional analysis ; the outputs are functional terms clearly consistent with ripening in cold situation. We then present the results of our differential network study, with a detailed biological analysis that confirms the interest of taking into account anti-sense transcription in biological experiments.

Methods

Biological material

We study the anti-sense transcription in the context of apple fruit ripening thanks to microarray data. The AryANE v1.0 microarray is designed to detected 63,011 predicted sense genes and 63,011 complementary anti-sense sequences for the Malus domestica genome [29]. With this microarray, we can identify the sense and anti-sense expression for the whole predicted genome. We used data produced in order to study the evolution of fruit in cold storage condition for different varieties. For each 22 genotypes we have two samples: one at harvest time (H) and one 60 days after harvest time (60DAH) with fruits kept at 4°C. The data produced by the microarray are the intensity values of loci, a locus being associated to a gene. In order to analyse the microarray data, we use the quantile normalization [30, 31]. Then we selected differentially expressed transcripts (p-val < 1%) between harvest and 60 days later. Based on a previous study [24], we applied a further threshold of 1 log change between the two conditions. We identified 931 sense transcripts and 694 anti-sense transcripts differentially expressed between H and 60DAH. We found 200 couples (i.e. sense and anti-sense complementary transcripts) among the differentially expressed transcripts. We use all these 1,625 transcripts into our analysis.

Differential functional analysis

As we observe a significant number of anti-sense transcripts in the differentially expressed probes, we first try to investigate the biological functions concerned by these actors. For that, we perform a differential functional analysis [32] with the particularity to analyse on one hand the sense data, and on the other hand the sense and anti-sense data.

Functional enrichment is used in order to identify biological functions associated to a set of genes. It relies on an ontology that regroups and hierarchizes a set of terms associated to genes. An ontology is an acyclic oriented graph, linking terms with a subsumption relation. Terms are ordered from the most specific ones to the most generic one.

The Gene Ontology Consortium provides three independent Gene Ontologies (GO) : “molecular function”, “biological process” and “cellular component” [33]. In a Gene Ontology, the most generic term is the one named after the ontology. The biochemical activity of the gene’s product is stored in the “molecular function” GO. It only specifies the activity, not where nor when it happens. A “biological process” GO term refers to a process in which the gene is involved. A biological process is an association of several molecular function via chemical or physical transformations. The “cellular component” GO indicates where the product is active.

In order to identify biological functions, we will use the “biological process” Gene Ontology and thus we associate each gene with a GO term. As anti-sense transcripts are not annotated, we affect to each anti-sense the annotation of the corresponding sense gene. This decision is based on the fact that due to its sequence complementarity, an anti-sense transcript may interact with the corresponding sense transcript, or at least with a very close member of the gene family. In [29], an apple gene is annotated using predicted orthologs (closest homolog) from Arabidopsis thaliana [34], the most studied model plant genome. When an apple gene has no homolog, we associate the term “unknown biological processes” to the gene and its anti-sense.

GO slim gives a broad overview of the ontology content without the detail of the specific fine-grained terms which are not always known. The categories were chosen to provide a broad representation of the distribution of biological roles. An annotation file associating a GO slim and a GO term with each of the 126,022 apple genes has also been created.

Once we can associate genes with GO terms, we can perform the functional analysis in which we identify statistically over-represented GO terms in a set of genes. The test performed is a hypergeometric test. A GO term is considered as statistically over-represented if the p-value is below a given threshold, generally set to 0.05.

Many tools were developed to identify the statistically over-represented GO categories in a set of genes. Among them, the Cytoscape App named BiNGO [35] provides a visual representation of the ontology: one possible output of BiNGO is a graph with nodes representing over-represented GO categories and arcs representing the hierarchy between them. The graph also represents the proportion of genes associated to a GO category by the size of a node, and the colour of a node codes for the associated p-value of the over-representation. BiNGO uses a Bonferroni correction on the hypergeometric test results. This correction is needed to compare the p-values obtained for each GO categories.

Concerning the apple ripening condition that we study, we notice an important proportion of anti-sense transcripts that are differentially expressed: the set AS is composed of 694 different transcripts whereas the S set is composed of 931 different transcripts. Therefore it is relevant to question the role of these anti-sense actors in the ripening process under cold storage conditions. To explore this question, we propose a differential functional analysis.

We perform the differential functional analysis as follows. We compute the over-represented GO categories on the sense set (S), then we compute the over-represented GO categories on the sense and anti-sense set (SAS=S ∪ AS). The use of BiNGO to perform the test gives us two sub-graphs of the GO. We propose to compute the difference of these two sub-graphs. We define the revealed-by-AS terms as the functional terms of this difference: they represent the functional terms that are outputted from the ontological analysis only when the AS transcripts are included.

Extended core network inference method

Motivations

Gene network inference from transcriptomic data has been studied in several works [7, 9, 10]. Some inference methods propose to reconstruct pairwise gene interaction networks by using a statistical measure to infer the co-expression or co-regulation between two genes. This statistical criterion can be Pearson or Spearman correlation [36], or mutual information [8, 27] to detect non-linear relationships. A threshold of significance is used to keep relevant relations in the network. The inconvenient with these methods is that they predict many false positive interactions. Among these false positive interactions, we find interactions that are not biologically true, and non-direct interactions. An indirect interaction can be observed if, for example, two genes g₂ and g₃ are under the regulation of a third gene g₁; in such a situation, the statistical criterion value between g₂ and g₃ is probably high and thus the method will infer an indirect interaction between g₂ and g₃. With these indirect interactions, the network becomes more complex and difficult to interpret. Therefore some pruning methods have been proposed to eliminate them [27]. In [8], the authors deal with this problem by proposing the Conservative Causal Core Network (C3NET) that computes the core of a gene network. The core of the network contains only the strongest interactions of the gene network: for each gene, it selects the best interaction (i.e. with the maximal mutual information). The aim of our method is to identify significant changes in the interactions when anti-sense actors are integrated by comparing two inferred networks. Therefore, we can work with the core of a gene network. However, the C3NET definition of a core network may be too restrictive since several mutual information values may be close to the maximal. This is why we propose a gene network inference method named Extended Core Network (ECN) that considers for each gene the most significant interactions.

Algorithm

Extended Core Network estimates the gene connections using mutual information. We first copula-transform [37, 38] the data to have a better estimation of the mutual information. The mutual information M[i,j] between genes i and j is estimated with the same estimator used in C3NET:

$$M[\!i,j] = \frac{1}{2}\log\left(\frac{\sigma_{I}^{2} \sigma_{J}^{2}}{|C|}\right)$$

where $\sigma _{I}^{2}$ and $\sigma _{J}^{2}$ are the variance of the expression vectors I and J of genes i and j respectively, and |C| is the determinant of the covariance matrix.

The statistical significance of pairwise mutual information is test by re-sampling methods, as C3NET or ARACNE [27]. Values that are not significant are set to 0 before applying the inference algorithm.

The Extended Core Network (ECN) algorithm infers a gene network represented by an adjacency matrix. Given a set of genes G, the first step of the algorithm is to create the zero matrix representing the fact that no genes are connected. The second step aims to identify the neighbourhood of each gene, composed of all the genes for which the mutual information value is maximal. In order to compute the maximal values of the mutual information, we use an accepting rate r. Given a gene g∈G, g^′∈G is a neighbour of g if the mutual information between g and g^′ is close to the best mutual information value of g with respect to the accepting rate r. The accepting rate must be between 0 and 1 where 0 means that we accept nothing but the best neighbour and 1 means that we accept every neighbour with a significant interaction. With an accepting rate fixed at 0, ECN works approximately as C3NET does, the difference is that C3NET selects only one interaction whereas ECN accepts all the interactions in case of two identical best values. An accepting rate fixed at 1 means that we define the core network by selecting all the significant mutual information values of the matrix. This is why the preprocessing step of the mutual information matrix sets all non-significant values to 0.

The mutual information between g₁ and g₂ is the same as the mutual information between g₂ and g₁. So C3NET outputs a symmetrical adjacency matrix, leading in an undirected graph. However we want to compare two core networks and be able to identify the interactions that changed when the anti-sense transcripts are added into the data. This is why our algorithm outputs a asymmetrical adjacency matrix, allowing us to identify the significant modifications in the neighbourhood of a gene.

The complexity of the ECN algorithm is $\mathcal {O}(n^{2})$, where n is the number of genes; it is the same as C3NET complexity [8].

Evaluation on artificial datasets

In order to compare our Extended Core Network algorithm with C3NET, we use simulated data. The assessment of a network inference method requires a reference biological network; for some organisms like E. coli [39, 40] and S. cerevisiae [41], gene networks of reasonable size are considered as established knowledge and are used as benchmarks in the following way. From the reference network, a sub-network of the desired size is extracted; then artificial expression datasets are produced by simulating the activity of the chosen sub-network; the inference method is applied on these simulated data and produces a learned network that can be compared to the original network.

We generate our simulated data with sub-networks of E. coli and S. cerevisiae thanks to the SynTREN [42] generator and simulator. The sub-networks are randomly generated using the neighbor addition method which creates a connected network of n=200 genes. We use SynTREN to create a dataset X of p=100 samples by simulating the activity of the genes from the selected network.

To evaluate the error rate of each inference method, we compare the inferred edges with the edges in the true gene network. This comparison gives us the $\text {precision} = \frac {\text {true positive}}{\text {true positive} + \text {false positive}}$ and $\text {recall} = \frac {\text {true positive}}{\text {true positive}+\text {false negative}}$ of each method. The precision and recall are used to compute the error measure named F₁ score such that $F_{1} = 2 \cdot \frac {\text {presicion} \cdot \text {recall}}{\text {precision} + \text {recall}}$.

The inference methods are tested in S=500 simulations. A simulation k∈[ 1..S] is run with a specific dataset X_k formed by n genes and $j \in \left [\!\frac {p}{2}..p\right ]$ samples randomly selected from the dataset X generated by SynTREN.

We compare different accepting rates of Extended Core Network with C3NET on two datasets, one from E. coli and another from S. cerevisiae. We test ECN with accepting rates ranging from 20 to 100% with a 10% step and from 0 to 20% with a 1% step.

Figure 2 shows the box plots of the F₁ scores obtained by the simulations. To enable the comparison with C3NET, ECN is used to produce an undirected network. The ECN_0 method corresponds to the Extended Core Network with a 0% accepting rate. The difference between ECN_0 and C3NET occurs when at least two genes have the best mutual information with one gene. We can observe that ECN has a higher F₁ score than C3NET on E. coli (Fig. 2a and b) and yeast (Fig. 2c and d) when the accepting rate is low. As expected, the F₁ score of ECN depends on the values of the accepting rate. For low values, a rate increase improves the score, but when the accepting rate exceeds a certain value, the F₁ score decreases. As explained before, a high accepting rate entails an increased number of false positives, and we observe that it greatly impacts the F₁ score. We can see this phenomena in Fig. 2a and c once the accepting rate is higher than 10% and 20% respectively, the F₁ scores drop down to 0. From empirical observations on several simulated datasets, we can notice that an accepting rate between 5 and 10% is a good rate. In the Fig. 2b and d, we can observe that the rate of 7% (ECN_0.07) is the best rate value on those specific data.

Differential network analysis

In this section we present the methodology that we propose in order to decipher the impact of anti-sense transcription on gene co-expression network. This methodology relies on the comparison of two extended core networks that reveals AS-impacted genes and change motifs that we define below. By computing Steiner trees, we also observe how the interactions between AS-impacted genes are reorganized in the network containing sense and anti-sense data.

AS-impacted genes and change motifs

We propose a differential network analysis where the extended core network inferred from the sense only data (S) is compared to the extended core network inferred from the sense and anti-sense data (SAS). We focus our analysis on a sense gene that has no sense neighbour in the SAS extended core network. If a sense node has no sense neighbour in the SAS network, it means that the interactions that exist between this gene and others in the S network are not strong enough to be present in the SAS core network. We say that this gene is an AS-impacted gene. An AS-impacted gene is observed when the mutual information of this gene with an anti-sense transcript is so strong that, even with the accepting rate, the mutual information between this gene and the other sense transcripts are not significant enough. By representing both S and SAS network in the same graph, we can define a change motif as a sub-graph formed by an AS-impacted node, with all its direct neighbours in the S and the SAS networks [32, 43]. We can identify the change motifs directly in the adjacency matrices of both networks. Change motifs allow us to identify local changes on interactions when the anti-sense actors are integrated in the network inference. The information contained in the change motifs shows the importance of taking anti-sense into account because it reveals the interactions between sense and anti-sense transcripts. Figure 3 represents a change motif designed from the AS-impacted node S1; the outer links of S1 in the S network ($S1 \multimap S2$ and $S1 \multimap S4$) are different than the outer link of S1 in the SAS network ($S1 \multimap AS3$).

A change motif surrounding an AS-impacted gene extracts from the data a small set of genes and anti-sense transcripts that deserves a further study. We shall report in the “Results and discussion” section the detailed biological analysis that we have done on these motifs.

Steiner trees to compute the rewiring of AS-impacted sub-graphs

Change motifs allow us to have a local view of the impact of the integration of anti-sense actors to the inference method. We now want to have a more global view on this impact. The AS-impacted genes are present all over the S network, but some of them are connected, forming an AS-impacted sub-graph. Those AS-impacted sub-graphs represent a part of the network strongly impacted by anti-sense actors. In order to analyse this impact, we study the rearrangement of the AS-impacted gene interactions in the SAS network. In other words, we look for indirect interactions in the SAS network between AS-impacted genes that interacts directly in the S network. This analysis is performed thanks to the Steiner tree problem explained below.

Given an undirected graph G=(V,E) with a set of vertices V and a set of edges E, and given a subset of vertices S⊆V, the minimal Steiner tree problem is to find a sub-graph G^′ of G such that S is contained in G^′, all the vertices are connected by a path, and the number of edges of G^′ is minimal. For a weighted graph, the aim is to minimize the total weight of the edges of G^′. The Steiner tree problem is used to solve several problems, such as extracting information from a large database of molecular interactions [44]. For example, for a given set of proteins, the Steiner tree problem may be applied to the interactome graph with these proteins as terminal nodes to compute a minimal set of relations connecting all these proteins. An experimental study used the Steiner tree problem on a large human protein-protein interaction network [45].

The minimum Steiner tree problem is a NP-complete combinatorial optimization problem [46] and several heuristic methods have been designed to use it with large graphs. In our workflow, we use the shortest paths based approximation [45] that computes the Steiner tree ST step by step. The first node added to the Steiner tree ST is one of the terminal node. Then, the shortest paths between all the remaining terminal nodes and the nodes of ST are computed, and the closest terminal node is connected to the Steiner tree ST with the shortest path. Once all the terminal nodes are in the Steiner tree ST, the resulting tree is pruned thanks to the minimum spanning tree method.

SteinerNet is a R package that allows to compute the minimum Steiner tree using one exact algorithm, or find a solution using four different approximate algorithms [45]. The shortest paths based approximation is one of the heuristic developed in SteinerNet that can be used on large graphs such as the protein-protein or gene networks. The SteinerNet package was last updated in 2013; then as the dependencies were obsolete, it was removed from the CRAN repository. We updated the package to the R version 3.2.0 and we will be pleased to share this updated package upon request.

In order to analyse the rewiring of AS-impacted genes, we solve the minimum Steiner tree problem with the subset of vertices composed by a AS-impacted sub-graph. AS-impacted sub-graphs have a particularity: all the genes have a direct connection in S, but because the genes are AS-impacted, all those connections are not represented in the sense and anti-sense network SAS. Thus it is interesting to wonder how these nodes rewired in SAS. If we can find a minimal Steiner tree for an AS-impacted sub-graph, the tree show the relationships between the AS-impacted genes of the sub-graph in the sense and anti-sense network. It shows how anti-sense actors intervene in the relationships between connected AS-impacted genes. Several exploitations of the Steiner trees can be done. In one hand, the Steiner tree can be visualized to help focus on interesting interactions. In an other hand, we can perform a functional analysis to identify the main biological functions impacted by anti-sense transcription.

Results and discussion

Differential functional analysis

We performed two functional analyses that, for a p-value threshold of 0.05, identified GO categories for sense data in one hand and for sense and anti-sense data in an other hand. This differential functional analysis provided 104 revealed-by-AS terms. The list of the revealed-by-AS terms associated with their p-value is given in Additional file 1. Among those terms, we can see terms related to cell wall (i.e. “cell wall organization or biogenesis”, p-value: 0.004), cold response (i.e. “response to cold”, p-value: 0.035) and osmotic response (i.e. “water transport”, p-value: 0.001). Those biological functions, related to response to stress and fruit ripening, are consistent with our experimental context since fruits are stored in cold chambers while the ripening continues (the different processes involved in this experiment are more thoroughly explained below). Without the anti-sense data, terms related to such conditions are not in the result of the functional enrichment analysis.

Differential network analysis

The number of change motifs depends on the accepting rate used in the Extended Core Network method, because this rate determines the neighbourhood size of nodes, and consequently of AS-impacted nodes. We identified 308 change motifs in the 60DAH experiment with an accepting rate of 5%. The number of change motifs is equal to the number of AS-impacted genes, and it represents about 30% of sense actors.

We performed the differential network analysis on the 60DAH experiment. Figure 4 shows the graphical result of this analysis. We can see the repartition of the AS-impacted genes (orange triangles) and note that they are spread in the network which means that AS transcription impacts is not restricted to a specific set of genes. The graph was drawn using Cytoscape [47] and features of Cytoscape may be used to mine this graph.

Enrichment analysis of change motifs

We identified 308 change motifs in the 60DAH experiment. Each change motif contains an AS-impacted gene and its direct neighbors in the S network and in the SAS network. For this analysis, each motif is enlarged by also considering the direct neighbours of its anti-sense actors; as the anti-sense actors impact the central gene of the motif, we want to consider the other interactions that they may have in the SAS network. In the following, the expression “change motif” will refer to an enlarged change motif. As a change motif is a gene set produced by our analysis work-flow, it is interesting to know if some biological functions are over-represented by this set. The change motifs are formed on average of 5 genes. Due to this small set size, we perform an enrichment analysis of each change motif using the apple GO slim, which contains 14 categories. Additional file 2 presents the results of this enrichment analysis.

The main post-harvest factors that influence apple softening include temperature, atmosphere, relative humidity, calcium treatment, and ethylene [48]. Artificial prevention of ripening process (and keeping quality) is the main goal of controlled atmosphere storage (low oxygen and high carbon dioxide) and/or cold storage (low temperatures). For cold storage, apples during post-harvest time are stored at 0-4°C. Low temperatures influence the post-harvest biology of apple fruits. Chilling stress is a physiological disorder that limits the storage of chilling sensitive fruits at low, but non-freezing temperatures [49]. Low temperatures disrupt the balance of reactive oxygen species (ROS), leading to its accumulation and oxidative stress [50]. Therefore, in the particular case of the specific transcriptome of apple after two months of cold storage, it is expected to find genes involved in: ripening/cell wall modifications, cold signaling/response to cold stress and response to oxidative stress.

We identified 308 motifs as AS-impacted in the transcriptome of the two months storage set (60DAH). Taking in account the homologies between apple and Arabidopsis genes when available, 72 of them displayed a significant enrichment in the GO slim categories such as “developmental processes”, “response to abiotic or biotic stimulus”, “response to stress” and “transport” (see Additional file 3).

The 291 apple genes involved in the 72 enriched motifs correspond to 209 putative Arabidopsis orthologs. Interestingly, a closer identification of putative biological functions based on the TAIR annotations [34] allowed to relate no less than 127 of them (i.e. 61%) to the above cited processes, in summary: ripening, cold and oxidative stress. They were found in 88% of the motifs (see Additional file 3), indicating that the majority of the genes belonging to change motifs can be directly related to the biological process of fruit post-harvest conservation at low temperature.

Besides, 81 apple genes lack any information on their biological function and are not considered in this analysis. Arabidopsis orthologs of some of them have been cited in deregulated gene lists of various conditions of stress response, which is not inconsistent per se, but not highlighted in this study. The consistency of the AS impacted motifs with the apple maturation process under cold temperature can be easily highlighted through the three following examples.

The motif #1 (Fig. 5a) contains 4 genes: MDP0000250286 is encoding for a putative superoxide dismutase responding to cold stress [51], MDP0000120044 is similar to the Cyt P450 monoxygenase CYP714A1 involved in gibberellic acid pathway [52] and MDP0000251669 to a thioredoxin [53], all of them involved in the oxidative stress response. Lastly, MDP0000813397 is encoding a brassinosteroid signaling kinase [54], brassinosteroid being a plant hormone family involved in cold stress response [55]. In this first example, the members of the motif are all directly related to the cold stress response which occurs during the storage process.

Motif #2 (Fig. 5b) contains two genes involved in the response to cold, MDP0000205588, encoding an osmotic fatty acid desaturase [56] and MDP0000249561, the BZIP44 transcription factor also putatively involved in the control of fruit maturation through cell wall loosening [57, 58]; and two genes involved in response to oxidative stress: MDP0000309881, similar to the hypoxia-induced gene domain 1 [59], and MDP0000920069, encoding a respiratory burst oxidase protein F [60] also responsive to ethylene and abscisic acid. Within this motif, only the gene MDP0000252890 encoding a rubisco subunit 3B, could not be directly related to the studied process according to scientific literature.

Out of the eight genes of the Motif #3 (Fig. 5c), five of them are perfectly consistent with the fruit ripening process under cold temperature: MDP0000250138 is an ortholog of BIN2, a member of the ATSK (shaggy-like kinase) family which acts in the cross-talk between auxin and brassinosteroid signaling pathways [61], MDP0000797759, a glycine-rich RNA binding protein which increases stress tolerance under conditions of low temperature [62], MDP0000151721 is an ortholog of Fro1 involved in cold acclimation and osmotic stress response [63], MDP0000263744, a xyloglucan endotransglycosylase involved in cell wall modification and cold acclimation [64], and finally MDP0000917574, an aldehyde dehydrogenase 1A which is a critical gene of the phenylpropanoid pathway involved in the production of antioxidant components and the response to biotic and abiotic stresses [65].

In summary, these results show that the motifs obtained by taking in account the AS transcriptome in the network inference highlights new actors. In this case we clearly show that biological functions of these new actors can be related to the studied biological question. Therefore the consideration of these new actors, either sense transcripts, or anti-sense transcripts supposed to act as regulators of the corresponding coding genes, sheds a new light of the putative regulation networks underlying the studied processes.

Moreover, a synthetic view of the whole set of AS-impacted motifs can provide new avenues of work to point pathways or genes whose importance could have been underestimated without this input. In this case it is noteworthy that at least 11 occurrences of genes related to the brassinoids pathway appear in the set of 72 motifs. Brassinoids have been reported to be involved in a range of developmental processes, such as stem and root growth, floral initiation, and the development of flowers and fruits [55, 66]. Studies also revealed that brassinoids can confer resistance of plants to various abiotic and biotic stresses, including cold stress [67, 68]. Li et al. [69] even reported that brassinolide mediates tolerance of plants to abiotic stress in general and cold stress in particular. Brassinoids have also been reported as involved in grape berry ripening [70] and early fruit development in cucumber [71], but their implication has not been reported yet in apple development and maturation. The present study has shown that several genes involved in the brassinoid pathway might play a role in the apple maturation and conservation at low temperature.

AS-impacted sub-graphs

In the 60DAH experiment we identify 29 AS-impacted sub-graphs, when considering only graphs with at least three connected AS-impacted nodes. For instance, Fig. 6 shows a Steiner tree containing the AS-impacted gene of the change motif #3. This Steiner tree connect all the AS-impacted genes (drawn in orange triangle-shaped nodes) thanks to sense (blue) and anti-sense (purple) nodes from the SAS network. Red links are interactions from the S network impacted by anti-sense transcription; they do not appear in the SAS network (nor in the Steiner tree). With this visualization, we can see that the core of the network has been highly modified by the integration of anti-sense actors, however AS-impacted genes that were direct neighbors in the S network are still connected but in an indirect manner. The Steiner tree from Fig. 6 is composed by 26 AS-impacted genes and 82 Steiner nodes. We perform a functional enrichment of AS-impacted genes in one hand and the whole genes of the Steiner tree in an other hand using the GO slim. The 26 AS-impacted genes are tagged with two GO slim categories: “other metabolic processes” (p-value: 0.030) and “transcription, DNA-dependent” (p-value: 0.023). The functional enrichment of the 108 genes of the Steiner tree are enriched by the “response to abiotic or biotic stimulus” GO slim category with a p-value of 0.048, which is consistent with our experimental context.

Conclusion

The original proposition of our work is to analyse the impact of anti-sense transcription on a large scale. To achieve this goal, sense and anti-sense transcripts are treated in the same way in our gene network inference method. By considering the recent ideas of differential network analysis, our main proposal is to compare network inferred from different sets of data: the sense data in one hand, and both the sense and anti-sense data in an other hand. To compare the inferred networks, we first propose the Extended Core Network inference method. Secondly we define the differential network analysis that we performed with the Extended Core Network method. The comparison of two networks inferred on different sets of actors shows that the integration of the anti-sense data clearly modifies the network topology. We define the AS-impacted genes and the change motifs, that identify those changes in the network topology. The biological analysis of change motifs on apple data highlights interesting actors and emphasizes the interest to take anti-sense data into account in transcriptomic analysis. The differential network analysis identifies subsets of new actors in the context of maturation and conservation of the fruit that will be explored in further studies. We also propose to study the AS-impacted sub-graphs to provide a more global view of the impact of anti-sense transcription on biological functions.

Abbreviations

AS:: Anti-sense
C3NET:: Conservative causal core network
ECN:: Extended core network
GO:: Gene ontology
RNA:: Ribonucleic acid
S:: Sense
SAS:: Sense and anti-sense

References

Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science. 1999; 286(5439):531–7.
Article CAS Google Scholar
de la Fuente A. From ’differential expression’ to ’differential networking’ – identification of dysfunctional regulatory networks in diseases. Trends Genet. 2010; 26(7):326–33.
Article CAS Google Scholar
Sharan R, Ideker T. Modeling cellular machinery through biological network comparison. Nat Biotechnol. 2006; 24(4):427–33.
Article CAS Google Scholar
Ideker T, Krogan NJ. Differential network biology. Mol Syst Biol. 2012; 8:565.
Article Google Scholar
Ideker T, Ozier O, Schwikowski B, Siegel AF. Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics. 2002; 18(suppl_1):233.
Article Google Scholar
Faisal FE, Milenković T. Dynamic networks reveal key players in aging. Bioinformatics. 2014; 30(12):1721.
Article CAS Google Scholar
Bansal M, Belcastro V, Ambesi-Impiombato A, di Bernardo D. How to infer gene networks from expression profiles. Mol Syst Biol. 2007; 3(1):78.
PubMed PubMed Central Google Scholar
Altay G, Emmert-Streib F. Inferring the conservative causal core of gene regulatory networks. BMC Syst Biol. 2010; 4(1):132.
Article Google Scholar
Marbach D, Costello JC, Küffner R, Vega NM, Prill RJ, Camacho DM, Allison KR, The DREAM5 Consortium, Kellis M, Collins JJ, Stolovitzky G. Wisdom of crowds for robust gene network inference. Nat Methods. 2012; 9(8):796–804.
Article CAS Google Scholar
Friedel S, Usadel B, von Wiren N, Sreenivasulu N. Reverse Engineering: A Key Component of Systems Biology to Unravel Global Abiotic Stress Cross-Talk. Front Plant Sci. 2012; 3:294.
Article Google Scholar
Barabási A-L, Gulbahce N, Loscalzo J. Network Medicine: A Network-based Approach to Human Disease. Nat Rev Genet. 2011; 12(1):56–68.
Article Google Scholar
Altay G, Asim M, Markowetz F, Neal DE. Differential C3net reveals disease networks of direct physical interactions. BMC Bioinformatics. 2011; 12(1):296.
Article Google Scholar
Bockmayr M, Klauschen F, Györffy B, Denkert C, Budczies J. New network topology approaches reveal differential correlation patterns in breast cancer. BMC Syst Biol. 2013; 7:78.
Article Google Scholar
Warsow G, Greber B, Falk SSI, Harder C, Siatkowski M, Schordan S, Som A, Endlich N, Schöler H, Repsilber D, Endlich K, Füllen G. Expressence - revealing the essence of differential experimental data in the context of an interaction/regulation net-work. BMC Syst Biol. 2010; 4:164.
Article Google Scholar
Langfelder P, Luo R, Oldham MC, Horvath S. Is my network module preserved and reproducible?PLOS Comput Biol. 2011; 7(1):1–29.
Article Google Scholar
Pelechano V, Steinmetz LM. Gene regulation by antisense transcription. Nat Rev Genet. 2013; 14(12):880–93.
Article CAS Google Scholar
Gelfand B, Mead J, Bruning A, Apostolopoulos N, Tadigotla V, Nagaraj V, Sengupta AM, Vershon AK. Regulated antisense transcription controls expression of cell-type-specific genes in yeast. Mol Cell Biol. 2011; 31(8):1701–9.
Article CAS Google Scholar
Faghihi MA, Zhang M, Huang J, Modarresi F, Van der Brug MP, Nalls MA, Cookson MR, St-Laurent G, Wahlestedt C. Evidence for natural antisense transcript-mediated inhibition of microRNA function. Genome Biol. 2010; 11:56.
Article Google Scholar
Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, Pesce E, Ferrer I, Collavin L, Santoro C, Forrest ARR, Carninci P, Biffo S, Stupka E, Gustincich S. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012; 491(7424):454–7.
Article CAS Google Scholar
Murray SC, Haenni S, Howe FS, Fischl H, Chocian K, Nair A, Mellor J. Sense and antisense transcription are associated with distinct chromatin architectures across genes. Nucleic Acids Re. 2015; 43(16):7823–37.
Article CAS Google Scholar
Yi H, Richards EJ. A Cluster of Disease Resistance Genes in Arabidopsis Is Coordinately Regulated by Transcriptional Activation and RNA Silencing. Plant Cell. 2007; 19(9):2929–39.
Article CAS Google Scholar
Borsani O, Zhu J, Verslues PE, Sunkar R, Zhu J-K. Endogenous siRNAs Derived from a Pair of Natural cis-Antisense Transcripts Regulate Salt Tolerance in Arabidopsis. Cell. 2005; 123(7):1279–91.
Article CAS Google Scholar
Wang H, Chung PJ, Liu J, Jang I-C, Kean M, Xu J, Chua N-H. Genome-wide identification of long noncoding natural antisense transcripts and their responses to light in Arabidopsis. Genome Res. 2014:165555–113.
Celton J-M, Gaillard S, Bruneau M, Pelletier S, Aubourg S, Martin-Magniette M-L, Navarro L, Laurens F, Renou J-P. Widespread anti-sense transcription in apple is correlated with siRNA production and indicates a large potential for transcriptional and/or post-transcriptional control. New Phytol. 2014; 203(1):287–99.
Article CAS Google Scholar
Meyers BC, Vu TH, Tej SS, Ghazal H, Matvienko M, Agrawal V, Ning J, Haudenschild CD. Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing. Nat Biotechnol. 2004; 22(8):1006–11.
Article CAS Google Scholar
Stolc V, Samanta MP, Tongprasit W, Sethi H, Liang S, Nelson DC, Hegeman A, Nelson C, Rancour D, Bednarek S, Ulrich EL, Zhao Q, Wrobel RL, Newman CS, Fox BG, Phillips GN, Markley JL, Sussman MR. Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays. Proc Natl Acad Sci U S A. 2005; 102(12):4453–8.
Article CAS Google Scholar
Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Favera RD, Califano A. ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context. BMC Bioinformatics. 2006; 7(Suppl 1):7.
Article Google Scholar
Zhang X, Liu K, Liu Z-P, Duval B, Richer J-M, Zhao X-M, Hao J-K, Chen L. NARROMI: a noise and redundancy reduction technique improves accuracy of gene regulatory network inference. Bioinformatics. 2013; 29(1):106–13.
Article Google Scholar
Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D, Salvi S, Pindo M, Baldi P, Castelletti S, Cavaiuolo M, Coppola G, Costa F, Cova V, Dal Ri A, Goremykin V, Komjanc M, Longhi S, Magnago P, Malacarne G, Malnoy M, Micheletti D, Moretto M, Perazzolli M, Si-Ammour A, Vezzulli S, Zini E, Eldredge G, Fitzgerald LM, Gutin N, Lanchbury J, Macalma T, Mitchell JT, Reid J, Wardell B, Kodira C, Chen Z, Desany B, Niazi F, Palmer M, Koepke T, Jiwan D, Schaeffer S, Krishnan V, Wu C, Chu VT, King ST, Vick J, Tao Q, Mraz A, Stormo A, Stormo K, Bogden R, Ederle D, Stella A, Vecchietti A, Kater MM, Masiero S, Lasserre P, Lespinasse Y, Allan AC, Bus V, Chagné D, Crowhurst RN, Gleave AP, Lavezzo E, Fawcett JA, Proost S, Rouzé P, Sterck L, Toppo S, Lazzari B, Hellens RP, Durel C-E, Gutin A, Bumgarner RE, Gardiner SE, Skolnick M, Egholm M, Van de Peer Y, Salamini F, Viola R. The genome of the domesticated apple (Malus × domestica Borkh,). Nat Genet. 2010; 42(10):833–9.
Article CAS Google Scholar
Bullard JH, Purdom E, Hansen KD, Dudoit S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics. 2010; 11:94.
Article Google Scholar
Qiu X, Wu H, Hu R. The impact of quantile and rank normalization procedures on the testing power of gene differential expression analysis. BMC Bioinformatics. 2013; 14:124.
Article Google Scholar
Legeay M, Duval B, Renou J-P. Differential Functional Analysis and Change Motifs in Gene Networks to Explore the Role of Anti-sense Transcription In: Bourgeois A, Skums P, Wan X, Zelikovsky A, editors. Bioinformatics Research and Applications. Lecture Notes in Computer Science. Switzerland: Springer: 2016. p. 117–126.
Google Scholar
The Gene Ontology Consortium: Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017; 45(D1):331–8.
The Arabidopsis Information Resource (TAIR). https://www.arabidopsis.org/.. Accessed 18 July 2017.
Maere S, Heymans K, Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks. Bioinformatics. 2005; 21(16):3448–9.
Article CAS Google Scholar
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008; 9(1):559.
Article Google Scholar
Steuer R, Kurths J, Daub CO, Weise J, Selbig J. The mutual information: Detecting and evaluating dependencies between variables. Bioinformatics. 2002; 18(suppl 2):231–40.
Article Google Scholar
Kurt Z, Aydin N, Altay G. A comprehensive comparison of association estimators for gene network inference algorithms. Bioinformatics. 2014; 30(15):2142–9.
Article CAS Google Scholar
Shen-Orr SS, Milo R, Mangan S, Alon U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet. 2002; 31(1):64–68.
Article CAS Google Scholar
Ma H-W, Kumar B, Ditges U, Gunzer F, Buer J, Zeng A-P. An extended transcriptional regulatory network of Escherichia coli and analysis of its hierarchical structure and network motifs. Nucleic Acids Res. 2004; 32(22):6643–9.
Article CAS Google Scholar
Guelzim N, Bottani S, Bourgine P, Képès F. Topological and causal structure of the yeast transcriptional regulatory network. Nat Genet. 2002; 31(1):60–63.
Article CAS Google Scholar
Bulcke TVd, Leemput KV, Naudts B, Remortel Pv, Ma H, Verschoren A, Moor BD, Marchal K. SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms. BMC Bioinformatics. 2006; 7(1):43.
Article Google Scholar
Legeay M, Duval B, Renou J-P. Inference and differential analysis of extended core networks: A way to study anti-sense regulation In: Tian T, Jiang Q, Liu Y, Burrage K, Song J, Wang Y, Hu X, Morishita S, Zhu Q, Wang G, editors. IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016, Shenzhen, China, December 15-18, 2016. IEEE Computer Society: 2016. p. 284–7. https://doi.org/10.1109/BIBM.2016.7822532.
Dittrich MT, Klau GW, Rosenwald A, Dandekar T, Müller T. Identifying functional modules in protein–protein interaction networks: an integrated exact approach. Bioinformatics. 2008; 24(13):223–31.
Article Google Scholar
Sadeghi A, Fröhlich H. Steiner tree methods for optimal sub-network identification: an empirical study. BMC Bioinformatics. 2013; 14:144.
Article Google Scholar
Karp RM. Reducibility among Combinatorial Problems. In: Complexity of Computer Computations. The IBM Research Symposia Series. Boston: Springer: 1972. p. 85–103.
Google Scholar
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003; 13(11):2498–504.
Article CAS Google Scholar
Johnston JW, Hewett EW, Hertog MLATM. Postharvest softening of apple (Malus domestica) fruit: A review. New Zealand J Crop Hortic Sci. 2002; 30(3):145–60.
Article Google Scholar
Yang Q, Rao J, Yi S, Meng K, Wu J, Hou Y. Antioxidant enzyme activity and chilling injury during low-temperature storage of Kiwifruit cv. Hongyang exposed to gradual postharvest cooling. Hortic Environ Biotechnol. 2012; 53(6):505–12.
Article CAS Google Scholar
Chaparzadeh N, Yavari B. Antioxidant responses of Golden delicious apple under cold storage conditions. Iran J Plant Physiol. 2013; 4(1):907–15.
Google Scholar
Juszczak I, Cvetkovic J, Zuther E, Hincha DK, Baier M. Natural Variation of Cold Deacclimation Correlates with Variation of Cold-Acclimation of the Plastid Antioxidant System in Arabidopsis thaliana Accessions. Front Plant Sci. 2016; 7:305.
Article Google Scholar
Zhang Y, Zhang B, Yan D, Dong W, Yang W, Li Q, Zeng L, Wang J, Wang L, Hicks LM, He Z. Two Arabidopsis cytochrome P450 monooxygenases, CYP714a1 and CYP714a2, function redundantly in plant development through gibberellin deactivation. Plant J Cell Mol Biol. 2011; 67(2):342–53.
Article CAS Google Scholar
Issakidis-Bourguet E, Mouaheb N, Meyer Y, Miginiac-Maslow M. Heterologous complementation of yeast reveals a new putative function for chloroplast m-type thioredoxin. Plant J Cell Mol Biol. 2001; 25(2):127–35.
Article CAS Google Scholar
Tang W, Kim T-W, Oses-Prieto JA, Sun Y, Deng Z, Zhu S, Wang R, Burlingame AL, Wang Z-Y. BSKs mediate signal transduction from the receptor kinase BRI1 in Arabidopsis. Science (New York, N.Y.) 2008; 321(5888):557–60.
Article CAS Google Scholar
Clouse SD, Sasse JM. BRASSINOSTEROIDS: Essential Regulators of Plant Growth and Development. Annu Rev Plant Physiol Plant Mol Biol. 1998; 49:427–51.
Article CAS Google Scholar
Shi Y, An L, Li X, Huang C, Chen G. The octadecanoid signaling pathway participates in the chilling-induced transcription of ω-3 fatty acid desaturases in Arabidopsis. Plant Physiol Biochem. 2011; 49(2):208–15.
Article CAS Google Scholar
Iglesias-Fernández R, Barrero-Sicilia C, Carrillo-Barral N, Oñate-Sánchez L, Carbonero P. Arabidopsis thaliana bZIP44: a transcription factor affecting seed germination and expression of the mannanase-encoding gene AtMAN7. Plant J Cell Mol Biol. 2013; 74(5):767–80.
Article Google Scholar
Weltmeier F, Rahmani F, Ehlert A, Dietrich K, Schütze K, Wang X, Chaban C, Hanson J, Teige M, Harter K, Vicente-Carbajosa J, Smeekens S, Dröge-Laser W. Expression patterns within the Arabidopsis C/S1 bZIP transcription factor network: availability of heterodimerization partners controls gene expression during stress response and development. Plant Mol Bio. 2009; 69(1-2):107–19.
Article CAS Google Scholar
Hwang S-T, Li H, Alavilli H, Lee B-H, Choi D. Molecular and physiological characterization of AtHIGD1 in Arabidopsis. Biochem Biophys Res Commun. 2017; 487(4):881–86.
Article CAS Google Scholar
Liu B, Sun L, Ma L, Hao F-S. Both AtrbohD and AtrbohF are essential for mediating responses to oxygen deficiency in Arabidopsis. Plant Cell Rep. 2017; 36(6):947–57.
Article CAS Google Scholar
Li H, Ye K, Shi Y, Cheng J, Zhang X, Yang S. BZR1 Positively Regulates Freezing Tolerance via CBF-Dependent and CBF-Independent Pathways in Arabidopsis. Mol Plant. 2017; 10(4):545–59.
Article CAS Google Scholar
Kim JY, Kim WY, Kwak KJ, Oh SH, Han YS, Kang H. Glycine-rich RNA-binding proteins are functionally conserved in Arabidopsis thaliana and Oryza sativa during cold adaptation process. J Exp Bot. 2010; 61(9):2317–25.
Article CAS Google Scholar
Lee B-h, Lee H, Xiong L, Zhu J-K. A Mitochondrial Complex I Defect Impairs Cold-Regulated Nuclear Gene Expression. Plant Cell. 2002; 14(6):1235–51.
Article CAS Google Scholar
Oono Y, Seki M, Satou M, Iida K, Akiyama K, Sakurai T, Fujita M, Yamaguchi-Shinozaki K, Shinozaki K. Monitoring expression profiles of Arabidopsis genes during cold acclimation and deacclimation using DNA microarrays. Funct Integr Genom. 2006; 6(3):212–34.
Article CAS Google Scholar
Nair RB, Bastress KL, Ruegger MO, Denault JW, Chapple C. The Arabidopsis thaliana REDUCED EPIDERMAL FLUORESCENCE1 gene encodes an aldehyde dehydrogenase involved in ferulic acid and sinapic acid biosynthesis. Plant Cell. 2004; 16(2):544–54.
Article CAS Google Scholar
Sasse JM. Physiological Actions of Brassinosteroids: An Update. J Plant Growth Regul. 2003; 22(4):276–88.
Article CAS Google Scholar
Krishna P. Brassinosteroid-Mediated Stress Responses. J Plant Growth Regul. 2003; 22(4):289–97.
Article CAS Google Scholar
Bajguz A, Hayat S. Effects of brassinosteroids on the plant responses to environmental stresses. Plant Physiol Biochem. 2009; 47(1):1–8.
Article CAS Google Scholar
Li B, Zhang C, Cao B, Qin G, Wang W, Tian S. Brassinolide enhances cold stress tolerance of fruit by regulating plasma membrane proteins and lipids. Amino Acids. 2012; 43(6):2469–80.
Article CAS Google Scholar
Symons GM, Davies C, Shavrukov Y, Dry IB, Reid JB, Thomas MR. Grapes on Steroids, Brassinosteroids Are Involved in Grape Berry Ripening. Plant Physiol. 2006; 140(1):150–8.
Article CAS Google Scholar
Fu FQ, Mao WH, Shi K, Zhou YH, Asami T, Yu JQ. A role of brassinosteroids in early fruit development in cucumber. J Exp Bot. 2008; 59(9):2299–308.
Article CAS Google Scholar

Download references

Acknowledgements

We thank Maryline Bruneau-Cournol (IRHS-UMR1345) for microarray data production, Sandra Pelletier (IRHS-UMR1345) for microarray design, data normalization and differential analyses, UEH-INRA 0449 for plant production and maintenance, and the ANAN platform of the SFR QuaSaV for microarray facilities access.

Funding

This work was supported by the French Pays-de-la-Loire Region via the funding of the GRIOTE project. Publications costs were funding by the GRIOTE project.

Availability of data and materials

The microarray data have been submitted to the Gene Expression Omnibus data base (https://www.ncbi.nlm.nih.gov/geo/) under the accession numbers GSE59947, GSE64079, and GSE101716 (the last one will be publicly accessible, but now requires the following token: wtobuqokfnepbkj). The design of the microarray chip is IRHS_ARyANE_v1 [110411_Mdom_JPR_exp ] identified by the accession number GPL16374. Additional files 4 and 5 are the normalized datasets used in the H and 60DAH experiments, respectively.

About this supplement

This article has been published as part of BMC Systems Biology Volume 12 Supplement 5, 2018: Selected articles from the 5th International Work-Conference on Bioinformatics and Biomedical Engineering: systems biology. The full contents of the supplement are available online at https://bmcsystbiol.biomedcentral.com/articles/supplements/volume-12-supplement-5.

Author information

Authors and Affiliations

LERIA, Université d’Angers, 2 bd Lavoisier, Angers, 49045, France
Marc Legeay & Béatrice Duval
IRHS, Agrocampus-Ouest, INRA, Université d’Angers, SFR 4207 QuaSaV, Beaucouzé, 49071, France
Marc Legeay, Sébastien Aubourg & Jean-Pierre Renou

Authors

Marc Legeay
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Aubourg
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Pierre Renou
View author publications
You can also search for this author in PubMed Google Scholar
Béatrice Duval
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ML contributed to the conception and the development of the algorithms, manuscript preparation, and analysis of the results. JPR contributed to biological analysis of the results and manuscript preparation. SA contributed to biological analysis of the results and manuscript preparation. BD contributed to the conception of the algorithms, manuscript preparation, and analysis of the results. All authors drafted the manuscript, revised it critically, read and approved the final version.

Corresponding author

Correspondence to Béatrice Duval.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1

List of the GO categories revealed by anti-sense data. List of the 104 revealed-by-AS terms. Column A: Identifier of the GO category. Column B: significant p-value. Column C: significant corrected p-value below 0.05, the correction used is the Benjamini & Hochberg False Discovery Rate (FDR) correction (Revealed-by-AS terms are ordered according to this criteria). Column D: Number of apple genes from the sample tagged with the GO category. Column E: Number of apple genes from the genome tagged with the GO category. Column F: Number of apple genes in the sample. Column G: Number of apple genes in the genome. Column H: Name of the GO category. Column I: Apple genes from the sample tagged with the GO category. Column J: Arabidopsis orthologs of the apple genes from the sample tagged with the GO category; here an Arabidopsis ortholog with an anti-sense apple gene is identified by the “_AS” prefix. (ODS 57 kb)

Additional file 2

Change motifs enrichment. Results of the enrichment analysis of the 308 change motifs from the 60DAH experiment. Columns A to D: apple genes, nodes of the change motif. Column E: GO slim category significantly over-represented by the genes present in the change motif (“No enrichment” if there is no significantly over-represented category). Column F: significant p-value below 0.05 (-1 if no enrichment). Column G: Apple genes from the sample tagged with the GO slim category. Column H: Number of apple genes in the genome. Column I: Number of apple genes in the sample. Column J: Number of apple genes from the genome tagged with the GO slim category. Column K: Number of apple genes from the sample tagged with the GO slim category. Columns L to O: Arabidopsis orthologs of the apple genes of the change motif. Column P: Arabidopsis orthologs of the apple genes from the sample tagged with the GO category. (ODS 34 kb)

Additional file 3

Functional annotation of change motifs significantly enriched according GO slim classification. For each enriched change motif from Additional file 2, we associate functional keywords from the TAIR annotation. Columns A to D: apple genes, nodes of the change motif. Column E: GO slim category significantly over-represented by the genes present in the change motif. Column F: significant p-value below 0.05 (change motifs are ordered according to this criteria). Column G: Arabidopsis orthologs tagged by the GO slim term associated with the change motif. Columns H to N: Functional keywords from Arabidopsis orthologs (TAIR annotations), keywords consistent with the biological context (fruit ripening in abiotic stress condition) are highlighted in orange. In columns H to N, we use the following abbreviations: ABA: abscissic acid; SA: salicylic acid; JA: jasmonate acid; ROS: reactive oxygen species. The three change motifs described in the manuscript (Fig. 5) are highlighted in blue. (ODS 23 kb)

Additional file 4

H experiment — http://www.info.univ-angers.fr/%7elegeay/AF4%5fHarvest.zip. Normalized data (without background suppression) of the H (Harvest) experiment. The file contains the normalized expression data of the 126,022 apple genes (rows) of the 22 samples (columns). The normalization method used is the quantile normalization. (TXT 1 kb)

Additional file 5

60DAH experiment — http://www.info.univ-angers.fr/%7elegeay/AF5%5f60DAH.zip. Normalized data (without background suppression) of the 60DAH (60 Days After Harvest) experiment. The file contains the normalized expression data of the 126,022 apple genes (rows) of the 22 samples (columns). The normalization method used is the quantile normalization. (TXT 1 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Legeay, M., Aubourg, S., Renou, JP. et al. Large scale study of anti-sense regulation by differential network analysis. BMC Syst Biol 12 (Suppl 5), 95 (2018). https://doi.org/10.1186/s12918-018-0613-7

Download citation

Published: 20 November 2018
DOI: https://doi.org/10.1186/s12918-018-0613-7

Selected articles from the 5th International Work-Conference on Bioinformatics and Biomedical Engineering: systems biology

Large scale study of anti-sense regulation by differential network analysis

Abstract

Background

Results

Conclusion

Background

Methods

Biological material

Differential functional analysis

Extended core network inference method

Motivations

Algorithm

Evaluation on artificial datasets

Differential network analysis

AS-impacted genes and change motifs

Steiner trees to compute the rewiring of AS-impacted sub-graphs

Results and discussion

Differential functional analysis

Differential network analysis

Enrichment analysis of change motifs

AS-impacted sub-graphs

Conclusion

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

About this supplement

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1

Additional file 2

Additional file 3

Additional file 4

Additional file 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Systems Biology

Contact us