Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks

Tanja Muetze; Ivan H. Goenawan; Heather L. Wiencko; Manuel Bernal-Llinares; Kenneth Bryan; David J. Lynn

doi:10.12688/f1000research.9118.2

Home Browse Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

Revised

Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks

[version 2; peer review: 2 approved]

Tanja Muetze¹^*, Ivan H. Goenawan¹^*, Heather L. Wiencko², Manuel Bernal-Llinares¹, Kenneth Bryan¹, David J. Lynn^1,3

Tanja Muetze¹^*, Ivan H. Goenawan¹^*, [...] Heather L. Wiencko², Manuel Bernal-Llinares¹, Kenneth Bryan¹, David J. Lynn^1,3

^* Equal contributors

PUBLISHED 30 Aug 2016

Author details Author details

¹ EMBL Australia Biomedical Informatics Group, Infection & Immunity Theme, South Australian Medical and Health Research Institute, Adelaide, Australia
² Animal and Bioscience Research Department, Animal and Grassland Research and Innovation Centre, Teagasc, Meath, Ireland
³ School of Medicine, Flinders University, Bedford Park, Australia

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Cytoscape gateway.

Abstract

Highly connected nodes (hubs) in biological networks are topologically important to the structure of the network and have also been shown to be preferentially associated with a range of phenotypes of interest. The relative importance of a hub node, however, can change depending on the biological context. Here, we report a Cytoscape app, the Contextual Hub Analysis Tool (CHAT), which enables users to easily construct and visualize a network of interactions from a gene or protein list of interest, integrate contextual information, such as gene expression or mass spectrometry data, and identify hub nodes that are more highly connected to contextual nodes (e.g. genes or proteins that are differentially expressed) than expected by chance. In a case study, we use CHAT to construct a network of genes that are differentially expressed in Dengue fever, a viral infection. CHAT was used to identify and compare contextual and degree-based hubs in this network. The top 20 degree-based hubs were enriched in pathways related to the cell cycle and cancer, which is likely due to the fact that proteins involved in these processes tend to be highly connected in general. In comparison, the top 20 contextual hubs were enriched in pathways commonly observed in a viral infection including pathways related to the immune response to viral infection. This analysis shows that such contextual hubs are considerably more biologically relevant than degree-based hubs and that analyses which rely on the identification of hubs solely based on their connectivity may be biased towards nodes that are highly connected in general rather than in the specific context of interest.

Availability: CHAT is available for Cytoscape 3.0+ and can be installed via the Cytoscape App Store (http://apps.cytoscape.org/apps/chat).

Keywords

Network analysis, hypergeometric test, hubs, gene expression data, contextual hub analysis, CHAT

Corresponding author: David J. Lynn

Competing interests: No competing interests were disclosed.

Grant information: The research leading to these results received funding from the European Union Seventh Framework Programme (FP7/2007-2013) PRIMES project under grant agreement number FP7-HEALTH-2011-278568. The Lynn Group is also supported by EMBL Australia.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2016 Muetze T et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

How to cite: Muetze T, Goenawan IH, Wiencko HL et al. Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks [version 2; peer review: 2 approved]. F1000Research 2016, 5:1745 (https://doi.org/10.12688/f1000research.9118.2) First published: 19 Jul 2016, 5:1745 (https://doi.org/10.12688/f1000research.9118.1) Latest published: 30 Aug 2016, 5:1745 (https://doi.org/10.12688/f1000research.9118.2)

Revised Amendments from Version 1

Incorporated Sandra Orchard's and Pablo Porras's suggestions to add a reference, to clarify that interactions between input list genes or proteins are considered in the network creation and calculations and to elucidate that CHAT can not only be applied to gene lists but also works for protein lists.

See the authors' detailed response to the review by Sandra Orchard and Pablo Porras Millán

Introduction

Network analysis has emerged as a powerful approach to elucidate biological and disease processes¹. Biological networks (and many other types of networks) have been shown to have a power law distribution of node connectivity, with most nodes having few connections and a few nodes being highly connected². The identification of such highly connected nodes, termed hubs, is often of interest as hubs have been shown to be topologically and functionally important. The deletion of genes encoding hub proteins, for example, has been shown to correlate with lethality in yeast (the centrality-lethality rule)³. Hubs have also been found to be preferentially targeted by both bacterial and viral pathogens⁴ and may be master regulators of biological processes⁵. Biological networks, such as the human interactome, however, are not static entities⁶, and the extent to which a node acts as a hub can change depending on the biological context e.g. the network present in a specific cell type at a particular point in time^7,8. Integrating contextual information, such as gene or protein expression data, with standard network analysis can provide insight into what are the most relevant network features in a particular study or context^9–11.

Cytoscape has a number of applications to identify hubs in networks including cytoHubba¹², APID2Net¹³, PinnacleZ¹⁴, NetworkAnalyzer^15,16 and CentiScaPe¹⁷, however, only the latter two are compatible with Cytoscape 3+. All of the applications available to date identify hubs based on node connectivity (degree) in a network of interest. To construct a network, users frequently query interaction databases to identify the interactors of a list of genes of interest, e.g. differentially expressed genes, and then identify the high degree nodes in this network. This approach to constructing a network is useful because it identifies a more fully connected network for analysis than would be the case if one restricted interactions to only those that occur between nodes in the gene list. Analysis of these networks can, for example, identify subnetworks that are enriched in (but do not exclusively consist of) differentially expressed genes, or identify non-differentially expressed nodes that are topologically important in the network, both of which would otherwise not be identified. Identifying hubs in these networks, however, is biased towards identifying nodes that are highly connected in general such as promiscuous, ubiquitous or well-studied nodes, because nodes with many interactions in the query database have a higher probability of being included in the network by chance alone. Analysis of these degree-based hubs, for example identifying what biological processes or pathways these nodes are enriched in, tells us little about the experimental context of interest and more about the properties of highly connected nodes in general. A more appropriate analysis is to determine which nodes interact with relevant nodes in the network (which we term contextual nodes) more than is statistically expected.

Here, we introduce the Contextual Hub Analysis Tool (CHAT), a Cytoscape App that identifies hub nodes that interact with more "contextual" nodes (e.g. differentially expressed genes or proteins) than statistically expected in networks integrated with user-supplied contextual data (e.g. gene expression data). We term these nodes contextual hubs. We show that such contextual hubs are considerably more relevant than degree-based hubs to the specific experimental context under investigation. As such, these nodes are promising candidates for further functional validation studies and potentially represent important points in the network for drug targeting.

Methods

Implementation

CHAT was written in Java 8 as an Open Services Gateway Initiative (OSGi) bundle for Cytoscape 3.0+¹⁸. It adds a “CHAT” option in the “Apps” menu that launches a popup window, which allows users to adjust different network initialization parameters. CHAT prompts users to input a list of gene identifiers (the supported ID types are dependent on the database selected by the user) and any associated contextual data, e.g. gene expression data associated with the genes. While the focus of this paper is on genes, CHAT can equally be applied to proteins. The OK button triggers Cytoscape’s TaskManager to run a task that initiates the network construction and adds a tab to the results panel that provides functionality to further modify and analyze the network. To create the network, CHAT finds all the first neighbor interactors of the user-provided genes (or their encoded products). Interaction data is retrieved from one of the databases included in the PSICQUIC registry¹⁹, which the user can select. Note that interactions between the first neighbors are considered by CHAT but these are not included in the network visualization for clarity reasons. Once the network has been constructed, CHAT performs a hypergeometric test on each node in the network to identify nodes that interact with contextual nodes more than expected by chance. The probability that a given hub has k or more contextual interactors among its n interactors is given by the hypergeometric distribution:

$p (X \geq k) = \sum_{x = k}^{n} \frac{(\begin{array}{l} K \\ x \end{array}) (\begin{array}{c} N - K \\ n - x \end{array})}{(\begin{array}{c} N \\ n \end{array})}$

Where N is the number of genes with at least one interaction in the database queried and K is the number of contextually relevant nodes provided by the user (with at least one interaction in the database queried). Overrepresentation analysis heavily depends on the choice of background dataset for the determination of N. To estimate the background frequency K/N, CHAT provides access to interaction data from databases available in the PSICQUIC registry. Databases with less than 10,000 interactions are excluded. The number of genes in the user-selected database that have at least one interaction (of the specified type) in which both interactors match the user-selected criteria for constructing the network (species, interaction type and ID type) determine the node population size N. Self-interactions are disregarded. Interactions between input genes and between their first neighbors are considered in the CHAT analysis. P-values calculated by CHAT are automatically corrected for multiple testing using the Benjamini-Hochberg procedure²⁰, a method widely used in bioinformatics to avoid high false discovery rates. The Bonferroni approach is widely considered to be too strict²¹.

A right click on a node brings up an option to activate the “Node Analyzer” mode, which allows the user to analyze the connectivity pattern of individual hubs of interest. Using this function will display the node analyzer table on the results panel and all nodes except the selected node and its interactors will be hidden in the network visualization. The execution time of CHAT varies between a few seconds and a few minutes based on the number of user-supplied (contextual) genes, the size of the chosen database and its connection speed as well as the user-selected network layout. These factors also influence memory consumption.

Operation

The identification of the top contextual hubs consists of three primary steps: 1) input of a user-supplied gene list and contextual data, 2) network construction and statistical analysis to identify nodes that preferentially interact with contextual nodes and 3) visualization of the top contextual hubs and their interactions and comparison to the top degree-based hubs. To construct a network using CHAT, the user must provide a list of gene identifiers and associated numerical or categorical attributes in the text box in tab-delimited format, or upload the data as a csv or tab-delimited file via the upload button (Figure 1) (.csv or .txt file types). The user can then specify which genes in the uploaded list are contextually important based on the user-provided contextual data (e.g. genes with > 2 fold-change in expression). The user then selects one of the databases in the PSICQUIC registry to query, and specifies the relevant species, ID type and interaction type for the query. The user can then choose to visualize the network using any of the layout algorithms available in Cytoscape. Clicking the OK button creates the network and a new tab in the results panel, which allows the user to visualize the network and to analyze the results further (Figure 2). The results panel is split into several parts. In the first part, the parameters used to generate the network (database, species, id type and interaction type(s)) are displayed. The second panel allows the user to compare the top contextual hubs and the top degree-based hubs at the click of a button. By default, node size and node color are proportional to the node’s corrected p-value calculated by CHAT, such that the smaller the p-value (i.e. more statistically significant), the larger the node size and the darker the red coloring of the node. The user can customize the color scheme, however. In contrast, if the users selects “Show degree hubs”, the visualization changes and the node size and coloring will now be proportional to each node’s degree in the selected database. By default, CHAT displays the top 20 contextual hubs but the user can adjust this by using the slider provided. To investigate a single node in detail the user can employ CHAT’s “Node Analyzer” by right clicking on a node. This will limit the network view to show only the selected node and its interactors and will display a table at the bottom of the results panel tab with information on the node’s name, p-value and its interactors.

Figure 1. CHAT network analysis.

To construct a network using CHAT, the user provides a list of gene identifiers and associated numerical or categorical attributes relevant in the context of interest.

Figure 2. Network visualization.

CHAT provides a number of options to customize the network visualization.

Use case

ENSG00000154099	57.81999969
ENSG00000092295	27.93000031
ENSG00000108691	27.23999977
ENSG00000164825	25.62999916
ENSG00000163666	23.94000053
ENSG00000108700	22.17000008
ENSG00000133101	22.04999924
ENSG00000104951	21.42000008
ENSG00000178965	21.29000092
ENSG00000107593	20.45000076
ENSG00000169245	18.17000008
ENSG00000125355	17.62999916
ENSG00000149564	16.77000046
ENSG00000151364	11.81999969
ENSG00000243649	11.35000038
ENSG00000185339	11.31999969
ENSG00000166278	10.89999962
ENSG00000255221	10.56000042
ENSG00000088827	10.44999981
ENSG00000166920	10.23999977
ENSG00000167601	10.17000008
ENSG00000196141	9.140000343
ENSG00000078098	8.840000153
ENSG00000117266	8.729999542
ENSG00000185745	8.56000042
ENSG00000198785	8.050000191
ENSG00000159189	7.880000114
ENSG00000162772	7.519999981
ENSG00000173369	7.349999905
ENSG00000108387	7.170000076
ENSG00000078081	7.150000095
ENSG00000149131	7.099999905
ENSG00000162614	7.019999981
ENSG00000134326	6.96999979
ENSG00000020577	6.960000038
ENSG00000169248	6.889999866
ENSG00000006075	6.809999943
ENSG00000197272	6.789999962
ENSG00000136689	6.78000021
ENSG00000137757	6.730000019
ENSG00000187608	6.610000134
ENSG00000184270	6.599999905
ENSG00000110492	6.480000019
ENSG00000142687	6.460000038
ENSG00000173801	6.420000076
ENSG00000106785	6.320000172
ENSG00000161640	6.300000191
ENSG00000108771	6.289999962
ENSG00000137726	6.050000191
ENSG00000079385	6.039999962
ENSG00000115155	5.96999979
ENSG00000185338	5.96999979
ENSG00000145555	5.789999962
ENSG00000119917	5.760000229
ENSG00000100342	5.730000019
ENSG00000198829	5.71999979
ENSG00000165997	5.699999809
ENSG00000205362	5.590000153
ENSG00000108679	5.579999924
ENSG00000165949	5.550000191
ENSG00000125148	5.519999981
ENSG00000173372	5.480000019
ENSG00000174600	5.429999828
ENSG00000168062	5.429999828
ENSG00000188290	5.360000134
ENSG00000213533	5.349999905
ENSG00000152766	5.349999905
ENSG00000171729	5.309999943
ENSG00000054598	5.289999962
ENSG00000136960	5.28000021
ENSG00000120217	5.260000229
ENSG00000131203	5.210000038
ENSG00000139572	5.199999809
ENSG00000038945	5.179999828
ENSG00000111912	5.150000095
ENSG00000185507	5.050000191
ENSG00000111335	5.03000021
ENSG00000125144	5.019999981
ENSG00000115159	5
ENSG00000132669	5
ENSG00000136514	4.940000057
ENSG00000134321	4.889999866
ENSG00000143344	4.820000172
ENSG00000123610	4.789999962
ENSG00000135114	4.75
ENSG00000184260	4.670000076
ENSG00000158373	4.630000114
ENSG00000178175	4.53000021
ENSG00000134809	4.5
ENSG00000135047	4.46999979
ENSG00000171631	4.429999828
ENSG00000121858	4.380000114
ENSG00000143891	4.380000114
ENSG00000115267	4.360000134
ENSG00000170866	4.269999981
ENSG00000122643	4.269999981
ENSG00000111331	4.230000019
ENSG00000137959	4.179999828
ENSG00000244617	4.159999847
ENSG00000185880	4.150000095
ENSG00000163568	4.059999943
ENSG00000138642	4.050000191
ENSG00000165682	4.050000191
ENSG00000121236	4.050000191
ENSG00000168306	4.050000191
ENSG00000101986	4.039999962
ENSG00000155363	3.960000038
ENSG00000178685	3.940000057
ENSG00000136231	3.920000076
ENSG00000188313	3.900000095
ENSG00000119915	3.900000095
ENSG00000130489	3.900000095
ENSG00000184678	3.880000114
ENSG00000139410	3.869999886
ENSG00000168026	3.859999895
ENSG00000068079	3.839999914
ENSG00000134627	3.839999914
ENSG00000196664	3.809999943
ENSG00000120539	3.789999962
ENSG00000172159	3.779999971
ENSG00000165029	3.769999981
ENSG00000168961	3.75
ENSG00000107201	3.74000001
ENSG00000159228	3.730000019
ENSG00000124762	3.720000029
ENSG00000117228	3.710000038
ENSG00000116691	3.710000038
ENSG00000175518	3.700000048
ENSG00000095739	3.660000086
ENSG00000146859	3.630000114
ENSG00000198848	3.630000114
ENSG00000089127	3.619999886
ENSG00000171860	3.609999895
ENSG00000143367	3.599999905
ENSG00000112053	3.599999905
ENSG00000204103	3.569999933
ENSG00000246705	3.559999943
ENSG00000144035	3.529999971
ENSG00000188820	3.529999971
ENSG00000141837	3.519999981
ENSG00000183347	3.50999999
ENSG00000138646	3.49000001
ENSG00000076067	3.49000001
ENSG00000179921	3.460000038
ENSG00000163823	3.420000076
ENSG00000124256	3.420000076
ENSG00000254521	3.359999895
ENSG00000177409	3.359999895
ENSG00000197249	3.349999905
ENSG00000117010	3.339999914
ENSG00000119922	3.329999924
ENSG00000132530	3.289999962
ENSG00000129538	3.289999962
ENSG00000107317	3.24000001
ENSG00000187116	3.210000038
ENSG00000002549	3.200000048
ENSG00000138119	3.190000057
ENSG00000129757	3.180000067
ENSG00000116663	3.160000086
ENSG00000136816	3.160000086
ENSG00000221963	3.150000095
ENSG00000144655	3.140000105
ENSG00000253958	3.130000114
ENSG00000055332	3.119999886
ENSG00000198719	3.109999895
ENSG00000140464	3.079999924
ENSG00000059378	3.069999933
ENSG00000258227	3.069999933
ENSG00000106605	3.069999933
ENSG00000183486	3.049999952
ENSG00000134470	3.039999962
ENSG00000111911	3.029999971
ENSG00000146425	3.029999971
ENSG00000112773	3.029999971
ENSG00000111181	3.019999981
ENSG00000116016	3.019999981
ENSG00000145287	3.00999999
ENSG00000100298	3
ENSG00000010030	2.99000001
ENSG00000126262	2.99000001
ENSG00000035720	2.980000019
ENSG00000113070	2.980000019
ENSG00000158104	2.970000029
ENSG00000137628	2.970000029
ENSG00000112137	2.960000038
ENSG00000131979	2.960000038
ENSG00000137965	2.960000038
ENSG00000130589	2.950000048
ENSG00000139832	2.940000057
ENSG00000116514	2.930000067
ENSG00000205413	2.920000076
ENSG00000067066	2.900000095
ENSG00000115415	2.890000105
ENSG00000168394	2.880000114
ENSG00000148926	2.880000114
ENSG00000152061	2.869999886
ENSG00000170439	2.859999895
ENSG00000170835	2.839999914
ENSG00000104147	2.829999924
ENSG00000168003	2.819999933
This is a portion of the data; to view all the data, please download the file.

Dataset 1.Use case data.

462 genes that have been reported to be up-regulated during Dengue fever infection

As a demonstration of its potential utility and as validation, CHAT was used to construct a network using a dataset of 462 genes that have been reported to be up-regulated during Dengue fever, a mosquito-borne viral infection²² (Ensembl gene IDs for these 462 genes are provided in Dataset 1). These 462 genes represent the contextual data for this case study. CHAT was used to construct a network of these genes and their first neighbor interactors using interaction data that was sourced from InnateDB^23,24 via the PSICQUIC web service (InnateDB-All). A network of 4,910 nodes was generated. CHAT was then used to identify the top 20 conventional hub nodes (based solely on degree) and the top 20 contextual hub nodes in the network (Figure 3). No nodes were in common in the two top 20 lists. InnateDB pathway analysis^23,24 revealed that the top 20 degree-based hubs were enriched in pathways related to the cell cycle and cancer (Supplementary Table 1), which is likely due to the fact that proteins involved in these processes tend to be highly connected in general. In comparison to degree-based hubs, the top 20 contextual hubs were statistically enriched in pathways related to the immune response to viral infection, such as the interferon signaling pathway; the Retinoic acid inducible gene-I (RIG-I) pathway; the Toll-like receptor (TLR) pathway; and the Janus kinase (JAK) - Signal Transducer and Activator of Transcription (STAT) pathway (Supplementary Table 2). All of these pathways have been shown to play key roles in the host response to Dengue infection^25,26. Indeed, many of the top 20 contextual hubs (but not degree-based hubs) were well-known transcription factors involved in the host interferon response including STAT1, STAT2 and the interferon regulatory factors (IRFs); IRF1, 3, 8 and 9, which is a key cellular response to viral infection including Dengue^27,28. Another gene identified in the contextual hub analysis but not the degree-based analysis was interferon-stimulated gene 15 (ISG15). Cells in which ISG15 has been silenced have been shown to have significantly higher Dengue viral loads²⁹. The results of the pathway analysis were reinforced by a Gene Ontology analysis using innatedb.com^23,24, which identified terms including cytokine-mediated signaling pathway, type I interferon signaling pathway, and innate immune response among the top 10 enriched terms (FDR < 0.05) for the contextual hubs but not the degree-based hubs (Supplementary Table 3 and Supplementary Table 4).

Figure 3. Visualization of a Dengue gene expression dataset.

A CHAT network visualization comparing contextual hubs (A) to degree-based hubs (B) in a network constructed using InnateDB^23,24.

Conclusion

Through the integration of contextual information, such as gene or protein expression, contextual hub analysis as implemented in CHAT can identify context-specific hubs more relevant to the biological context under study, such as disease, treatment or cellular state. As shown in the above case study, these hubs are of more functional relevance than genes found through analysis based on degree only. Given the current emphasis on the importance of considering the network model of biological pathways and the ever-increasing abundance of high-throughput data, CHAT provides a valuable addition to the biologists’ computational toolkit in using a network-based approach to help prioritize genes of interest for further investigation or drug discovery. In the future, CHAT can be extended to include the contextual analysis of other network features such as network bottlenecks.

Data availability

F1000Research: Dataset 1. Use case data: 462 genes that have been reported to be up-regulated during Dengue fever infection, 10.5256/f1000research.9118.d128126³⁰

Software availability

Software available from: http://apps.cytoscape.org/apps/chat

Latest source code: https://bitbucket.org/dynetteam/chat

Archived source code at time of publication: http://www.dx.doi.org/10.5281/zenodo.56496³¹

Manual/Tutorial: https://bitbucket.org/dynetteam/chat/downloads

License: Lesser GNU Public License 3.0

Author contributions

TM and IHG jointly developed the App under the supervision of KB and DJL. TM and DJL wrote the paper with contributions from the other authors. HLW developed an earlier unpublished Python version of CHAT that inspired the development of this App. MBL provided computational and systems support for the project. DJL conceived of the idea, supervised the App’s development and co-wrote the paper.

Competing interests

No competing interests were disclosed.

Grant information

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Supplementary material

Supplementary Table 1. Pathway analysis of the top 20 degree-based hubs.

Supplementary Table 2. Pathway analysis of the top 20 contextual hubs.

Supplementary Table 3. Gene ontology terms overrepresented among the top 20 contextual hubs in the Dengue fever network.

Supplementary Table 4. Gene ontology terms overrepresented among the top 20 degree-based hubs in the Dengue fever network.

Faculty Opinions recommended

References

1. Barabasi AL, Gulbahce N, Loscalzo J: Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011; 12(1): 56–68. PubMed Abstract | Publisher Full Text | Free Full Text
2. Barabasi AL, Oltvai ZN: Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004; 5(2): 101–113. PubMed Abstract | Publisher Full Text
3. Jeong H, Mason SP, Barabási AL, et al.: Lethality and centrality in protein networks. Nature. 2001; 411(6833): 41–42. PubMed Abstract | Publisher Full Text
4. Dyer MD, Murali TM, Sobral BW: The landscape of human proteins interacting with viruses and other pathogens. PLoS Pathog. 2008; 4(2): e32. PubMed Abstract | Publisher Full Text | Free Full Text
5. Borneman AR, Leigh-Bell JA, Yu H, et al.: Target hub proteins serve as master regulators of development in yeast. Genes Dev. 2006; 20(4): 435–448. PubMed Abstract | Publisher Full Text | Free Full Text
6. Przytycka TM, Singh M, Slonim DK: Toward the dynamic interactome: It’s about time. Brief Bioinform. 2010; 11(1): 15–29. PubMed Abstract | Publisher Full Text | Free Full Text
7. Rachlin J, Cohen DD, Cantor C, et al.: Biological context networks: a mosaic view of the interactome. Mol Syst Biol. 2006; 2: 66. PubMed Abstract | Publisher Full Text | Free Full Text
8. Agarwal S, Deane CM, Porter MA, et al.: Revisiting date and party hubs: Novel approaches to role assignment in protein interaction networks. PLoS Comput Biol. 2010; 6(6): e1000817. PubMed Abstract | Publisher Full Text | Free Full Text
9. Gao S, Wang X: Identification of highly synchronized subnetworks from gene expression data. BMC Bioinformatics. 2013; 14(Suppl 9): S5. PubMed Abstract | Publisher Full Text | Free Full Text
10. Zinman GE, Naiman S, O'Dee DM, et al.: ModuleBlast: identifying activated sub-networks within and across species. Nucleic Acids Res. 2015; 43(3): e20. PubMed Abstract | Publisher Full Text | Free Full Text
11. Soul J, Hardingham TE, Boot-Handford RP, et al.: PhenomeExpress: a refined network analysis of expression datasets by inclusion of known disease phenotypes. Sci Rep. 2015; 5: 8117. PubMed Abstract | Publisher Full Text | Free Full Text
12. Chin CH, Chen SH, Wu HH, et al.: cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol. 2014; 8(Suppl 4): S11. PubMed Abstract | Publisher Full Text | Free Full Text
13. Hernandez-Toro J, Prieto C, De Las Rivas J: APID2NET: Unified interactome graphic analyzer. Bioinformatics. 2007; 23(18): 2495–2497. PubMed Abstract | Publisher Full Text
14. Chuang HY, Lee E, Liu YT, et al.: Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007; 3: 140. PubMed Abstract | Publisher Full Text | Free Full Text
15. Assenov Y, Ramírez F, Schelhorn SE, et al.: Computing topological parameters of biological networks. Bioinformatics. 2008; 24(2): 282–284. PubMed Abstract | Publisher Full Text
16. Doncheva NT, Assenov Y, Domingues FS, et al.: Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc. 2012; 7(4): 670–85. PubMed Abstract | Publisher Full Text
17. Scardoni G, Tosadori G, Faizan M, et al.: Biological network analysis with CentiScaPe: centralities and experimental dataset integration [version 2; referees: 2 approved]. F1000Research. 2014; 3: 139. PubMed Abstract | Publisher Full Text | Free Full Text
18. Shannon P, Markiel A, Ozier O, et al.: Cytoscape: A software Environment for integrated models of biomolecular interaction networks. Genome Res. 2003; 13(11): 2498–2504. PubMed Abstract | Publisher Full Text | Free Full Text
19. Aranda B, Blankenburg H, Kerrien S, et al.: PSICQUIC and PSISCORE: accessing and scoring molecular interactions. Nat Methods. 2011; 8(7): 528–529. PubMed Abstract | Publisher Full Text | Free Full Text
20. Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995; 57(1): 289–300. Reference Source
21. Noble WS: How does multiple testing correction work? Nat Biotechnol. 2009; 27(12): 1135–1137. PubMed Abstract | Publisher Full Text | Free Full Text
22. Hoang LT, Lynn DJ, Henn M, et al.: The early whole-blood transcriptional signature of dengue virus and features associated with progression to dengue shock syndrome in Vietnamese children and young adults. J Virol. 2010; 84(24): 12982–94. PubMed Abstract | Publisher Full Text | Free Full Text
23. Breuer K, Foroushani AK, Laird MR, et al.: InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res. 2013; 41(Database issue): D1228–1233. PubMed Abstract | Publisher Full Text | Free Full Text
24. del-Toro N, Dumousseau M, Orchard S, et al.: A new reference implementation of the PSICQUIC web service. Nucleic Acids Res. 2013; 41(Web Server issue): W601–6. PubMed Abstract | Publisher Full Text | Free Full Text
25. Nasirudeen AM, Wong HH, Thien P, et al.: RIG-I, MDA5 and TLR3 synergistically play an important role in restriction of dengue virus infection. PLoS Negl Trop Dis. 2011; 5(1): e926. PubMed Abstract | Publisher Full Text | Free Full Text
26. Souza-Neto JA, Sim S, Dimopoulos G: An evolutionary conserved function of the JAK-STAT pathway in anti-dengue defense. Proc Natl Acad Sci U S A. 2009; 106(42): 17841–6. PubMed Abstract | Publisher Full Text | Free Full Text
27. De La Cruz Hernández SI, Puerta-Guardo H, Flores-Aguilar H, et al.: A strong interferon response correlates with a milder dengue clinical condition. J Clin Virol. 2014; 60(3): 196–199. PubMed Abstract | Publisher Full Text
28. Morrison J, García-Sastre A: STAT2 signaling and dengue virus infection. JAKSTAT. 2014; 3(1): e27715. PubMed Abstract | Publisher Full Text | Free Full Text
29. Dai J, Pan W, Wang P: ISG15 facilitates cellular antiviral response to dengue and west nile virus infection in vitro. Virol J. 2011; 8: 468. PubMed Abstract | Publisher Full Text | Free Full Text
30. Muetze T, Goenawan IH, Wiencko HL, et al.: Dataset 1 in: Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks. F1000Research. 2016. Data Source
31. Muetze T, Goenawan IH, Wiencko HL, et al.: Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks. Zenodo. 2016. Data Source

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 19 Jul 2016

Author details Author details

Competing interests

No competing interests were disclosed.

Grant information

The research leading to these results received funding from the European Union Seventh Framework Programme (FP7/2007-2013) PRIMES project under grant agreement number FP7-HEALTH-2011-278568. The Lynn Group is also supported by EMBL Australia.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 30 Aug 2016, 5:1745

https://doi.org/10.12688/f1000research.9118.2

version 1

Published: 19 Jul 2016, 5:1745

https://doi.org/10.12688/f1000research.9118.1

© 2016 Muetze T et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Muetze T, Goenawan IH, Wiencko HL et al. Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks [version 2; peer review: 2 approved] F1000Research 2016, 5:1745 (https://doi.org/10.12688/f1000research.9118.2)

NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 30 Aug 2016

Revised

Views

Reviewer Report 01 Nov 2016

Christopher K. Tuggle, Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, USA

Haibo Liu, Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, USA

Approved

https://doi.org/10.5256/f1000research.10240.r16856

This Cytoscape app “CHAT” is valuable given its improvement over conventional biological network analysis methods by considering the context of network analysis. It is a good addition to the Cytoscape toolkits. However, we suggest some modifications that might make the ... Continue reading

In the “Operation” section, “The identification of the top contextual hubs consists of three primary steps: 1) input of a user-supplied gene list and contextual data, 2) network construction and statistical analysis to identify nodes that preferentially interact with contextual nodes and 3) visualization of the top contextual hubs and their interactions and comparison to the top degree-based hubs.” should include four steps to be consistent with the following statements: 1) input a gene list, 2) database selection, 3) network construction and statistical analysis, 4) visualization of the top conceptual hubs and their interactions and comparison to the top degree-based hubs.
The “implementation” and “Operation” sections in the Methods are somewhat redundant. For example, “The OK button triggers Cytoscape’s TaskManager to run a task that initiates the network construction and adds a tab to the results panel that provides functionality to further modify and analyze the network” and “A right click on a node brings up an option to activate the “Node Analyzer”mode”. It will be more appropriate if both these sentences are moved to the “Operation” section. The “Implementation” section should focus on the description of functionalities provide by CHAT and the application interfaces (APIs) are implemented, while how to conduct network analysis should be in the “Operation” part.

Suggested Modifications to future versions of CHAT

One question is the background selection criteria. As the authors mentioned, this step is very important and directly affects the later statistical test results. An alternative/better background might be the set of genes/proteins expressed under the condition used to create the original dataset, such as genes expressed in given the cell type, tissue, or treatment. This might eliminate the irrelevant nodes and reduce the number of tests needed.
In the current CHAT version, only the first-order neighbor-interactors are allowed to be considered which is generally most important and might be enough in most situations. But when the resulting network is small, the user might not be able to perform further analysis. So if the option of higher-order interactors is provided, the tool will be more versatile.
The authors mentioned that only databases in the PSICQUIC registry with at least 10,000 interactions are included. This is kind of arbitrary. While it is appreciated that other DB available in Cytoscape can be used with this tool, the usefulness of the databases should be determined case by case. So we suggest the author provide the user all the available choices of interaction databases.
We suggest a different method might be more appropriate for multiple testing correction.

The authors state that “a method widely used in bioinformatics to avoid high false discovery rates. The Bonferroni approach is widely considered to be too strict.” This could be reworded as “a method widely used in bioinformatics to avoid high false discovery rates, instead of the Bonferroni approach which is widely considered to be too strict.”
However, the author assumed the genes in the network are independent by using the “BH” method. To be more realistic, we suggest the “BY” method for correction of the multiple testing. See Benjamini and Yekutieli: The control of the false discovery rate in multiple testing under dependency Ann. Statist. 2001; 29 (1165-1188)

Competing Interests: No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 19 Jul 2016

Views

Reviewer Report 09 Aug 2016

Sandra Orchard, Wellcome Trust Genome Campus, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK

Pablo Porras Millán, Wellcome Trust Genome Campus, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK

Approved

https://doi.org/10.5256/f1000research.9812.r15065

This is a well written technical paper, clearly outlining a new Cytoscape App in terms that would make it easy for a new user, with some familiarity with Cytoscape, to download, install and use. The ability to generate contextual hubs is currently not possible with existing Cytoscape Apps, so this is a valuable addition to the collection. A couple of queries and some minor points for correction:

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.
Can this application be made to work with an existing network?
The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.
It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

References

1. del-Toro N, Dumousseau M, Orchard S, Jimenez RC, et al.: A new reference implementation of the PSICQUIC web service.Nucleic Acids Res. 2013; 41 (Web Server issue): W601-6 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 30 Aug 2016

Tanja Muetze, EMBL Australia Biomedical Informatics Group, Infection & Immunity Theme, South Australian Medical and Health Research Institute, Adelaide, Australia

30 Aug 2016

Author Response

Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented ... Continue reading Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.

This observation is incorrect. CHAT does include interactions between members of the uploaded list. Perhaps what you mean is that interactions between the first neighbors of the uploaded list of genes/proteins are not considered? These are actually also considered in CHAT’s calculations – they are just excluded from the visualization to avoid unhelpful "hairball" visualizations. We have now clarified this point in the paper. We are not sure why a user would want to exclude first neighbors – this information is needed in CHAT to identify the contextual hubs.

Can this application be made to work with an existing network?

We agree that this would be a very nice feature, unfortunately it is actually very difficult to do with the current design of CHAT. CHAT identifies hub nodes that interact with more "contextual" nodes than statistically expected using a hypergeometric test. This test is reliant on calculating, N, the number of genes with at least one interaction in the database queried to estimate the background expectation. This parameter would not be known or easily estimated in a user-supplied network as CHAT wouldn't know what database or the data in the database when the network was constructed by the user. One of the nice features of CHAT is that the network is constructed using the latest data available via a PSICQUIC query.

The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.

Good point. We have now edited the text to clarify that protein as well as gene ids can be used to construct the network in CHAT.

It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

We have now added the suggested reference to the paper.

Again, we thank you for comments and suggestions.

Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.

This observation is incorrect. CHAT does include interactions between members of the uploaded list. Perhaps what you mean is that interactions between the first neighbors of the uploaded list of genes/proteins are not considered? These are actually also considered in CHAT’s calculations – they are just excluded from the visualization to avoid unhelpful "hairball" visualizations. We have now clarified this point in the paper. We are not sure why a user would want to exclude first neighbors – this information is needed in CHAT to identify the contextual hubs.

Can this application be made to work with an existing network?

We agree that this would be a very nice feature, unfortunately it is actually very difficult to do with the current design of CHAT. CHAT identifies hub nodes that interact with more "contextual" nodes than statistically expected using a hypergeometric test. This test is reliant on calculating, N, the number of genes with at least one interaction in the database queried to estimate the background expectation. This parameter would not be known or easily estimated in a user-supplied network as CHAT wouldn't know what database or the data in the database when the network was constructed by the user. One of the nice features of CHAT is that the network is constructed using the latest data available via a PSICQUIC query.

The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.

Good point. We have now edited the text to clarify that protein as well as gene ids can be used to construct the network in CHAT.

It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

We have now added the suggested reference to the paper.

Again, we thank you for comments and suggestions.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 30 Aug 2016

Tanja Muetze, EMBL Australia Biomedical Informatics Group, Infection & Immunity Theme, South Australian Medical and Health Research Institute, Adelaide, Australia

30 Aug 2016

Author Response

Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented ... Continue reading Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.

This observation is incorrect. CHAT does include interactions between members of the uploaded list. Perhaps what you mean is that interactions between the first neighbors of the uploaded list of genes/proteins are not considered? These are actually also considered in CHAT’s calculations – they are just excluded from the visualization to avoid unhelpful "hairball" visualizations. We have now clarified this point in the paper. We are not sure why a user would want to exclude first neighbors – this information is needed in CHAT to identify the contextual hubs.

Can this application be made to work with an existing network?

We agree that this would be a very nice feature, unfortunately it is actually very difficult to do with the current design of CHAT. CHAT identifies hub nodes that interact with more "contextual" nodes than statistically expected using a hypergeometric test. This test is reliant on calculating, N, the number of genes with at least one interaction in the database queried to estimate the background expectation. This parameter would not be known or easily estimated in a user-supplied network as CHAT wouldn't know what database or the data in the database when the network was constructed by the user. One of the nice features of CHAT is that the network is constructed using the latest data available via a PSICQUIC query.

The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.

Good point. We have now edited the text to clarify that protein as well as gene ids can be used to construct the network in CHAT.

It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

We have now added the suggested reference to the paper.

Again, we thank you for comments and suggestions.

Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.

This observation is incorrect. CHAT does include interactions between members of the uploaded list. Perhaps what you mean is that interactions between the first neighbors of the uploaded list of genes/proteins are not considered? These are actually also considered in CHAT’s calculations – they are just excluded from the visualization to avoid unhelpful "hairball" visualizations. We have now clarified this point in the paper. We are not sure why a user would want to exclude first neighbors – this information is needed in CHAT to identify the contextual hubs.

Can this application be made to work with an existing network?

We agree that this would be a very nice feature, unfortunately it is actually very difficult to do with the current design of CHAT. CHAT identifies hub nodes that interact with more "contextual" nodes than statistically expected using a hypergeometric test. This test is reliant on calculating, N, the number of genes with at least one interaction in the database queried to estimate the background expectation. This parameter would not be known or easily estimated in a user-supplied network as CHAT wouldn't know what database or the data in the database when the network was constructed by the user. One of the nice features of CHAT is that the network is constructed using the latest data available via a PSICQUIC query.

The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.

Good point. We have now edited the text to clarify that protein as well as gene ids can be used to construct the network in CHAT.

It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

We have now added the suggested reference to the paper.

Again, we thank you for comments and suggestions.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 19 Jul 2016

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 30 Aug 16		read
Version 1 19 Jul 16	read

Sandra Orchard, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK

Pablo Porras Millán, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK
Christopher K. Tuggle, Iowa State University, Ames, USA

Haibo Liu, Iowa State University, Ames, USA

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

19 Views

01 Nov 2016 | for Version 2

Christopher K. Tuggle, Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, USA

Haibo Liu, Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, USA

19 Views Cite this report Responses(0)

Approved

In the “Operation” section, “The identification of the top contextual hubs consists of three primary steps: 1) input of a user-supplied gene list and contextual data, 2) network construction and statistical analysis to identify nodes that preferentially interact with contextual nodes and 3) visualization of the top contextual hubs and their interactions and comparison to the top degree-based hubs.” should include four steps to be consistent with the following statements: 1) input a gene list, 2) database selection, 3) network construction and statistical analysis, 4) visualization of the top conceptual hubs and their interactions and comparison to the top degree-based hubs.
The “implementation” and “Operation” sections in the Methods are somewhat redundant. For example, “The OK button triggers Cytoscape’s TaskManager to run a task that initiates the network construction and adds a tab to the results panel that provides functionality to further modify and analyze the network” and “A right click on a node brings up an option to activate the “Node Analyzer”mode”. It will be more appropriate if both these sentences are moved to the “Operation” section. The “Implementation” section should focus on the description of functionalities provide by CHAT and the application interfaces (APIs) are implemented, while how to conduct network analysis should be in the “Operation” part.

Suggested Modifications to future versions of CHAT

One question is the background selection criteria. As the authors mentioned, this step is very important and directly affects the later statistical test results. An alternative/better background might be the set of genes/proteins expressed under the condition used to create the original dataset, such as genes expressed in given the cell type, tissue, or treatment. This might eliminate the irrelevant nodes and reduce the number of tests needed.
In the current CHAT version, only the first-order neighbor-interactors are allowed to be considered which is generally most important and might be enough in most situations. But when the resulting network is small, the user might not be able to perform further analysis. So if the option of higher-order interactors is provided, the tool will be more versatile.
The authors mentioned that only databases in the PSICQUIC registry with at least 10,000 interactions are included. This is kind of arbitrary. While it is appreciated that other DB available in Cytoscape can be used with this tool, the usefulness of the databases should be determined case by case. So we suggest the author provide the user all the available choices of interaction databases.
We suggest a different method might be more appropriate for multiple testing correction.

Competing Interests

No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

28 Views

09 Aug 2016 | for Version 1

Sandra Orchard, Wellcome Trust Genome Campus, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK

Pablo Porras Millán, Wellcome Trust Genome Campus, European Molecular Biology Laboratory-European Bioinformatics Institute, Hinxton, UK

28 Views Cite this report Responses(1)

Approved

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.
Can this application be made to work with an existing network?
The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.
It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

References

Competing Interests

No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

30 Aug 2016

Tanja Muetze, EMBL Australia Biomedical Informatics Group, Infection & Immunity Theme, South Australian Medical and Health Research Institute, Adelaide, Australia

Thank you very much for your thoughtful review. Below we have addressed each of the points raised.

The application searches for first-neighbour interactions of molecules in the list presented to it. It did not appear to search for interaction between members of the list, which should not affect the contextual nodes selection much, but will alter the degree-based hubs. This should be commented on, or the documentation made clearer if we are incorrect with this observation. To bypass this problem and make the user more aware of this limitation, the tool should be able to provide more control over how the network is constructed, for example providing the option to exclude first neighbours.

This observation is incorrect. CHAT does include interactions between members of the uploaded list. Perhaps what you mean is that interactions between the first neighbors of the uploaded list of genes/proteins are not considered? These are actually also considered in CHAT’s calculations – they are just excluded from the visualization to avoid unhelpful "hairball" visualizations. We have now clarified this point in the paper. We are not sure why a user would want to exclude first neighbors – this information is needed in CHAT to identify the contextual hubs.

Can this application be made to work with an existing network?

We agree that this would be a very nice feature, unfortunately it is actually very difficult to do with the current design of CHAT. CHAT identifies hub nodes that interact with more "contextual" nodes than statistically expected using a hypergeometric test. This test is reliant on calculating, N, the number of genes with at least one interaction in the database queried to estimate the background expectation. This parameter would not be known or easily estimated in a user-supplied network as CHAT wouldn't know what database or the data in the database when the network was constructed by the user. One of the nice features of CHAT is that the network is constructed using the latest data available via a PSICQUIC query.

The text is entirely gene-centric and may leave an inexperienced use under the impression is is only usable for gene-expression data whereas it is equally useful for the analysis of proteomic data and works with UniProtKB identifiers. Whilst I realise this is apparent to anyone who downloads the app, it may well be worth adding a sentence to both the Summary or Introduction of this paper, and also the description in the App store just to make this very clear to naive users.

Good point. We have now edited the text to clarify that protein as well as gene ids can be used to construct the network in CHAT.

It may also be worth adding the reference to the 2013 PSICQUIC paper as well as the original as I personally find it more informative and again, may be helpful to the inexperienced user.

We have now added the suggested reference to the paper.

Again, we thank you for comments and suggestions.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

Click here to access the data.

Downloaded data do not display as expected? Download the data (12.92KB)

[1] 1. Barabasi AL, Gulbahce N, Loscalzo J: Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011; 12(1): 56–68. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Barabasi AL, Oltvai ZN: Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004; 5(2): 101–113. PubMed Abstract | Publisher Full Text

[3] 3. Jeong H, Mason SP, Barabási AL, et al.: Lethality and centrality in protein networks. Nature. 2001; 411(6833): 41–42. PubMed Abstract | Publisher Full Text

[4] 4. Dyer MD, Murali TM, Sobral BW: The landscape of human proteins interacting with viruses and other pathogens. PLoS Pathog. 2008; 4(2): e32. PubMed Abstract | Publisher Full Text | Free Full Text

[5] 5. Borneman AR, Leigh-Bell JA, Yu H, et al.: Target hub proteins serve as master regulators of development in yeast. Genes Dev. 2006; 20(4): 435–448. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. Przytycka TM, Singh M, Slonim DK: Toward the dynamic interactome: It’s about time. Brief Bioinform. 2010; 11(1): 15–29. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Rachlin J, Cohen DD, Cantor C, et al.: Biological context networks: a mosaic view of the interactome. Mol Syst Biol. 2006; 2: 66. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Agarwal S, Deane CM, Porter MA, et al.: Revisiting date and party hubs: Novel approaches to role assignment in protein interaction networks. PLoS Comput Biol. 2010; 6(6): e1000817. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Gao S, Wang X: Identification of highly synchronized subnetworks from gene expression data. BMC Bioinformatics. 2013; 14(Suppl 9): S5. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. Zinman GE, Naiman S, O'Dee DM, et al.: ModuleBlast: identifying activated sub-networks within and across species. Nucleic Acids Res. 2015; 43(3): e20. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Soul J, Hardingham TE, Boot-Handford RP, et al.: PhenomeExpress: a refined network analysis of expression datasets by inclusion of known disease phenotypes. Sci Rep. 2015; 5: 8117. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Chin CH, Chen SH, Wu HH, et al.: cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol. 2014; 8(Suppl 4): S11. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Hernandez-Toro J, Prieto C, De Las Rivas J: APID2NET: Unified interactome graphic analyzer. Bioinformatics. 2007; 23(18): 2495–2497. PubMed Abstract | Publisher Full Text

[14] 14. Chuang HY, Lee E, Liu YT, et al.: Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007; 3: 140. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Assenov Y, Ramírez F, Schelhorn SE, et al.: Computing topological parameters of biological networks. Bioinformatics. 2008; 24(2): 282–284. PubMed Abstract | Publisher Full Text

[16] 16. Doncheva NT, Assenov Y, Domingues FS, et al.: Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc. 2012; 7(4): 670–85. PubMed Abstract | Publisher Full Text

[17] 17. Scardoni G, Tosadori G, Faizan M, et al.: Biological network analysis with CentiScaPe: centralities and experimental dataset integration [version 2; referees: 2 approved]. F1000Research. 2014; 3: 139. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Shannon P, Markiel A, Ozier O, et al.: Cytoscape: A software Environment for integrated models of biomolecular interaction networks. Genome Res. 2003; 13(11): 2498–2504. PubMed Abstract | Publisher Full Text | Free Full Text

[19] 19. Aranda B, Blankenburg H, Kerrien S, et al.: PSICQUIC and PSISCORE: accessing and scoring molecular interactions. Nat Methods. 2011; 8(7): 528–529. PubMed Abstract | Publisher Full Text | Free Full Text

[20] 20. Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995; 57(1): 289–300. Reference Source

[21] 21. Noble WS: How does multiple testing correction work? Nat Biotechnol. 2009; 27(12): 1135–1137. PubMed Abstract | Publisher Full Text | Free Full Text

[22] 22. Hoang LT, Lynn DJ, Henn M, et al.: The early whole-blood transcriptional signature of dengue virus and features associated with progression to dengue shock syndrome in Vietnamese children and young adults. J Virol. 2010; 84(24): 12982–94. PubMed Abstract | Publisher Full Text | Free Full Text

[23] 23. Breuer K, Foroushani AK, Laird MR, et al.: InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res. 2013; 41(Database issue): D1228–1233. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. del-Toro N, Dumousseau M, Orchard S, et al.: A new reference implementation of the PSICQUIC web service. Nucleic Acids Res. 2013; 41(Web Server issue): W601–6. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Nasirudeen AM, Wong HH, Thien P, et al.: RIG-I, MDA5 and TLR3 synergistically play an important role in restriction of dengue virus infection. PLoS Negl Trop Dis. 2011; 5(1): e926. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Souza-Neto JA, Sim S, Dimopoulos G: An evolutionary conserved function of the JAK-STAT pathway in anti-dengue defense. Proc Natl Acad Sci U S A. 2009; 106(42): 17841–6. PubMed Abstract | Publisher Full Text | Free Full Text

[27] 27. De La Cruz Hernández SI, Puerta-Guardo H, Flores-Aguilar H, et al.: A strong interferon response correlates with a milder dengue clinical condition. J Clin Virol. 2014; 60(3): 196–199. PubMed Abstract | Publisher Full Text

[28] 28. Morrison J, García-Sastre A: STAT2 signaling and dengue virus infection. JAKSTAT. 2014; 3(1): e27715. PubMed Abstract | Publisher Full Text | Free Full Text

[29] 29. Dai J, Pan W, Wang P: ISG15 facilitates cellular antiviral response to dengue and west nile virus infection in vitro. Virol J. 2011; 8: 468. PubMed Abstract | Publisher Full Text | Free Full Text

[30] 30. Muetze T, Goenawan IH, Wiencko HL, et al.: Dataset 1 in: Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks. F1000Research. 2016. Data Source

[31] 31. Muetze T, Goenawan IH, Wiencko HL, et al.: Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks. Zenodo. 2016. Data Source

Contextual Hub Analysis Tool (CHAT): A Cytoscape app for identifying contextually relevant hubs in biological networks

Abstract

Keywords

Revised Amendments from Version 1

Introduction

Methods

Implementation

Operation

Figure 1. CHAT network analysis.

Figure 2. Network visualization.

Use case

Figure 3. Visualization of a Dengue gene expression dataset.

Conclusion

Data availability

Software availability

Author contributions

Competing interests

Grant information

Supplementary material

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

The problem

How to fix it

Competing Interests Policy

Stay Updated