Tools for visualization and analysis of molecular networks, pathways, and -omics data

Jose M Villaveces; Prasanna Koti; Bianca H Habermann

doi:10.2147/AABC.S63534

Back to Journals » Advances and Applications in Bioinformatics and Chemistry » Volume 8

Review

Tools for visualization and analysis of molecular networks, pathways, and -omics data

Authors Villaveces J, Koti P, Habermann B

Received 27 January 2015

Accepted for publication 1 April 2015

Published 4 June 2015 Volume 2015:8 Pages 11—22

DOI https://doi.org/10.2147/AABC.S63534

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Dr Juan Fernandez-Recio

Download Article [PDF]

Jose M Villaveces, Prasanna Koti, Bianca H Habermann

Max Planck Institute of Biochemistry, Research Group Computational Biology, Martinsried, Germany

Abstract: Biological pathways have become the standard way to represent the coordinated reactions and actions of a series of molecules in a cell. A series of interconnected pathways is referred to as a biological network, which denotes a more holistic view on the entanglement of cellular reactions. Biological pathways and networks are not only an appropriate approach to visualize molecular reactions. They have also become one leading method in -omics data analysis and visualization. Here, we review a set of pathway and network visualization and analysis methods and take a look at potential future developments in the field.

Keywords: biological networks, reactions, proteins, genes, signaling, protein-protein interactions, organisms

Introduction

The study of biological pathways is a key to understand the different processes inside a cell: proteins exert their function not in isolation but in a tightly controlled network of interactions and reactions. Activation of a pathway typically leads to a change of state in the cell: as examples, a set of genes is transcribed, new molecules are assembled, molecules are secreted, or new signals are recognized. Pathways come in different flavors, depending on their functions in the cell – the three main types are metabolic pathways, gene regulatory pathways, and signaling pathways.

Biological pathways are generally derived from experimental work in cell culture or model organisms. Many different studies are collected and integrated over time by initiatives like KEGG,¹ Reactome,² PANTHER pathway,³ BioCyc,⁴ or the user-curated WikiPathways⁵ (Table 1). The KEGG pathway database was first published in the year 1997 and has since been updated regularly. It is to our knowledge the largest pathway resource, with metabolic and signaling pathways from a pleiotropy of organisms. Reactome has pathways for a selected number of organisms, representing mostly the commonly used model organisms. BioCyc has a large collections of metabolic and regulatory pathways for ~5,500 organisms, with components such as EcoCyc and MetaCyc,⁶ or HumanCyc. PANTHER pathway is a rather small resource, with currently 176 primarily signaling pathways. PANTHER pathway, however, encourages user-curation and user-creation of novel pathways. WikiPathways works under the same principles: users can contribute to the open-source pathway resource by suggesting new pathways to be added or by creating novel pathways. WikiPathways offers a good selection of organisms. Its largest organism-specific pathway resource is the one for Homo sapiens, with currently 633 available pathways.

Table 1 Pathway resources
Abbreviations: BioPAX, Biological Pathway Exchange; KGML, KEGG Markup Language; PSI-MI, Proteomics Standards Initiative Molecular Interaction; SBML, Systems Biology Markup Language; NCI, National Cancer Institute; INOH, Integrating Network Objects with Hierarchies; PharmGKB, Pharmacogenomics Knowledge Base; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Defining novel pathways is however a very labor- and time-intensive task. Furthermore, we cannot assume that currently existing pathway definitions are complete: over time, it has rather become clear that pathways are much more complicated than initially believed. They form a tightly regulated network of reactions, with many cross-reactions between our simpler definitions of individual biological pathways.

A possible solution to circumvent incomplete pathway definitions is the usage of curated protein–protein interaction networks. Biological networks typically contain data from protein interaction studies in addition to curated biological pathways. They are believed to represent a more complete view of the complex, biological network within a cell. Integrated biological network resources include, for instance, Pathway Commons,⁷ ConsensusPathDB,⁸ or similar meta-databases. However, interaction networks do typically not provide information on known biological pathways. Therefore, either the network has to be enriched for biological pathways or its structure has to be analyzed for more tightly connected node clusters.

Pathways are a fundamental part of interpreting -omics data, as they provide the biological context for a given observation. Hardly any -omics study is published nowadays without integrating the data with pathway or network information. A general application of biological pathways or networks in -omics data analysis is to integrate them with numerical data from an -omics screen. To select relevant pathways for visualization, users commonly perform pathway enrichment analysis on their hit lists.

The integrative analysis of -omics data, however, increases the complexity, since it adds different dimensions to the data. Moreover, integrating -omics data boosts data size. With growing complexity emerges the problem on how to make the data digestible for the human mind. Visual reprocessing is still one of the most forward ways to comprehend highly complex data, and thus the importance of scientific visualization, since it enhances our understanding of biological data.

A vast variety of different pathway and network visualization and analysis tools exist. In this review, we introduce a few concepts and tools that are in our view useful for analyzing and visualizing pathway and molecular network data. We further discuss general problems and future directions of biological data visualization.

Pathway-based visualization and analysis

In principle, two categories of pathway visualizations exist: first, those that allow a simple and static representation of pathways (such as static images or pdf files), which many pathway resources already offer. Though practical for providing pathway information “at a glance”, their use is limited for -omics data analysis.

The second type of pathway visualization allows users to integrate different data types with pathways by rendering user data on top of pathway maps (Table 2). The KEGG pathway web interface (http://www.genome.jp/kegg/tool/map_pathway2.html),⁹ provides the possibility to color code individual nodes within a pathway. The TrED database,¹⁰ for instance, has implemented the KEGG Mapper in its database to visualize metabolic pathways.

Table 2 Pathway and network visualization and analysis software
Abbreviations: KEGG, Kyoto Encyclopedia of Genes and Genomes; BioJS, BioJavaScript; Reactome FI, Reactome Functional Interaction.

The KEGG-based Pathway Visualization Tool for Complex Omics Data¹¹ is another practical, user-friendly method to visualize expression data on KEGG pathways. Its functionality is more complex than the pathway viewer offered by KEGG. Proteins, as well as substrates, can be shaded according to a numerical value for more than one condition. The reader might turn to the website of this visualization tool, http://www.g-language.org/data/marray/,¹¹ where a comprehensive analysis of expression data in Escherichia coli using the software has been performed and which gives a very good impression on the software’s potential.

MEGU is a web-based visualization application that allows users to map multiple layers of -omics data simultaneously.¹² MEGU, in addition, provides integrated pathways by combination of several individual pathways. The MEGU website also demonstrates its usability on a comprehensive analysis of E. coli expression data. Visualization is, in this case, more plastic; however it is also more crowded, as several gene names are sometimes provided per node.

BioCyc⁴ offers a collection of pathways in a variety of organisms for analyzing and visualizing metabolic pathways. It allows the user to overlay experimental data on a pathway map by uploading a file or importing datasets from different databases. BioCyc offers in general a comprehensive tool set for genome analysis and comparison, which goes beyond most of the other pathway resources.

Pathway Projector (http://ws.g-language.org/g4/),¹³ another web-based visualization software, allows users to explore the network of biological pathways in a cell: the user can zoom into the network, render pathways with experimental data, or predict the enriched pathways for a set of user-provided genes. Pathway Projector was, for instance, used to visualize metabolic changes upon nitrogen deficiency of the oil-rich algae Pseudochoricystis ellipsoidea.¹⁴

Finally, with KEGGViewer,¹⁵ users can integrate expression data with KEGG pathways and create a web-based visualization using the open-source JavaScript library BioJS.¹⁶ KEGGViewer is purely web based. Visualization of differential expression can be done for a series of experiments or time points, as the tool enables users to loop over several uploaded conditions. It is so far the only web-based viewer that allows an animation of differential expression events overlaid on pathways.

Further examples include the stand-alone application PathVisio¹⁷ and the web front end of Reactome (http://www.reactome.org/PathwayBrowser/#TOOL=AT),² where the user can upload expression data and color a target pathway accordingly. Furthermore, PathVisio offers the possibility to fully customize the looks of a given pathway. PathVisio was, for instance, applied by Fijten et al¹⁸ to construct a customized, liver-specific ligand-activated nuclear receptor pathway. The R-package Pathview¹⁹ finally has a broader approach and creates pathway visualizations from additional data types like genomic variation, literature record, and metabolite level. Pathview has, for instance, been applied by Arthur et al,²⁰ for pathway mapping or by Gorvel et al,²¹ to render pathways using experimental data. Pathview depends on KEGG for pathway information. PathVisio works with WikiPathways, and for human, also with Reactome data.

Besides permitting users to make queries based on formatted text files, the web tool iPath²² and iPath2²³ allows users to define visual properties like color or transparency for various nodes, similar to PathVisio. iPath/iPath2 has been extensively used for customized pathway rendering and visualization of experimental data and can conveniently be employed to visualize differential usage of pathways in a global pathway map (for typical applications of iPath2, see, for instance, Navid and Almaas²⁴ and Erickson et al²⁵). Wisecaver et al²⁶ used iPath2 for comparative pathway analysis, and Creek et al²⁷ used the software to model metabolic data obtained by liquid chromatography–mass spectrometry. Dutilh et al²⁸ used iPath2 for interpreting microbial Genome-Wide Association Studies (GWAS) studies, and Guzman et al²⁹ employed it to map microRNA sequencing data on metabolic maps.

To demonstrate the usability of some of the above-mentioned tools, we selected four software packages that can render expression data on pathways from KEGG, WikiPathways, or Reactome (Figure 1). We used RNA-seq data of colon cancer patients and compared expression values from control and paracancerous and cancer tissue.³⁰ The resulting log 2 fold changes were overlaid on the Glycolysis pathway.

Figure 1 Visualization of differential expression data on the Glycolysis pathway using different software packages.
Notes: (A) KEGG Mapper and KEGGViewer were used to render -omics data on the KEGG Glycolysis pathway. Upregulated genes are shown in yellow, and downregulated genes are highlighted in blue. (B) PathVisio was used to visualize -omics data on the Glycolysis pathway from WikiPathways. Upregulated genes are colored in red shades, and downregulated genes are shown in green shading. PathVisio is able to combine two datasets on the same pathway map. (C) The Reactome analysis toolbox was used to visualize -omics data on the Glycolysis pathway. Upregulated genes are highlighted in yellow, and downregulated genes are highlighted in blue.

In Figure 1A, we used KEGG Mapper and KEGGViewer as web-based visualization tools for KEGG pathways. KEGG Mapper creates a static image, whereas KEGGViewer allows users to loop over different conditions, making it potentially easier to spot differential events between several conditions or a time series. KEGG Mapper on the other hand visualizes the directionality of reactions, which are omitted in KEGGViewer. For metabolic pathways, the labels in KEGG Mapper are by default EC numbers, while KEGGViewer provides gene names. However, the user can interactively use the pathway map in KEGG Mapper to retrieve detailed information on an individual gene by clicking on its node.

The WikiPathways version of the Glycolysis pathway was created using PathVisio (Figure 1B). PathVisio is a very powerful method to visualize differential events in several conditions on the same map. It keeps directionality of reactions and is able to combine more than one pathway, whereby the edges (= reactions) from different pathways are color coded.

Finally, we used the Reactome analysis tool to visualize differential expression of our gene set on the Glycolysis pathway (Figure 1C). Reactome gives quite a different picture of the pathway, embedding it within its neighbors. This extended visualization can be very useful in providing a more complete picture of differential expression events. It, however, can also be distracting, if the pathway map gets too large, and therefore, too crowded.

The handling of KEGG Mapper, as well as Reactome, was very easy and straightforward. The input format is simple, and KEGG Mapper accepts several identifiers; Reactome even does identifier conversion on the fly. Reactome moreover offers a variety of additional information on genes, such as the expression of a gene in different tissues. Both tools are easy to use for beginners and thus represent a good starting point for rendering expression data on pathways. PathVisio and KEGGViewer require more sophisticated handling. PathVisio is a desktop application, and KEGGViewer requires some knowledge of JSON (JavaScript Object Notation) and JavaScript. While the learning curves for the latter programs are steeper, both tools offer useful functionalities, such as looping over conditions (KEGGViewer) or mapping several conditions on the same pathway map (PathVisio).

In general, each pathway visualization tool has advantages and disadvantages. While we give examples only for a very small collection of available pathway mapping tools, we recommend to try out several methods and to regularly look for novel developments in the field of pathway visualization.

It is noteworthy to mention that many software packages for in silico metabolic modeling also allow users to visualize and analyze molecular pathways and networks with -omics data. These include, for instance, SEED (http://seed-viewer.theseed.org/),³¹ and the COBRA project.³² SEED has a web-based user interface and allows to read in SEED models in Cytoscape using the plugin CytoSEED.³³ COBRA can be used via Matlab³⁴ or Python.³⁵ We predict again a steep learning curve, as the tools are tailored for metabolic modeling applications. Depending on the complexity of the research question, however, such tools might offer more functionality.

Molecular networks analysis

Other than pathways, which contain information on directionality of reactions, as well as on small compounds, protein interaction networks are a collection of nodes (proteins) connected to each other (via edges). Biological interaction networks are typically created for one organism and contain information on all known biological interactions within the cell, whereby genetic and physical interactions are collected separately. Biological networks, though being less detailed in information than pathways, are nevertheless useful for analyzing and visualizing -omics data.

The most popular tools for network analysis are Cytoscape³⁶ and Gephi.³⁷ Both are open source and aid network exploration, manipulation, and analysis by providing filtering, layout, coloring, and clustering capabilities.

Cytoscape’s network analysis package³⁸ and Gephi facilitate network analysis by calculating complex network parameters like average clustering coefficients, shortest paths, and node degrees, as well as centrality measures like stress centrality,^39,40 betweenness centrality,³⁹ and closeness centrality.⁴¹ The calculated parameters can be visualized as histograms and mapped as visualization attributes on the network in the form of node size and color.

Cytoscape and Gephi are able to handle diverse datasets, which can be biological or not. However, Cytoscape tends to be biased toward biological data analysis. It is able to fetch data from I) molecular interaction databases, II) Biomart, and III) different ontologies out of the box. Additionally, Cytoscape is extendible via community developed plugins⁴² that range from Gene Ontology enrichment⁴³ to clustering to pathway visualization and analysis. For instance, the Cytoscape plugins WikiPathways,⁴⁴ and CyKEGGParser⁴⁵ parse and visualize WikiPathways and KEGG pathways, respectively. KGMLReader and KEGGScape⁴⁶ are plugins provided by KEGG for pathway import and visualization in Cytoscape versions 2 and 3, respectively. Similarly, the Reactome FI (Functional Interaction) plugin⁴⁷ finds pathways and network patterns related to cancer and other diseases using Reactome data. Its handling is very easy and comparable to the web-based visualization application of Reactome. CluePedia⁴⁸ is a search tool for new markers potentially linked to known pathways. It is one of the tools that offer the potential to extend our existing pathway definitions.

Gephi, on the other hand, tends to focus on network visualization. It is based on the principle What You See Is What You Get. It is therefore able to generate powerful visualizations. Like Cytoscape, Gephi can be extended via plugins.

To illustrate the visualization capabilities of both, Cytoscape and Gephi, we collected molecular interactions from VirHostNet 2.0,⁴⁹ a knowledge base dedicated to network-based exploration of molecular virus–host interactions. We created a network of 7,318 nodes and 296,659 edges composed of human and influenza A virus proteins. We further created a subnetwork with influenza A interactors and their first neighbors, which resulted in a network of 3,362 nodes and 19,352 edges. We applied several force-directed layout algorithms to those networks (Figure 2). We chose force-directed layouts as they rely on the structure of the network and thus do not require domain-specific information. These layout types furthermore are visually more appealing as they reduce edge crossing and reveal symmetries within the graph. To visualize a large network is generally problematic. We found the Fruchterman Reingold layout as the most appealing, as it provides some form of structure for very large networks. All chosen layout options are separating the small network in several, more tightly connected clusters, with the Spring layout performing best with respect to visual separation of subclusters.

Figure 2 Network visualization using Gephi and Cytoscape using different layout algorithms.
Notes: The entire human and influenza A interactomes are shown, as well as the first neighbors of the influenza A proteins. Influenza A proteins are colored in orange, and human nodes are shown in green. The size of the nodes indicates the connectivity of the individual proteins.

While Cytoscape and Gephi are popular tools, there is a broad spectrum of alternatives available. For example, sigma.js⁵⁰ and Cytoscape.js⁵¹ are JavaScript libraries for network visualization on the web. Additionally, software packages such as iGraph⁵² and NetworkX⁵³ further support network analysis and visualization.

Pathway standardization

Pathway resources are annotated and curated separately. It is due to this fact that heterogeneous data models, formats, and application programming interfaces (APIs) are used. This makes the development of pathway visualization and analysis tools as well as the aggregation of pathway data an arduous task. Fortunately, the community using the pathway resources came together and launched the Biological Pathway Exchange (BioPAX) project⁵⁴ with the aim to define a standard for sharing pathway information.

BioPAX supports I) metabolic and signaling pathways, II) molecular and genetic interactions, and III) gene regulation data. It encapsulates the semantics related to pathways by using controlled vocabularies such as the Gene Ontology⁵⁵ and the Proteomics Standards Initiative Molecular Interaction⁵⁶ vocabularies, as well as its own community-defined constructs.

Several tools have been developed to read and process BioPAX files. These include libraries like paxtools,⁵⁷ rBiopaxParser,⁵⁸ and BioPAX-pattern,⁵⁹ and visualization and analysis tools like Cytoscape and cyPath2,⁶⁰ among others.

However useful, the primary goal of BioPAX is not to satisfy data visualization but to coordinate its forces with the Systems Biology Graphical Notation (SBGN, http://www.sbgn.org/),⁶¹ community and to ensure that BioPAX can be visualized through SBGN.

SBGN is a community-developed, standard graphical language for unambiguously representing diverse biochemical and cellular events. One of the key aspects of SBGN is to minimize the effort needed to understand a pathway. It does so by limiting the number of symbols and reusing them when possible. Data providers such as Reactome and PANTHER pathway support SBGN. Additional tools supporting SBGN are listed in http://www.sbgn.org/SBGN_Software.

However, the current standards are still maturing and so not fully adopted. For example, KEGG provides BioPAX-formatted pathways through their FTP server only. Additionally, KEGG does not follow the SBGN specification but rather uses its own graphical representation format KEGG Markup Language (http://www.kegg.jp/kegg/xml/).

Application programming interfaces

An API is a particular set of guidelines and specifications that software applications can follow to communicate with each other. It provides an interface between different applications facilitating their interaction. The combination of different APIs allows developers to create new applications, visualizations, and services^9,44,46,62 with use cases beyond the scope of a given data type or data source.

Most pathway data sources^{1–3,5,7,44,63} provide data access methods such as I) return relevant information about a given pathway (list of participants, list of complexes, etc), II) search for a keyword or pathway in a given database, III) convert identifiers from one format to another, and IV) fetch pathway maps as plain bitmap images or, in some cases, as XML files.

Additionally, a few APIs provide methods for data analysis. For example, through the PANTHER pathway API, it is possible to run an overrepresentation test: by comparing a list of genes to a reference list, the statistical over- or underrepresentation of selected categories is determined (eg, function, process, cellular location, protein class, or pathway). Similarly, Reactome recently developed an API devoted solely to data analysis.

With regard to the retrieval of molecular interaction data, the Proteomics Standard Initiative Common QUery InterfaCe⁶⁴ provides a standardized web service to access multiple interaction resources such as IntAct⁶⁵ and STRING.⁶⁶

Discussion

The amount of data produced by the different -omics fields is rapidly growing, mainly due to the constant development of high-throughput methods. Given the fast rate at which vast amounts of data are produced, the need for new data visualization and analysis methods is undisputed. There are, however, a series of challenges that need to be overcome.

Standard formats allow biological data to be easily exchanged, manipulated, aggregated, and analyzed. Different communities, like the Proteomics Standard Initiative⁶⁷ and BioPAX, are addressing that need, for proteomics and biological pathways, respectively. However, data integration is still challenging. For instance, mapping between different identifiers such as protein, gene, transcript, or clone is perhaps the most common operation a bioinformatician has to perform. Though there are several tools for mapping identifiers,^68–70 significant manual effort is still required. Further development of tools and models able to link and aggregate datasets from various sources and types is crucial to enable detailed analysis and rich visualizations.

Traditionally, networks or graphs are visualized in node-link diagrams, where nodes represent entities and edges represent relationships. Such a representation is intuitive and works well for small networks, but as soon as the network size increases, so does the visualization complexity. Different techniques have been developed to try and enhance such visualizations: node filtering, edge bundling,^71,72 different layout algorithms,^72–75 and edge lenses are currently available in the toolbox. However, the traditional node-link visualization is limited when applied to large networks.

Alternative network visualizations such as matrix diagrams^74,75 and 3D models^76–79 are exiting steps towards a hairball-free visualization and may lead to much more interesting visualization techniques than the traditional node-link diagrams.

Matrix-based visualization

Matrix diagrams^80,81 are based on the adjacency matrix of a given graph. The main advantage of this kind of visualization is that line crossings are impossible, which leads to a clear visualization. However, the effectiveness of a matrix diagram is heavily dependent on the order of rows and columns. Therefore, patterns may be hidden due to a nonoptimal clustering or ordering algorithm.

3D visualization

Adding an additional dimension enhances the understanding of the topology of a large network. It increases the possibilities of layout elements avoiding undesirable visualization objects, like line crossings. 3D visualizations have been used to analyze biological networks^72,73 (aided by BioLayout Express3D) as well as biological pathways.^78,79

Virtual reality

Virtual reality offers the possibility of fully exploring complex datasets by using powerful interactive visualizations. For example, Redgraph⁸² allows users to visualize semantic social networks in a virtual reality-based environment supported by the Computer-Assisted Virtual Environment⁸³ (CAVE). Users can interact with the visualization in order to contextualize graph nodes by clicking on them in order to fetch relevant data.

Other interesting CAVE visualizations permit users to see the world at the molecular level. For example, the gold particle visualization explores the self-assembly of a ligated gold nanoparticle and proteins inside an ionic solution, while another initiative explores a glass fissure computed in nanoscale simulation. Similar approaches could be directed toward the exploration and understanding of diverse biological datasets, such as pathways, gene regulatory networks, and molecular interactions.

Acknowledgments

JMV and PK were supported by BMBF-Project 0315759 (The Virtual Liver Network, http://www.virtual-liver.de/). This work was supported by the Max Planck Society.

Author contributions

PK analyzed the RNA-seq data. JMV and BHH wrote this manuscript. All authors contributed toward data analysis, drafting and revising the paper and agree to be accountable for all aspects of the work.

Disclosure

The authors report no conflicts of interest in this work.

References

1.	Goto S, Bono H, Ogata H, et al. Organizing and computing metabolic pathway data in terms of binary relations. Pac Symp Biocomput. 1997: 175–186.
2.	Joshi-Tope G, Gillespie M, Vastrik I, et al. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 2005;33(Database issue):D428–D432.
3.	Mi H, Thomas P. PANTHER pathway: an ontology-based pathway database coupled with data analysis tools. Methods Mol Biol. 2009;563: 123–140.
4.	Caspi R, Altman T, Billington R, et al. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2014;42(Database issue):D459–D471.
5.	Pico AR, Kelder T, van Iersel MP, Hanspers K, Conklin BR, Evelo C. WikiPathways: pathway editing for the people. PLoS Biol. 2008;6(7):e184.
6.	Karp PD, Riley M, Saier M, Paulsen IT, Paley SM, Pellegrini-Toole A. The EcoCyc and MetaCyc databases. Nucleic Acids Res. 2000;28(1):56–59.
7.	Cerami EG, Gross BE, Demir E, et al. Pathway commons, a web resource for biological pathway data. Nucleic Acids Res. 2011;39(Database issue):D685–D690.
8.	Kamburov A, Wierling C, Lehrach H, Herwig R. ConsensusPathDB – a database for integrating human functional interaction networks. Nucleic Acids Res. 2009;37(Database issue):D623–D628.
9.	Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012;40(Database issue):D109–D114.
10.	Yang J, Chen L, Wang L, Zhang W, Liu T, Jin Q. TrED: the Trichophyton rubrum expression database. BMC Genomics. 2007;8(1):250.
11.	Arakawa K, Kono N, Yamada Y, Mori H, Tomita M. KEGG-based pathway visualization tool for complex omics data. In Silico Biol. 2005; 5(4):419–423.
12.	Kono N, Arakawa K, Tomita M. MEGU: pathway mapping web-service based on KEGG and SVG. In Silico Biol. 2006;6(6):621–625.
13.	Kono N, Arakawa K, Ogawa R, et al. Pathway projector: web-based zoomable pathway browser using KEGG atlas and Google Maps API. PLoS One. 2009;4(11):e7710.
14.	Ito T, Tanaka M, Shinkawa H, et al. Metabolic and morphological changes of an oil accumulating trebouxiophycean alga in nitrogen-deficient conditions. Metabolomics. 2013;9(Suppl 1):178–187.
15.	Villaveces JM, Jimenez RC, Habermann BH. KEGGViewer, a BioJS component to visualize KEGG pathways. F1000Res. 2014;3:43.
16.	Gómez J, García LJ, Salazar GA, et al. BioJS: an open source JavaScript framework for biological data visualization. Bioinformatics. 2013;29(8):1103–1104.
17.	van Iersel MP, Kelder T, Pico AR, et al. Presenting and exploring biological pathways with PathVisio. BMC Bioinformatics. 2008;9:399.
18.	Fijten RR, Jennen DG, van Delft JH. Pathways for ligand activated nuclear receptors to unravel the genomic responses induced by hepatotoxicants. Curr Drug Metab. 2013;14(10):1022–1028.
19.	Luo W, Brouwer C. Pathview: an R/Bioconductor package for pathway-based data integration and visualization. Bioinformatics. 2013;29(14):1830–1831.
20.	Arthur JC, Gharaibeh RZ, Mühlbauer M, et al. Microbial genomic analysis reveals the essential role of inflammation in bacteria-induced colorectal cancer. Nat Commun. 2014;5:4724.
21.	Gorvel L, Textoris J, Banchereau R, et al. Intracellular bacteria interfere with dendritic cell functions: role of the type I interferon pathway. PLoS One. 2014;9(6):e99420.
22.	Letunic I, Yamada T, Kanehisa M, Bork P. iPath: interactive exploration of biochemical pathways and networks. Trends Biochem Sci. 2008; 33(3):101–103.
23.	Yamada T, Letunic I, Okuda S, Kanehisa M, Bork P. iPath2.0: interactive pathway explorer. Nucleic Acids Res. 2011;39 (Web Server issue):W412–W415.
24.	Navid A, Almaas E. Genome-level transcription data of Yersinia pestis analyzed with a new metabolic constraint-based approach. BMC Syst Biol. 2012;6(1):150.
25.	Erickson AR, Cantarel BL, Lamendella R, et al. Integrated metagenomics/ metaproteomics reveals human host-microbiota signatures of Crohn’s disease. PLoS One. 2012;7(11):e49138.
26.	Wisecaver JH, Slot JC, Rokas A. The evolution of fungal metabolic pathways. PLoS Genet. 2014;10(12):e1004816.
27.	Creek DJ, Chokkathukalam A, Jankevics A, Burgess KE, Breitling R, Barrett MP. Stable isotope-assisted metabolomics for network-wide metabolic pathway elucidation. Anal Chem. 2012;84(20):8442–8447.
28.	Dutilh BE, Backus L, Edwards RA, Wels M, Bayjanov JR, van Hijum SA. Explaining microbial phenotypes on a genomic scale: GWAS for microbes. Brief Funct Genomics. 2013;12(4):366–380.
29.	Guzman F, Almerão MP, Körbes AP, Loss-Morais G, Margis R. Identification of microRNAs from Eugenia uniflora by high-throughput sequencing and bioinformatics analysis. PLoS One. 2012;7(11):e49811.
30.	Wu Y, Wang X, Wu F, et al. Transcriptome profiling of the cancer, adjacent non-tumor and distant normal tissues from a colorectal cancer patient by deep sequencing. PloS One. 2012;7(8):e41001.
31.	Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010;28(9):977–982. Accessed March 31, 2015.
32.	The openCOBRA Project [homepage on the Internet]. Available from: http://opencobra.sourceforge.net/openCOBRA/Welcome.html. Accessed.
33.	DeJongh M, Bockstege B, Frybarger P, Hazekamp N, Kammeraad J, McGeehan T. CytoSEED: a Cytoscape plugin for viewing, manipulating and analyzing metabolic models created by the Model SEED. Bioinformatics. 2012;28(6):891–892.
34.	Schellenberger J, Que R, Fleming RM, et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc. 2011;6(9):1290–1307.
35.	Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: constraints-based reconstruction and analysis for python. BMC Syst Biol. 2013;7(1):74.
36.	Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–2504.
37.	Bastian M, Heymann S, Jacomy M. Gephi: an open source software for exploring and manipulating networks. In: International AAAI Conference on Weblogs and Social Media; Palo Alto, California; 2009.
38.	Doncheva NT, Assenov Y, Domingues FS, Albrecht M. Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc. 2012;7(4):670–685.
39.	Brandes U. A faster algorithm for betweenness centrality. J Math Sociol. 2001;25(2):163–177.
40.	Shimbel A. Structural parameters of communication networks. Bull Math Biol. 1953;15(4):501–507.
41.	Newman MEJ. A measure of betweenness centrality based on random walks. Soc Networks. 2005;27(1):39–54.
42.	Saito R, Smoot ME, Ono K, et al. A travel guide to Cytoscape plugins. Nat Methods. 2012;9(11):1069–1076.
43.	Maere S, Heymans K, Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005;21(16):3448–3449.
44.	Kutmon M, Lotia S, Evelo CT, Pico AR. WikiPathways App for Cytoscape: making biological pathways amenable to network analysis and visualization. F1000Res. 2014;3:152.
45.	Nersisyan L, Samsonyan R, Arakelyan A. CyKEGGParser: tailoring KEGG pathways to fit into systems biology analysis workflows. F1000Res. 2014;3:145.
46.	Nishida K, Ono K, Kanaya S, Takahashi K. KEGGscape: a Cytoscape app for pathway data integration. F1000Res. 2014;3:144.
47.	Wu G, Stein L. A network module-based method for identifying cancer prognostic signatures. Genome Biol. 2012;13(12):R112.
48.	Bindea G, Galon J, Mlecnik B. CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics. 2013;29(5):661–663.
49.	Guirimand T, Delmotte S, Navratil V. VirHostNet 2.0: surfing on the web of virus/host molecular interactions data. Nucleic Acids Res. 2015;43(Database issue):D583–D587.
50.	Sigma.js [homepage on the Internet]. Available from: http://sigmajs.org/. Accessed March 31, 2015.
51.	Cytoscape.js [homepage on the Internet]. Available from: https://github.com/cytoscape/cytoscape.js. Accessed March 31, 2015.
52.	Csardi G, Nepusz T. The igraph software package for complex network research. Inter Journal. 2006;Complex Systems:1695.
53.	Hagberg AA, Schult DA, Swart PJ. Exploring network structure, dynamics, and function using NetworkX. In: Proceedings of the 7th Python in Science Conference;2008;Pasadena, CA USA:11–15.
54.	Demir E, Cary MP, Paley S, et al. The BioPAX community standard for pathway data sharing. Nat Biotechnol. 2010;28(9):935–942.
55.	Ashburner M, Ball CA, Blake JA, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–29.
56.	Kerrien S, Orchard S, Montecchi-Palazzi L, et al. Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol. 2007;5:44.
57.	Demir E, Babur O, Rodchenkov I, et al. Using biological pathway data with paxtools. PLoS Comput Biol. 2013;9(9):e1003194.
58.	Kramer F, Bayerlová M, Klemm F, Bleckmann A, Beissbarth T. rBiopaxParser – an R package to parse, modify and visualize BioPAX data. Bioinformatics. 2013;29(4):520–522.
59.	Babur O, Aksoy BA, Rodchenkov I, Sümer SO, Sander C, Demir E. Pattern search in BioPAX models. Bioinformatics. 2014;30(1):139–140.
60.	Cytoscape App Store [homepage on the Internet]. Available from: http://apps.cytoscape.org/apps/cypath2. Accessed .
61.	Le Novère N, Hucka M, Mi H, et al. The systems biology graphical notation. Nat Biotechnol. 2009;27(8):735–741. Accessed March 31, 2015.
62.	Wu G, Feng X, Stein L. A human functional protein interaction network and its application to cancer data analysis. Genome Biol. 2010; 11(5):R53.
63.	Schaefer CF, Anthony K, Krupa S, et al. PID: the pathway interaction database. Nucleic Acids Res. 2009;37(Database issue):D674–D679.
64.	Aranda B, Blankenburg H, Kerrien S, et al. PSICQUIC and PSISCORE: accessing and scoring molecular interactions. Nat Methods. 2011;8(7):528–529.
65.	Aranda B, Achuthan P, Alam-Faruque Y, et al. The IntAct molecular interaction database in 2010. Nucleic Acids Res. 2010;38(Database issue):D525–D531.
66.	von Mering C, Jensen LJ, Snel B, et al. STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic Acids Res. 2005;33(Database issue):D433–D437.
67.	Omenn GS, States DJ, Adamski M, et al. Overview of the HUPO plasma proteome project: results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database. Proteomics. 2005; 5(13):3226–3245.
68.	Côté RG, Jones P, Martens L, et al. The protein identifier cross-referencing (PICR) service: reconciling protein identifiers across multiple source databases. BMC Bioinformatics. 2007;8:401.
69.	UniProt Consortium. The universal protein resource (UniProt) in 2010. Nucleic Acids Res. 2010;38(Database):D142–D148.
70.	Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.
71.	Zhou H, Xu P, Yuan X, Qu H. Edge bundling in information visualization. Tsinghua Sci Technol. 2013;18(2):145–156.
72.	Holten D, van Wijk JJ. Force-directed edge bundling for graph visualization. Comput Graph Forum. 2009;28(3):983–990.
73.	Yee K-P, Fisher D, Dhamija R, Hearst M. Animated exploration of dynamic graphs with radial layout. In: IEEE Infovis Symposium; 2001; San Diego:43–50.
74.	Dwyer T. Scalable, versatile and simple constrained graph layout. Comput Graph Forum. 2009;1:991–998.
75.	Krzywinski M, Birol I, Jones SJ, Marra MA. Hive plots – rational approach to visualizing networks. Brief Bioinform. 2012;13(5):627–644.
76.	Freeman TC, Goldovsky L, Brosch M, et al. Construction, visualisation, and clustering of transcription networks from microarray expression data. PLoS Comput Biol. 2007;3(10):2032–2042.
77.	Theocharidis A, van Dongen S, Enright AJ, Freeman TC. Network visualization and analysis of gene expression data using BioLayout Express(3D). Nat Protoc. 2009;4(10):1535–1550.
78.	Qeli E, Wiechert W, Freisieben B. 3D visualization and animation of metabolic networks. In: 18th European Simulation; 2004; Magdeburg.
79.	Rojdestvenski I. Metabolic pathways in three dimensions. Bioinformatics. 2003;19(18):2436–2441.
80.	Chen C-H, Härdle WK, Unwin A. Handbook of Data Visualization. Berlin, HD: Springer Science and Business Media; 2007.
81.	Bae J, Watson B. Developing and evaluating quilts for the depiction of large layered graphs. IEEE Trans Vis Comput Graph. 2011;17(12):2268–2275.
82.	Halpin H, Zielinski DJ, Brady R, Kelly G. Exploring semantic social networks using virtual reality. In: Sheth A, Staab S, Dean M, et al., editors. The Semantic Web – ISWC 2008. Berlin, HD: Springer; 2008: 599–614.
83.	Cruz-Neira C, Sandin DJ, DeFanti TA. Surround-Screen Projection-Based Virtual Reality: The Design and Implementation of the CAVE. New York, NY: ACM; 1993:135–142.
84.	Yamamoto S, Sakai N, Nakamura H, Fukagawa H, Fukuda K, Takagi T. INOH: ontology-based highly structured database of signal transduction pathways. Database (Oxford). 2011;2011:bar052.
85.	Kandasamy K, Mohan SS, Raju R, et al. NetPath: a public resource of curated signal transduction pathways. Genome Biol. 2010;11(1):R3.
86.	Whirl-Carrillo M, McDonagh EM, Hebert JM, et al. Pharmacogenomics knowledge for personalized medicine. Clin Pharmacol Ther. 2012; 92(4):414–417.

Creative Commons License © 2015 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF]