Abstract
Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout







Similar content being viewed by others
References
Kislinger, T. et al. Global survey of organ and organelle protein expression in mouse: combined proteomic and transcriptomic profiling. Cell 125, 173–186 (2006).
Galbraith, D.W. & Birnbaum, K. Global studies of cell type-specific gene expression in plants. Annu. Rev. Plant Biol. 57, 451–475 (2006).
Butcher, E.C., Berg, E.L. & Kunkel, E.J. Systems biology in drug discovery. Nat. Biotechnol. 22, 1253–1259 (2004).
Bader, G.D. et al. Functional genomics and proteomics: charting a multidimensional map of the yeast cell. Trends Cell Biol. 13, 344–356 (2003).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Doniger, S.W. et al. MAPPFinder: using gene ontology and GenMAPP to create a global gene-expression profile from microarray data. Genome Biol. 4, R7 (2003).
Maere, S., Heymans, K. & Kuiper, M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21, 3448–3449 (2005).
Zeeberg, B.R. et al. GoMiner: a resource for biological interpretation of genomic and proteomic data. Genome Biol. 4, R28 (2003).
Quackenbush, J. Weighing our measures of gene expression. Mol. Syst. Biol. 2, 63 (2006).
Sprinzak, E., Sattath, S. & Margalit, H. How reliable are experimental protein–protein interaction data? J. Mol. Biol. 327, 919–923 (2003).
D'Haeseleer, P. & Church, G.M. Estimating and improving protein interaction error rates. Proc. IEEE Comput. Syst. Bioinform. Conf. 216–223 (2004).
Singhal, M. & Domico, K. CABIN: collective analysis of biological interaction networks. Comput. Biol. Chem. 31, 222–225 (2007).
Barrett, T. & Edgar, R. Gene expression omnibus: microarray data storage, submission, retrieval, and analysis. Methods Enzymol. 411, 352–369 (2006).
Parkinson, H. et al. ArrayExpress—a public database of microarray experiments and gene expression profiles. Nucleic Acids Res. 35, D747–D750 (2007).
Hermjakob, H. et al. IntAct: an open source molecular interaction database. Nucleic Acids Res. 32, D452–D455 (2004).
Peri, S. et al. Human protein reference database as a discovery resource for proteomics. Nucleic Acids Res. 32, D497–D501 (2004).
Zanzoni, A. et al. MINT: a Molecular INTeraction database. FEBS Lett. 513, 135–140 (2002).
Bader, G.D., Cary, M.P. & Sander, C. Pathguide: a pathway resource list. Nucleic Acids Res. 34, D504–D506 (2006).
Johnson, D.S., Mortazavi, A., Myers, R.M. & Wold, B. Genome-wide mapping of in vivo protein–DNA interactions. Science 316, 1497–1502 (2007).
Krallinger, M. & Valencia, A. Text-mining and information-retrieval services for molecular biology. Genome Biol. 6, 224 (2005).
Vailaya, A. et al. An architecture for biological information extraction and representation. Bioinformatics 21, 430–438 (2005).
Mishra, G.R. et al. Human protein reference database—2006 update. Nucleic Acids Res. 34, D411–D414 (2006).
Breitkreutz, B.J., Stark, C. & Tyers, M. Osprey: a network visualization system. Genome Biol. 4, R22 (2003).
Hu, Z. et al. VisANT: data-integrating visual framework for biological networks and modules. Nucleic Acids Res. 33, W352–W357 (2005).
Funahashi, A., Morohashi, M., Kitano, H. & Tanimura, N. CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. Biosilico 1, 159–162 (2004).
Dahlquist, K.D., Salomonis, N., Vranizan, K., Lawlor, S.C. & Conklin, B.R. GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat. Genet. 31, 19–20 (2002).
Aragues, R., Jaeggi, D. & Oliva, B. PIANA: protein interactions and network analysis. Bioinformatics 22, 1015–1017 (2006).
Iragne, F., Nikolski, M., Mathieu, B., Auber, D. & Sherman, D. ProViz: protein interaction visualization and exploration. Bioinformatics 21, 272–274 (2005).
Goldovsky, L., Cases, I., Enright, A.J. & Ouzounis, C.A. BioLayout(Java): versatile network visualisation of structural and functional relationships. Appl. Bioinformatics 4, 71–74 (2005).
Demir, E. et al. PATIKA: an integrated visual environment for collaborative construction and analysis of cellular pathways. Bioinformatics 18, 996–1003 (2002).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Avila-Campillo, I., Drew, K., Lin, J., Reiss, D.J. & Bonneau, R. BioNetBuilder: automatic integration of biological networks. Bioinformatics 23, 392–393 (2007).
Cerami, E.G., Bader, G.D., Gross, B.E. & Sander, C. cPath: open source software for collecting, storing, and querying biological pathways. BMC Bioinformatics 7, 497 (2006).
Jayapandian, M. et al. Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together. Nucleic Acids Res. 35, D566–D571 (2007).
Salwinski, L. & Eisenberg, D. The MiSink Plugin: Cytoscape as a graphical interface to the database of interacting proteins. (2007).
Garcia, O. et al. GOlorize: a Cytoscape plug-in for network visualization with gene ontology-based layout and coloring. Bioinformatics 23, 394–396 (2007).
Barsky, A., Gardy, J.L., Hancock, R.E. & Munzner, T. Cerebral: a Cytoscape plugin for layout of and interaction with biological networks using subcellular localization annotation 23, 1040–1042 (2007).
Yip, K.Y., Yu, H., Kim, P.M., Schultz, M. & Gerstein, M. The tYNA platform for comparative interactomics: a web tool for managing, comparing and mining multiple networks. Bioinformatics 22, 2968–2970 (2006).
Ferro, A. et al. NetMatch: a Cytoscape plugin for searching biological networks. Bioinformatics 23, 910–912 (2007).
Vlasblom, J. et al. GenePro: a Cytoscape plug-in for advanced visualization and analysis of interaction networks. Bioinformatics 22, 2178–2179 (2006).
Luo, F. et al. Modular organization of protein interaction networks. Bioinformatics 23, 207–214 (2007).
Ideker, T., Ozier, O., Schwikowski, B. & Siegel, A.F. Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18 (Suppl 1): S233–S240 (2002).
Bader, G.D. & Hogue, C.W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 4, 2 (2003).
Albrecht, M., Huthmacher, C., Tosatto, S.C. & Lengauer, T. Decomposing protein networks into domain–domain interactions. Bioinformatics 21 (Suppl 2): ii220–ii221 (2005).
Tong, A.H. et al. Global mapping of the yeast genetic interaction network. Science, 303, 808–813 (2004).
Pan, X. et al. A DNA integrity network in the yeast Saccharomyces cerevisiae. Cell 124, 1069–1081 (2006).
Kelley, R. & Ideker, T. Systematic interpretation of genetic interactions using protein networks. Nat. Biotechnol. 23, 561–566 (2005).
Drees, B.L. et al. Derivation of genetic interaction networks from quantitative phenotype data. Genome Biol. 6, R38 (2005).
Gilchrist, M. et al. Systems biology approaches identify ATF3 as a negative regulator of Toll-like receptor 4. Nature 441, 173–178 (2006).
Yeang, C.H. et al. Validation and refinement of gene-regulatory pathways on a network of physical interactions. Genome Biol. 6, R62 (2005).
Krogan, N.J. et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440, 637–643 (2006).
Rhodes, D.R. et al. Probabilistic model of the human protein-protein interaction network. Nat. Biotechnol. 23, 951–959 (2005).
Gutierrez, R.A. et al. Qualitative network models and genome-wide expression data define carbon/nitrogen-responsive molecular machines in Arabidopsis. Genome Biol. 8, R7 (2007).
Gunsalus, K.C. et al. Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis. Nature 436, 861–865 (2005).
Sharan, R. et al. Conserved patterns of protein interaction in multiple species. Proc. Natl. Acad. Sci. USA 102, 1974–1979 (2005).
King, J.Y. et al. Pathway analysis of coronary atherosclerosis. Physiol. Genomics 23, 103–118 (2005).
Ideker, T. et al. Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. Science 292, 929–934 (2001).
Wixon, J. & Kell, D. The Kyoto encyclopedia of genes and genomes—KEGG. Yeast 17, 48–55 (2000).
Joshi-Tope, G. et al. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 33, D428–D432 (2005).
Stromback, L., Jakoniene, V., Tan, H. & Lambrix, P. Representing, storing and accessing molecular interaction data: a review of models and tools. Brief Bioinform. 7, 331–338 (2006).
Wuchty, S., Barabasi, A.L. & Ferdig, M.T. Stable evolutionary signal in a yeast protein interaction network. BMC Evol. Biol. 6, 8 (2006).
Yu, H., Kim, P.M., Sprecher, E., Trifonov, V. & Gerstein, M. The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics. PLoS Comput. Biol. 3, e59 (2007).
Ghaemmaghami, S. et al. Global analysis of protein expression in yeast. Nature 425, 737–741 (2003).
Margolin, A.A. et al. Reverse engineering cellular networks. Nat. Protoc. 1, 662–671 (2006).
Fu, J., Swertz, M.A., Keurentjes, J.J. & Jansen, R.C. MetaNetwork: a computational protocol for the genetic study of metabolic networks. Nat. Protoc. 2, 685–694 (2007).
Eyre, T. et al. The HUGO gene nomenclature database, 2006 updates. Nucleic Acids Res. 1, D319–D321 (2006).
Di Battista, G., Eades, P., Tamassia, R. & Tollis, I. Graph Drawing: Algorithms for the Visualization of Graphs (Prentice-Hall, Upper Saddle River, NJ, USA, 1999).
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat. Genet. 25, 25–29 (2000).
Krishnamurthy, N., Brown, D.P., Kirshner, D. & Sjolander, K. PhyloFacts: an online structural phylogenomic encyclopedia for protein functional and structural classification. Genome Biol. 7, R83 (2006).
Deluca, T.F. et al. Roundup: a multi-genome repository of orthologs and evolutionary distances. Bioinformatics 22, 2044–2046 (2006).
Hermjakob, H. et al. The HUPO PSI's molecular interaction format—a community standard for the representation of protein interaction data. Nat. Biotechnol. 22, 177–183 (2004).
de Lichtenberg, U., Jensen, L.J., Brunak, S. & Bork, P. Dynamic complex formation during the yeast cell cycle. Science 307, 724–727 (2005).
Irizarry, R.A. et al. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 31, e15 (2003).
Acknowledgements
Many research groups have developed plugins to Cytoscape and provided them for download free of charge from http://www.cytoscape.org/. These plugins represent key contributions to the overall utility of Cytoscape, and we gratefully thank the authors for their contributions. Cytoscape is developed through an ongoing collaboration between the University of California at San Diego, the University of Toronto, the Institute for Systems Biology, Memorial Sloan-Kettering Cancer Center, Institut Pasteur, Agilent Technologies and the University of California at San Francisco. Many developers have contributed to Cytoscape, and we gratefully acknowledge the contributions of former developers including Nada Amin, Mark Anderson, Richard Bonneau, Larissa Kamenkovich, Andrew Markiel, Owen Ozier, Paul Shannon, Robert Sheridan and Jonathan Wang. We thank Tero Aittikalio and Cricket Sloan for assistance with the manuscript. Funding for Cytoscape is provided by the US National Institute of General Medical Sciences of the National Institutes of Health under award number GM070743-01. Corporate funding is provided through a contract from Unilever PLC. Cytoscape contributions by G.D.B. were funded in part by Genome Canada. The BiNGO plugin was developed at the Department of Plant Systems Biology at the University of Ghent, with partial funding through the Research Foundation Flanders in Belgium.
Author information
Authors and Affiliations
Contributions
Cytoscape development is a large community effort, with principal efforts under the direction of A.A. at Agilent Technologies; I.S. and L.H. at the Institute for Systems Biology; C.S. at Memorial Sloan Kettering Cancer Center; B.S. at Institut Pasteur; G.J.W. at Unilever; T.I. at University of California, San Diego; B.R.C. at University of California, San Francisco; and G.D.B. at University of Toronto. Every author has made significant contributions to the software, without which this protocol would not be possible. Specific contributions are as follows. E.C., B.G., G.D.B. and C.S. developed the cPath plugin. A.K., A.V. and M.C. developed the Agilent Literature Search plugin. S.M., R.I. and M.K. developed the BiNGO plugin. M.S., T.I. and R.K. developed the jActiveModules plugin. V.P., G.D.B. and C.S. developed the MCODE plugin. N.L., R.C., I.A.C., S.K., S.L., M.S., K.O. and P.-L.W. developed much of the improvements to the Cytoscape v2.5 core, including automatic loading of plugins and the VizMapper. K.H., J.M. and A.R.P. were instrumental in the usability analysis, specification and prototyping of many of these features. This protocol was written by M.S.C., M.S., A.K., S.M., C.W., G.D.B. and T.I.
Corresponding author
Supplementary information
Supplementary Data
Sample Data for performing the protocol as a tutorial (ZIP 972 kb)
Rights and permissions
About this article
Cite this article
Cline, M., Smoot, M., Cerami, E. et al. Integration of biological networks and gene expression data using Cytoscape. Nat Protoc 2, 2366–2382 (2007). https://doi.org/10.1038/nprot.2007.324
Published:
Issue Date:
DOI: https://doi.org/10.1038/nprot.2007.324