Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression Data

  1. Philip M. Kim1 and
  2. Bruce Tidor2,3,4
  1. 1 Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
  2. 2 Biological Engineering Division, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
  3. 3 Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA

Abstract

The availability of parallel, high-throughput biological experiments that simultaneously monitor thousands of cellular observables provides an opportunity for investigating cellular behavior in a highly quantitative manner at multiple levels of resolution. One challenge to more fully exploit new experimental advances is the need to develop algorithms to provide an analysis at each of the relevant levels of detail. Here, the data analysis method non-negative matrix factorization (NMF) has been applied to the analysis of gene array experiments. Whereas current algorithms identify relationships on the basis of large-scale similarity between expression patterns, NMF is a recently developed machine learning technique capable of recognizing similarity between subportions of the data corresponding to localized features in expression space. A large data set consisting of 300 genome-wide expression measurements of yeast was used as sample data to illustrate the performance of the new approach. Local features detected are shown to map well to functional cellular subsystems. Functional relationships predicted by the new analysis are compared with those predicted using standard approaches; validation using bioinformatic databases suggests predictions using the new approach may be up to twice as accurate as some conventional approaches.

Footnotes

  • [Supplemental material is available online at www.genome.org.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.903503.

  • 4 Corresponding author. E-MAIL tidor{at}mit.edu; FAX (617)252-1816.

    • Accepted March 24, 2003.
    • Received October 11, 2002.
| Table of Contents

Preprint Server