|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal of Lipid Research, Vol. 47, 1887-1890, September 2006
A thematic review series: systems biology approaches to metabolic and cardiovascular disorders
Department of Medicine/Division of Cardiology, BH-307 Center for the Health Sciences, University of California Los Angeles, Los Angeles, CA 90095-1679
1 To whom correspondence should be addressed. e-mail: jlusis{at}mednet.ucla.edu
The next series of the JLR's thematic review articles concerns systems-level approaches to cardiovascular and metabolic traits. "Systems-level" means a kind of biologic analysis that looks beyond individual genes or proteins or lipids to the ensemble of multiple elements of a system. Some examples of biologic systems are the transcripts in a cell (a "transcriptome"), the proteins of an organelle (a "proteome"), and the metabolites in a liver (a "metabolome"). Systems-level approaches have been stimulated by the genome project, by the development of experimental techniques that can simultaneously interrogate many elements of a system (such as expression microarrays), and by advances in computational methods for analyzing large data sets (1). A key concept in systems biology is that of "emergent properties," important features of biologic systems that can best be identified by examining the system as a whole. A simple example of an emergent property is illustrated in Fig. 1 . In this experiment, the levels of the transcripts in livers of mice from an intercross between two common inbred strains were determined using expression microarrays. The entire data set was then tested to identify transcripts whose levels correlate with one another. One of many such sets of highly correlated transcripts from this experiment is the set of cholesterol biosynthetic enzymes, as shown. In this case, the relationships of these enzymes were already known through the painstaking, decades-long studies of many biochemists. However, even if this information was not known, our global experiment would immediately group all of the enzymes together, implying a functional relationship. Jim Weiss's review in this series will discuss a particularly striking emergent property in metabolism. Thus, both glycolysis and oxidative phosphorylation are capable, under the right conditions, of developing self-sustained oscillations. Such oscillations are observed only when the individual enzymes are coupled into a "network" with other metabolic enzymes to create positive and negative feedback loops. Clearly, in this example, studies of the individual components of the system would not provide mechanistic understanding of the overall dynamics of the system.
A particularly important goal of systems biology is to construct "networks," sets of genes or proteins or metabolites that act in concert in a common biologic process (2). These networks can be experimentally identified by classical methods (e.g., the pathways of cholesterol homeostasis), but systems-level approaches such as expression arrays, chIP-chip studies, and whole-genome yeast two-hybrid experiments can efficiently test millions of possible interactions. Topologically, networks consist of elements (termed "nodes" in network nomenclature) that exhibit functional connections ("edges") (Fig. 2 ). The connections can be identified using coregulation (as in Fig. 1), physical interactions (as in protein complexes), or metabolic relationships (e.g., the intermediates in glycolysis). "Undirected" networks are simply nodes connected by edges, with no causal direction, whereas "directed" networks have edges with a given direction (Fig. 2). An important emergent property observed for most biologic networks is scale-free topology, in which some nodes have many edges (these nodes are termed "hubs") but most nodes have few edges.
A convenient way of organizing the data sets generated using various global platforms is as a series of orthogonal stages, ordered according to the sequential stages of gene expression (genome, transcriptome, proteome) followed by the metabolome (the set of metabolites) and the phenome (the set of physiologic or disease parameters of interest) (Fig. 3 ). Each of these data sets can be used to construct networks or derive other useful information, but none alone tells the whole story. For example, analyses of the transcriptome will miss crucial aspects of protein realization and cellular signaling. Thus, the intersection of these orthogonal data sets is an important challenge. One particularly useful application of such genomic integration is for the identification of genes and pathways contributing to common diseases. Bayesian inference is a particularly useful statistical approach for the integration of information from heterogeneous data sets, assigning a probability to the predictions rather than only a binary classification (3, 4).
How important will systems-level approaches be? For well over 50 years, since chemists developed the tools for studies of biological molecules, research in biology has focused on individual genes, painstakingly connecting them to other genes and to functions. Almost the entire body of mechanistic understanding of cellular and developmental biology has been built up in this reductionistic way. Not surprisingly, many biologists are skeptical about the new systems biology that seeks a global view of all of the elements and their interactions and their responses to environmental changes. The goal of this series of reviews is to illustrate how systems-based approaches can complement traditional approaches to the biology of lipids and diseases involving lipids. The most elegant systems biology studies to date have been in model organisms, such as bacteria, yeast, Caenorhabditis elegans, and flies (57). However, a number of studies involving various global data sets for mammals (primarily mouse and human) (Table 1 ) have already contributed importantly to our understanding of metabolic and cardiovascular diseases.
A particularly important goal of systems biology is to identify genes involved in diseases. The genes underlying approximately half of the Mendelian disorders listed in McKusick's Online Mendelian Inheritance in Man have now been identified (3), but the search for genes and pathways involved in common complex diseases, such as the metabolic syndrome, diabetes, and cardiovascular diseases, is only beginning. The first review of this series, by Karen Reue and Laurent Vergnes (UCLA), deals with the integration of genomic resources, particularly the genome sequences of human, mouse, and other model organisms, for the identification and functional analysis of genes involved in lipid metabolism. This review outlines the various genomic approaches to this problem, including studies with model organisms, systematic gene modification projects, and comparative genomics. For example, a number of new genes involved in lipid metabolism have recently been identified based on sequence similarities with known genes and their functions determined using mice with gene modifications (8). In the second article in the series, Andrew Watson (UCLA) will review "lipidomics," a systems-based study of all lipids, the molecules with which they interact, and their functions. Lipidomics can be considered a branch of metabolomics. Lipids, of course, play crucial structural roles in biologic systems, but the appreciation of their key regulatory functions in inflammation and chronic diseases is greatly expanding. For example, certain recently identified oxidation products of phospholipids appear to play a key role in atherosclerosis (9), and one such oxidized lipid was shown to significantly perturb the expression of >1,000 genes in human endothelial cells (10). It is noteworthy that the National Institutes of Health has established a large collaborative effort, the LIPID MAPS consortium, to identify, characterize, and quantitate all the lipids in specific cells. This includes a bioinformatics focus to organize and integrate the data, consisting of rapidly expanding lists of lipids (presently over 8,000), lipid related proteins, and lipid related changes in the transcriptome (11, 12). As discussed above, a major application of genome-wide data sets will be to model networks. One important characteristic of networks is "topology," the overall architecture of the interactions between nodes. A second is "dynamics," the temporal interactions involved in perturbations of the network or the maintenance of homeostasis. Interaction maps represent possible networks, but not all edges will be present at the same time in a particular cell, just as edges will differ between cell types and subcellular locations. Studies in yeast, for example, have shown that environmental responses involve multiple temporal stages and surprisingly large-scale topological changes (13). An understanding of the dynamic nature of a network will thus require comprehensive time-course data sets with tools for quantifying kinetic parameters such as concentrations, interaction strengths, and fluxes (14) (Fig. 2). In the third review, Jim Weiss (UCLA) will review studies of the dynamics of cardiac and smooth muscle metabolism (15). These studies illustrate how systems-level approaches can reveal emergent properties of a system that lead to testable hypotheses. Thus, studies of cardiac metabolism indicate that, in a normal state, energy supply and demand are closely balanced, but under stress (as in postischemic injury), this coordination can be lost, leading to cell-wide oscillations in ATP levels. There have been a number of successes in the identification of genes underlying common diseases, and there will likely be hundreds of additional genes identified in the next few years. But how will this information be used in the identification of targets and the development of safe, efficient treatments? In the fourth review, Eric Schadt (Merck) will discuss how systems-based approaches can be used to define targets for therapeutic intervention (16). He will first describe how directed networks can be constructed using data from molecular profiling, genotyping, and clinical studies. He will then discuss how such networks can be intersected with orthogonal experimental data, such as data from transgenic mice or small interfering RNA experiments in cultured cells, to identify candidates for drug intervention. Proteomics involves the systematic analysis of proteins and their interactions. Although the technologies for examining proteins on a genome-wide basis are less powerful than the microarray technologies for RNA expression, they are already providing important complementary information. Peipei Ping and Thomas Drake (UCLA) will review the various levels of proteomic investigation as they apply to metabolic and vascular diseases, including organelle topography, protein-protein interaction networks, protein interactions with DNA and lipids, protein processing, and disease marker discovery. One important application of proteomics is the identification of disease markers (17, 18). In the final review, Erwin Kurland (State University of New York at Stony Brook) will focus on metabolomics, the systematic profiling of metabolites using nuclear magnetic resonance, tandem mass spectrometry, calorimetry, and stable isotopes. Topics relevant to lipid metabolism, such as energy partitioning and metabolite flux between tissues, will be described. This review will emphasize how metabolomics can help clarify very complex disorders such as diabetes that involve multiple tissues [for example, (19)].
The author expresses his gratitude to the investigators mentioned in this article for participating in this project. They are all leaders in their respective areas of research, and the author is confident that their reviews will generate interest in this promising new approach in biology. Manuscript received July 17, 2006 REFERENCES
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||