Hello. Sign in to personalize your visit. New user? Register now.  
GEN Updates in Biotechnology - Next-Generation Sequencing
Journal of Computational Biology
Continuous Representations of Time-Series Gene Expression Data

To cite this paper:
Ziv Bar-Joseph, Georg K. Gerber, David K. Gifford, Tommi S. Jaakkola, Itamar Simon. Journal of Computational Biology. June 1, 2003, 10(3-4): 341-356. doi:10.1089/10665270360688057.

Full Text PDF: • HiRes for printing (297.2 KB) • PDF Plus w/ links (266.6 KB)


Ziv Bar-Joseph
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
Georg K. Gerber
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
David K. Gifford
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
Tommi S. Jaakkola
MIT Artificial Intelligence Laboratory, 200 Technology Square, Cambridge, MA 02139
Itamar Simon
Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.

Free first page

This paper was cited by:

Expression profiling analysis for genes related to meat quality and carcass traits during postnatal development of backfat in two pig breeds
MingZhou Li, Li Zhu, XueWei Li, SuRong Shuai, XiaoKun Teng, HuaSheng Xiao, Qiang Li, Lei Chen, YuJiao Guo, JinYong Wang
Science in China Series C: Life Sciences. Sep 2008, Vol. 51, No. 8: 718-733
CrossRef
Discovering Statistically Significant Periodic Gene Expression
Jie Chen, Kuang-Chao Chang
International Statistical Review. Sep 2008, Vol. 76, No. 2: 228-246
CrossRef
Gene profiling for determining pluripotent genes in a time course microarray experiment
J. Tuke, G. F. V. Glonek, P. J. Solomon
Biostatistics. Jul 2008
CrossRef
Influence of mRNA decay rates on the computational prediction of transcription rate profiles from gene expression profiles
Chi-Fang Chin, Arthur Chun-Chieh Shih, Kuo-Chin Fan
Journal of Biosciences. Jan 2008, Vol. 32, No. S3: 1251-1262
CrossRef
Bayesian State Space Models for Inferring and Predicting Temporal Gene Expression Profiles
Yulan Liang, Arpad Kelemen
Biometrical Journal. Jul 2007, Vol. 49, No. 6: 801-814
CrossRef
Reconstructing dynamic regulatory maps
Jason Ernst, Oded Vainas, Christopher T Harbison, Itamar Simon, Ziv Bar-Joseph
Molecular Systems Biology. Feb 2007, Vol. 3
CrossRef
Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives
S.  Déjean, P. G. P. Martin, A.  Baccini, P.  Besse
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11
CrossRef
Analysis of Gene Coexpression by B-Spline Based CoD Estimation
Huai  Li, Yu  Sun, Ming  Zhan
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11
CrossRef
Significance analysis of time-series transcriptomic data: A methodology that enables the identification and further exploration of the differentially expressed genes at each time-point
Bhaskar Dutta, Robert Snyder, Maria I. Klapa
Biotechnology and Bioengineering. 2007, Vol. 98, No. 3: 668
CrossRef
The Wavelet-Based Cluster Analysis for Temporal Gene Expression Data
J. Z. Song, K. M. Duan, T. Ware, M. Surette
EURASIP Journal on Bioinformatics and Systems Biology. 2007, Vol. 2007: 1
CrossRef
A Bayesian Mixture Model for Partitioning Gene Expression Data
Chuan Zhou, Jon Wakefield
Biometrics. 2005, Vol. 0, No. 0: 051215022406003
CrossRef
Combined static and dynamic analysis for determining the quality of time-series expression profiles
Itamar Simon, Zahava Siegfried, Jason Ernst, Ziv Bar-Joseph
Nature Biotechnology. 2005, Vol. 23, No. 12: 1503
CrossRef
All papers
Previous Next