Future Medicine Logo
Welcome Guest user | Log in | Athens Login | Register
Journals
Future medicine series
Aging Health
Biomarkers in Medicine
Nanomedicine
Pediatric Health
Personalized Medicine
Pharmacogenomics
Regenerative Medicine
Therapy
Women's Health
Resources
Register
For Authors
For Librarians
For Advertisers
Services
Subscriptions/Pricing
Reprints
Advertising
Press Releases/News
Help
Downloads/Links
2008/9 Catalogue
Library Recommendation
 

Summary
June 2006, Vol. 7, No. 4, Pages 543-550
(doi:10.2217/14622416.7.4.543)

Assessing performance of prediction rules in machine learning
Rory Martin1 & Kai Yu2,3
1Millennium Pharmaceuticals, Cambridge MA 02139, USA.
2Washington University, St. Louis, MO 63110, USA
3Current address: National Cancer Institute, Division of Cancer Epidemiology and Genetics, Bethesda MD 20892, USA
† Author for correspondence



Introduction: An important goal in machine learning is to assess the degree to which prediction rules are robust and replicable, since these rules are used for decision making and for planning follow-up studies. This requires an estimate of a prediction rule's true error rate, a statistic that can be estimated by resampling data. However, there are many possible approaches depending upon whether we draw observations with or without replacement, or sample once, repeatedly, or not at all, and the pros and cons of each are often unclear. This study illustrates and compares different methods for estimating true error with the aim of providing practical guidance to users of machine learning techniques. Methods: We conducted Monte Carlo simulation studies using four different error estimators: bootstrap, split sample, resubstitution and a direct estimate of true error. Here, 'split sample' refers to a single random partition of the data into a pair of training and test samples, a popular scheme. We used stochastic gradient boosting as a learning algorithm, and considered data from two studies for which the underlying data mechanism was known to be complex: a library of 6000 tripeptide substrates collected for analysis of proteasome inhibition as part of anticancer drug design, and a cardiovascular study involving 600 subjects receiving antiplatelet treatment for acute coronary syndrome. Results: There were important differences in the performance of the various error estimators examined. Error estimators for split sample and resubstitution, while being the most transparent in action and the simplest to apply, did not quantify the performance of prediction rules as accurately as the bootstrap. This was true for both types of study data, despite their highly different nature. Conclusions: The robustness and reliability of decisions based on analysis of genomics data could, in many cases, be improved by following best practices for prediction error estimation. For this, techniques such as bootstrap should be considered.

Cited by

Rory Martin, David Rose, Kai Yu, Scott Barros. (2006) Toxicogenomics strategies for predicting drug toxicity. Pharmacogenomics 7:7, 1003-1016
Online publication date: 1-Oct-2006.
Summary | Full Text | PDF (252 KB) | PDF Plus (260 KB) 
Full Text PDF (187.524 KB) PDF Plus (203.888 KB)
 

Prev. Article | Next Article
View/Print PDF (188 KB)
View PDF Plus (204 KB)
Add to favorites
Email to a friend
TOC Alert | Citation Alert What is RSS?

Quick Links
 • Alert me when:
New articles cite this article
 • Download to citation manager
 • Related articles found in:
Future Medicine
 • View Most Downloaded Articles
 
 
Quick Search
for 
Authors:
Rory Martin
Kai Yu
Keywords:
bootstrap
machine learning
Monte Carlo simulation
prediction rule
split sample
stochastic gradient boosting
true error