Statistical Learning for Biomedical Data
Part of Practical Guides to Biostatistics and Epidemiology
- Authors:
- James D. Malley, National Institutes of Health, Maryland
- Karen G. Malley, Malley Research Programming, Maryland
- Sinisa Pajevic, National Institutes of Health, Maryland
- Date Published: March 2011
- availability: Available
- format: Paperback
- isbn: 9780521699099
Paperback
Other available formats:
Hardback, eBook
Looking for an examination copy?
This title is not currently available for examination. However, if you are interested in the title for your course we can consider offering an examination copy. To register your interest please contact collegesales@cambridge.org providing details of the course you are teaching.
-
This book is for anyone who has biomedical data and needs to identify variables that predict an outcome, for two-group outcomes such as tumor/not-tumor, survival/death, or response from treatment. Statistical learning machines are ideally suited to these types of prediction problems, especially if the variables being studied may not meet the assumptions of traditional techniques. Learning machines come from the world of probability and computer science but are not yet widely used in biomedical research. This introduction brings learning machine techniques to the biomedical world in an accessible way, explaining the underlying principles in nontechnical language and using extensive examples and figures. The authors connect these new methods to familiar techniques by showing how to use the learning machine models to generate smaller, more easily interpretable traditional models. Coverage includes single decision trees, multiple-tree techniques such as Random Forests™, neural nets, support vector machines, nearest neighbors and boosting.
Read more- Free open-source computer code is available online
- Brings valuable new ideas from probability and computer science into the biomedical world to provide more accurate predictions
- Plain-language approach makes the techniques more accessible
Reviews & endorsements
"Some important concepts are explained well including the difference between supervised and unsupervised learning, and describing when specific methods work well and when they don't. They also list twenty canonical questions and point the reader to the sections in the book where these questions are answered. They provide many important examples from biomedical research and illustrate the methods to solve these problems along with the pitfalls of some of them. ... Overall, I think this is a good reference source for biomedical researchers involved in data mining or classification, but the reader should beware of the arguments that are loosely explained."
Michael R. Chernick, SignificanceSee more reviews"The book is well written and provides nice graphics and numerous applications... the book is good for its intended audience, the users of biomedical data."
Michael R. Chernick, Technometrics"While biomedical applications of the statistical learning machines described in this book are becoming more apparent, they are not widely practiced. This book provides an excellent overview for the neophyte as to the nuts and bolts of certain statistical learning machines and the major issues involved in development and evaluation of specific machines. The authors display a nice dance between exuberance and caution; they do not attempt to advance any particular machine learning approach, instead emphasizing the processes and the need to use several approaches depending on data context."
Wendy J. Mack, University of Southern California, Los Angeles for American Journal of EpidemiologyCustomer reviews
Review was not posted due to profanity
×Product details
- Date Published: March 2011
- format: Paperback
- isbn: 9780521699099
- length: 298 pages
- dimensions: 245 x 175 x 11 mm
- weight: 0.6kg
- contains: 47 b/w illus. 25 tables
- availability: Available
Table of Contents
Preface
Acknowledgements
Part I. Introduction:
1. Prologue
2. The landscape of learning machines
3. A mangle of machines
4. Three examples and several machines
Part II. A Machine Toolkit:
5. Logistic regression
6. A single decision tree
7. Random forests – trees everywhere
Part III. Analysis Fundamentals:
8. Merely two variables
9. More than two variables
10. Resampling methods
11. Error analysis and model validation
Part IV. Machine Strategies:
12. Ensemble methods – let's take a vote
13. Summary and conclusions
References
Index.Instructors have used or reviewed this title for the following courses
- Analysis of Categorical Data
- Applied Epidemiology
- Fundamentals of Biostatistics
- Statistics for Health Sciences
Sorry, this resource is locked
Please register or sign in to request access. If you are having problems accessing these resources please email lecturers@cambridge.org
Register Sign in» Proceed
You are now leaving the Cambridge University Press website. Your eBook purchase and download will be completed by our partner www.ebooks.com. Please see the permission section of the www.ebooks.com catalogue page for details of the print & copy limits on our eBooks.
Continue ×Are you sure you want to delete your account?
This cannot be undone.
Thank you for your feedback which will help us improve our service.
If you requested a response, we will make sure to get back to you shortly.
×