Search results for Statistics and Probability

Part IV - Biomarker Discovery via Multistage Signal Enhancement and Identification of Essential Patterns
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 207-218
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Multistage Signal Enhancement
from Part IV - Biomarker Discovery via Multistage Signal Enhancement and Identification of Essential Patterns
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 209-212
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapters 14 and 15 describe a method for the identification of parsimonious and robust multivariate biomarkers that may also have the best chance for plausible biological interpretation. The method is based on multistage signal enhancement and identification of essential patterns. Chapter 14 covers the first logical part of this method – the multistage signal enhancement approach leading to the identification of a pool of potentially important variables.

6 - Basic Regression Methods
from Part II - Regression Methods for Estimation
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 101-113
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 6 starts with the description of multiple regression. Even if it is unlikely for multiple regression to be used as the primary method for multivariate biomarker discovery based on high-dimensional data, presenting this classical method provides the necessary background for regression analysis and highlights the weaknesses of multiple regression, which will be addressed by the subsequently presented methods. This chapter also presents partial least squares regression (PLSR), which by performing supervised dimensionality reduction addresses some weaknesses of multiple regression; however, by not performing any feature selection, PLSR does not reduce noise that is typically abundant in high-dimensional data.

13 - Neural Networks and Deep Learning
from Part III - Classification Methods
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 191-206
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 13 discusses neural networks and deep learning; included is a presentation of deep convolutional networks that seem to have a great potential in the classification of medical images.

5 - Multivariate Feature Selection
from Part I - Framework for Multivariate Biomarker Discovery
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 76-98
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 5 is dedicated to the most important part of predictive modeling for biomarker discovery based on high-dimensional data – multivariate feature selection. When dealing with sparse biomedical data whose dimensionality is much higher than the number of training observations, the crucial issue is to overcome the curse of dimensionality by using methods capable of elevating signal (predictive information) from the overwhelming noise. One way of doing this is to perform many (hundreds or thousands) parallel feature selection experiments based on different random subsamples of the original training data and then aggregating their results (for example, by analyzing the distribution of variables among the results of those parallel experiments). Two designs of such parallel feature selection experiments are discussed in detail: one based on recursive feature elimination, and the other on implementing the stepwise hybrid selection with T2. The chapter includes also descriptions of three evolutionary feature selection algorithms: simulated annealing, genetic algorithms, and particle swarm optimization.

11 - Classification with Support Vector Machines
from Part III - Classification Methods
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 158-173
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 11 presents classification with support vector machines – details of the algorithms for linear and nonlinear SVMs. Discussed are also kernel functions, hyperparameters, variable importance measures, and cost-sensitive SVMs.

References
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 267-272
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Predictive Modeling for Biomarker Discovery
from Part I - Framework for Multivariate Biomarker Discovery
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 25-43
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 3 provides an overview of all elements of the predictive modeling process, from the selection of training and test data sets, parallel multivariate feature selection experiments and deciding on an optimal multivariate biomarker, to building, tuning, validating, and testing predictive models implementing the optimal biomarker. Discussed are also such topics as bias-variance tradeoff, segmentation models, and committees of predictive models.

Part III - Classification Methods
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 147-206
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Framework for Multivariate Biomarker Discovery
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 1-98
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction
from Part I - Framework for Multivariate Biomarker Discovery
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 3-15
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 1 focuses on terminology and basic concepts of the area, and places multivariate biomarker discovery in the context of biomarker studies and personalized medicine. For ease of reference, included are also short descriptions of some of the terms and concepts introduced and discussed in various parts of the book.

Contents
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp vii-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Classification with Random Forests
from Part III - Classification Methods
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 149-157
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 10 covers the random forests algorithm for classification. Presented are also the impurity metrics applicable to splitting nodes in classification trees (Gini, entropy, and misclassification impurity), as well as permutation-based and impurity-based variable importance measures.

Part V - Multivariate Biomarker Discovery Studies
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 219-266
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Discriminant Analysis
from Part III - Classification Methods
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 174-190
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 12 presents discriminant analysis – a classical (and powerful) supervised learning approach for classification. Discussed are Fisher’s discriminant analysis, as well as Gaussian linear, quadratic, and regularized discriminant analysis. The chapter concludes with a discussion of partial least squares discriminant analysis, which is still popular in some application areas, even if its application to high-dimensional data is likely to result in solutions that are suboptimal in terms of predictive abilities and interpretability (alternative approaches are recommended).

17 - Biomarker Discovery Study 2
from Part V - Multivariate Biomarker Discovery Studies
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 241-266
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 17 describes the second real-life study, whose goal is the identification of multivariate biomarkers for liver cancer. This study implements parallel recursive feature elimination experiments coupled with random forests and support vector machines. Included are also considerations for rebalancing class proportions. Three multivariate biomarkers for liver cancer have been identified. The study has been performed in an R environment, and R scripts for all of its steps are provided.

16 - Biomarker Discovery Study 1
from Part V - Multivariate Biomarker Discovery Studies
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 221-240
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapters 16 presents the first of the two real-life multivariate biomarker discovery studies included in the book. The goal of this study – which implements the method presented in Chapters 14 and 15 – is to identify the essential gene expression patterns and a multivariate biomarker common for multiple types of cancer. This study is based on the TCGA RNA-Seq data of 3,528 patients and 20,530 gene expression variables; the data represent five tumor types of five different tissues. A parsimonious multivariate biomarker (consisting of ten genes) with high sensitivity and specificity has been identified.

9 - Support Vector Regression
from Part II - Regression Methods for Estimation
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 136-146
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 9 presents support vector regression (SVR), a relatively newer supervised learning algorithm for predictive regression modeling, which – like random forests for regression – also may outperform the least-squares-based methods. Discussed is ε-insensitive loss used by SVR, the ε-tube concept, as well as algorithms for linear and nonlinear SVRs.

4 - Evaluation of Predictive Models
from Part I - Framework for Multivariate Biomarker Discovery
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp 44-75
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 4 provides a detailed coverage of methods for the evaluation of predictive models: the methods applicable to regression models implementing estimation biomarkers, as well as methods evaluating binary and multiclass classification models. Discussion of resampling techniques is accompanied by accentuating the danger of information leakage and by emphasizing the paramount importance of avoiding internal validation. Discussion of metrics for the evaluation of classification biomarkers includes the issue of proper and improper interpretation of sensitivity and specificity, illustrated by an example of a screening biomarker targeting a population with low prevalence of the tested disease. For such biomarkers, positive predictive value may be unacceptably low even when the biomarker has a very high specificity and sensitivity. Discussed in this chapter are also misclassification costs and incorporating them into cost-sensitive classification.

Acknowledgments
Darius M. Dziuda, Central Connecticut State University
Book:

Multivariate Biomarker Discovery

Published online:

30 May 2024

Print publication:

06 June 2024, pp xvii-xviii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistics and Probability

Refine search

Refine search

Actions for selected content:

52341 results in Statistics and Probability

Part IV - Biomarker Discovery via Multistage Signal Enhancement and Identification of Essential Patterns

14 - Multistage Signal Enhancement

Summary

6 - Basic Regression Methods

Summary

13 - Neural Networks and Deep Learning

Summary

5 - Multivariate Feature Selection

Summary

11 - Classification with Support Vector Machines

Summary

References

3 - Predictive Modeling for Biomarker Discovery

Summary

Part III - Classification Methods

Part I - Framework for Multivariate Biomarker Discovery

1 - Introduction

Summary

Contents

10 - Classification with Random Forests

Summary

Part V - Multivariate Biomarker Discovery Studies

12 - Discriminant Analysis

Summary

17 - Biomarker Discovery Study 2

Summary

16 - Biomarker Discovery Study 1

Summary

9 - Support Vector Regression

Summary

4 - Evaluation of Predictive Models

Summary

Acknowledgments

Statistics and Probability

Refine search

Refine search

Actions for selected content:

Save Search

52341 results in Statistics and Probability

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary