Hostname: page-component-6766d58669-nqrmd Total loading time: 0 Render date: 2026-05-20T03:43:00.167Z Has data issue: false hasContentIssue false

Multinomial Logistic Factor Regression for Multi-source Functional Block-wise Missing Data

Published online by Cambridge University Press:  01 January 2025

Xiuli Du*
Affiliation:
Nanjing Normal University
Xiaohu Jiang
Affiliation:
Nanjing Normal University
Jinguan Lin
Affiliation:
Nanjing Audit University
*
Correspondence should be made to Xiuli Du, College of Mathematical Sciences, Nanjing Normal University, Nanjing 210023, China. Email: duxiuli@njnu.edu.cn

Abstract

Multi-source functional block-wise missing data arise more commonly in medical care recently with the rapid development of big data and medical technology, hence there is an urgent need to develop efficient dimension reduction to extract important information for classification under such data. However, most existing methods for classification problems consider high-dimensional data as covariates. In the paper, we propose a novel multinomial imputed-factor Logistic regression model with multi-source functional block-wise missing data as covariates. Our main contribution is to establishing two multinomial factor regression models by using the imputed multi-source functional principal component scores and imputed canonical scores as covariates, respectively, where the missing factors are imputed by both the conditional mean imputation and the multiple block-wise imputation approaches. Specifically, the univariate FPCA is carried out for the observable data of each data source firstly to obtain the univariate principal component scores and the eigenfunctions. Then, the block-wise missing univariate principal component scores instead of the block-wise missing functional data are imputed by the conditional mean imputation method and the multiple block-wise imputation method, respectively. After that, based on the imputed univariate factors, the multi-source principal component scores are constructed by using the relationship between the multi-source principal component scores and the univariate principal component scores; and at the same time, the canonical scores are obtained by the multiple-set canonial correlation analysis. Finally, the multinomial imputed-factor Logistic regression model is established with the multi-source principal component scores or the canonical scores as factors. Numerical simulations and real data analysis on ADNI data show the proposed method works well.

Information

Type
Theory & Methods
Copyright
Copyright © 2023 The Author(s) under exclusive licence to The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Supplementary material: File

Du et al. supplementary material

Du et al. supplementary material 1
Download Du et al. supplementary material(File)
File 1.5 MB
Supplementary material: File

Du et al. supplementary material

Du et al. supplementary material 2
Download Du et al. supplementary material(File)
File 537.5 KB