ISSN 1006-298X      CN 32-1425/R

导航

肾脏病与透析肾移植杂志 ›› 2021, Vol. 30 ›› Issue (5): 476-479.DOI: 10.3969/j.jssn.1006-298X2021.5.016

• • 上一篇    下一篇

基于多重填补的广义线性模型在肾脏疾病研究中的应用

  

  • 出版日期:2021-10-28 发布日期:2021-10-28

A generalized linear model based on multiple imputation in kidney disease research

  • Online:2021-10-28 Published:2021-10-28

摘要: 在临床数据的统计分析中,我们常会遇到数据缺失。根据数据的缺失情况,使用妥当的方法处理缺失值关系着统计推断的可靠性和准确性。本文对数据缺失的模式、比例和常见的处理方法进行了简单的总结,对基于R语言程序包mice的多重填补的流程和内容进行了详细的梳理,并以急性肾损伤的影响因素分析为例,展示和对比基于多重填补数据集、缺失数据集和原始数据集的广义线性模型的结果。结果表明,多重填补可以很好地对定量变量、二分类变量和有序多分类变量进行填补,基于多重填补数据集的模型结果稳健、可靠。


Abstract: In the statistical analysis of clinical data, we often encounter the situation of missing data. According to the condition of missing data, proper methods to deal with missing values are related to the reliability and accuracy of statistical inference. In this paper, the model of missing data, proportion of missing values, and the common processing method were briefly summarized; the process and content of multiple imputation based on R package mice were shown in detail. Finally, an analysis of the influencing factors of acute kidney injury was conducted to illustrate multiple imputation, and the results of generalized linear models based on multiple imputation data sets, missing data sets, and original data sets were presented and compared. The results showed that multiple imputation did well in the imputation of quantitative variables, dichotomous variables and ordered multiclassification variables, and the result of the model based on multiple imputation data sets was robust and reliable.