Integrative Analysis for Subgroup and Sparsity Recovery

主讲人: 严晓东

严晓东,香港理工大学与云南大学联合培养博士,山东大学副教授,山东大学未来学者。研究领域为数据科学技术、统计学习、计量经济等。论文发表在Journal of Econometrics、Journal of the American Statistical Association、Statistica Sinica、Journal of Multivariate Analysis、Computational Statistics & Data Analysis等期刊上。目前主持国家自然科学基金、山东省自然科学基金、山东省社科规划项目基金等。

主持人: 方匡南

In modern economic studies, the population heterogeneity of multiple stratifications and the high dimensionality of the predictors pose a major challenge. In this study, we introduce an integrative procedure that can be used to explore the information regarding group and sparsity structures for high-dimensional and heterogeneous stratified models. Further, we propose $K$-regression modeling as a hybrid of complex and simple models exhibiting arbitrary dependence on the stratification features, but linear dependence on other variables. $K$-regression models preeminently exhibit the following features:(i) they are essentially non-parametric with respect to the stratified feature, and parametric linearly effects in other variables with potentially integrative pattern because the effects and the corresponding sparsity structures can be the same for the stratifications in common groups but vary across different groups; (ii) the devised $K$-regression algorithm can automatically integrate the stratifications pertaining to common regression model and simultaneously estimate the corresponding effects simultaneously; (iii) the proposal quickly recovers the subpopulation and sparsity structure of the $K$-regression models within massive and high-dimensional stratifications; (iv) the resulting estimators exhibit two-layer oracle properties, i.e., the oracle estimator obtained using the known group and sparsity structures is the local minimizer of the objective function with high probability. The stratification-specific bootstrap (SSB) sampling scheme was developed to improve the integration accuracy. Furthermore, the simulation studies provide supportive evidence that the newly proposed method performs appropriately in case of finite samples; a real data example has been provided for illustration.

时间: 2021-05-26(Wednesday)16:40-18:00
地点: 经济楼N302
主办单位: 厦门大学经济学院、王亚南经济研究院
承办单位: 厦门大学经济学院、王亚南经济研究院
类型: 系列讲座