research findings
Filters
搜索

Zhang, S., Guo, B., Dong, A., He, J., Xu, Z., & Chen, S. X. (2017). Cautionary tales on air-quality improvement in Beijing. Proceedings of the Royal Society A, 473, 20170457.

The official air-quality statistic reported that Beijing had a 9.9% decline in the annual concentration of PM2.5 in 2016. While this statistic offered some relief for the inhabitants of the capital, we present several analyses based on Beijing’s PM2.5 data of the past 4 years at 36 monitoring sites along with meteorological data of the past 7 years. The analyses reveal the air pollution situation in 2016 was not as rosy as the 9.9% decline would convey, and improvement if any was rather uncertain. The paper also provides an assessment on the city’s PM2.5 situation in the past 4 years.

Read More »

Chang, J., Shao, Q. M., & Zhou, W. X. (2016). Cramér-type moderate deviations for Studentized two-sample U-statistics with applications. The Annals of Statistics, 44, 1931-1956.

Two-sample U -statistics are widely used in a broad range of applications, including those in the fields of biostatistics and econometrics. In this paper, we establish sharp Cramér-type moderate deviation theorems for Studentized two-sample U-statistics in a general framework, including the two-sample t-statistic and Studentized Mann–Whitney test statistic as prototypical exam- ples. In particular, a refined moderate deviation theorem with second-order accuracy is established for the two-sample t-statistic. These results extend the applicability of the existing statistical methodologies from the one-sample t-statistic to more general nonlinear statistics. Applications to two-sample large-scale multiple testing problems with false discovery rate control and the regularized bootstrap method are also discussed.

Read More »

Chang, J., Tang, C. Y., & Wu, Y. (2016). Local independence feature screening for nonparametric and semiparametric models by marginal empirical likelihood. Annals of statistics, 44, 515.

We consider an independence feature screening technique for identifying explanatory variables that locally contribute to the response variable in high- dimensional regression analysis. Without requiring a specific parametric form of the underlying data model, our approach accommodates a wide spectrum of nonparametric and semiparametric model families. To detect the local con- tributions of explanatory variables, our approach constructs empirical likeli- hood locally in conjunction with marginal nonparametric regressions. Since our approach actually requires no estimation, it is advantageous in scenarios such as the single-index models where even specification and identification of a marginal model is an issue. By automatically incorporating the level of variation of the nonparametric regression and directly assessing the strength of data evidence supporting local contribution from each explanatory vari- able, our approach provides a unique perspective for solving feature screen- ing problems. Theoretical analysis shows that our approach can handle data dimensionality growing exponentially with the sample size. With extensive theoretical illustrations and numerical examples, we show that the local inde- pendence screening approach performs promisingly.

Read More »

He, J., & Chen, S. X. (2016). Testing super-diagonal structure in high dimensional covariance matrices. Journal of Econometrics, 194, 283-297.

The covariance matrices are essential quantities in econometric and statistical applications including port- folio allocation, asset pricing and factor analysis. Testing the entire covariance under high dimensionality endures large variability and causes a dilution of the signal-to-noise ratio and hence a reduction in the power. We consider a more powerful test procedure that focuses on testing along the super-diagonals of the high dimensional covariance matrix, which can infer more accurately on the structure of the co- variance. We show that the test is powerful in detecting sparse signals and parametric structures in the covariance. The properties of the test are demonstrated by theoretical analyses, simulation and empirical studies.

Read More »

Zhou, W., & Ling, C. (2016). Higher-order expansions of powered extremes of normal samples. Statistics & Probability Letters, 111, 12-17.

In this paper, higher-order expansions for distributions and densities of powered extremes of standard normal random sequences are established under an optimal choice of normalized constants. Our findings refine the related results in Hall (1980). Furthermore, it is shown that the rate of convergence of distributions/densities of normalized extremes depends in principle on the power index.

Read More »

Chang, J., Guo, B., & Yao, Q. (2015). High dimensional stochastic regression with latent factors, endogeneity and nonlinearity. Journal of Econometrics, 189, 297-312.

We consider a multivariate time series model which represents a high dimensional vector process as a sum of three terms: a linear regression of some observed regressors, a linear combination of some latent and serially correlated factors, and a vector white noise. We investigate the inference without imposing stationary conditions on the target multivariate time series, the regressors and the underlying factors. Furthermore we deal with the endogeneity that there exist correlations between the observed regressors and the unobserved factors. We also consider the model with nonlinear regression term which can be approximated by a linear regression function with a large number of regressors. The convergence rates for the estimators of regression coefficients, the number of factors, factor loading space and factors are established under the settings when the dimension of time series and the number of regressors may both tend to infinity together with the sample size. The proposed method is illustrated with both simulated and real data examples.

Read More »

Chang, J., & Hall, P. (2015). Double-bootstrap methods that use a single double-bootstrap simulation. Biometrika, 102, 203-214.

We show that, when the double bootstrap is used to improve performance of bootstrap meth- ods for bias correction, techniques based on using a single double-bootstrap sample for each single-bootstrap sample can produce third-order accuracy for much less computational expense than is required by conventional double-bootstrap methods. However, this improved level of per- formance is not available for the single double-bootstrap methods that have been suggested to construct confidence intervals or distribution estimators.

Read More »

Chang, J., Chen, S. X., & Chen, X. (2015). High dimensional generalized empirical likelihood for moment restrictions with dependent data. Journal of Econometrics, 185, 283-304.

This paper considers the maximum generalized empirical likelihood (GEL) estimation and inference on parameters identified by high dimensional moment restrictions with weakly dependent data when the dimensions of the moment restrictions and the parameters diverge along with the sample size. The consistency with rates and the asymptotic normality of the GEL estimator are obtained by properly restricting the growth rates of the dimensions of the parameters and the moment restrictions, as well as the degree of data dependence. It is shown that even in the high dimensional time series setting, the GEL ratio can still behave like a chi-square random variable asymptotically. A consistent test for the over- identification is proposed. A penalized GEL method is also provided for estimation under sparsity setting.

Read More »

Chang, J., Tang, C. Y., & Wu, Y. (2013). Marginal empirical likelihood and sure independence feature screening. Annals of statistics, 41. 2123-2148.

We study a marginal empirical likelihood approach in scenarios when the number of variables grows exponentially with the sample size. The marginal empirical likelihood ratios as functions of the parameters of interest are sys- tematically examined, and we find that the marginal empirical likelihood ratio evaluated at zero can be used to differentiate whether an explanatory vari- able is contributing to a response variable or not. Based on this finding, we propose a unified feature screening procedure for linear models and the gen- eralized linear models. Different from most existing feature screening ap- proaches that rely on the magnitudes of some marginal estimators to identify true signals, the proposed screening approach is capable of further incorpo- rating the level of uncertainties of such estimators. Such a merit inherits the self-studentization property of the empirical likelihood approach, and extends the insights of existing feature screening methods. Moreover, we show that our screening approach is less restrictive to distributional assumptions, and can be conveniently adapted to be applied in a broad range of scenarios such as models specified using general moment conditions. Our theoretical results and extensive numerical examples by simulations and data analysis demon- strate the merits of the marginal empirical likelihood approach.

Read More »

Chang, J., & Chen, S. X. (2011). On the approximate maximum likelihood estimation for diffusion processes. Annals of statistics, 39, 2820-2851.

The transition density of a diffusion process does not admit an explicit expression in general, which prevents the full maximum likelihood estimation (MLE) based on discretely observed sample paths. Aït-Sahalia [J. Finance 54 (1999) 1361–1395; Econometrica 70 (2002) 223–262] proposed asymptotic expansions to the transition densities of diffusion processes, which lead to an approximate maximum likelihood estimation (AMLE) for parameters. Built on Aït-Sahalia’s [Econometrica 70 (2002) 223–262; Ann. Statist. 36 (2008) 906–937] proposal and analysis on the AMLE, we establish the consistency and convergence rate of the AMLE, which reveal the roles played by the number of terms used in the asymptotic density expansions and the sampling interval between successive observations. We find conditions under which the AMLE has the same asymptotic distribution as that of the full MLE. A first order approximation to the Fisher information matrix is proposed.

Read More »