About Me

I am a final-year Ph.D. student in Econometrics and Statistics at the University of Chicago Booth School of Business, where I am advised by Prof. Dacheng Xiu. Prior to UChicago, I obtained a B.S. in Statistics from the School of the Gifted Young, University of Science and Technology of China in 2020. My research lies in the intersection of econometrics, statistics, and machine learning, with a focus on addressing the methodological and theoretical challenges of applying machine learning techniques in economic contexts.

Email: zshen10@chicagobooth.edu

Download CV

Research Interest

Machine Learning, High Dimensional Statistics, Time-series Econometrics, Financial Econometrics

Working Paper

Deep Autoencoders for Nonlinear Factor Models: Theory and Applications (with Dacheng Xiu)
Working paper

Abstract: Autoencoders are neural networks widely used in unsupervised learning for dimensionality reduction and feature extraction. This paper provides non-asymptotic guarantees for deep autoencoders within a nonlinear factor model, showing they can effectively extract latent components with errors that diminish with increasing dimensionality and sample size. The extracted factors converge to the true latent factors, up to a functional transformation. We extend these results to supervised autoencoders, supporting their use in factor-augmented prediction and structured matrix completion. Finally, we illustrate the practical value of autoencoders in macroeconomic forecasting, asset return prediction, and noise reduction for causal analysis.


Can Machines Learn Weak Signals? (with Dacheng Xiu)
Working paper, winner of the 2024 Bates-White Prize for the Best Paper at SoFiE Annual Conference.

Abstract: In high-dimensional regression scenarios with low signal-to-noise ratios, we assess the predictive performance of several prevalent machine learning algorithms. Theoretical insights show Ridge regression’s superiority in exploiting weak signals, surpassing a zero benchmark. In contrast, Lasso fails to exceed this baseline, indicating its learning limitations. Simulations reveal that Random Forest generally outperforms Gradient Boosted Regression Trees when signals are weak. Moreover, Neural Networks with L_2-regularization excel in capturing nonlinear functions of weak signals. Our empirical analysis across six economic datasets suggests that the weakness of signals, not necessarily the absence of sparsity, may be Lasso’s major limitation in economic predictions.


Recurrent Neural Networks Meet Time Series: A Theoretical Perspective (with Xiao Chen, Yu Chen, and Dacheng Xiu)
Work in progress

Abstract: Recurrent Neural Networks (RNNs) are a class of artificial neural networks designed to model sequential data, including text, speech, and time series. This paper investigates the internal mechanisms of RNNs and provides theoretical guarantees within the framework of time series models. Specifically, we analyze a nonlinear autoregressive and moving-average model and derive a statistical error bound for RNN prediction performance. Our analysis demonstrates that RNNs excel at capturing nonlinear dependencies in noise, offering a distinct advantage over traditional nonparametric methods such as feed-forward neural networks.


Boosting or Bagging? Navigating Weak Signals (with Dacheng Xiu and Mingjie Zhang)
Work in progress

Abstract: This study investigates the predictive performance of Bagging and Boosting, two widely used ensemble techniques, in high-dimensional settings characterized by weak signals. Using a novel theoretical framework, we evaluate and compare their effectiveness. Our analysis demonstrates that Bagging excels at capturing weak signals, consistently outperforming the naive benchmark predictor. In contrast, Boosting often underperforms in these environments. Building on these insights, we propose modifications to the traditional random forest algorithm, enhancing its effectiveness for prediction tasks involving weak signals.

Published Paper

Modeling Tail Index With Autoregressive Conditional Pareto Model (with Yu Chen and Ruxin Shi)
Journal of Business & Economic Statistics, Volume 40, 2022

Abstract: We propose an autoregressive conditional Pareto (AcP) model based on the dynamic peaks over threshold method to model a dynamic tail index in the financial markets. Unlike the score-based approach which is widely used in many articles, we use an exponential function to model the tail index process for its intuitiveness and interpretability. Probabilistic properties of the AcP model and the statistical properties of its parameter estimators of maximum likelihood are studied in this article. Real data are used to show the advantages of AcP, especially, compared to the estimation volatility of GARCH model, the result of AcP is more sensitive to turmoil. The estimated tail index of AcP can accurately reflect the risk of the stock and may even play an early warning role to the turmoil of stock market. We also calculate the tail connectedness based on the estimated tail index of AcP and show that tail connectedness increases during period of turmoil, which is consistent with the result of the score-based approach.