Expectile Regression with Reciprocal Lassopenalty
Abstract
Expectile regression (ER) has emerged as a valuable alternative to quantile regression, offering robust modeling of the conditional distribution of response variables across diverse fields such as economics, medicine, and ecology. While traditional penalized ER models like SCAD, elastic-net, and adaptive Lasso improve variable selection, they often struggle with achieving optimal sparsity and precision. Existing penalties typically exhibit continuity and symmetry, which may not sufficiently penalize near-zero coefficients in high-dimensional data, limiting their effectiveness in variable selection. This study proposes a novel penalized expectile regression method using the reciprocal Lasso (rLasso) penalty, which introduces discontinuity at the origin and infinite shrinkage as coefficients approach zero. Simulation studies across various settings and error distributions demonstrate that ER-rLasso consistently achieves the lowest root mean square error (RMSE) and false positive rate (FPR), outperforming established ER models. In real-data analysis involving prostate cancer, ER-rLasso showed superior predictive accuracy across all tested expectile levels. The distinct penalization structure of rLasso, being non-symmetric and non-convex, introduces a more aggressive variable selection mechanism that enhances model sparsity and interpretability.
References
S. Yang, "Censored median regression using weighted empirical survival and hazard functions," J. Am. Stat. Assoc., vol. 94, no. 445, pp. 137–145, 1999.
E. Waldmann, F. Sobotka, and T. Kneib, "Bayesian regularisation in geoadditive expectile regression," Stat. Comput., vol. 27, pp. 1539–1553, 2017.
B. Abdous and B. Remillard, "Relating quantiles and expectiles under weighted-symmetry," Ann. Inst. Stat. Math., vol. 47, no. 2, pp. 371–384, 1995.
R. Alhamzawi, K. Yu, V. Vinciotti, and A. Tucker, "Prior elicitation for mixed quantile regression with an allometric model," Environmetrics, vol. 22, no. 7, pp. 911–920, 2011.
P. Artzner, F. Delbaen, J.-M. Eber, and D. Heath, "Coherent measures of risk," Math. Finance, vol. 9, no. 3, pp. 203–228, 1999.
Q. Yao and H. Tong, "Asymmetric least squares regression estimation: a nonparametric approach," J. Nonparametric Stat., vol. 6, no. 2–3, pp. 273–292, 1996.
R. Alhamzawi and H. T. M. Ali, "Bayesian tobit quantile regression with penalty," Commun. Stat.-Simul. Comput., vol. 47, no. 6, pp. 1739–1750, 2018.
R. Alhamzawi and H. Mallick, "Bayesian reciprocal lasso quantile regression," Commun. Stat.-Simul. Comput., vol. 51, no. 11, pp. 6479–6494, 2022.
R. Alhamzawi, "Bayesian model selection in ordinal quantile regression," Comput. Stat. Data Anal., vol. 103, pp. 68–78, 2016.
A. Alcaraz and G. Ciuperca, "Automatic selection by penalized asymmetric lq-norm in a high-dimensional model with grouped variables," Statistics, vol. 57, no. 5, pp. 1202–1238, 2023.
P. C. Austin, J. V. Tu, P. A. Daly, and D. A. Alter, "The use of quantile regression in health care research: a case study examining gender differences in the timeliness of thrombolytic therapy," Stat. Med., vol. 24, no. 5, pp. 791–816, 2005.
L. S. Waltrup, F. Sobotka, T. Kneib, and G. Kauermann, "Expectile and quantile regression—David and Goliath?" Stat. Model., vol. 15, no. 5, pp. 433–456, 2015.
A. Daouia, S. Girard, and G. Stupfler, "Tail expectile process and risk assessment," Bernoulli, vol. 26, no. 1, pp. 531–556, 2020.
F. Bellini and E. D. Bernardino, "Risk management with expectiles," Eur. J. Finance, vol. 23, no. 6, pp. 487–506, 2017.
B. Efron, "Regression percentiles using asymmetric squared error loss," Stat. Sin., vol. 1, no. 1, pp. 93–125, 1991.
A. Daouia, S. Girard, and G. Stupfler, "Estimation of tail risk based on extreme expectiles," J. R. Stat. Soc. Ser. B Stat. Methodol., vol. 80, no. 2, pp. 263–292, 2018.
Q. Song, "An overview of reciprocal l1-regularization for high dimensional regression data," Wiley Interdiscip. Rev. Comput. Stat., vol. 10, no. 1, p. e1416, 2018.
Q. Xu, X. Ding, C. Jiang, K. Yu, and L. Shi, "An elastic-net penalized expectile regression with applications," J. Appl. Stat., vol. 48, no. 12, pp. 2205–2230, 2021.
B. S. Cade, J. W. Terrell, and M. T. Porath, "Estimating fish body condition with quantile regression," North Am. J. Fish. Manag., vol. 28, pp. 349–359, 2008.
P. Artzner, F. Delbaen, J.-M. Eber, and D. Heath, "Coherent measures of risk," Math. Finance, vol. 9, no. 3, pp. 203–228, 1999.
S. K. Schnabel and P. H. Eilers, "An analysis of life expectancy and economic production using expectile frontier zones," Demogr. Res., vol. 21, pp. 109–134, 2009.
B. S. Cade and R. N. Barry, "A gentle introduction to quantile regression for ecologists," Front. Ecol. Environ., vol. 1, pp. 412–420, 2003.
Q. Song and F. Liang, "High-dimensional variable selection with reciprocal l1-regularization," J. Am. Stat. Assoc., vol. 110, no. 512, pp. 1607–1620, 2015.
R. Koenker and G. Bassett, "Regression quantiles," Econom. J. Econom. Soc., vol. 46, no. 1, pp. 33–50, 1978.
M. C. Jones, "Expectiles and m-quantiles are quantiles," Stat. Probab. Lett., vol. 20, no. 2, pp. 149–153, 1994.
Y. Gu and H. Zou, "High-dimensional generalizations of asymmetric least squares regression and their applications," Ann. Stat., vol. 44, no. 6, pp. 2661–2694, 2016.
Y. Li and J. Zhu, "L1-norm quantile regression," J. Comput. Graph. Stat., vol. 17, no. 1, pp. 163–185, 2008.
J. Hartog, P. T. Pereira, and J. A. Vieira, "Changing returns to education in Portugal during the 1980s and early 1990s: OLS and quantile regression estimators," Appl. Econ., vol. 33, no. 8, pp. 1021–1037, 2001.
Y. Huang and J. Chen, "Bayesian quantile regression-based nonlinear mixed-effects joint models for time-to-event and longitudinal data with multiple features," Stat. Med., vol. 35, no. 30, pp. 5666–5685, 2016.
L. H. Hashim et al., "An Application Comparison of Two Poisson Models on Zero Count Data," J. Phys. Conf. Ser., vol. 1818, p. 012165, 2021.
N. Wirsik, F. Otto-Sobotka, and I. Pigeot, "Modeling physical activity data using l0-penalized expectile regression," Biom. J., vol. 61, no. 6, pp. 1371–1384, 2019.
L. H. Hashim et al., "An Application Comparison of Two Negative Binomial Models on Rainfall Count Data," J. Phys. Conf. Ser., vol. 1818, p. 012100, 2021.
L. Liao, C. Park, and H. Choi, "Penalized expectile regression: an alternative to penalized quantile regression," Ann. Inst. Stat. Math., vol. 71, no. 2, pp. 409–438, 2019.
E. Paul and H. Mallick, "Unified reciprocal lasso estimation via least squares approximation," Commun. Stat.-Simul. Comput., pp. 1–11, 2022.
H. Mallick, R. Alhamzawi, E. Paul, and V. Svetnik, "The reciprocal Bayesian lasso," Stat. Med., vol. 40, no. 22, pp. 4830–4849, 2021.
A. B. Rejeb and M. Arfaoui, "Financial market interdependencies: A quantile regression analysis of volatility spillover," Res. Int. Bus. Finance, vol. 36, pp. 140–157, 2016.
X. Liu, Z. Tan, Y. Wu, and Y. Zhou, "The financial risk measurement evar based on dtarch models," Entropy, vol. 25, no. 8, p. 1204, 2023.
B. Melly, "Decomposition of differences in distribution using quantile regression," Labour Econ., vol. 12, no. 4
W. K. Newey and J. L. Powell, "Asymmetric least squares estimation and testing," Econom. J. Econom. Soc., vol. 55, no. 4, pp. 819–847, 1987.
J. Lin, Y. Huang, and S. Ma, "Composite expectile regression with gene-environment interaction," ArXiv Prepr. ArXiv220801461, 2022.
V. Picheny, H. Moss, L. Torossian, and N. Durrande, "Bayesian quantile and expectile optimisation," in Uncertainty in Artificial Intelligence, PMLR, 2022, pp. 1623–1633.
T. A. Stamey et al., "Prostate specific antigen in the diagnosis and treatment of adenocarcinoma of the prostate. II. Radical prostatectomy treated patients," J. Urol., vol. 141, no. 5, pp. 1076–1083, 1989.
Y. Wei, A. Pere, R. Koenker, and X. He, "Quantile regression methods for reference growth charts," Stat. Med., vol. 25, no. 8, pp. 1369–1382, 2006.
M. Shin, A. Bhattacharya, and V. E. Johnson, "Scalable bayesian variable selection using nonlocal prior densities in ultrahigh-dimensional settings," Stat. Sin., vol. 28, no. 2, p. 1053, 2018.
L. Schulze Waltrup and G. Kauermann, "Smooth expectiles for panel data using penalized splines," Stat. Comput., vol. 27, pp. 271–282, 2017.