Publications – KLLAB.org

Journal Papers

📢 View details on Zotero 🎏 Subscribe with RSS

Zhiru Wang, Yu Pang, Hongxia Wang, Feng Li, Martin Skitmore, Tingting Liu and Cheryl Desha (2026). “Impact of Transportation Network Development on Service Resilience: Insights from Beijing”. Journal of Transportation Engineering, Part A: Systems, Vol. 152 (2), pp. 04025134. American Society of Civil Engineers
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Urban rail transit (URT) networks can evolve substantially over time, with an increasing number and complexity of interactions between network components. In such networks, seemingly small disruptions can trigger cascading network service failures, affecting commuter comfort and experience. This study seeks to address the resilience of URT networks to service disruptions by understanding network evolution for resilience planning. Specifically, it develops the Network Resilience Assessment Framework (NRAF) to assess whether URT network structures are inherently resilient to service disruptions over time. The NRAF was applied to Beijing Subway data to evaluate the correlation between network evolution and service resilience. Key network evolution metrics, such as station line degree, clustering, and scaling factors, were analyzed to determine their impact on service resilience. The findings indicate that service resilience is correlated with the network’s evolution. Both a high station line degree and clustering contribute to absorbing the impacts of service disruptions, while lower scaling factors are associated with higher annual average service resilience. The results suggest that promoting URT networking is an effective strategy for mitigating the impact of service disruptions. The study provides evidence that enhancing resilience can be achieved by fostering a more densely connected network and by decentralizing and reducing hierarchical differences at the global network level. The analysis reveals that the evolution of URT networking does not exhibit the characteristics of adverse effects diffusion, as highlighted in some previous studies. Instead, our findings provide a complementary perspective, emphasizing the role of decentralization and enhanced local connectivity in improving network resilience. These findings have immediate positive implications for Beijing URT and other evolving networks worldwide.
BibTeX:
```
@article{WangZ2026ImpactTransportation,
  author = {Wang, Zhiru and Pang, Yu and Wang, Hongxia and Li, Feng and Skitmore, Martin and Liu, Tingting and Desha, Cheryl},
  title = {Impact of Transportation Network Development on Service Resilience: Insights from Beijing},
  journal = {Journal of Transportation Engineering, Part A: Systems},
  publisher = {American Society of Civil Engineers},
  year = {2026},
  volume = {152},
  number = {2},
  pages = {04025134},
  url = {https://ascelibrary.org/doi/10.1061/JTEPBS.TEENG-8850},
  doi = {10.1061/JTEPBS.TEENG-8850}
}
```
Quan Wen, Yanrong Zeng, Fotios Petropoulos and Yanfei Kang (2026). “Coherent Forecasts for Tourism Demand with Automated Immutability Constraints”. Tourism Management, Vol. 113 pp. 105342.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: This study tackles key challenges in tourism demand forecasting within a hierarchical time series framework. To ensure coherence across aggregation levels and improve forecasting performance, we incorporate immutability constraints that preserve forecasts for strategically important nodes. Two automated selection methods are proposed to identify such nodes: (i) a clustering-based approach that ensures dispersion across levels, and (ii) a penalized optimization approach that selects immutable nodes based on data-driven criteria. Through Monte Carlo simulations, and two empirical applications, we demonstrate that the proposed methods improve forecast accuracy, robustness and flexibility while preserving interpretability. The framework is model-agnostic with respect to base forecasts and provides tourism managers with a scalable, data-driven tool to focus on critical segments, improve resource allocation, and support strategic planning in tourism management.
BibTeX:
```
@article{WenQ2026CoherentForecasts,
  author = {Wen, Quan and Zeng, Yanrong and Petropoulos, Fotios and Kang, Yanfei},
  title = {Coherent Forecasts for Tourism Demand with Automated Immutability Constraints},
  journal = {Tourism Management},
  year = {2026},
  volume = {113},
  pages = {105342},
  url = {https://www.sciencedirect.com/science/article/pii/S0261517725002122},
  doi = {10.1016/j.tourman.2025.105342}
}
```
王雯 and 李丰 (2025). “基于分段组合VARX模型的中国出境游客数量预测”. 经济管理学刊, Vol. 4 pp. 255-284.
[Abstract] [BibTeX] [DOI]

Abstract: 本文对结构性变化的旅游需求进行研究,基于带有外生变量的向量自回归(VARX)模型,提出了一种分段组合预测的方法。与既有研究普遍采用的基于完整数据集构建组合预测模型不同,本文创新性地将时间因素纳入组合预测考量,通过将不同时间段的变量视为独立的单元,构建出分段时间序列数据集的组合预测模型。该方法以游客的网络搜索行为作为外生变量用于预测旅游人数,并捕捉这些外生变量在不同时间节点上对旅游人数产生的差异化影响,特别是在新冠疫情等突发冲击下的动态变化。实证结果显示,VARX模型的分段组合在预测中国出境旅游人数时展现出更高的准确性,其预测精度因考虑了外生变量在不同时间段的特异性影响而得以提升。事后分析进一步显示,特别是针对2024年中国出境旅游趋势的外样本预测结果,随着新冠疫情影响的逐渐消退及全球旅游市场的逐步复苏,中国出境旅游人数将呈现积极向上的增长态势。这一结论与现有公开文献中的趋势分析相吻合,进一步印证了本文预测方法的实践应用价值。
BibTeX:
```
@article{WangWen2025VARX,
  author = {王雯 and 李丰},
  title = {基于分段组合VARX模型的中国出境游客数量预测},
  journal = {经济管理学刊},
  year = {2025},
  volume = {4},
  pages = {255--284},
  doi = {10.20180/j.qjem.2025.01.09}
}
```
Christoph Bergmeir, Frits de Nijs, Evgenii Genov, Abishek Sriramulu, Mahdi Abolghasemi, Richard Bean, John Betts, Quang Bui, Nam Trong Dinh, Nils Einecke, Rasul Esmaeilbeigi, Scott Ferraro, Priya Galketiya, Robert Glasgow, Rakshitha Godahewa, Yanfei Kang, Steffen Limmer, Luis Magdalena, Pablo Montero-Manso, Daniel Peralta, Yogesh Pipada Sunil Kumar, Alejandro Rosales-Pérez, Julian Ruddick, Akylas Stratigakos, Peter Stuckey, Guido Tack, Isaac Triguero and Rui Yuan (2025). “Predict+Optimize Problem in Renewable Energy Scheduling”. IEEE Access, Vol. 13 pp. 60064-60087.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Predict+Optimize frameworks integrate forecasting and optimization to address real-world challenges such as renewable energy scheduling, where variability and uncertainty are critical factors. This paper benchmarks solutions from the IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling, focusing on forecasting renewable production and demand and optimizing energy cost. The competition attracted 49 participants in total. The top-ranked method employed stochastic optimization using LightGBM ensembles, and achieved at least a 2% reduction in energy costs compared to deterministic approaches, demonstrating that the most accurate point forecast does not necessarily guarantee the best performance in downstream optimization. The published data and problem setting establish a benchmark for further research into integrated forecasting-optimization methods for energy systems, highlighting the importance of considering forecast uncertainty in optimization models to achieve cost-effective and reliable energy management. The novelty of this work lies in its comprehensive evaluation of Predict+Optimize methodologies applied to a real-world renewable energy scheduling problem, providing insights into the scalability, generalizability, and effectiveness of the proposed solutions. Potential applications extend beyond energy systems to any domain requiring integrated forecasting and optimization, such as supply chain management, transportation planning, and financial portfolio optimization.
BibTeX:
```
@article{BergmeirC2025Predict+OptimizeProblem,
  author = {Bergmeir, Christoph and de Nijs, Frits and Genov, Evgenii and Sriramulu, Abishek and Abolghasemi, Mahdi and Bean, Richard and Betts, John and Bui, Quang and Dinh, Nam Trong and Einecke, Nils and Esmaeilbeigi, Rasul and Ferraro, Scott and Galketiya, Priya and Glasgow, Robert and Godahewa, Rakshitha and Kang, Yanfei and Limmer, Steffen and Magdalena, Luis and Montero-Manso, Pablo and Peralta, Daniel and Kumar, Yogesh Pipada Sunil and Rosales-Pérez, Alejandro and Ruddick, Julian and Stratigakos, Akylas and Stuckey, Peter and Tack, Guido and Triguero, Isaac and Yuan, Rui},
  title = {Predict+Optimize Problem in Renewable Energy Scheduling},
  journal = {IEEE Access},
  year = {2025},
  volume = {13},
  pages = {60064--60087},
  url = {https://ieeexplore.ieee.org/abstract/document/10942601/authors},
  doi = {10.1109/ACCESS.2025.3555393}
}
```
Yuan Gao, Rui Pan, Feng Li, Riquan Zhang and Hansheng Wang (2025). “Grid Point Approximation for Distributed Nonparametric Smoothing and Prediction”. Journal of Computational and Graphical Statistics, Vol. 34 (3), pp. 824-836. ASA Website
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Kernel smoothing is a widely used nonparametric method in modern statistical analysis. The problem of efficiently conducting kernel smoothing for a massive dataset on a distributed system is a problem of great importance. In this work, we find that the popularly used one-shot type estimator is highly inefficient for prediction purposes. To this end, we propose a novel grid point approximation (GPA) method, which has the following advantages. First, the resulting GPA estimator is as statistically efficient as the global estimator under mild conditions. Second, it requires no communication and is extremely efficient in terms of computation for prediction. Third, it is applicable to the case where the data are not randomly distributed across different machines. To select a suitable bandwidth, two novel bandwidth selectors are further developed and theoretically supported. Extensive numerical studies are conducted to corroborate our theoretical findings. Two real data examples are also provided to demonstrate the usefulness of our GPA method. Supplementary materials for this article are available online.
BibTeX:
```
@article{GaoY2025GridPoint,
  author = {Gao, Yuan and Pan, Rui and Li, Feng and Zhang, Riquan and Wang, Hansheng},
  title = {Grid Point Approximation for Distributed Nonparametric Smoothing and Prediction},
  journal = {Journal of Computational and Graphical Statistics},
  publisher = {ASA Website},
  year = {2025},
  volume = {34},
  number = {3},
  pages = {824--836},
  url = {https://doi.org/10.1080/10618600.2024.2409817},
  doi = {10.1080/10618600.2024.2409817}
}
```
Zhe Pan, Haocheng Xian, Feng Li, Ziyao Wang, Zihan Li, Yu Huang, Wenqing Liu, Yiman Li, Fan Li, Jinyuan Wang, Haichao Chen, Yilan Wu, Yueyuan Xu, Gangyue Wu, Yang Zhang, Lvfu He, Jianping Zhang, Fangxia Zhang, Xuehan Qian, Xiuhong Zhang, Lianhong Zhou, Yanqing Feng, Li Li, Xiangui He, Xun Xu, Jin Yang, Xiyuan Zhou, Dan Zhu, Chenwei Pan, Marcus Ang, Seang-Mei Saw, Yingfeng Zheng, Mingguang He, Jost B. Jonas, Neil M. Bressler, Ching-Yu Cheng, Yih-Chung Tham, Chun Zhang, Ya Xing Wang and Tien Yin Wong (2025). “Myopia and High Myopia Trends in Chinese Children and Adolescents over 25 Years: A Nationwide Study with Projections to 2050”. The Lancet Regional Health – Western Pacific, Vol. 59 pp. 101577.
[Abstract] [BibTeX] [DOI]

Abstract: Background The global rise in myopia, particularly in Asia, presents significant public health challenges. Analyzing trends and forecasting impacts are critical for developing strategies to mitigate this burden. Methods We conducted the largest study to date on myopia and high myopia prevalence in Chinese children and adolescents aged 7–18 years, analyzing data from 5,095,256 individuals across 119 studies from 1998 to 2022. Data variability between cycloplegia and non-cycloplegia measurements was addressed using a distance-based model averaging calibration. Aggregated prevalence and age-specific urban-rural trends were estimated using thin plate spline regression, with projections to 2050 derived from time series modeling. Findings Myopia prevalence plateaued in 2006 in urban areas and in 2013 in rural areas, with the urban-rural prevalence gap narrowing since 2015 (urban/rural ratio below 1.3 for all ages). By 2050, myopia prevalence is projected to stabilize at 27.1% (95% CI: 10.0–44.4%) for ages 7–9 and 81.5% (74.7–88.3%) for ages 16–18 in urban areas, and at 20.1% (8.6–31.7%) and 74.1% (63.2–84.8%), respectively, in rural areas. High myopia prevalence among adolescents aged 16–18 is expected to rise from 7.3% in 2001 to 22.1% by 2050. Prevalence correlated significantly with the Human Development Index (P < 0.001). Interpretation Despite stabilization in overall myopia prevalence, the continued rise in high myopia underscores the need for targeted control measures. Projections emphasize the importance of addressing regional disparities and prioritizing public health interventions. Funding National Natural Science Foundation of China (#82271086; #82388101; #72495123).
BibTeX:
```
@article{PanZ2025MyopiaHigh,
  author = {Pan, Zhe and Xian, Haocheng and Li, Feng and Wang, Ziyao and Li, Zihan and Huang, Yu and Liu, Wenqing and Li, Yiman and Li, Fan and Wang, Jinyuan and Chen, Haichao and Wu, Yilan and Xu, Yueyuan and Wu, Gangyue and Zhang, Yang and He, Lvfu and Zhang, Jianping and Zhang, Fangxia and Qian, Xuehan and Zhang, Xiuhong and Zhou, Lianhong and Feng, Yanqing and Li, Li and He, Xiangui and Xu, Xun and Yang, Jin and Zhou, Xiyuan and Zhu, Dan and Pan, Chenwei and Ang, Marcus and Saw, Seang-Mei and Zheng, Yingfeng and He, Mingguang and Jonas, Jost B. and Bressler, Neil M. and Cheng, Ching-Yu and Tham, Yih-Chung and Zhang, Chun and Wang, Ya Xing and Wong, Tien Yin},
  title = {Myopia and High Myopia Trends in Chinese Children and Adolescents over 25 Years: A Nationwide Study with Projections to 2050},
  journal = {The Lancet Regional Health - Western Pacific},
  year = {2025},
  volume = {59},
  pages = {101577},
  doi = {10.1016/j.lanwpc.2025.101577}
}
```
Yiming Zhong, Yinuo Ren, Guangyao Cao, Feng Li and Haobo Qi (2025). “Optimal Starting Point for Time Series Forecasting”. Expert Systems with Applications, Vol. 273 pp. 126798.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Recent advances on time series forecasting mainly focus on improving the forecasting models themselves. However, when the time series data suffer from potential structural breaks or concept drifts, the forecasting performance might be significantly reduced. In this paper, we introduce a novel approach called Optimal Starting Point Time Series Forecast (OSP-TSP) for optimal forecasting, which can be combined with existing time series forecasting models. By adjusting the sequence length via leveraging the XGBoost and LightGBM models, the proposed approach can determine the optimal starting point (OSP) of the time series and then enhance the prediction performances of the base forecasting models. To illustrate the effectiveness of the proposed approach, comprehensive empirical analysis have been conducted on the M4 dataset and other real world datasets. Empirical results indicate that predictions based on the OSP-TSP approach consistently outperform those using the complete time series dataset. Moreover, comparison results reveals that combining our approach with existing forecasting models can achieve better prediction accuracy, which also reflect the advantages of the proposed approach.
BibTeX:
```
@article{ZhongY2025OptimalStarting,
  author = {Zhong, Yiming and Ren, Yinuo and Cao, Guangyao and Li, Feng and Qi, Haobo},
  title = {Optimal Starting Point for Time Series Forecasting},
  journal = {Expert Systems with Applications},
  year = {2025},
  volume = {273},
  pages = {126798},
  url = {https://arxiv.org/abs/2409.16843},
  doi = {10.1016/j.eswa.2025.126798}
}
```
Han Wang, Wen Wang, Feng Li, Yanfei Kang and Han Li (2025). “Catastrophe Duration and Loss Prediction via Natural Language Processing”. Variance, Vol. 18
[Abstract] [BibTeX] [Preprint]

Abstract: Textual information from online news is more timely than insurance claim data during catastrophes, and there is value in using this information to achieve earlier damage estimates. In this paper, we use text-based information to predict the duration and severity of catastrophes. We construct text vectors through Word2Vec and BERT models, using Random Forest, LightGBM, and XGBoost as different learners, all of which show more satisfactory prediction results. This new approach is informative in providing timely warnings of the severity of a catastrophe, which can aid decision-making and support appropriate responses.
BibTeX:
```
@article{WangH2024CatastropheDuration,
  author = {Wang, Han and Wang, Wen and Li, Feng and Kang, Yanfei and Li, Han},
  title = {Catastrophe Duration and Loss Prediction via Natural Language Processing},
  journal = {Variance},
  year = {2025},
  volume = {18},
  url = {https://variancejournal.org/article/133589-catastrophe-duration-and-loss-prediction-via-natural-language-processing}
}
```
Yuqin Huang, Feng Li, Tong Li and Tse-Chun Lin (2024). “Local Information Advantage and Stock Returns: Evidence from Social Media”. Contemporary Accounting Research, Vol. 41 (2), pp. 1089-1119.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: We examine the information asymmetry between local and nonlocal investors with a large dataset of stock message board postings. We document that abnormal relative postings of a firm, i.e., unusual changes in the volume of postings from local versus nonlocal investors, capture locals’ information advantage. This measure positively predicts firms’ short-term stock returns as well as those of peer firms in the same city. Sentiment analysis shows that posting activities primarily reflect good news, potentially due to social transmission bias and short-sales constraints. We identify the information driving return predictability through content-based analysis. Abnormal relative postings also lead analysts’ forecast revisions. Overall, investors’ interactions on social media contain valuable geography-based private information.
BibTeX:
```
@article{HuangY2024LocalInformation,
  author = {Huang, Yuqin and Li, Feng and Li, Tong and Lin, Tse-Chun},
  title = {Local Information Advantage and Stock Returns: Evidence from Social Media},
  journal = {Contemporary Accounting Research},
  year = {2024},
  volume = {41},
  number = {2},
  pages = {1089--1119},
  url = {http://doi.org/10.2139/ssrn.2501937},
  doi = {10.1111/1911-3846.12935}
}
```
Feng Li (2024). “Book Review of Causality: Models, Reasoning, and Inference, Judea Pearl. (Second Edition). (2009)”. International Journal of Forecasting, Vol. 40 (1), pp. 423-425.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: With the big popularity and success of Judea Pearl’s original causality book, this review covers the main topics updated in the second edition in 2009 and illustrates an easy-to-follow causal inference strategy in a forecast scenario. It further discusses some potential benefits and challenges for causal inference with time series forecasting when modeling the counterfactuals, estimating the uncertainty and incorporating prior knowledge to estimate causal effects in different forecasting scenarios.
BibTeX:
```
@article{LiF2024ForecasterReview,
  author = {Li, Feng},
  title = {Book Review of Causality: Models, Reasoning, and Inference, Judea Pearl. (Second Edition). (2009)},
  journal = {International Journal of Forecasting},
  year = {2024},
  volume = {40},
  number = {1},
  pages = {423--425},
  url = {http://arxiv.org/abs/2308.05451},
  doi = {10.1016/j.ijforecast.2023.08.005}
}
```
Bohan Zhang, Anastasios Panagiotelis and Yanfei Kang (2024). “Discrete Forecast Reconciliation”. European Journal of Operational Research,
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: This paper presents a formal framework and proposes algorithms to extend forecast reconciliation to discrete-valued data to extend forecast reconciliation to discrete-valued data, including low counts. A novel method is introduced based on recasting the optimisation of scoring rules as an assignment problem, which is solved using quadratic programming. The proposed framework produces coherent joint probabilistic forecasts for count hierarchical time series. Two discrete reconciliation algorithms are also proposed and compared against generalisations of the top-down and bottom-up approaches for count data. Two simulation experiments and two empirical examples are conducted to validate that the proposed reconciliation algorithms improve forecast accuracy. The empirical applications are forecasting criminal offences in Washington D.C. and product unit sales in the M5 dataset. Compared to benchmarks, the proposed framework shows superior performance in both simulations and empirical studies.
BibTeX:
```
@article{ZhangB2024DiscreteForecast,
  author = {Zhang, Bohan and Panagiotelis, Anastasios and Kang, Yanfei},
  title = {Discrete Forecast Reconciliation},
  journal = {European Journal of Operational Research},
  year = {2024},
  url = {https://www.sciencedirect.com/science/article/pii/S0377221724003801},
  doi = {10.1016/j.ejor.2024.05.024}
}
```
Shengjie Wang, Yanfei Kang and Fotios Petropoulos (2024). “Combining Probabilistic Forecasts of Intermittent Demand”. European Journal of Operational Research, Vol. 315 (3), pp. 1038-1048.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: In recent decades, new methods and approaches have been developed for forecasting intermittent demand series. However, the majority of research has focused on point forecasting, with little exploration into probabilistic intermittent demand forecasting. This is despite the fact that probabilistic forecasting is crucial for effective decision-making under uncertainty and inventory management. Additionally, most literature on this topic has focused solely on forecasting performance and has overlooked the inventory implications, which are directly relevant to intermittent demand. To address these gaps, this study aims to construct probabilistic forecasting combinations for intermittent demand while considering both forecasting accuracy and inventory control utility in obtaining combinations and evaluating forecasts. Our empirical findings demonstrate that combinations perform better than individual approaches for forecasting intermittent demand, but there is a trade-off between forecasting and inventory performance.
BibTeX:
```
@article{WangS2024CombiningProbabilistic,
  author = {Wang, Shengjie and Kang, Yanfei and Petropoulos, Fotios},
  title = {Combining Probabilistic Forecasts of Intermittent Demand},
  journal = {European Journal of Operational Research},
  year = {2024},
  volume = {315},
  number = {3},
  pages = {1038--1048},
  url = {https://arxiv.org/abs/2304.03092},
  doi = {10.1016/j.ejor.2024.01.032}
}
```
Spyros Makridakis, Fotios Petropoulos and Yanfei Kang (2023). “Large Language Models: Their Success and Impact”. Forecasting, Vol. 5 (3), pp. 536-549. Multidisciplinary Digital Publishing Institute
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: ChatGPT, a state-of-the-art large language model (LLM), is revolutionizing the AI field by exhibiting humanlike skills in a range of tasks that include understanding and answering natural language questions, translating languages, writing code, passing professional exams, and even composing poetry, among its other abilities. ChatGPT has gained an immense popularity since its launch, amassing 100 million active monthly users in just two months, thereby establishing itself as the fastest-growing consumer application to date. This paper discusses the reasons for its success as well as the future prospects of similar large language models (LLMs), with an emphasis on their potential impact on forecasting, a specialized and domain-specific field. This is achieved by first comparing the correctness of the answers of the standard ChatGPT and a custom one, trained using published papers from a subfield of forecasting where the answers to the questions asked are known, allowing us to determine their correctness compared to those of the two ChatGPT versions. Then, we also compare the responses of the two versions on how judgmental adjustments to the statistical/ML forecasts should be applied by firms to improve their accuracy. The paper concludes by considering the future of LLMs and their impact on all aspects of our life and work, as well as on the field of forecasting specifically. Finally, the conclusion section is generated by ChatGPT, which was provided with a condensed version of this paper and asked to write a four-paragraph conclusion.
BibTeX:
```
@article{MakridakisS2023LargeLanguage,
  author = {Makridakis, Spyros and Petropoulos, Fotios and Kang, Yanfei},
  title = {Large Language Models: Their Success and Impact},
  journal = {Forecasting},
  publisher = {Multidisciplinary Digital Publishing Institute},
  year = {2023},
  volume = {5},
  number = {3},
  pages = {536--549},
  url = {https://www.mdpi.com/2571-9394/5/3/30},
  doi = {10.3390/forecast5030030}
}
```
Guanyu Zhang, Feng Li and Yanfei Kang (2023). “Probabilistic Forecast Reconciliation with Kullback-Leibler Divergence Regularization”. In 2023 IEEE International Conference on Data Mining Workshops (ICDMW). pp. 601-607.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: As the popularity of hierarchical point forecast reconciliation methods increases, there is a growing interest in probabilistic forecast reconciliation. Many studies have utilized machine learning or deep learning techniques to implement probabilistic forecasting reconciliation and have made notable progress. However, these methods treat the reconciliation step as a fixed and hard post-processing step, leading to a trade-off between accuracy and coherency. In this paper, we propose a new approach for probabilistic forecast reconciliation. Unlike existing approaches, our proposed approach fuses the prediction step and reconciliation step into a deep learning framework, making the reconciliation step more flexible and soft by introducing the Kullback-Leibler divergence regularization term into the loss function. The approach is evaluated using three hierarchical time series datasets, which shows the advantages of our approach over other probabilistic forecast reconciliation methods.
BibTeX:
```
@inproceedings{ZhangG2023ProbabilisticForecast,
  author = {Zhang, Guanyu and Li, Feng and Kang, Yanfei},
  title = {Probabilistic Forecast Reconciliation with Kullback-Leibler Divergence Regularization},
  booktitle = {2023 IEEE International Conference on Data Mining Workshops (ICDMW)},
  year = {2023},
  pages = {601--607},
  url = {https://arxiv.org/abs/2311.12279},
  doi = {10.1109/ICDMW60847.2023.00084}
}
```
Yinuo Ren, Feng Li, Yanfei Kang and Jue Wang (2023). “Infinite Forecast Combinations Based on Dirichlet Process”. In 2023 IEEE International Conference on Data Mining Workshops (ICDMW). pp. 579-587.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecast combination integrates information from various sources by consolidating multiple forecast results from the target time series. Instead of the need to select a single optimal forecasting model, this paper introduces a deep learning ensemble forecasting model based on the Dirichlet process. Initially, the learning rate is sampled with three basis distributions as hyperparameters to convert the infinite mixture into a finite one. All checkpoints are collected to establish a deep learning sub-model pool, and weight adjustment and diversity strategies are developed during the combination process. The main advantage of this method is its ability to generate the required base learners through a single training process, utilizing the decaying strategy to tackle the challenge posed by the stochastic nature of gradient descent in determining the optimal learning rate. To ensure the method’s generalizability and competitiveness, this paper conducts an empirical analysis using the weekly dataset from the M4 competition and explores sensitivity to the number of models to be combined. The results demonstrate that the ensemble model proposed offers substantial improvements in prediction accuracy and stability compared to a single benchmark model.
BibTeX:
```
@inproceedings{RenY2023InfiniteForecast,
  author = {Ren, Yinuo and Li, Feng and Kang, Yanfei and Wang, Jue},
  title = {Infinite Forecast Combinations Based on Dirichlet Process},
  booktitle = {2023 IEEE International Conference on Data Mining Workshops (ICDMW)},
  year = {2023},
  pages = {579--587},
  url = {https://arxiv.org/abs/2311.12379},
  doi = {10.1109/ICDMW60847.2023.00081}
}
```
Li Li, Yanfei Kang, Fotios Petropoulos and Feng Li (2023). “Feature-Based Intermittent Demand Forecast Combinations: Accuracy and Inventory Implications”. International Journal of Production Research, Vol. 61 (22), pp. 7557-7572.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Intermittent demand forecasting is a ubiquitous and challenging problem in production systems and supply chain management. In recent years, there has been a growing focus on developing forecasting approaches for intermittent demand from academic and practical perspectives. However, limited attention has been given to forecast combination methods, which have achieved competitive performance in forecasting fast-moving time series. The current study aims to examine the empirical outcomes of some existing forecast combination methods and propose a generalized feature-based framework for intermittent demand forecasting. The proposed framework has been shown to improve the accuracy of point and quantile forecasts based on two real data sets. Further, some analysis of features, forecasting pools and computational efficiency is also provided. The findings indicate the intelligibility and flexibility of the proposed approach in intermittent demand forecasting and offer insights regarding inventory decisions.
BibTeX:
```
@article{LiL2023FeaturebasedIntermittent,
  author = {Li, Li and Kang, Yanfei and Petropoulos, Fotios and Li, Feng},
  title = {Feature-Based Intermittent Demand Forecast Combinations: Accuracy and Inventory Implications},
  journal = {International Journal of Production Research},
  year = {2023},
  volume = {61},
  number = {22},
  pages = {7557--7572},
  url = {https://arxiv.org/abs/2204.08283},
  doi = {10.1080/00207543.2022.2153941}
}
```
Xiaoqian Wang, Rob J. Hyndman, Feng Li and Yanfei Kang (2023). “Forecast Combinations: An over 50-Year Review”. International Journal of Forecasting, Vol. 39 (4), pp. 1518-1547.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecast combinations have flourished remarkably in the forecasting community and, in recent years, have become part of mainstream forecasting research and activities. Combining multiple forecasts produced for a target time series is now widely used to improve accuracy through the integration of information gleaned from different sources, thereby avoiding the need to identify a single “best” forecast. Combination schemes have evolved from simple combination methods without estimation to sophisticated techniques involving time-varying weights, nonlinear combinations, correlations among components, and cross-learning. They include combining point forecasts and combining probabilistic forecasts. This paper provides an up-to-date review of the extensive literature on forecast combinations and a reference to available open-source software implementations. We discuss the potential and limitations of various methods and highlight how these ideas have developed over time. Some crucial issues concerning the utility of forecast combinations are also surveyed. Finally, we conclude with current research gaps and potential insights for future research.
BibTeX:
```
@article{WangX2023ForecastCombinations,
  author = {Wang, Xiaoqian and Hyndman, Rob J. and Li, Feng and Kang, Yanfei},
  title = {Forecast Combinations: An over 50-Year Review},
  journal = {International Journal of Forecasting},
  year = {2023},
  volume = {39},
  number = {4},
  pages = {1518--1547},
  url = {https://arxiv.org/abs/2205.04216},
  doi = {10.1016/j.ijforecast.2022.11.005}
}
```
Li Li, Yanfei Kang and Feng Li (2023). “Bayesian Forecast Combination Using Time-Varying Features”. International Journal of Forecasting, Vol. 39 (3), pp. 1287-1302.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: In this work, we propose a novel framework for density forecast combination by constructing time-varying weights based on time-varying features. Our framework estimates weights in the forecast combination via Bayesian log predictive scores, in which the optimal forecast combination is determined by time series features from historical information. In particular, we use an automatic Bayesian variable selection method to identify the importance of different features. To this end, our approach has better interpretability compared to other black-box forecasting combination schemes. We apply our framework to stock market data and M3 competition data. Based on our structure, a simple maximum-a-posteriori scheme outperforms benchmark methods, and Bayesian variable selection can further enhance the accuracy for both point forecasts and density forecasts.
BibTeX:
```
@article{LiL2023BayesianForecast,
  author = {Li, Li and Kang, Yanfei and Li, Feng},
  title = {Bayesian Forecast Combination Using Time-Varying Features},
  journal = {International Journal of Forecasting},
  year = {2023},
  volume = {39},
  number = {3},
  pages = {1287--1302},
  url = {https://arxiv.org/abs/2108.02082},
  doi = {10.1016/j.ijforecast.2022.06.002}
}
```
Xiaoqian Wang, Yanfei Kang, Rob J. Hyndman and Feng Li (2023). “Distributed ARIMA Models for Ultra-Long Time Series”. International Journal of Forecasting, Vol. 39 (3), pp. 1163-1184.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Providing forecasts for ultra-long time series plays a vital role in various activities, such as investment decisions, industrial production arrangements, and farm management. This paper develops a novel distributed forecasting framework to tackle the challenges of forecasting ultra-long time series using the industry-standard MapReduce framework. The proposed model combination approach retains the local time dependency. It utilizes a straightforward splitting across samples to facilitate distributed forecasting by combining the local estimators of time series models delivered from worker nodes and minimizing a global loss function. Instead of unrealistically assuming the data generating process (DGP) of an ultra-long time series stays invariant, we only make assumptions on the DGP of subseries spanning shorter time periods. We investigate the performance of the proposed approach with AutoRegressive Integrated Moving Average (ARIMA) models using the real data application as well as numerical simulations. Our approach improves forecasting accuracy and computational efficiency in point forecasts and prediction intervals, especially for longer forecast horizons, compared to directly fitting the whole data with ARIMA models. Moreover, we explore some potential factors that may affect the forecasting performance of our approach.
BibTeX:
```
@article{WangX2023DistributedARIMA,
  author = {Wang, Xiaoqian and Kang, Yanfei and Hyndman, Rob J. and Li, Feng},
  title = {Distributed ARIMA Models for Ultra-Long Time Series},
  journal = {International Journal of Forecasting},
  year = {2023},
  volume = {39},
  number = {3},
  pages = {1163--1184},
  url = {https://arxiv.org/abs/2007.09577},
  doi = {10.1016/j.ijforecast.2022.05.001}
}
```
Bohan Zhang, Yanfei Kang, Anastasios Panagiotelis and Feng Li (2023). “Optimal Reconciliation with Immutable Forecasts”. European Journal of Operational Research, Vol. 308 (1), pp. 650-660.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: The practical importance of coherent forecasts in hierarchical forecasting has inspired many studies on forecast reconciliation. Under this approach, so-called base forecasts are produced for every series in the hierarchy and are subsequently adjusted to be coherent in a second reconciliation step. Reconciliation methods have been shown to improve forecast accuracy, but will, in general, adjust the base forecast of every series. However, in an operational context, it is sometimes necessary or beneficial to keep forecasts of some variables unchanged after forecast reconciliation. In this paper, we formulate reconciliation methodology that keeps forecasts of a pre-specified subset of variables unchanged or “immutable”. In contrast to existing approaches, these immutable forecasts need not all come from the same level of a hierarchy, and our method can also be applied to grouped hierarchies. We prove that our approach preserves unbiasedness in base forecasts. Our method can also account for correlations between base forecasting errors and ensure non-negativity of forecasts. We also perform empirical experiments, including an application to sales of a large scale online retailer, to assess the impacts of our proposed methodology.
BibTeX:
```
@article{ZhangB2023OptimalReconciliation,
  author = {Zhang, Bohan and Kang, Yanfei and Panagiotelis, Anastasios and Li, Feng},
  title = {Optimal Reconciliation with Immutable Forecasts},
  journal = {European Journal of Operational Research},
  year = {2023},
  volume = {308},
  number = {1},
  pages = {650--660},
  url = {http://arxiv.org/abs/2204.09231},
  doi = {10.1016/j.ejor.2022.11.035}
}
```
Yun Bai, Ganglin Tian, Yanfei Kang and Suling Jia (2023). “A Hybrid Ensemble Method with Negative Correlation Learning for Regression”. Machine Learning,
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Hybrid ensemble, an essential branch of ensembles, has flourished in the regression field, with studies confirming diversity’s importance. However, previous ensembles consider diversity in the sub-model training stage, with limited improvement compared to single models. In contrast, this study automatically selects and weights sub-models from a heterogeneous model pool. It solves an optimization problem using an interior-point filtering linear-search algorithm. The objective function innovatively incorporates negative correlation learning as a penalty term, with which a diverse model subset can be selected. The best sub-models from each model class are selected to build the NCL ensemble, which performance is better than the simple average and other state-of-the-art weighting methods. It is also possible to improve the NCL ensemble with a regularization term in the objective function. In practice, it is difficult to conclude the optimal sub-model for a dataset prior due to the model uncertainty. Regardless, our method would achieve comparable accuracy as the potential optimal sub-models. In conclusion, the value of this study lies in its ease of use and effectiveness, allowing the hybrid ensemble to embrace diversity and accuracy.
BibTeX:
```
@article{BaiY2023HybridEnsemble,
  author = {Bai, Yun and Tian, Ganglin and Kang, Yanfei and Jia, Suling},
  title = {A Hybrid Ensemble Method with Negative Correlation Learning for Regression},
  journal = {Machine Learning},
  year = {2023},
  url = {https://arxiv.org/abs/2104.02317},
  doi = {10.1007/s10994-023-06364-3}
}
```
Li Li, Feng Li and Yanfei Kang (2023). “Forecasting Large Collections of Time Series: Feature-Based Methods”. In Forecasting with Artificial Intelligence: Theory and Applications. Cham pp. 251-276. Springer Nature Switzerland
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: In economics and many other forecasting domains, the real world problems are too complex for a single model that assumes a specific data generation process. The forecasting performance of different methods changesChange(s) depending on the nature of the time series. When forecasting large collections of time series, two lines of approaches have been developed using time series features, namely feature-based model selection and feature-based model combination. This chapter discusses the state-of-the-art feature-based methods, with reference to open-source software implementationsImplementation.
BibTeX:
```
@incollection{LiL2023ForecastingLarge,
  author = {Li, Li and Li, Feng and Kang, Yanfei},
  editor = {Hamoudia, Mohsen and Makridakis, Spyros and Spiliotis, Evangelos},
  title = {Forecasting Large Collections of Time Series: Feature-Based Methods},
  booktitle = {Forecasting with Artificial Intelligence: Theory and Applications},
  publisher = {Springer Nature Switzerland},
  year = {2023},
  pages = {251--276},
  url = {http://arxiv.org/abs/2309.13807},
  doi = {10.1007/978-3-031-35879-1_10}
}
```
Rui Pan, Tunan Ren, Baishan Guo, Feng Li, Guodong Li and Hansheng Wang (2022). “A Note on Distributed Quantile Regression by Pilot Sampling and One-Step Updating”. Journal of Business & Economic Statistics, Vol. 40 (4), pp. 1691-1700.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Quantile regression is a method of fundamental importance. How to efficiently conduct quantile regression for a large dataset on a distributed system is of great importance. We show that the popularly used one-shot estimation is statistically inefficient if data are not randomly distributed across different workers. To fix the problem, a novel one-step estimation method is developed with the following nice properties. First, the algorithm is communication efficient. That is the communication cost demanded is practically acceptable. Second, the resulting estimator is statistically efficient. That is its asymptotic covariance is the same as that of the global estimator. Third, the estimator is robust against data distribution. That is its consistency is guaranteed even if data are not randomly distributed across different workers. Numerical experiments are provided to corroborate our findings. A real example is also presented for illustration.
BibTeX:
```
@article{PanR2022NoteDistributed,
  author = {Pan, Rui and Ren, Tunan and Guo, Baishan and Li, Feng and Li, Guodong and Wang, Hansheng},
  title = {A Note on Distributed Quantile Regression by Pilot Sampling and One-Step Updating},
  journal = {Journal of Business &amp; Economic Statistics},
  year = {2022},
  volume = {40},
  number = {4},
  pages = {1691--1700},
  url = {https://www.researchgate.net/publication/354770486},
  doi = {10.1080/07350015.2021.1961789}
}
```
Xiaoqian Wang, Yanfei Kang, Fotios Petropoulos and Feng Li (2022). “The Uncertainty Estimation of Feature-Based Forecast Combinations”. Journal of the Operational Research Society, Vol. 73 (5), pp. 979-993.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecasting is an indispensable element of operational research (OR) and an important aid to planning. The accurate estimation of the forecast uncertainty facilitates several operations management activities, predominantly in supporting decisions in inventory and supply chain management and effectively setting safety stocks. In this paper, we introduce a feature-based framework, which links the relationship between time series features and the interval forecasting performance into providing reliable interval forecasts. We propose an optimal threshold ratio searching algorithm and a new weight determination mechanism for selecting an appropriate subset of models and assigning combination weights for each time series tailored to the observed features. We evaluate our approach using a large set of time series from the M4 competition. Our experiments show that our approach significantly outperforms a wide range of benchmark models, both in terms of point forecasts as well as prediction intervals.
BibTeX:
```
@article{WangX2022UncertaintyEstimation,
  author = {Wang, Xiaoqian and Kang, Yanfei and Petropoulos, Fotios and Li, Feng},
  title = {The Uncertainty Estimation of Feature-Based Forecast Combinations},
  journal = {Journal of the Operational Research Society},
  year = {2022},
  volume = {73},
  number = {5},
  pages = {979--993},
  url = {https://arxiv.org/abs/1908.02891},
  doi = {10.1080/01605682.2021.1880297}
}
```
Zhiru Wang, Yu Pang, Mingxin Gan, Martin Skitmore and Feng Li (2022). “Escalator Accident Mechanism Analysis and Injury Prediction Approaches in Heavy Capacity Metro Rail Transit Stations”. Safety Science, Vol. 154 pp. 105850.
[Abstract] [BibTeX] [DOI]

Abstract: The semi-open character with high passenger flow in Metro Rail Transport Stations (MRTS) makes safety management of human-electromechanical interaction escalator systems more complex. Safety management should not consider only single failures, but also the complex interactions in the system. This study applies task driven behavior theory and system theory to reveal a generic framework of the MRTS escalator accident mechanism and uses Lasso-Logistic Regression (LLR) for escalator injury prediction. Escalator accidents in the Beijing MRTS are used as a case study to estimate the applicability of the methodologies. The main results affirm that the application of System-Theoretical Process Analysis (STPA) and Task Driven Accident Process Analysis (TDAPA) to the generic escalator accident mechanism reveals non-failure state task driven passenger behaviors and constraints on safety that are not addressed in previous studies. The results also confirm that LLR is able to predict escalator accidents where there is a relatively large number of variables with limited observations. Additionally, increasing the amount of data improves the prediction accuracy for all three types of injuries in the case study, suggesting the LLR model has good extrapolation ability. The results can be applied in MRTS as instruments for both escalator accident investigation and accident prevention.
BibTeX:
```
@article{WangZ2022EscalatorAccident,
  author = {Wang, Zhiru and Pang, Yu and Gan, Mingxin and Skitmore, Martin and Li, Feng},
  title = {Escalator Accident Mechanism Analysis and Injury Prediction Approaches in Heavy Capacity Metro Rail Transit Stations},
  journal = {Safety Science},
  year = {2022},
  volume = {154},
  pages = {105850},
  doi = {10.1016/j.ssci.2022.105850}
}
```
Matthias Anderer and Feng Li (2022). “Hierarchical Forecasting with a Top-down Alignment of Independent-Level Forecasts”. International Journal of Forecasting, Vol. 38 (4), pp. 1405-1414.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Hierarchical forecasting with intermittent time series is a challenge in both research and empirical studies. Extensive research focuses on improving the accuracy of each hierarchy, especially the intermittent time series at bottom levels. Then, hierarchical reconciliation can be used to improve the overall performance further. In this paper, we present a hierarchical-forecasting-with-alignment approach that treats the bottom-level forecasts as mutable to ensure higher forecasting accuracy on the upper levels of the hierarchy. We employ a pure deep learning forecasting approach, N-BEATS, for continuous time series at the top levels, and a widely used tree-based algorithm, LightGBM, for intermittent time series at the bottom level. The hierarchical-forecasting-with-alignment approach is a simple yet effective variant of the bottom-up method, accounting for biases that are difficult to observe at the bottom level. It allows suboptimal forecasts at the lower level to retain a higher overall performance. The approach in this empirical study was developed by the first author during the M5 Accuracy competition, ranking second place. The method is also business orientated and can be used to facilitate strategic business planning.
BibTeX:
```
@article{AndererM2022HierarchicalForecasting,
  author = {Anderer, Matthias and Li, Feng},
  title = {Hierarchical Forecasting with a Top-down Alignment of Independent-Level Forecasts},
  journal = {International Journal of Forecasting},
  year = {2022},
  volume = {38},
  number = {4},
  pages = {1405--1414},
  url = {https://arxiv.org/abs/2103.08250},
  doi = {10.1016/j.ijforecast.2021.12.015}
}
```
Fotios Petropoulos, Daniele Apiletti, Vassilios Assimakopoulos, Mohamed Zied Babai, Devon K. Barrow, Souhaib Ben Taieb, Christoph Bergmeir, Ricardo J. Bessa, Jakub Bijak, John E. Boylan, Jethro Browell, Claudio Carnevale, Jennifer L. Castle, Pasquale Cirillo, Michael P. Clements, Clara Cordeiro, Fernando Luiz Cyrino Oliveira, Shari De Baets, Alexander Dokumentov, Joanne Ellison, Piotr Fiszeder, Philip Hans Franses, David T. Frazier, Michael Gilliland, M. Sinan Gönül, Paul Goodwin, Luigi Grossi, Yael Grushka-Cockayne, Mariangela Guidolin, Massimo Guidolin, Ulrich Gunter, Xiaojia Guo, Renato Guseo, Nigel Harvey, David F. Hendry, Ross Hollyman, Tim Januschowski, Jooyoung Jeon, Victor Richmond R. Jose, Yanfei Kang, Anne B. Koehler, Stephan Kolassa, Nikolaos Kourentzes, Sonia Leva, Feng Li, Konstantia Litsiou, Spyros Makridakis, Gael M. Martin, Andrew B. Martinez, Sheik Meeran, Theodore Modis, Konstantinos Nikolopoulos, Dilek Önkal, Alessia Paccagnini, Anastasios Panagiotelis, Ioannis Panapakidis, Jose M. Pavía, Manuela Pedio, Diego J. Pedregal, Pierre Pinson, Patrícia Ramos, David E. Rapach, J. James Reade, Bahman Rostami-Tabar, Michał Rubaszek, Georgios Sermpinis, Han Lin Shang, Evangelos Spiliotis, Aris A. Syntetos, Priyanga Dilini Talagala, Thiyanga S. Talagala, Len Tashman, Dimitrios Thomakos, Thordis Thorarinsdottir, Ezio Todini, Juan Ramón Trapero Arenas, Xiaoqian Wang, Robert L. Winkler, Alisa Yusupova and Florian Ziel (2022). “Forecasting: Theory and Practice”. International Journal of Forecasting, Vol. 38 (3), pp. 705-871.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecasting has always been at the forefront of decision making and planning. The uncertainty that surrounds the future is both exciting and challenging, with individuals and organisations seeking to minimise risks and maximise utilities. The large number of forecasting applications calls for a diverse set of forecasting methods to tackle real-life challenges. This article provides a non-systematic review of the theory and the practice of forecasting. We provide an overview of a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts. We then demonstrate how such theoretical concepts are applied in a variety of real-life contexts. We do not claim that this review is an exhaustive list of methods and applications. However, we wish that our encyclopedic presentation will offer a point of reference for the rich work that has been undertaken over the last decades, with some key insights for the future of forecasting theory and practice. Given its encyclopedic nature, the intended mode of reading is non-linear. We offer cross-references to allow the readers to navigate through the various topics. We complement the theoretical concepts and applications covered by large lists of free or open-source software implementations and publicly-available databases.
BibTeX:
```
@article{PetropoulosF2022ForecastingTheory,
  author = {Petropoulos, Fotios and Apiletti, Daniele and Assimakopoulos, Vassilios and Babai, Mohamed Zied and Barrow, Devon K. and Ben Taieb, Souhaib and Bergmeir, Christoph and Bessa, Ricardo J. and Bijak, Jakub and Boylan, John E. and Browell, Jethro and Carnevale, Claudio and Castle, Jennifer L. and Cirillo, Pasquale and Clements, Michael P. and Cordeiro, Clara and Cyrino Oliveira, Fernando Luiz and De Baets, Shari and Dokumentov, Alexander and Ellison, Joanne and Fiszeder, Piotr and Franses, Philip Hans and Frazier, David T. and Gilliland, Michael and Gönül, M. Sinan and Goodwin, Paul and Grossi, Luigi and Grushka-Cockayne, Yael and Guidolin, Mariangela and Guidolin, Massimo and Gunter, Ulrich and Guo, Xiaojia and Guseo, Renato and Harvey, Nigel and Hendry, David F. and Hollyman, Ross and Januschowski, Tim and Jeon, Jooyoung and Jose, Victor Richmond R. and Kang, Yanfei and Koehler, Anne B. and Kolassa, Stephan and Kourentzes, Nikolaos and Leva, Sonia and Li, Feng and Litsiou, Konstantia and Makridakis, Spyros and Martin, Gael M. and Martinez, Andrew B. and Meeran, Sheik and Modis, Theodore and Nikolopoulos, Konstantinos and Önkal, Dilek and Paccagnini, Alessia and Panagiotelis, Anastasios and Panapakidis, Ioannis and Pavía, Jose M. and Pedio, Manuela and Pedregal, Diego J. and Pinson, Pierre and Ramos, Patrícia and Rapach, David E. and Reade, J. James and Rostami-Tabar, Bahman and Rubaszek, Michał and Sermpinis, Georgios and Shang, Han Lin and Spiliotis, Evangelos and Syntetos, Aris A. and Talagala, Priyanga Dilini and Talagala, Thiyanga S. and Tashman, Len and Thomakos, Dimitrios and Thorarinsdottir, Thordis and Todini, Ezio and Trapero Arenas, Juan Ramón and Wang, Xiaoqian and Winkler, Robert L. and Yusupova, Alisa and Ziel, Florian},
  title = {Forecasting: Theory and Practice},
  journal = {International Journal of Forecasting},
  year = {2022},
  volume = {38},
  number = {3},
  pages = {705--871},
  url = {https://arxiv.org/abs/2012.03854},
  doi = {10.1016/j.ijforecast.2021.11.001}
}
```
Xixi Li, Yun Bai and Yanfei Kang (2022). “Exploring the Social Influence of the Kaggle Virtual Community on the M5 Competition”. International Journal of Forecasting, Vol. 38 (4), pp. 1507-1518.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: One of the most significant differences of M5 over previous forecasting competitions is that it was held on Kaggle, an online platform for data scientists and machine learning practitioners. Kaggle provides a gathering place, or virtual community, for web users who are interested in the M5 competition. Users can share code, models, features, and loss functions through online notebooks and discussion forums. Here, we study the social influence of this virtual community on user behavior in the M5 competition. We first research the content of the M5 virtual community by topic modeling and trend analysis. Further, we perform social media analysis to identify the potential relationship network of the virtual community. We study the roles and characteristics of some key participants who promoted the diffusion of information within the M5 virtual community. Overall, this study provides in-depth insights into the mechanism of the virtual community’s influence on the participants and has potential implications for future online competitions.
BibTeX:
```
@article{LiX2022ExploringSocial,
  author = {Li, Xixi and Bai, Yun and Kang, Yanfei},
  title = {Exploring the Social Influence of the Kaggle Virtual Community on the M5 Competition},
  journal = {International Journal of Forecasting},
  year = {2022},
  volume = {38},
  number = {4},
  pages = {1507--1518},
  url = {https://www.sciencedirect.com/science/article/pii/S0169207021001643},
  doi = {10.1016/j.ijforecast.2021.10.001}
}
```
Thiyanga S. Talagala, Feng Li and Yanfei Kang (2022). “FFORMPP: Feature-Based Forecast Model Performance Prediction”. International Journal of Forecasting, Vol. 38 (3), pp. 920-943.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: This paper introduces a novel meta-learning algorithm for time series forecast model performance prediction. We model the forecast error as a function of time series features calculated from historical time series with an efficient Bayesian multivariate surface regression approach. The minimum predicted forecast error is then used to identify an individual model or a combination of models to produce the final forecasts. It is well known that the performance of most meta-learning models depends on the representativeness of the reference dataset used for training. In such circumstances, we augment the reference dataset with a feature-based time series simulation approach, namely GRATIS, to generate a rich and representative time series collection. The proposed framework is tested using the M4 competition data and is compared against commonly used forecasting approaches. Our approach provides comparable performance to other model selection and combination approaches but at a lower computational cost and a higher degree of interpretability, which is important for supporting decisions. We also provide useful insights regarding which forecasting models are expected to work better for particular types of time series, the intrinsic mechanisms of the meta-learners, and how the forecasting performance is affected by various factors.
BibTeX:
```
@article{TalagalaTS2022FFORMPPFeaturebased,
  author = {Talagala, Thiyanga S. and Li, Feng and Kang, Yanfei},
  title = {FFORMPP: Feature-Based Forecast Model Performance Prediction},
  journal = {International Journal of Forecasting},
  year = {2022},
  volume = {38},
  number = {3},
  pages = {920--943},
  url = {https://arxiv.org/abs/1908.11500},
  doi = {10.1016/j.ijforecast.2021.07.002}
}
```
Yanfei Kang, Wei Cao, Fotios Petropoulos and Feng Li (2022). “Forecast with Forecasts: Diversity Matters”. European Journal of Operational Research, Vol. 301 (1), pp. 180-190.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecast combinations have been widely applied in the last few decades to improve forecasting. Estimating optimal weights that can outperform simple averages is not always an easy task. In recent years, the idea of using time series features for forecast combinations has flourished. Although this idea has been proved to be beneficial in several forecasting competitions, it may not be practical in many situations. For example, the task of selecting appropriate features to build forecasting models is often challenging. Even if there was an acceptable way to define the features, existing features are estimated based on the historical patterns, which are likely to change in the future. Other times, the estimation of the features is infeasible due to limited historical data. In this work, we suggest a change of focus from the historical data to the produced forecasts to extract features. We use out-of-sample forecasts to obtain weights for forecast combinations by amplifying the diversity of the pool of methods being combined. A rich set of time series is used to evaluate the performance of the proposed method. Experimental results show that our diversity-based forecast combination framework not only simplifies the modeling process but also achieves superior forecasting performance in terms of both point forecasts and prediction intervals. The value of our proposition lies on its simplicity, transparency, and computational efficiency, elements that are important from both an optimization and a decision analysis perspective.
BibTeX:
```
@article{KangY2022ForecastForecasts,
  author = {Kang, Yanfei and Cao, Wei and Petropoulos, Fotios and Li, Feng},
  title = {Forecast with Forecasts: Diversity Matters},
  journal = {European Journal of Operational Research},
  year = {2022},
  volume = {301},
  number = {1},
  pages = {180--190},
  url = {https://arxiv.org/abs/2012.01643},
  doi = {10.1016/j.ejor.2021.10.024}
}
```

Xixi Li, Fotios Petropoulos and Yanfei Kang (2022). “Improving Forecasting by Subsampling Seasonal Time Series”. International Journal of Production Research, pp. 1-17. Taylor & Francis

[BibTeX]

BibTeX:

@article{LiX2022ImprovingForecasting,
  author = {Li, Xixi and Petropoulos, Fotios and Kang, Yanfei},
  title = {Improving Forecasting by Subsampling Seasonal Time Series},
  journal = {International Journal of Production Research},
  publisher = {Taylor &amp; Francis},
  year = {2022},
  pages = {1--17}
}

Xuening Zhu, Feng Li and Hansheng Wang (2021). “Least-Square Approximation for a Distributed System”. Journal of Computational and Graphical Statistics, Vol. 30 (4), pp. 1004-1018.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: In this work, we develop a distributed least-square approximation (DLSA) method that is able to solve a large family of regression problems (e.g., linear regression, logistic regression, and Cox’s model) on a distributed system. By approximating the local objective function using a local quadratic form, we are able to obtain a combined estimator by taking a weighted average of local estimators. The resulting estimator is proved to be statistically as efficient as the global estimator. Moreover, it requires only one round of communication. We further conduct a shrinkage estimation based on the DLSA estimation using an adaptive Lasso approach. The solution can be easily obtained by using the LARS algorithm on the master node. It is theoretically shown that the resulting estimator possesses the oracle property and is selection consistent by using a newly designed distributed Bayesian information criterion. The finite sample performance and computational efficiency are further illustrated by an extensive numerical study and an airline dataset. The airline dataset is 52 GB in size. The entire methodology has been implemented in Python for a de-facto standard Spark system. The proposed DLSA algorithm on the Spark system takes 26 min to obtain a logistic regression estimator, which is more efficient and memory friendly than conventional methods. Supplementary materials for this article are available online.
BibTeX:
```
@article{ZhuX2021LeastSquareApproximation,
  author = {Zhu, Xuening and Li, Feng and Wang, Hansheng},
  title = {Least-Square Approximation for a Distributed System},
  journal = {Journal of Computational and Graphical Statistics},
  year = {2021},
  volume = {30},
  number = {4},
  pages = {1004--1018},
  url = {https://arxiv.org/abs/1908.04904},
  doi = {10.1080/10618600.2021.1923517}
}
```
Kasun Bandara, Hansika Hewamalage, Yuan-Hao Liu, Yanfei Kang and Christoph Bergmeir (2021). “Improving the Accuracy of Global Forecasting Models Using Time Series Data Augmentation”. Pattern Recognition, Vol. 120 pp. 108148.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Forecasting models that are trained across sets of many time series, known as Global Forecasting Models (GFM), have shown recently promising results in forecasting competitions and real-world applications, outperforming many state-of-the-art univariate forecasting techniques. In most cases, GFMs are implemented using deep neural networks, and in particular Recurrent Neural Networks (RNN), which require a sufficient amount of time series to estimate their numerous model parameters. However, many time series databases have only a limited number of time series. In this study, we propose a novel, data augmentation based forecasting framework that is capable of improving the baseline accuracy of the GFM models in less data-abundant settings. We use three time series augmentation techniques: GRATIS, moving block bootstrap (MBB), and dynamic time warping barycentric averaging (DBA) to synthetically generate a collection of time series. The knowledge acquired from these augmented time series is then transferred to the original dataset using two different approaches: the pooled approach and the transfer learning approach. When building GFMs, in the pooled approach, we train a model on the augmented time series alongside the original time series dataset, whereas in the transfer learning approach, we adapt a pre-trained model to the new dataset. In our evaluation on competition and real-world time series datasets, our proposed variants can significantly improve the baseline accuracy of GFM models and outperform state-of-the-art univariate forecasting methods.
BibTeX:
```
@article{BandaraK2021ImprovingAccuracy,
  author = {Bandara, Kasun and Hewamalage, Hansika and Liu, Yuan-Hao and Kang, Yanfei and Bergmeir, Christoph},
  title = {Improving the Accuracy of Global Forecasting Models Using Time Series Data Augmentation},
  journal = {Pattern Recognition},
  year = {2021},
  volume = {120},
  pages = {108148},
  url = {https://arxiv.org/abs/2008.02663},
  doi = {10.1016/j.patcog.2021.108148}
}
```
Yanfei Kang, Evangelos Spiliotis, Fotios Petropoulos, Nikolaos Athiniotis, Feng Li and Vassilios Assimakopoulos (2021). “Déjà vu: A Data-Centric Forecasting Approach through Time Series Cross-Similarity”. Journal of Business Research, Vol. 132 pp. 719-731.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Accurate forecasts are vital for supporting the decisions of modern companies. Forecasters typically select the most appropriate statistical model for each time series. However, statistical models usually presume some data generation process while making strong assumptions about the errors. In this paper, we present a novel data-centric approach — ‘forecasting with cross-similarity’, which tackles model uncertainty in a model-free manner. Existing similarity-based methods focus on identifying similar patterns within the series, i.e., ‘self-similarity’. In contrast, we propose searching for similar patterns from a reference set, i.e., ‘cross-similarity’. Instead of extrapolating, the future paths of the similar series are aggregated to obtain the forecasts of the target series. Building on the cross-learning concept, our approach allows the application of similarity-based forecasting on series with limited lengths. We evaluate the approach using a rich collection of real data and show that it yields competitive accuracy in both points forecasts and prediction intervals.
BibTeX:
```
@article{KangY2021DejaVu,
  author = {Kang, Yanfei and Spiliotis, Evangelos and Petropoulos, Fotios and Athiniotis, Nikolaos and Li, Feng and Assimakopoulos, Vassilios},
  title = {Déjà vu: A Data-Centric Forecasting Approach through Time Series Cross-Similarity},
  journal = {Journal of Business Research},
  year = {2021},
  volume = {132},
  pages = {719--731},
  url = {https://arxiv.org/abs/1909.00221},
  doi = {10.1016/j.jbusres.2020.10.051}
}
```
Megan G. Janeway, Xiang Zhao, Max Rosenthaler, Yi Zuo, Kumar Balasubramaniyan, Michael Poulson, Miriam Neufeld, Jeffrey J. Siracuse, Courtney E. Takahashi, Lisa Allee, Tracey Dechert, Peter A. Burke, Feng Li and Bindu Kalesan (2021). “Clinical Diagnostic Phenotypes in Hospitalizations Due to Self-Inflicted Firearm Injury”. Journal of Affective Disorders, Vol. 278 pp. 172-180.
[Abstract] [BibTeX] [DOI]

Abstract: Hospitalized self-inflicted firearm injuries have not been extensively studied, particularly regarding clinical diagnoses at the index admission. The objective of this study was to discover the diagnostic phenotypes (DPs) or clusters of hospitalized self-inflicted firearm injuries. Using Nationwide Inpatient Sample data in the US from 1993 to 2014, we used International Classification of Diseases, Ninth Revision codes to identify self-inflicted firearm injuries among those ≥18 years of age. The 25 most frequent diagnostic codes were used to compute a dissimilarity matrix and the optimal number of clusters. We used hierarchical clustering to identify the main DPs. The overall cohort included 14072 hospitalizations, with self-inflicted firearm injuries occurring mainly in those between 16 to 45 years of age, black, with co-occurring tobacco and alcohol use, and mental illness. Out of the three identified DPs, DP1 was the largest (n=10,110), and included most common diagnoses similar to overall cohort, including major depressive disorders (27.7%), hypertension (16.8%), acute post hemorrhagic anemia (16.7%), tobacco (15.7%) and alcohol use (12.6%). DP2 (n=3,725) was not characterized by any of the top 25 ICD-9 diagnoses codes, and included children and peripartum women. DP3, the smallest phenotype (n=237), had high prevalence of depression similar to DP1, and defined by fewer fatal injuries of chest and abdomen. There were three distinct diagnostic phenotypes in hospitalizations due to self-inflicted firearm injuries. Further research is needed to determine how DPs can be used to tailor clinical care and prevention efforts.
BibTeX:
```
@article{JanewayMG2021ClinicalDiagnostic,
  author = {Janeway, Megan G. and Zhao, Xiang and Rosenthaler, Max and Zuo, Yi and Balasubramaniyan, Kumar and Poulson, Michael and Neufeld, Miriam and Siracuse, Jeffrey J. and Takahashi, Courtney E. and Allee, Lisa and Dechert, Tracey and Burke, Peter A. and Li, Feng and Kalesan, Bindu},
  title = {Clinical Diagnostic Phenotypes in Hospitalizations Due to Self-Inflicted Firearm Injury},
  journal = {Journal of Affective Disorders},
  year = {2021},
  volume = {278},
  pages = {172--180},
  doi = {10.1016/j.jad.2020.09.067}
}
```
Evangelos Theodorou, Shengjie Wang, Yanfei Kang, Evangelos Spiliotis, Spyros Makridakis and Vassilios Assimakopoulos (2021). “Exploring the Representativeness of the M5 Competition Data”. International Journal of Forecasting, pp. S0169207021001175.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: The main objective of the M5 competition, which focused on forecasting the hierarchical unit sales of Walmart, was to evaluate the accuracy and uncertainty of forecasting methods in the field to identify best practices and highlight their practical implications. However, can the findings of the M5 competition be generalized and exploited by retail firms to better support their decisions and operation? This depends on the extent to which M5 data is sufficiently similar to unit sales data of retailers operating in different regions selling different product types and considering different marketing strategies. To answer this question, we analyze the characteristics of the M5 time series and compare them with those of two grocery retailers, namely Corporación Favorita and a major Greek supermarket chain, using feature spaces. Our results suggest only minor discrepancies between the examined data sets, supporting the representativeness of the M5 data.
BibTeX:
```
@article{TheodorouE2021ExploringRepresentativeness,
  author = {Theodorou, Evangelos and Wang, Shengjie and Kang, Yanfei and Spiliotis, Evangelos and Makridakis, Spyros and Assimakopoulos, Vassilios},
  title = {Exploring the Representativeness of the M5 Competition Data},
  journal = {International Journal of Forecasting},
  year = {2021},
  pages = {S0169207021001175},
  url = {https://linkinghub.elsevier.com/retrieve/pii/S0169207021001175},
  doi = {10.1016/j.ijforecast.2021.07.006}
}
```
Yitian Chen, Yanfei Kang, Yixiong Chen and Zizhuo Wang (2020). “Probabilistic Forecasting with Temporal Convolutional Neural Network”. Neurocomputing, Vol. 399 pp. 491-501.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: We present a probabilistic forecasting framework based on convolutional neural network (CNN) for multiple related time series forecasting. The framework can be applied to estimate probability density under both parametric and non-parametric settings. More specifically, stacked residual blocks based on dilated causal convolutional nets are constructed to capture the temporal dependencies of the series. Combined with representation learning, our approach is able to learn complex patterns such as seasonality, holiday effects within and across series, and to leverage those patterns for more accurate forecasts, especially when historical data is sparse or unavailable. Extensive empirical studies are performed on several real-world datasets, including datasets from JD.com, China’s largest online retailer. The results show that our framework compares favorably to the state-of-the-art in both point and probabilistic forecasting.
BibTeX:
```
@article{ChenY2020ProbabilisticForecasting,
  author = {Chen, Yitian and Kang, Yanfei and Chen, Yixiong and Wang, Zizhuo},
  title = {Probabilistic Forecasting with Temporal Convolutional Neural Network},
  journal = {Neurocomputing},
  year = {2020},
  volume = {399},
  pages = {491--501},
  url = {https://arxiv.org/abs/1906.04397},
  doi = {10.1016/j.neucom.2020.03.011}
}
```

Bindu Kalesan, Siran Zhao, Michael Poulson, Miriam Neufeld, Tracey Dechert, Jeffrey J. Siracuse, Yi Zuo and Feng Li (2020). “Intersections of Firearm Suicide, Drug-Related Mortality, and Economic Dependency in Rural America”. Journal of Surgical Research, Vol. 256 pp. 96-102. Elsevier

[BibTeX] [DOI]

BibTeX:

@article{KalesanB2020IntersectionsFirearm,
  author = {Kalesan, Bindu and Zhao, Siran and Poulson, Michael and Neufeld, Miriam and Dechert, Tracey and Siracuse, Jeffrey J and Zuo, Yi and Li, Feng},
  title = {Intersections of Firearm Suicide, Drug-Related Mortality, and Economic Dependency in Rural America},
  journal = {Journal of Surgical Research},
  publisher = {Elsevier},
  year = {2020},
  volume = {256},
  pages = {96--102},
  doi = {10.1016/j.jss.2020.06.011}
}

Xixi Li, Yanfei Kang and Feng Li (2020). “Forecasting with Time Series Imaging”. Expert Systems with Applications, Vol. 160 pp. 113680.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Feature-based time series representations have attracted substantial attention in a wide range of time series analysis methods. Recently, the use of time series features for forecast model averaging has been an emerging research focus in the forecasting community. Nonetheless, most of the existing approaches depend on the manual choice of an appropriate set of features. Exploiting machine learning methods to extract features from time series automatically becomes crucial in state-of-the-art time series analysis. In this paper, we introduce an automated approach to extract time series features based on time series imaging. We first transform time series into recurrence plots, from which local features can be extracted using computer vision algorithms. The extracted features are used for forecast model averaging. Our experiments show that forecasting based on automatically extracted features, with less human intervention and a more comprehensive view of the raw time series data, yields highly comparable performances with the best methods in the largest forecasting competition dataset (M4) and outperforms the top methods in the Tourism forecasting competition dataset.
BibTeX:
```
@article{LiX2020ForecastingTime,
  author = {Li, Xixi and Kang, Yanfei and Li, Feng},
  title = {Forecasting with Time Series Imaging},
  journal = {Expert Systems with Applications},
  year = {2020},
  volume = {160},
  pages = {113680},
  url = {https://arxiv.org/abs/1904.08064},
  doi = {10.1016/j.eswa.2020.113680}
}
```
Chengcheng Hao, Feng Li and Dietrich von Rosen (2020). “A Bilinear Reduced Rank Model”. In Contemporary Experimental Design, Multivariate Analysis and Data Mining. Springer Nature
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: This article considers a bilinear model that includes two different latent effects. The first effect has a direct influence on the response variable, whereas the second latent effect is assumed to first influence other latent variables, which in turn affect the response variable. In this article, latent variables are modelled via rank restrictions on unknown mean parameters and the models which are used are often referred to as reduced rank regression models. This article presents a likelihood-based approach that results in explicit estimators. In our model, the latent variables act as covariates that we know exist, but their direct influence is unknown and will therefore not be considered in detail. One example is if we observe hundreds of weather variables, but we cannot say which or how these variables affect plant growth.
BibTeX:
```
@incollection{HaoC2020BilinearReduced,
  author = {Hao, Chengcheng and Li, Feng and von Rosen, Dietrich},
  editor = {Fan, Jianqing and Pan, Jianxin},
  title = {A Bilinear Reduced Rank Model},
  booktitle = {Contemporary Experimental Design, Multivariate Analysis and Data Mining},
  publisher = {Springer Nature},
  year = {2020},
  url = {https://www.researchgate.net/publication/341587390},
  doi = {10.1007/978-3-030-46161-4_21}
}
```
Yanfei Kang, Rob J. Hyndman and Feng Li (2020). “GRATIS: GeneRAting TIme Series with Diverse and Controllable Characteristics”. Statistical Analysis and Data Mining: The ASA Data Science Journal, Vol. 13 (4), pp. 354-376.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: The explosion of time series data in recent years has brought a flourish of new time series analysis methods, for forecasting, clustering, classification and other tasks. The evaluation of these new methods requires either collecting or simulating a diverse set of time series benchmarking data to enable reliable comparisons against alternative approaches. We propose GeneRAting TIme Series with diverse and controllable characteristics, named GRATIS, with the use of mixture autoregressive (MAR) models. We simulate sets of time series using MAR models and investigate the diversity and coverage of the generated time series in a time series feature space. By tuning the parameters of the MAR models, GRATIS is also able to efficiently generate new time series with controllable features. In general, as a costless surrogate to the traditional data collection approach, GRATIS can be used as an evaluation tool for tasks such as time series forecasting and classification. We illustrate the usefulness of our time series generation process through a time series forecasting application.
BibTeX:
```
@article{KangY2020GRATISGeneRAting,
  author = {Kang, Yanfei and Hyndman, Rob J. and Li, Feng},
  title = {GRATIS: GeneRAting TIme Series with Diverse and Controllable Characteristics},
  journal = {Statistical Analysis and Data Mining: The ASA Data Science Journal},
  year = {2020},
  volume = {13},
  number = {4},
  pages = {354--376},
  url = {https://arxiv.org/abs/1903.02787},
  doi = {10.1002/sam.11461}
}
```

康雁飞 and 李丰 (2020). “预测：方法与实践”. 在线出版

[BibTeX] [Preprint]

BibTeX:

@book{kang2020fppcn,
  author = {康雁飞 and 李丰},
  title = {预测：方法与实践},
  publisher = {在线出版},
  year = {2020},
  url = {https://otexts.com/fppcn/}
}

康雁飞 and 李丰 (2020). “统计计算”. 在线出版

[BibTeX] [Preprint]

BibTeX:

@book{kang2020statcompcn,
  author = {康雁飞 and 李丰},
  title = {统计计算},
  publisher = {在线出版},
  year = {2020},
  url = {https://feng.li/files/statscompbook/}
}

Hannah M. Bailey, Yi Zuo, Feng Li, Jae Min, Krishna Vaddiparti, Mattia Prosperi, Jeffrey Fagan, Sandro Galea and Bindu Kalesan (2019). “Changes in Patterns of Mortality Rates and Years of Life Lost Due to Firearms in the United States, 1999 to 2016: A Joinpoint Analysis”. PLOS ONE, Vol. 14 (11), pp. e0225223. Public Library of Science
[Abstract] [BibTeX] [DOI]

Abstract: Background Firearm-related death rates and years of potential life lost (YPLL) vary widely between population subgroups and states. However, changes or inflections in temporal trends within subgroups and states are not fully documented. We assessed temporal patterns and inflections in the rates of firearm deaths and %YPLL due to firearms for overall and by sex, age, race/ethnicity, intent, and states in the United States between 1999 and 2016. Methods We extracted age-adjusted firearm mortality and YPLL rates per 100,000, and %YPLL from 1999 to 2016 by using the WONDER (Wide-ranging Online Data for Epidemiologic Research) database. We used Joinpoint Regression to assess temporal trends, the inflection points, and annual percentage change (APC) from 1999 to 2016. Results National firearm mortality rates were 10.3 and 11.8 per 100,000 in 1999 and 2016, with two distinct segments; a plateau until 2014 followed by an increase of APC = 7.2% (95% CI 3.1, 11.4). YPLL rates were from 304.7 and 338.2 in 1999 and 2016 with a steady APC increase in %YPLL of 0.65% (95% CI 0.43, 0.87) from 1999 to an inflection point in 2014, followed by a larger APC in %YPLL of 5.1% (95% CI 0.1, 10.4). The upward trend in firearm mortality and YPLL rates starting in 2014 was observed in subgroups of male, non-Hispanic blacks, Hispanic whites and for firearm assaults. The inflection points for firearm mortality and YPLL rates also varied across states. Conclusions Within the United States, firearm mortality rates and YPLL remained constant between 1999 and 2014 and has been increasing subsequently. There was, however, an increase in firearm mortality rates in several subgroups and individual states earlier than 2014.
BibTeX:
```
@article{BaileyHM2019ChangesPatterns,
  author = {Bailey, Hannah M. and Zuo, Yi and Li, Feng and Min, Jae and Vaddiparti, Krishna and Prosperi, Mattia and Fagan, Jeffrey and Galea, Sandro and Kalesan, Bindu},
  title = {Changes in Patterns of Mortality Rates and Years of Life Lost Due to Firearms in the United States, 1999 to 2016: A Joinpoint Analysis},
  journal = {PLOS ONE},
  publisher = {Public Library of Science},
  year = {2019},
  volume = {14},
  number = {11},
  pages = {e0225223},
  doi = {10.1371/journal.pone.0225223}
}
```
Feng Li and Zhuojing He (2019). “Credit Risk Clustering in a Business Group: Which Matters More, Systematic or Idiosyncratic Risk?”. Cogent Economics & Finance, Vol. 7 (1), pp. 1632528.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Understanding how defaults correlate across firms is a persistent concern in risk management. In this paper, we apply covariate-dependent copula models to assess the dynamic nature of credit risk dependence, which we define as “credit risk clustering”. We also study the driving forces of the credit risk clustering in CEC business group in China. Our empirical analysis shows that the credit risk clustering varies over time and exhibits different patterns across firm pairs in a business group. We also investigate the impacts of systematic and idiosyncratic factors on credit risk clustering. We find that the impacts of the money supply and the short-term interest rates are positive, whereas the impacts of exchange rates are negative. The roles of the CPI on credit risk clustering are ambiguous. Idiosyncratic factors are vital for predicting credit risk clustering. From a policy perspective, our results not only strengthen the results of previous research but also provide a possible approach to model and predict the extreme co-movement of credit risk in business groups with financial indicators.
BibTeX:
```
@article{LiF2019CreditRisk,
  author = {Li, Feng and He, Zhuojing},
  editor = {McMillan, David},
  title = {Credit Risk Clustering in a Business Group: Which Matters More, Systematic or Idiosyncratic Risk?},
  journal = {Cogent Economics &amp; Finance},
  year = {2019},
  volume = {7},
  number = {1},
  pages = {1632528},
  url = {http://doi.org/10.2139/ssrn.3182925},
  doi = {10.1080/23322039.2019.1632528}
}
```
Elizabeth C. Pino, Yi Zuo, Camila Maciel De Olivera, Shruthi Mahalingaiah, Olivia Keiser, Lynn L. Moore, Feng Li, Ramachandran S. Vasan, Barbara E. Corkey and Bindu Kalesan (2018). “Cohort Profile: The MULTI sTUdy Diabetes rEsearch (MULTITUDE) Consortium”. BMJ Open, Vol. 8 (5), pp. e020640.
[Abstract] [BibTeX] [DOI]

Abstract: Purpose Globally, the age-standardised prevalence of type 2 diabetes mellitus (T2DM) has nearly doubled from 1980 to 2014, rising from 4.7% to 8.5% with an estimated 422 million adults living with the chronic disease. The MULTI sTUdy Diabetes rEsearch (MULTITUDE) consortium was recently established to harmonise data from 17 independent cohort studies and clinical trials and to facilitate a better understanding of the determinants, risk factors and outcomes associated with T2DM. Participants Participants range in age from 3 to 88 years at baseline, including both individuals with and without T2DM. MULTITUDE is an individual-level pooled database of demographics, comorbidities, relevant medications, clinical laboratory values, cardiac health measures, and T2DM-associated events and outcomes across 45 US states and the District of Columbia. Findings to date Among the 135 156 ongoing participants included in the consortium, almost 25% (33 421) were diagnosed with T2DM at baseline. The average age of the participants was 54.3, while the average age of participants with diabetes was 64.2. Men (55.3%) and women (44.6%) were almost equally represented across the consortium. Non-whites accounted for 31.6% of the total participants and 40% of those diagnosed with T2DM. Fewer individuals with diabetes reported being regular smokers than their non-diabetic counterparts (40.3% vs 47.4%). Over 85% of those with diabetes were reported as either overweight or obese at baseline, compared with 60.7% of those without T2DM. We observed differences in all-cause mortality, overall and by T2DM status, between cohorts. Future plans Given the wide variation in demographics and all-cause mortality in the cohorts, MULTITUDE consortium will be a unique resource for conducting research to determine: differences in the incidence and progression of T2DM; sequence of events or biomarkers prior to T2DM diagnosis; disease progression from T2DM to disease-related outcomes, complications and premature mortality; and to assess race/ethnicity differences in the above associations.
BibTeX:
```
@article{PinoEC2018CohortProfile,
  author = {Pino, Elizabeth C. and Zuo, Yi and Olivera, Camila Maciel De and Mahalingaiah, Shruthi and Keiser, Olivia and Moore, Lynn L. and Li, Feng and Vasan, Ramachandran S. and Corkey, Barbara E. and Kalesan, Bindu},
  title = {Cohort Profile: The MULTI sTUdy Diabetes rEsearch (MULTITUDE) Consortium},
  journal = {BMJ Open},
  year = {2018},
  volume = {8},
  number = {5},
  pages = {e020640},
  doi = {10.1136/bmjopen-2017-020640}
}
```
Feng Li and Yanfei Kang (2018). “Improving Forecasting Performance Using Covariate-Dependent Copula Models”. International Journal of Forecasting, Vol. 34 (3), pp. 456-476.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Copulas provide an attractive approach to the construction of multivariate distributions with flexible marginal distributions and different forms of dependences. Of particular importance in many areas is the possibility of forecasting the tail-dependences explicitly. Most of the available approaches are only able to estimate tail-dependences and correlations via nuisance parameters, and cannot be used for either interpretation or forecasting. We propose a general Bayesian approach for modeling and forecasting tail-dependences and correlations as explicit functions of covariates, with the aim of improving the copula forecasting performance. The proposed covariate-dependent copula model also allows for Bayesian variable selection from among the covariates of the marginal models, as well as the copula density. The copulas that we study include the Joe-Clayton copula, the Clayton copula, the Gumbel copula and the Student’s t-copula. Posterior inference is carried out using an efficient MCMC simulation method. Our approach is applied to both simulated data and the S&P 100 and S&P 600 stock indices. The forecasting performance of the proposed approach is compared with those of other modeling strategies based on log predictive scores. A value-at-risk evaluation is also performed for the model comparisons.
BibTeX:
```
@article{LiF2018ImprovingForecasting,
  author = {Li, Feng and Kang, Yanfei},
  title = {Improving Forecasting Performance Using Covariate-Dependent Copula Models},
  journal = {International Journal of Forecasting},
  year = {2018},
  volume = {34},
  number = {3},
  pages = {456--476},
  url = {https://arxiv.org/abs/1401.0100},
  doi = {10.1016/j.ijforecast.2018.01.007}
}
```

Yanfei Kang, Rob J. Hyndman and Kate Smith-Miles (2017). “Visualising Forecasting Algorithm Performance Using Time Series Instance Spaces”. International Journal of Forecasting, Vol. 33 (2), pp. 345-358. Elsevier

[BibTeX] [DOI]

BibTeX:

@article{KangY2017VisualisingForecasting,
  author = {Kang, Yanfei and Hyndman, Rob J and Smith-Miles, Kate},
  title = {Visualising Forecasting Algorithm Performance Using Time Series Instance Spaces},
  journal = {International Journal of Forecasting},
  publisher = {Elsevier},
  year = {2017},
  volume = {33},
  number = {2},
  pages = {345--358},
  doi = {10.1016/j.ijforecast.2016.09.004}
}

李丰 (2016). “大数据分布式计算与案例”. 中国人民大学出版社

[BibTeX] [Preprint]

BibTeX:

@book{li2016distributedcn,
  author = {李丰},
  title = {大数据分布式计算与案例},
  publisher = {中国人民大学出版社},
  year = {2016},
  edition = {第一版},
  url = {https://feng.li/files/distcompbook/}
}

Yanfei Kang, Danijel Belušić and Kate Smith-Miles (2015). “Classes of Structures in the Stable Atmospheric Boundary Layer”. Quarterly Journal of the Royal Meteorological Society, Vol. 141 (691), pp. 2057-2069.

[BibTeX] [DOI]

BibTeX:

@article{KangY2015ClassesStructures,
  author = {Kang, Yanfei and Belušić, Danijel and Smith-Miles, Kate},
  title = {Classes of Structures in the Stable Atmospheric Boundary Layer},
  journal = {Quarterly Journal of the Royal Meteorological Society},
  year = {2015},
  volume = {141},
  number = {691},
  pages = {2057--2069},
  doi = {10.1002/qj.2501}
}

Yanfei Kang (2015). “Detection, Classification and Analysis of Events in Turbulence Time Series”. Bulletin of the Australian Mathematical Society, Vol. 91 (3), pp. 521-522.

[BibTeX]

BibTeX:

@article{KangY2015DetectionClassification,
  author = {Kang, Yanfei},
  title = {Detection, Classification and Analysis of Events in Turbulence Time Series},
  journal = {Bulletin of the Australian Mathematical Society},
  year = {2015},
  volume = {91},
  number = {3},
  pages = {521--522}
}

Yanfei Kang (2014). “Detection, Classification and Analysis of Events in Turbulence Time Series”. Thesis at: Monash University.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Time series is the major source of information to study characteristics of the atmospheric boundary layer (ABL), which is frequently dominated by various types of events embedded in the time series with different levels of noise. To understand physical dynamics of the atmospheric turbulence, the individual events need to be detected and studied about their physical and structural characteristics. In spite of the attention that has been given to studying events among the atmospheric sci- ence community, the detection of events still presents a challenge, and thus their characteristics and contributions to the ABL remain poorly understood. Besides the existence of high level noise in turbulence, the main diculty is that many of the events that are responsible for the variability in the atmospheric turbulence time series are previously unknown, especially in the stable ABL. This dissertation develops a new method for detecting and classifying structures from turbulence time series. The main idea of the method is in defining events as time-series subsequences that are significantly different from noise. This switches the focus of the event detection approach towards defining the characteristics of noise, which is in many situations an easier problem than defining a structure. For atmospheric time series, a natural characterization of the noise is red noise, which is a stationary AR(1) process. The proposed method consists of two steps. The first step of the method is event extraction based on noise tests. We perform a noise test on each subsequence extracted from the series using a sliding window. All the subsequences recognized as noise are removed from further analysis, and the events are extracted from the remaining non-noise subsequences. This step does not assume particular geometries or amplitudes of the flow structures. In the second step, the detected structures are classified into groups with similar characteristics. This step groups large numbers of detected events such that it opens a pathway for the detailed study of their characteristics, and helps to gain understanding of events with previously unknown origin. In order to account for the underlying characteristics of the extracted events, a feature-based clustering method is used, which first summarizes each event with its global measures before performing clustering in the feature space. It yields substantially better results than clustering based on raw data of the events. The developed R package TED is tested on artificial time series with different levels of complexity and real world atmospheric turbulence time series. The results on artificial data show that events used to generate the data can be exactly detected and clustered. The method is robust to high levels of noise, which is advantageous regarding very noisy turbulence time series. Application of the method to a well- known real world turbulence dataset demonstrates that the method successfully extracts realistic flow structures, which are in line with previous studies that have examined the underlying physical mechanisms of several isolated events on that dataset. From the application of the method to a more complicated turbulence dataset, about which no published results can be found regarding extraction of unknown events, the proposed method is able to detect and distinguish events with different dynamical characteristics even though the clustering step is only based on statistical measures of characteristics of events from time series.
BibTeX:
```
@thesis{KangY2014DetectionClassification,
  author = {Kang, Yanfei},
  title = {Detection, Classification and Analysis of Events in Turbulence Time Series},
  school = {Monash University},
  year = {2014},
  url = {https://bridges.monash.edu/articles/thesis/Detection_classification_and_analysis_of_events_in_turbulence_time_series/4684273/1},
  doi = {10.4225/03/58ae5d42bf02e}
}
```

Yanfei Kang, Danijel Belušić and Kate Smith-Miles (2014). “Detecting and Classifying Events in Noisy Time Series”. Journal of the Atmospheric Sciences, Vol. 71 (3), pp. 1090-1104.

[BibTeX] [DOI]

BibTeX:

@article{KangY2014DetectingClassifying,
  author = {Kang, Yanfei and Belušić, Danijel and Smith-Miles, Kate},
  title = {Detecting and Classifying Events in Noisy Time Series},
  journal = {Journal of the Atmospheric Sciences},
  year = {2014},
  volume = {71},
  number = {3},
  pages = {1090--1104},
  doi = {10.1175/jas-d-13-0182.1}
}

Yanfei Kang, Danijel Belušić and Kate Smith-Miles (2014). “A Note on the Relationship between Turbulent Coherent Structures and Phase Correlation”. Chaos: An Interdisciplinary Journal of Nonlinear Science, Vol. 24 (2), pp. 023114.

[BibTeX]

BibTeX:

@article{KangY2014NoteRelationship,
  author = {Kang, Yanfei and Belušić, Danijel and Smith-Miles, Kate},
  title = {A Note on the Relationship between Turbulent Coherent Structures and Phase Correlation},
  journal = {Chaos: An Interdisciplinary Journal of Nonlinear Science},
  year = {2014},
  volume = {24},
  number = {2},
  pages = {023114}
}

Feng Li and Mattias Villani (2013). “Efficient Bayesian Multivariate Surface Regression”. Scandinavian Journal of Statistics, Vol. 40 (4), pp. 706-723.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Methods for choosing a fixed set of knot locations in additive spline models are fairly well established in the statistical literature. The curse of dimensionality makes it nontrivial to extend these methods to nonadditive surface models, especially when there are more than a couple of covariates. We propose a multivariate Gaussian surface regression model that combines both additive splines and interactive splines, and a highly efficient Markov chain Monte Carlo algorithm that updates all the knot locations jointly. We use shrinkage prior to avoid overfitting with different estimated shrinkage factors for the additive and surface part of the model, and also different shrinkage parameters for the different response variables. Simulated data and an application to firm leverage data show that the approach is computationally efficient, and that allowing for freely estimated knot locations can offer a substantial improvement in out-of-sample predictive performance.
BibTeX:
```
@article{LiF2013EfficientBayesian,
  author = {Li, Feng and Villani, Mattias},
  title = {Efficient Bayesian Multivariate Surface Regression},
  journal = {Scandinavian Journal of Statistics},
  year = {2013},
  volume = {40},
  number = {4},
  pages = {706--723},
  url = {https://arxiv.org/abs/1110.3689},
  doi = {10.1111/sjos.12022}
}
```

Yanfei Kang, Kate Smith-Miles and Danijel Belušić (2013). “How to Extract Meaningful Shapes from Noisy Time-Series Subsequences?”. In Proceedings of the 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM). pp. 65-72. IEEE

[BibTeX]

BibTeX:

@inproceedings{KangY2013HowExtract,
  author = {Kang, Yanfei and Smith-Miles, Kate and Belušić, Danijel},
  title = {How to Extract Meaningful Shapes from Noisy Time-Series Subsequences?},
  booktitle = {Proceedings of the 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)},
  publisher = {IEEE},
  year = {2013},
  pages = {65--72}
}

Feng Li (2013). “Bayesian Modeling of Conditional Densities”. Thesis at: Department of Statistics, Stockholm University.
[Abstract] [BibTeX] [Preprint]

Abstract: This thesis develops models and associated Bayesian inference methods for flexible univariate and multivariate conditional density estimation. The models are flexible in the sense that they can capture widely differing shapes of the data. The estimation methods are specifically designed to achieve flexibility while still avoiding overfitting. The models are flexible both for a given covariate value, but also across covariate space. A key contribution of this thesis is that it provides general approaches of density estimation with highly efficient Markov chain Monte Carlo methods. The methods are illustrated on several challenging non-linear and non-normal datasets. In the first paper, a general model is proposed for flexibly estimating the density of a continuous response variable conditional on a possibly high-dimensional set of covariates. The model is a finite mixture of asymmetric student-t densities with covariate-dependent mixture weights. The four parameters of the components, the mean, degrees of freedom, scale and skewness, are all modeled as functions of the covariates. The second paper explores how well a smooth mixture of symmetric components can capture skewed data. Simulations and applications on real data show that including covariate-dependent skewness in the components can lead to substantially improved performance on skewed data, often using a much smaller number of components. We also introduce smooth mixtures of gamma and log-normal components to model positively-valued response variables. In the third paper we propose a multivariate Gaussian surface regression model that combines both additive splines and interactive splines, and a highly efficient MCMC algorithm that updates all the multi-dimensional knot locations jointly. We use shrinkage priors to avoid overfitting with different estimated shrinkage factors for the additive and surface part of the model, and also different shrinkage parameters for the different response variables. In the last paper we present a general Bayesian approach for directly modeling dependencies between variables as function of explanatory variables in a flexible copula context. In particular, the Joe-Clayton copula is extended to have covariate-dependent tail dependence and correlations. Posterior inference is carried out using a novel and efficient simulation method. The appendix of the thesis documents the computational implementation details.
BibTeX:
```
@thesis{LiF2013BayesianModeling,
  author = {Li, Feng},
  title = {Bayesian Modeling of Conditional Densities},
  school = {Department of Statistics, Stockholm University},
  year = {2013},
  url = {http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-89426}
}
```

Yanfei Kang (2012). “Real-Time Change Detection in Time Series Based on Growing Feature Quantization”. In Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN). pp. 1-6. IEEE

[BibTeX]

BibTeX:

@inproceedings{KangY2012RealtimeChange,
  author = {Kang, Yanfei},
  title = {Real-Time Change Detection in Time Series Based on Growing Feature Quantization},
  booktitle = {Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN)},
  publisher = {IEEE},
  year = {2012},
  pages = {1--6}
}

Feng Li, Mattias Villani and Robert Kohn (2011). “Modelling Conditional Densities Using Finite Smooth Mixtures”. In Mixtures: Estimation and Applications. pp. 123-144. John Wiley & Sons
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: Smooth mixtures, i.e. mixture models with covariate-dependent mixing weights, are very useful flexible models for conditional densities. Previous work shows that using too simple mixture components for modeling heteroscedastic and/or heavy tailed data can give a poor fit, even with a large number of components. This paper explores how well a smooth mixture of symmetric components can capture skewed data. Simulations and applications on real data show that including covariate-dependent skewness in the components can lead to substantially improved performance on skewed data, often using a much smaller number of components. Furthermore, variable selection is effective in removing unnecessary covariates in the skewness, which means that there is little loss in allowing for skewness in the components when the data are actually symmetric. We also introduce smooth mixtures of gamma and log-normal components to model positively-valued response variables.
BibTeX:
```
@incollection{LiF2011ModellingConditional,
  author = {Li, Feng and Villani, Mattias and Kohn, Robert},
  title = {Modelling Conditional Densities Using Finite Smooth Mixtures},
  booktitle = {Mixtures: Estimation and Applications},
  publisher = {John Wiley &amp; Sons},
  year = {2011},
  pages = {123--144},
  url = {https://archive.riksbank.se/en/Web-archive/Published/Other-reports/Working-Paper-Series/2010/No-245-Modeling-Conditional-Densities-Using-Finite-Smooth-Mixtures/index.html},
  doi = {10.1002/9781119995678.ch6}
}
```
Feng Li, Mattias Villani and Robert Kohn (2010). “Flexible Modeling of Conditional Distributions Using Smooth Mixtures of Asymmetric Student t Densities”. Journal of Statistical Planning and Inference, Vol. 140 (12), pp. 3638-3654.
[Abstract] [BibTeX] [DOI] [Preprint]

Abstract: A general model is proposed for flexibly estimating the density of a continuous response variable conditional on a possibly high-dimensional set of covariates. The model is a finite mixture of asymmetric student t densities with covariate-dependent mixture weights. The four parameters of the components, the mean, degrees of freedom, scale and skewness, are all modeled as functions of the covariates. Inference is Bayesian and the computation is carried out using Markov chain Monte Carlo simulation. To enable model parsimony, a variable selection prior is used in each set of covariates and among the covariates in the mixing weights. The model is used to analyze the distribution of daily stock market returns, and shown to more accurately forecast the distribution of returns than other widely used models for financial data.
BibTeX:
```
@article{LiF2010FlexibleModeling,
  author = {Li, Feng and Villani, Mattias and Kohn, Robert},
  title = {Flexible Modeling of Conditional Distributions Using Smooth Mixtures of Asymmetric Student t Densities},
  journal = {Journal of Statistical Planning and Inference},
  year = {2010},
  volume = {140},
  number = {12},
  pages = {3638--3654},
  url = {https://archive.riksbank.se/en/Web-archive/Published/Other-reports/Working-Paper-Series/2009/No-233-Flexible-Modeling-of-Conditional-Distributions-Using-Smooth-Mixtures-of-Asymmetric-Student-T-Densities/index.html},
  doi = {10.1016/j.jspi.2010.04.031}
}
```

Books

Hyndman, R.J., & Athanasopoulos, G.著. 预测：方法与实践（第2版），康雁飞、李丰（译）https://otexts.com/fppcn/
李丰（2016）大数据分布式计算与案例。中国人民大学出版社。ISBN 9787300230276. [ 第二版在线预览 ]
康雁飞，李丰（2021）统计计算。[ 在线预览版本 ]