Skip to main content
The Actuary: The magazine of the Institute and Faculty of Actuaries - return to the homepage Logo of The Actuary website
  • Search
  • Visit The Actuary Magazine on Facebook
  • Visit The Actuary Magazine on LinkedIn
  • Visit @TheActuaryMag on Twitter
Visit the website of the Institute and Faculty of Actuaries Logo of the Institute and Faculty of Actuaries

Main navigation

  • News
  • Features
    • General Features
    • Interviews
    • Students
    • Opinion
  • Topics
  • Knowledge
    • Business Skills
    • Careers
    • Events
    • Predictions by The Actuary
    • Whitepapers
    • Moody's - Climate Risk Insurers series
    • Webinars
    • Podcasts
  • Jobs
  • IFoA
    • CEO Comment
    • IFoA News
    • People & Social News
    • President Comment
  • Archive
Quick links:
  • Home
  • The Actuary Issues
  • November 2021
General Features

Around the block: learning-driven approach to mortality modelling

Open-access content Wednesday 3rd November 2021

Andrés M Villegas, Salvatory Kessy, Michael Sherris, Dilan SriDaran and Jonathan Ziveyi propose a new statistical learning paradigm for mortality modelling

web_p20_Around-the-block_CREDIT_Nick-Lowndes_Ikon_00017103.jpg

In recent decades, humanity has made significant progress in averting and delaying death. While indicative of tremendous societal progress, this longevity does present economic and social risks to healthcare, social security, private pension systems and life insurance. The main risk, however, is not the increased longevity per se, but the inherent uncertainty over the evolution of future human mortality. Dynamic mortality modelling is therefore a critical tool for actuaries, both in terms of facilitating an understanding of the past, and in providing a basis for future mortality projections.

In this article, we will discuss some of the limitations of current mortality forecasting approaches and introduce alternative data-driven tools for constructing, selecting and combining models. We have made these data-driven tools readily available through open-source code within the R packages StMoMo (github.com/amvillegas/StMoMo) and CoMoMo (github.com/kessysalvatory/CoMoMo).

The limitations of current mortality forecasting approaches

Generally, mortality rates are broken down into various functions of age, time and birth cohorts. These three features respectively aim to capture changes from growing older, systematic longevity improvements over time, and any factors that are unique to specific cohorts of individuals. Generalised age-period-cohort (GAPC) models have traditionally been favoured within actuarial spheres due to their ease of implementation and interpretability, as well as their ability to simulate future outcomes and develop probabilistic forecast intervals. GAPC models encompass a wide variety of models, including the commonly used Lee-Carter (LC) model and the Cairns-Blake-Dowd (CBD) family of models.

In such GAPC models, the force of mortality at age x in year symbols can be represented as

symbols


where the general age shape component of mortality is captured by symbols , systematic longevity improvements across specific ages over time are reflected by symbolsand any effects embedded in an individual’s year of birth are allowed for by symbols .

In theory, an infinite number of models could be produced within this GAPC framework, ranging from simplistic models that attribute an equal mortality improvement factor to all ages, to complex models that fit several different time trends that are specific to certain ages or cohorts. Simpler models are easier to interpret but may not capture more nuanced systematic features within historical data, while overly complex models may over-fit to noise and produce volatile, spurious forecasts.

In an ideal world, we would be able to consider every possible GAPC model so that we can select the optimal model for a given dataset and desired application. However, time and computational constraints mean this is not feasible. Instead, practitioners typically rely on a small (but growing) selection of models with predefined feature specifications, some of which are shown in Table 1.

 block_Table-1.jpg

However, it is perhaps unreasonable to expect that every dataset in the world could be explained by one of just six models. Moreover, the tools available for selecting the ‘optimal’ model – such as the Akaike information criterion and the Bayesian information criterion – are often ad hoc and typically focus on in-sample goodness-of-fit. Such approaches do not consider how well a model may forecast out of sample and into the future, and therefore do not allow users to robustly determine the model’s desired features and degree of complexity. In addition, model selection methods may yield contradicting information, making it difficult to select a single ‘optimal’ model.

Statistical learning tools for mortality modelling

To overcome these limitations, we have been exploring the use of different statistical learning tools in the context of mortality modelling and forecasting. In ‘A Group Regularisation Approach for Constructing Generalised Age-Period-Cohort Mortality Projection Models’ (bit.ly/GroupReg_APC), we propose a framework that uses group regularisation to produce bespoke GAPC models for specific datasets and applications, and in ‘Mortality Forecasting Using Stacked Regression Ensembles’ (bit.ly/MortForecast_Stacked), we propose a stacked regression approach to optimally combine existing GAPC models, rather than constructing bespoke models. Both approaches are underpinned by re-sampling techniques, which aim to determine the optimal degree of complexity for different applications by placing emphasis on how well a statistical model will generalise to unseen data.

dtd

Block cross-validation for model selection

To emphasise forecasting performance over in-sample goodness-of-fit, we propose the use of block cross-validation techniques that are specifically tailored to a user’s forecasting horizon of interest, as demonstrated in Figure 1. Mortality data typically comes in matrix form, with rows and columns denoting unique ages and calendar years respectively. In block cross-validation, this data is iteratively divided into training data, to fit the mortality models, and test data, to evaluate their out-of-sample performances. The test data can take different widths to represent different forecasting horizons. For example, when predicting one-year-ahead out-of-sample mortality rates, test data should be defined as one-year blocks (as in the top row of Figure 1), and it should be defined as three-year blocks when predicting three-years-ahead rates (as in the middle row of Figure 1).

One interesting aspect of our cross-validation approach is that we use data both to the left and right of each test set to train the model, in contrast to the more common rolling window out-of-sample evaluation (as in the bottom row of Figure 1). This allows us to conduct significantly more test sets for a given dataset than if we were restricted to fitting models using only data to the left, since this would place a limit on how far to the left test sets can occur. Moreover, we minimise the portion of data that is left completely unused (the grey dots in Figure 1), thereby extracting as much information as possible and producing more precise estimates of the out-of-sample prediction error.

Group regularisation for model construction

To avoid limiting ourselves to a finite number of GAPC models, we consider the use of regularisation techniques to explore a much richer array of potential features. Conceptually, the idea behind our approach is to first define a large basket of candidate features that could explain different patterns and trends. For example, one simple feature could aim to capture a systematic mortality improvement over time that affects all ages equally. Another, more complex, feature could resemble a ‘put’ option function by fitting a time trend to only ages below a certain value, with the impact of the trend increasing as age decreases. In our example below, we propose 27 unique features; however, our package StMoMo allows users to easily add elements to or remove elements from this basket using their own judgment.

To fit the models, we exploit the fact that GAPC models can be expressed as a generalised linear model of the form

symbols

 

 

where each symbolsrepresent, respectively, the design matrix and vector of coefficients associated with a unique feature in the basket. This allows us to use a group regularisation algorithm to estimate the coefficients of the model. Group regularisation is like traditional ordinary-least squares (OLS) or maximum likelihood estimation (MLE), except that it includes a penalty term symbolsthat biases groups of model coefficients to zero. When symbols, no penalty is applied, and fitting is identical to the OLS or MLE setting. This means that all coefficients attached to the basket of features will be non-zero, yielding a highly complex model that will likely over-fit the data. At the other extreme, when symbols, all coefficients will be shrunk to zero, meaning all features from the basket are ‘removed’ and we are left with an extremely simplistic model.

In our paper, we use the aforementioned cross-validation framework to determine the optimal value of symbols  and, by extension, which features from the basket to include in a model for each dataset and application.

Stacked regression ensembles for model combination

However, cross-validation offers more information than merely identifying which single model or set of features is the best. It also permits the estimation of combinations of models, which can yield more precise out-of-sample forecasts than their individual constituents. Therefore, as an alternative to picking a single best model with the lowest cross-validation error, in ‘Mortality Forecasting Using Stacked Regression Ensembles’ we use the cross-validation predictions to develop a stacked regression ensemble that combines the predictions from multiple individual mortality models.

_Around-the-block_Figure-2_0.jpg

eb_p22_Around-the-block_Figure-3_0.jpg

Stacked regression ensembles combine several diverse individual models into one powerful prediction function via a secondary meta-learning process, whereby the weights assigned to each individual model are optimised to reflect the ability of each model to generalise to new data. Owing to its empirical and theoretical benefits, stacked regression ensembles are often used by winning teams in data science competitions. Stacked regression ensemble methods have also been successfully applied, with improved predictive accuracy, to a range of problems – such as credit risk assessment, forecasting global energy consumption, financial time series forecasting, and prediction of infectious disease epidemics.

In a stacked regression ensemble, the combination weights are viewed as coefficients of a linear regression problem, in which the (log) mortality rates are treated as the dependent variable and the point predictions from individual mortality models are treated as the independent variables. Assuming that there are M models to combine, this is

symbols

 

where symbolsare observed mortality rates and symbolsare the mortality rate predictions from the individual models generated via cross-validation. For each forecast horizon h, the combination weights symbolssymbols can be learned using any supervised statistical learning algorithm, such as lasso regression, ridge regression or elastic-net penalisation.

Demonstration using male mortality data for England and Wales

To demonstrate these tools, we consider the Human Mortality Database’s mortality data for males in England and Wales aged 50-89, for the years 1960-2018.

For the group regularisation framework, we consider a basket of 27 features, which include polynomials up to order 10 and ‘put’ and ‘call’ option functions at ages 55, 60, …, 80, 85.  To search for the optimal level of regularisation, we consider symbolsvalues in the range symbols . As shown in Figure 2, this range of  ​symbols​ defines a regularisation path containing different GAPC mortality models with varying degrees of complexity.

Small values of ​symbols​ correspond to low levels of regularisation and, subsequently, more complex models. For example, to the left of Figure 2 when  ​symbols​ = 0.000123, we start with a model that includes six of the 27 initial features. Besides a general mortality improvement term symbolsand a cohort effect cohort, this also includes terms that capture the idiosyncrasies of mortality trends at different ages via call and put features symbols and symbols. As the value of symbols increases, the additional regularisation eliminates some of the age-period terms, resulting in more parsimonious models. For example, when symbols = 0.001412, we obtain a model that only includes the  symbolsterm capturing the general level of mortality, the symbolsterm capturing the differential mortality improvement of individuals older than age 75, and the cohort effect symbols. 

According to our cross-validation approach, the complex symbols = 0.000123 model is ‘optimal’ for the one-year forecasting horizon, while the simpler symbols = 0.001412 is ‘optimal’ for six-to-10-year forecasting horizons. In general, from our empirical analyses and simulations, we have determined that the longer the forecasting horizon, the higher the value of the ‘optimal’ symbols, the fewer the number of period terms, and the more parsimonious the selected model specification.

In Figure 3 we show the results of applying a stacked regression ensemble to combine the six mortality models in Table 1 for the same dataset. The stacked regression ensemble learns horizon-specific optimal weights for combining these individual mortality models at different forecasting horizons so that the mortality models with stronger predictive strength for a given horizon receive higher weights. The super-learner mortality model for forecasting one-year-ahead rates is given by combining the individual mortality models LC, RH, APC, CBD, M7 and PLAT using their corresponding weights 0.17, 0.18, 0.17, 0.14, 0.18 and 0.17 respectively, with the weights changing to 0.07, 0.44, 0.22, 0.03, 0.13 and 0.11 for the 10-year-ahead forecast. These weights reflect that the RH and APC models generalise well to new unknown future mortality data for this population and so get higher weights that increase with the forecasting horizon. In contrast, the weaker performing LC, CBD, M7 and PLAT models receive corresponding smaller weights that decrease with the forecasting horizon.


A bespoke approach

In our two academic papers, we have carried out exhaustive analyses of the out-of-sample performance of both the group regularisation and the stacked regression approaches, using more than 40 populations from the Human Mortality Database. These empirical analyses have revealed that no predefined model can perform well for every dataset and forecasting horizon, with our bespoke regularised models and model combinations consistently outperforming existing GAPC models. Moreover, our empirical analyses reaffirmed the commonly held heuristic that simpler models outperform their complex counterparts as forecasting horizons increase.

However, we must stress that while we have obtained very strong results across many datasets, we do not claim to provide all the answers or the perfect models. We simply aim to provide alternative data-driven approaches to thinking about mortality modelling through the lens of statistical learning techniques.


View the full PDF here

Andrés Villegas is senior lecturer in risk and actuarial studies at UNSW Sydney, Australia

Salvatory Kessy is PhD student in risk and actuarial studies at UNSW Sydney, Australia

Michael Sherris is professor in risk and actuarial studies at UNSW Sydney, Australia, and chief investigator at CEPAR

Dilan SriDaran is senior consultant at EY

Jonathan Ziveyi is associate professor in risk and actuarial studies at UNSW Sydney, Australia

Image Credit | Ikon

Linked ACT Nov21_Full LR.jpg
This article appeared in our November 2021 issue of The Actuary .
Click here to view this issue

You may also be interested in...

web_p40-41_Caught-on-camera_CREDIT_shutterstock_1231599400.jpg

Caught on camera: how data can help preserve biodiversity

Olga Mierzwa-Sulima, Robin Whytock and Jdrzej wieewski share their experience of building a machine learning algorithm that helps track biodiversity in Gabon’s tropical forests
Wednesday 3rd November 2021
Open-access content
web_p42_Avoiding-the-extremes_CREDIT_GettyImages-1297624297.jpg

Avoiding the extremes: sidestepping mistakes in temperature and mortality modelling

Darryl Brundle and Ashley Kanter on sidestepping mistakes in temperature and mortality modelling
Wednesday 3rd November 2021
Open-access content
web-p24-27-Indepth--Stack-of-Toy-cars--Getty--82862527.jpg

A better fit: presenting an intuitive and flexible non-linear regression model

Andrei Badescu, Tsz Chai Fung, X Sheldon Lin and Spark Tseung present a flexible nonlinear regression model and software for insurance risk classification, ratemaking and reserving
Wednesday 6th October 2021
Open-access content
Cutting the bias: Modelling the impact of Covid-19

Cutting the bias: Modelling the impact of Covid-19

Stephen Richards demonstrates how to model the impact of COVID-19 on portfolio mortality experience
Wednesday 1st December 2021
Open-access content
web_p13_Outward-bound_Sid-Malik_DSF8155_CREDIT_Peter-Searle.jpg

Sid Malik: Outward bound

Sid Malik talks to Dan Georgescu about the wide-ranging and boundary-bridging actuarial roles carried out at the Prudential Regulation Authority, as well as the pressing issues it is currently tackling
Wednesday 3rd November 2021
Open-access content
q

A fairer deal for diabetes

The insurance industry needs to update its understanding of diabetes, say Nicola Oliver and Scott Reid
Wednesday 3rd November 2021
Open-access content

Latest from Modelling/software

EG\

Uneven outcomes: findings on cancer mortality

Ayşe Arık, Andrew Cairns, Erengul Dodd, Adam Shao and George Streftaris share their findings on the impact of socio-economic differences and diagnostic delays on cancer mortality
Wednesday 1st June 2022
Open-access content
dtj

Talking census: making use of data

With the ONS starting to release the data from the 2021 census, Jeremy Keating considers how those working in insurance can make use of it
Wednesday 1st June 2022
Open-access content
hrts

Storm watch: Can IPCC models be used in cat modelling?

Can IPCC projections be used to adjust catastrophe models for climate change? Nigel Winspear and David Maneval investigate, using US hurricanes as an example
Wednesday 1st June 2022
Open-access content

Latest from Life insurance

ytg

Seek cover

When it comes to sustaining your products’ performance in a ‘polycrisis’, customer engagement is key. Marco Spagnuolo outlines how life insurers can weather today’s economic storm
Wednesday 1st March 2023
Open-access content
67

Knock-on effects: the risks of cyber crime for life insurers

Life and health insurers need to consider how cyber risk could potentially impact them, say Visesh Gosrani, Mikhail Norshteyn and Karl Oliver
Wednesday 30th November 2022
Open-access content
tsj

On the write track: using machine learning to predict underwriting decisions

Yafei (Patricia) Wang looks at the use of machine learning to predict underwriting decisions for life and health insurance
Wednesday 3rd August 2022
Open-access content

Latest from Risk & ERM

KV

Liability-driven investments: new landscape

What now for liability-driven investments, after last year’s crash in the market? Pensions experts Rakesh Girdharlal and Moiz Khan say it should lead to a more balanced approach
Wednesday 1st February 2023
Open-access content
cj

Natural capital investing

Chris Howells and Andrew Dreaneen discuss how today’s investments in natural capital profit portfolios as well as the planet and humanity
Wednesday 1st February 2023
Open-access content
bl

'Takaful' models of Islamic insurance

Ethical, varied and a growing market – ‘takaful’ Islamic insurance is worth knowing about, wherever you’re from and whatever your beliefs, says Ali Asghar Bhuriwala
Wednesday 1st February 2023
Open-access content

Latest from General Features

yguk

Is anybody out there?

There’s no point speaking if no one hears you. Effective communication starts with silence – this is the understated art of listening, says Tan Suee Chieh
Thursday 2nd March 2023
Open-access content
ers

By halves

Reducing the pensions gap between men and women is a work in progress – and there’s still a long way to go, with women retiring on 50% less than men, says Alexandra Miles
Thursday 2nd March 2023
Open-access content
web_Question-mark-lightbulbs_credit_iStock-1348235111.png

Figuring it out

Psychologist Wendy Johnson recalls how qualifying as an actuary and running her own consultancy in the US allowed her to overcome shyness and gave her essential skills for life
Wednesday 1st March 2023
Open-access content

Latest from November 2021

web_p5_CEO_Stephen-Mann.jpg

Finding our voice

We want our members to have rewarding and fulfilling careers. There is no set ‘landing zone’ for this, but underpinning our approach is the belief that the best thing we can do is ensure actuaries have a vision for the future, a growth mindset, and the knowledge that we will help them develop the skillsets and learning they need.
Thursday 4th November 2021
Open-access content
Puzzles-iStock-1146577830.jpg

Puzzles November 2021

Puzzles November 2021
Wednesday 3rd November 2021
Open-access content
web_p44_Society-News_Volunteer_CREDIT_Shutterstock_1910041621.jpg

People and society news: November

People and society news: November
Wednesday 3rd November 2021
Open-access content
Share
  • Twitter
  • Facebook
  • Linked in
  • Mail
  • Print

Latest Jobs

Shape the Future of Insurance

London (Central)
£ excellent package
Reference
149090

Senior Pricing Actuary - Life Reinsurance

London (Central)
£ excellent
Reference
149089

Insurance Investment Leadership Opportunities

Flexible / hybrid with 2 days p/w office-based
£ dependent upon experience
Reference
149088
See all jobs »
 
 
 
 

Sign up to our newsletter

News, jobs and updates

Sign up

Subscribe to The Actuary

Receive the print edition straight to your door

Subscribe
Spread-iPad-slantB-june.png

Topics

  • Data Science
  • Investment
  • Risk & ERM
  • Pensions
  • Environment
  • Soft skills
  • General Insurance
  • Regulation Standards
  • Health care
  • Technology
  • Reinsurance
  • Global
  • Life insurance
​
FOLLOW US
The Actuary on LinkedIn
@TheActuaryMag on Twitter
Facebook: The Actuary Magazine
CONTACT US
The Actuary
Tel: (+44) 020 7880 6200
​

IFoA

About IFoA
Become an actuary
IFoA Events
About membership

Information

Privacy Policy
Terms & Conditions
Cookie Policy
Think Green

Get in touch

Contact us
Advertise with us
Subscribe to The Actuary Magazine
Contribute

The Actuary Jobs

Actuarial job search
Pensions jobs
General insurance jobs
Solvency II jobs

© 2023 The Actuary. The Actuary is published on behalf of the Institute and Faculty of Actuaries by Redactive Publishing Limited. All rights reserved. Reproduction of any part is not allowed without written permission.

Redactive Media Group Ltd, 71-75 Shelton Street, London WC2H 9JQ