Deduplication and pension schemes

Deduplication is an essential part of data preparation for statistical modelling. The phenomenon of multiple policies per person is a major issue for annuity portfolios, and arises from life companies' policy-orientated view of the world. This makes perfect sense for insurers, of course, since their legal liability is the policy.

My expectation was that it would be less of an issue for pension schemes, whom I thought would naturally have a more person-orientated view of their liabilities. However, I recently analysed the mortality of a UK pension scheme with over 38,000 benefit records, of which over 1,300 were clear duplicates. I didn't reckon on the frequency with which people can return to a former employer, thus accruing two separate periods of pensionable service. Many people also meet their spouse at work, which results in having one record for their main pension benefit and another for a surviving spouse's benefit after the death of their partner.

It looks like deduplication is an essential part of data preparation for pension schemes as well.

Deduplication in Longevitas

Longevitas users can control all aspects of deduplication — including switching it off — in the Deduplication tab in the Configuration area.  There are ten different deduplication schemes that you can choose to apply, depending on what data you have available. 

Previous posts

Are you allergic to statistical models?

Or do you know someone who is? Some people are uncomfortable with the idea of statistical models, especially ones with parameters.
Tags: Filter information matrix by tag: survival analysis, Filter information matrix by tag: Kaplan-Meier

Postcodes

There is some degree of confusion over what people mean by "postcode" when applied to modelling mortality in the United Kingdom. There are varying ways of using postcodes, depending how much of the full postcode is actually used.
Tags: Filter information matrix by tag: postcodes, Filter information matrix by tag: profiling, Filter information matrix by tag: geodemographics, Filter information matrix by tag: Mosaic

Add new comment

Restricted HTML

  • Allowed HTML tags: <a href hreflang> <em> <strong> <cite> <blockquote cite> <code> <ul type> <ol start type> <li> <dl> <dt> <dd> <h2 id> <h3 id> <h4 id> <h5 id> <h6 id>
  • Lines and paragraphs break automatically.
  • Web page addresses and email addresses turn into links automatically.