Dossier assignment

dossierarticle1.pdf

Home >Education homework help >Dossier assignment

Risk Management and Insurance Review

C© Risk Management and Insurance Review, 2018, Vol. 21, No. 3, 389-411 DOI: 10.1111/rmir.12112

FEATURE ARTICLE

A CONCEPTUAL MODEL FOR PRICING HEALTH AND LIFE INSURANCE USING WEARABLE TECHNOLOGY Michael McCrea Mark Farrell

ABSTRACT

A health risk score was created to investigate the possibility of using data provided by wearable technology to help predict overall health and mortality, with the ultimate goal of using this score to enhance the pricing of health or life insurance. Subjects were categorized into low-, increased-, and high- risk groups, and after results were adjusted for age and sex, Cox proportional hazards analysis revealed a high level of significance when predicting mortality. High-risk subjects were shown to have a hazard ratio of 2.1 relative to those in the low-risk group, which can be interpreted as an equivalent increase in age of 7.8 years. Our findings help to demonstrate the predictive capabilities of potential new rating factors, measured via wearables, that could feasibly be incorporated into actuarial insurance pricing models. The model also provides an initial step for insurers to begin to consider the incorporation of continuous wearable data into current risk models. With this in mind, an emphasis is placed on the limitations of the study in order to highlight the areas that must be addressed before incorporating aspects of this model within current pricing models.

INTRODUCTION Much like the disruptions seen in the banking industry over the past decade, emerging technologies are revolutionizing the insurance industry. Traditional insurers are under pressure to innovate existing business models to retain a competitive edge (Hilton, 2017). Data from CB Insights showed funding to start-ups in the newly coined InsurTech industry has risen from $140 million in 2011 to $2.7 billion in 2016, and investment in the sector is expected to continue to grow as new technologies arise (Catlin et al., 2017; Jubraj et al., 2017).

The primary driver of this change has been the increasingly larger amounts of personal data available to insurers, which offers the opportunity to predict the risk for each

Michael McCrea and Mark Farrell are in Queen’s Management School, Queens’ University Belfast, Riddel Hall, 185 Stranmillis Rd, Belfast BT9 5EE, UK; e-mail: mmccrea11@qub.ac.uk, mark.farrell@qub.ac.uk. The authors would like to thank Anthony Horn, whose invaluable suggestions contributed significantly to the planning of this research.

389

390 RISK MANAGEMENT AND INSURANCE REVIEW

customer and charge them accordingly. Traditionally, in life insurance, underwriting data come from a questionnaire and a medical examination performed by a registered nurse or licensed physician, depending on the coverage amount and the age of the customer. A new potential source of this information is the Internet of Things (IoT), composed of the network of physical objects that can connect to the Internet and communicate with one another. Examples of these include mobile phones, pacemakers, onboard computers in cars, and of most interest to this study, wearable technology. This term, often shortened to just wearables, describes all technology that is worn comfortably on the body or combined with clothing (Tehrani and Michael, 2014). Common wearables include Fitbit, Garmin fitness bands, and the Oura ring. By use of the IoT, insurers have the potential to access huge amounts of real-time data, allowing them to build far more accurate risk profiles concerning the people they insure. Not only can this allow a more personal and fairer method of pricing, but through improved engagement with the customer risks can be both managed and reduced.

The overall research aim of this article is to demonstrate the potential that data derived from wearable devices may provide to insurance companies in terms of new rating factors for their pricing models. As such, we develop a conceptual risk model that utilizes data measurable by wearables, and can classify a policyholders’ relative risk to the rest of the population. This risk model serves to highlight the potential for insurance companies to incorporate wearable device data in their own health- and life-insurance-related pricing models. As this is to be a preliminary model demonstrating potential and acting as a proof of concept, simplicity will be key in order to retain the model’s generalizability. This is the first study that attempts to create a health risk score comprising solely data that can be collected in a continuous manner by wearable technology.

The rest of the article is outlined as follows. The “Literature Review” section investigates the current state of the insurance industry with respect to the use of wearable technology, and then provides a review of medical literature used in the formation of health risk scores. The “Methodology” section describes the methods used to analyze the data. The “Statistical Analysis—Results” section comprises the results of the analyses and the diagnostic tests performed to confirm the validity of the findings. The “Discussion” section discusses the possible implications of the findings, and follows with an in-depth investigation into the limitations of the analyses performed. The article is concluded with suggestions for possible extensions of the study.

LITERATURE REVIEW Current State of the Insurance Industry The insurance industry is well aware of the challenges it is likely to face over the coming years, and so is investing heavily in research and data in order to evolve (Sultan, 2015). Over 63 percent of insurers expect wearables to effect the industry significantly in the next 2 years (Schwartz and Hamilton, 2015). A huge advantage of these pieces of tech- nology is the ability to record and analyze data continuously with minimal interaction.

Already, a number of insurers have begun to incorporate wearables into their products, trialing new innovative programs in an attempt to get ahead of their competitors and break into markets of potential customers previously considered uninsurable. A main area of concern for insurers is the willingness to participate in these kinds of programs. Opt-in rates can be as low as 5 percent; however, PwC found that if the wearable was

PRICING INSURANCE USING WEARABLE TECHNOLOGY 391

provided for free, over two-thirds of customers or employees would wear the device (Dart, 2015). With this in mind, companies such as John Hancock Financial and MLC have provided Fitbits and smartwatches to customers for free (Becher, 2016). As the main goal at this stage of the process is the collection of data to analyze, they have agreed to lower premiums as an incentive for customers to release their personal data. A more original method to collect data was used by United Health, which used a penalty rather than awards system to motivate consumers (Dart, 2015). By requiring users to reach fitness targets in order to avoid the purchase cost of the wearable, their system was three times more successful in collecting data.

Other companies have gone even further and have launched programs making direct use of the technology available and the data they receive. One of the first insurers to use wearables in their “Vitality” product was South African company Discovery (Abraham, 2016). Vitality provides rewards such as discounted travel or accommodation if certain activity levels are met. This is profitable as when policyholders become healthier, the expected cost of the risk pool is lowered. Similar programs are offered by other insurers such as MLC in Australia, who provide premium discounts when healthy behaviors are displayed. These products are also being marketed toward large businesses that pur- chase insurance, as healthier employees have on average greater levels of productivity (Abraham, 2016). Existing wearables used for these sorts of programs include the Fitbit1

and the Apple Watch.2 These products often contain accelerometers to detect move- ment/sleep/heart rate; data can usually be accessed remotely via smartphone with the relevant GPS tracking data.

The usage of real-time data from wearables draws parallels with insurance telematics programs, which have increasingly gained market share in the automotive industry (Wahlström et al., 2015). GPS can measure metrics such as mileage, speeding, and lo- cation, whereas an accelerometer known as the “Black Box” can record instances of hard breaking, sharp turns, and sudden acceleration (Iqbal and Lim, 2006). Analysis of these data can help build a more personalized and accurate estimate of the level of risk the policyholder places on the insurer. In addition to this, drivers tend to improve driving method when monitored by telematic devices in order to lower their premiums (Azzopardi and Cortis, 2013); thus, the policies can encourage safer driving behaviors, lowering the expected cost of the risk pool. If wearables can follow automotive telemat- ics and gain a foothold in the insurance industry, they have the potential to become an integral part of many health and life insurance policies.

Health Risk Scores in the Literature The key question for research is how insurers can transform wearable technology’s raw data into meaningful information that could be used to price their products. Without being able to find a quantifiable link between the measurements and the health of an individual, the data have no value (Abraham, 2016). One possible option to achieve this could be using these data to create a health risk score. The concept of summarizing a patient’s data into a single score is not new in academia. The Framingham Risk Score (Wilson et al., 1998) is used worldwide as an estimate of cardiovascular risk, and the

1 www.fitbit.com 2 www.apple.com/uk/watch/

392 RISK MANAGEMENT AND INSURANCE REVIEW

probability of onset of Type 2 diabetes is typically predicted by the Diabetes Risk Score (Lindström and Tuomilehto, 2003).

Although the risk of a specific health condition can be modeled to a reasonable level of accuracy using known causal variables, the formation of a health score using simple met- rics to quantify an individual’s overall health and attempt to predict all-cause mortality is a more challenging task. Due to their known strong association with mortality, certain factors such as smoking, alcohol consumption, diet, and physical activity are prevalent in the majority of studies (Knoops et al., 2004; van Dam et al., 2008; Khaw et al., 2008; Gopinath et al., 2010; Kvaavik et al., 2010; Nechuta et al., 2010; van den Brandt, 2011; Hamer et al., 2011; Ding et al., 2015). By creating health risk scores whereby each good (bad) behavior is assigned a point, higher scores were consistently associated with an increased (decreased) risk of mortality. Some studies went further and considered the individual risk combinations and the possibility of synergistic relationships between the factors (Ding et al., 2015), with smoking and excess alcohol consumption having substantially more effect on mortality when combined. A key objective of many studies was to attempt to discover new factors to incorporate into their risk scores. Nechuta et al. (2010) find waist-hip ratio may be an even stronger predictor of mortality than body mass index (BMI), and include both of these factors in their health risk scores. Ding et al. (2015) incorporate metrics to measure a sedentary lifestyle, finding that both prolonged sitting and unhealthy sleep duration could be used in combination with physical activity levels in a health score.

Methods used to classify diets also vary considerably between studies. The most com- mon way this is performed is by summing up the quantity of fruits and vegetables eaten and using this as a proxy for a healthy diet, but researchers have endeavored to improve this simple method by including a range of different foods eaten (van den Brandt, 2011). A further step was performed by Khaw et al. (2008) who use blood plasma vitamin C concentrations as a proxy, allowing a measured value to be reported rather than a potentially biased and inaccurate self-reported value.

An interesting idea can be drawn from Glei et al. (2014) and Gruenewald et al. (2006), who discuss the notion of using particular biomarkers to represent the functionality of different biological systems. Gruenewald et al. (2006) give suggestions of biomarkers to act as proxies for neurological function, immune activity, cardiovascular function, and metabolic activity. From the perspective of creating a health risk score, ensuring chosen metrics can represent the functionality of all major biological systems could be a route to creating a more complete picture of overall health.

Using walking activity as a metric to predict mortality has had success in both young and elderly populations (Tudor-Locke et al., 2011). Walking is a particularly useful metric; while improving cardiovascular or respiratory health, it can also suggest a conscious decision to lead a healthy lifestyle when done for pleasure. Furthermore, a lack of walking can also be indicative of underlying chronic conditions. Simple measures of walking associated with mortality include distance walking per day (Hakim et al., 1998) or, equivalently, the average number of steps each day (Tudor-Locke et al., 2011). Ganna and Ingelsson (2015) find self-reported walking pace to be one of the strongest lifestyle predictors of mortality, greater even than smoking habits. This measurement could be

PRICING INSURANCE USING WEARABLE TECHNOLOGY 393

recorded by wearables using a combination of GPS data and accelerometers with the ability to distinguish between walking, running, and other types of movement.

Due to the failure of the cardiovascular system being responsible for a large proportion of deaths, it is only natural that many studies have focused on finding ways to measure this risk. Elevated resting heart rate has been shown by many studies to be an independent predictor of both cardiovascular and all-cause mortality (Seccareccia et al., 2001; Jensen et al., 2013; Zhang et al., 2015). This is not the only possible metric however; blood pressure is shown to be strongly associated with the occurrence of a stroke, and is also highly correlated to all-cause mortality (Georgakis et al., 2017). As technology progresses heart rate variability, which is typically measured by ECG, will likely become measurable on a continuous basis and shows much promise in being used to predict heart failure (Lucena et al., 2016).

Ding et al. (2015) incorporate sleep duration into their health risk score, yet this is only one way to measure sleep. Wong et al. (2012) find that in addition to duration, sleep quality also had a marked effect on physical well-being on a sample of Chinese students. A connection between poor sleeping patterns and the risk of onset of Type 2 diabetes is found by Cappuccio et al. (2010a), which as a long-term illness can be very expensive as an insurer.

The list of metrics discussed here is not exhaustive, but merely an indication of how much potential exists for this academic area to be developed. Subject to availability of data, the “Assessment of Health Metrics” section attempts to utilize these possible metrics to create a wearable-focused score.

METHODOLOGY Study Population This analysis is based on data from participants of the Health and Lifestyle Survey (HALS) (Cox, 1988), with the target population defined as individuals 18 years and over in England, Wales, and Scotland. The methods and rationale of this study have been reported elsewhere (Cox et al., 1987). In brief, 12,672 addresses were selected randomly from electoral registers, yielding 12,254 suitable households, from which of each one person was randomly chosen. A response rate of 73 percent generated 9,003 in-person interviews, with 82 percent (7,414) agreeing to a further visit from a study nurse to carry out various health measurements. Comparison with the 1981 census showed that the sample was representative of the adult British population (Blaxter, 1987). The current status of the participants (alive or deceased) as of June 2009 was provided by the UK National Health Service (NHS) Central Registry.

Relevant areas of the interviews included lifestyle habits such as alcohol consumption, smoking, physical activity, and sleep duration. Information about previous diagnoses and health history were also recorded at this time. Height, weight, blood pressure, and resting heart rate were measured by a study nurse in the follow-up visit.

Assessment of Health Metrics The chosen health metrics were included for a number of reasons, but the most impor- tant factor was the ability to effectively quantify the information provided by the HALS to separate subjects into healthy and unhealthy groups. The commonly used metrics

394 RISK MANAGEMENT AND INSURANCE REVIEW

TABLE 1 Health Metric Classifications

Health Metric Variable Point Awarded Percentage

Alcohol consumption Al Intake of > 14 units of alcohol per week 18.3%

Smoking Sm Current smoker 36.1%

BMI Bmi ≥ 30 kg/m2 (obese) 9.5% Physical activity Phy ≤ 120 minutes/week leisure time exercise 76.0% Sleep duration Sd Sleeping < 7 or > 9 hours/day 39.8%

Blood pressure Bp Hypertensive reading 8.6%

Resting heart rate Rhr Rate ≥ 90 bpm 4.7% Walking duration Wd Walking < 20 minutes/day 18.70%

(alcohol consumption, smoking, and BMI) are health metrics that have been shown in previous literature to have an effect on mortality risk (Khaw et al., 2008; Kuh et al., 2009; Hamer et al., 2011). The metrics measurable by wearables were chosen due to the existence of wearables that can currently record these metrics to a certain level of accuracy. Exercise activity can be detected and distinguished from walking or regular activity through movement sensors and heart rate increases (Comstock, 2015). Sleeping activity can be tracked to various degrees of accuracy through a multitude of devices, including most mobile phones via applications (Mann, 2017). Blood pressure measure- ment has only begun to enter the wearables market recently; however, as the technology is developed it is likely to become more widespread and available in mainstream de- vices (Redlitz, 2017). Heart rate is one of the most common health metrics, available in virtually all mainstream fitness wearables including Fitbit, Jawbone, and Apple Watch. Similarly, almost all devices have some form of pedometer measuring steps, walking duration, pace, and distance walked when combined with GPS.

Poor health metrics were classified as shown in Table 1 using results from previous literature and official bodies (World Health Organization, 1995; World Health Organi- zation and International Society of Hypertension Writing Group, 2003; Leitzmann et al., 2007; Cappuccio et al., 2010b; Kvaavik et al., 2010; Jensen et al., 2013; Department of Health, 2016). The effect of these metrics on subject survival time is assessed by the Cox proportional hazards model.3

STATISTICAL ANALYSIS—RESULTS Of the 7,414 participants who agreed to a visit from the study nurse, 291 (3.9 percent) had incomplete measurements and had to be excluded from the analysis. A further 238 (3.3 percent) were unable to be categorized in the June 2009 survey due to reasons such as having departed overseas, no longer being registered on the NHS, or simply being unable to be contacted. This left n = 6,885 suitable subjects for the following analyses, out of which 2,160 (31.4 percent) died prior to June 1, 2009. The principle outcome in this

3 For further information on the Cox model, see Cox (1972).

PRICING INSURANCE USING WEARABLE TECHNOLOGY 395

TABLE 2 Results of Individual Cox Regressions

Health Metric Deaths Coefficient p-Value HR (95% CI)

Alcohol consumption 25.9% 0.13 0.04 1.14 (1.01–1.29)

Smoking 33.7% 0.53 0.00 1.69 (1.55–1.85)

BMI 43.4% 0.28 0.00 1.32 (1.17–1.50)

Physical activity 36.4% 0.33 0.00 1.39 (1.22–1.59)

Sleep duration 39.6% 0.15 0.00 1.16 (1.06–1.26)

Blood pressure 68.1% 0.23 0.00 1.26 (1.13–1.41)

Resting heart rate 45.3% 0.47 0.00 1.60 (1.36–1.90)

Walking duration 40.7% 0.22 0.00 1.25 (1.13–1.38)

study was survival time, measured as the time in years between collection of baseline data until death or date of censorship (June 1, 2009). All statistical tests and analyses were performed using Stata 14.1 (StataCorp, 2015).

A log-rank test was performed to assess the Kaplan–Meier (survival) functions of males and females, with the null hypothesis for the test assuming the estimates for the two sexes are equal. A p-value of 0.00 indicated this was not the case. Accordingly, all Cox proportional hazards regression models were adjusted for sex and age.

Individual Health Metrics Adjusted Cox regressions were run for all eight metrics individually, with results dis- played in Table 2. The p-values in the table report the significance of the coefficients using the Wald test statistic, where the null hypothesis assumes βi = 0. We can see that all the variables were statistically significant at the 99.5 percent level of confidence ex- cept for alcohol consumption, which was significant at the 96 percent level. The hazard ratios show smoking raised the probability of death to the greatest extent, whereas al- cohol consumption and abnormal sleep duration had the least effect. It is important to note the nonlinear relationship between the percentage of deaths of those with a poor health metric and the corresponding hazard rate. A total of 68.1 percent of those with high blood pressure died in the study versus only 33.7 percent of smokers; however, the hazard ratio of smokers was much higher. Adjusting for age or taking into account the survival times after measurement could have marked effects on the coefficients of our model. For reference, the hazard ratios for smoking and high blood pressure after removing the adjustment for age were 1.11 (from 1.69) and 3.37 (from 1.26), respectively. This could suggest high blood pressure may be in part caused by age, be more likely to cause death when the subject is at an older age, or be related to age in another way entirely.

Combined Health Metrics For variables to be analyzed in a combined health metrics discussed below, a point of 1 was given to each poor health metric classification and 0 otherwise, with classifications defined as in the “Assessment of Health Metrics” section.

396 RISK MANAGEMENT AND INSURANCE REVIEW

TABLE 3 Distribution of Health Score I

Points Frequency Percentage of Total Deaths Percentage Died

0 573 8.3% 54 9.4%

1 1,949 28.3% 355 18.2%

2 2,378 34.5% 797 33.5%

3 1,428 20.7% 646 45.2%

4 475 6.9% 250 52.6%

5 73 1.1% 51 69.9%

6 9 0.1% 7 77.8%

A Cox regression was then run on all the variables at once. There were no major differ- ences between this and the individual regressions, except that the variable for alcohol consumption was no longer significant. Further investigation showed that this was mainly due to collinearity between alcohol consumption and another explanatory vari- able: smoking. This would be expected considering 51 percent of those with a poor health classification for alcohol consumption were current smokers, whereas only 26 percent of smokers were considered to have poor drinking behavior; thus, much of alcohol’s effects would likely be incorporated into the smoking variable. This lack of significance of an alcohol variable has been seen in similar studies on different populations such as that by Ding et al. (2015), who note that while it may not show significance by itself, when in combination with other metrics such as smoking or physical inactivity it can have a strong association with all-cause mortality. There is also a general consensus of alcohol and mortality having a U-shaped relationship, in which both drinkers and nondrinkers have an increased risk (Khaw et al., 2008). The model showed a high level of significance overall with χ 2(10) for the log-rank test giving a p-value of 0.00. We can also see the significance of the wearable-related health metrics when combined with the more commonly used metrics. This suggests that there may be some benefit to a model consisting of measurements made by wearable technology.

Health Score I. We formulate Health Score I, composed of all significant variables in the previous analyses. The health score was created by summing the points for each individual subject, giving a possible range of 0 to 7 points. A lower score was indicative of a healthier lifestyle, and thus it was hypothesized that survival probability would decrease with the increase of poor health metrics. Health Score I can be summarized as

Health Score I = Sm + Bmi + P h y + Sd + B p + Rhr + Wd, (1)

with variables defined in Table 1. A Cox regression was run on Health Score I, assessing both the overall explanatory power of the score and its effectiveness across its range.

The distribution of Score I is shown in Table 3. We can see that no subjects achieved the maximum points tally of 7, and only nine subjects had 6 points, which could affect the

PRICING INSURANCE USING WEARABLE TECHNOLOGY 397

TABLE 4 Results of Cox Regression for Health Score I

Variable Coefficient p-Value HR (95% CI)

Score I 0.27 0.00 1.31 (1.26–1.36)

0 1 (Reference)

1 0.02 0.88 1.02 (0.77–1.36)

2 0.51 0.00 1.66 (1.25–2.19)

3 0.74 0.00 2.10 (1.59–2.78)

4 0.86 0.00 2.37 (1.76–3.19)

5 1.31 0.00 3.69 (2.51–5.42)

6 1.10 0.01 3.00 (1.36–6.61)

significance of the regressions at this value. The final column shows that as the number of points increases, the percentage of deaths for each total number of points are strictly increasing as hypothesized.

Running an adjusted Cox regression showed that Health Score I was able to predict survival time to a high level of significance, with p-value 0.00. The first row of Table 4 tells us that for an increase of 1 point, a subject has on average a 31 percent higher chance of dying during the next year. In a stratified analysis of the score, we can see that there is little evidence to suggest that the presence of one poor metric had any effect on the hazard ratio of a participant. After this, each additional poor metric shows an increase in predicted hazard ratio until a total of 6 is reached; however, this is likely due to the small sample size for this score value, which is apparent on inspection of the wide range seen in the 95 percent confidence intervals (CIs). The CIs for adjacent point totals also overlap, which may suggest that an increase of just 1 point in Score I may not be statistically significant or indicative that too many metrics are being used.

Health Score II. Here, we create an alternative health score that consisted of only the health metrics deemed viable to be measured by wearables, as seen in the “Assessment of Health Metrics” section. Health Score II was defined by

Health Score II = P h y + Sd + B p + Rhr + Wd (2)

and was analyzed in a similar manner to Health Score I.

Again the maximum points total, in this case 5, had a very low frequency of subjects and so was unlikely to be able to provide a strong level of significance of increased mortality above those with 4 points. This can be viewed in Table 5. As before, the percentage of deaths for each total number of points is increasing, with the difference between each level more distinct.

The Cox regression for this score illustrated a strong ability to predict relative survival time, producing a p-value of 0.00 as seen in Table6. Score II shows an expected 25 percent increase of death in the next year for each increase of 1 point. The presence of one single

398 RISK MANAGEMENT AND INSURANCE REVIEW

TABLE 5 Distribution of Health Score II

Points Frequency Percentage of Total Deaths Percentage Died

0 871 12.7% 93 10.7%

1 2,820 41.0% 647 22.9%

2 2,345 34.1% 927 39.5%

3 740 10.8% 415 56.1%

4 104 1.5% 73 70.2%

5 5 0.1% 5 100.0%

TABLE 6 Results of Cox Regression for Health Score II

Variable Coefficient p-Value HR (95% CI)

Score II 0.23 0.00 1.25 (1.19–1.31)

0 1 (Reference)

1 0.19 0.08 1.21 (0.97–1.51)

2 0.46 0.00 1.58 (1.27–1.96)

3 0.64 0.00 1.88 (1.50–2.38)

4 0.92 0.00 2.50 (1.83–3.41)

5 0.90 0.05 2.45 (0.99–6.05)

poor metric was again not statistically significant at the 95 percent level; however, with a p-value of 0.08 and a hazard ratio of 1.25, there appears to be an indication of one poor metric having some effect on survival time. When stratifying the score, again we see the hazard ratio increases as we increase the number of points until we reach a total of 5. Once more, this is likely due to the sample size of five subjects (0.1 percent) above anything else. Overlapping CIs are seen again with the individual points totals, suggesting that there is still room to improve the model.

Health Score III. We further refine Health Score II in order to achieve nonoverlapping CIs, resulting in the creation of Health Score III, a final model that categorized subjects as low, increased, and high risk. This score categorized subjects into three groups as shown in Table 7. The aim of Health Score III was to summarize participants into distinct groups without any overlapping of 95 percent CIs.

Adjusted Cox regressions were run on Score III with results displayed in Table 8. Similar to Score II, the model is statistically significant, with a greater hazard ratio as would be expected due to the presence of more poor metrics between each neighboring value.

In the stratified analysis, we can also see a score of 1 is no longer insignificant, a major flaw present in the previous models. In addition, we have now managed to successfully

PRICING INSURANCE USING WEARABLE TECHNOLOGY 399

TABLE 7 Classification of Health Score III

Points in Score III Risk

Score II Variable Classification

0 0

1 0 Low risk

2 1

3 1 Increased risk

4 2

5 2 High risk

TABLE 8 Results of Cox Regression for Health Score III

Variable Coefficient p-Value HR (95% CI)

Score III 0.35 0.00 1.42 (1.31–1.54)

Age 0.10 0.00 1.10 (1.10–1.11)

Sex 0.49 0.00 1.63 (1.49–1.77)

0 1 (Reference)

1 0.34 0.00 1.40 (1.27–1.53)

2 0.74 0.00 2.10 (1.66–2.65)

remove the overlapping of 95 percent CIs with adjacent scores. We can interpret this as that there is over a 95 percent probability that an arbitrary subject will be classified in the survival category that describes their survival function best.

We can now write an equation for the hazard function λ(t) of a subject, with respect to vector Xi consisting of Health Score III, age, and sex as

λ(t|Xi ) = λ0(t)exp[0.35(Score IIIi ) + 0.10(a gei ) + 0.49(se xi )], (3)

where λ0(t) denotes the baseline hazard function.

The age and sex variables included are as a result of the model adjustment. Our Cox model suggests a hazard ratio of exp[0.10] = 1.11 for an increase in age of 1 year, so the probability of death for an arbitrary subject increase by approximately 11 percent each year. A value of 0 for the variable sex denoted a female, and 1 a male; thus, the model predicts that all else being equal, the hazard rate of a male subject will be 63 percent higher than that of a female.

A graphical estimate of the baseline hazard function (low risk) was calculated to complement the model, along with estimates for the other categories. The baseline hazard term λ0(t) cannot be represented as a function in (3); however, Stata can use

400 RISK MANAGEMENT AND INSURANCE REVIEW

FIGURE 1 Estimated Hazard Functions [Color figure can be viewed at wileyonlinelibrary.com]

0 0.

00 5

0. 01

5 0.

02 0.

02 5

Sm oo

th ed

h az

ar d

fu nc

ti on

0 5 10 15 20 25

t Score III = 0 Score III = 1 Score III = 2

standard kernel-smoothing methodology (Gray, 1990) to approximate a smooth curve to the Nelson–Aalen4 estimate, thus making it differentiable. The resulting estimated baseline hazard rate can be viewed in Figure 1. It is represented by the blue line in the plot, for which Health Score III is 0. Hazard rates for scores of 1 and 2 are also plotted for reference. As expected, the hazard rate increases exponentially over time, noticeable through the slight convexity of the functions. The sharp drop at the end is caused by the censoring of subjects after differing lengths of time under observation, resulting from different measurement dates, and has no bearing on the true hazard function.

This graph also illustrates the consequence of the proportional-hazards assumption. It is clear that the smoothed hazard functions are proportional and would be parallel if scaled logarithmically.5

Diagnostics If we are to discuss the Cox model with Health Score III, and its ability to be used in the life insurance market, we must first run several diagnostic tests. Although the Cox model is semiparametric, it still must be checked for misspecification, goodness of fit, outliers, and influential points.

The simple yet powerful link test was run as a general specification test for the model (Cleves et al., 2010), with no evidence of misspecification uncovered. When Schoenfeld (1982) residuals were analyzed graphically to check the proportional hazards assump- tion, no violation of the assumption was apparent. The assumption was further sup- ported using a log–log plot for Health Score III, with curves appearing roughly parallel as expected.

4 See Nelson (1972) and Aalen (1978) for further information. 5 With a slight allowance due to the kernel-smoothing process.

PRICING INSURANCE USING WEARABLE TECHNOLOGY 401

FIGURE 2 Comparison of Kaplan–Meier (Observed) and Cox (Predicted) Survival Functions [Color figure can be viewed at wileyonlinelibrary.com]

0. 20

0. 40

0. 60

0. 80

1. 00

Su rv

iv al

p ro

ba bi

lit y

0 5 10 15 20 25 t

Observed: III = 0 Observed: III = 1 Observed: III = 2

Predicted: III = 0 Predicted: III = 1 Predicted: III = 2

Model agnostic observed Kaplan–Meier curves (Kaplan and Meier, 1958) were plotted alongside the predicted survival functions for each score produced by our Cox model to observe how they compared to the data. This is shown in Figure 2, with the Cox model appearing to be an excellent fit for estimating survival probability of subjects who scored 0 or 1 in Score III. There is a slight deviation between predicted and observed for those who scored 3. We would hope that this is due to the smaller sample size of this category, and perhaps due to the presence of a few outliers, rather than a deviation from proportionality. Ideally, if the sample size were to approach infinity, the observed Kaplan–Meier curves would become indistinguishable from those predicted by the Cox regression.

Cox–Snell residuals (Cox and Snell, 1968) examining the overall fit of the model showed little divergence between observed and predicted values. The predictive power of the model was evaluated with Harrell’s C concordance statistic (Harrell et al., 1982), which indicated the model correctly predicting the order of survival times in 86 percent of instances.

Martingale residuals were calculated, with no evidence in the residuals to suggest that there were covariates with incorrect functional form. Deviance residuals6 and dfbetas were used to determine the influence that outliers exerted on the Score III with no individuals subjects considered highly influential (Belsley et al., 2005).

DISCUSSION The model created in this study demonstrates that insurance rating factors that can feasibly be measured by wearable devices have predictive power in relation to all-cause mortality.

6 For further information on deviance residuals, see Therneau et al. (1990).

402 RISK MANAGEMENT AND INSURANCE REVIEW

Benefits of Model A key benefit of Health Score III is its simplicity of being categorized into low, increased, and high risk. The simplicity should help facilitate the possible inclusion of a similar model within current developed pricing models. Furthermore, using simple logarithm rules and the data in Table 8, we can interpret the scores in terms of years added on to the age of the insured. For each additional point added in Health Score III, an increase of 3.7 years above the subject’s true age would be equivalent. Using the hazard rates, we can also say that being considered high risk is equivalent to a low-risk subject being 7.8 years older. The difference in mortality risk due to being male can be quantified as in increase in age of 5.1 years, which is slightly higher than the average difference in life expectancy of males and females of 3.7 years (Office for National Statistics, 2016). The effect on life expectancy would likely be greater at younger ages as well, due to the higher expected survival times in that population. Being able to transform the health scores into an equivalent increase in age could drastically simplify the process needed to integrate them into existing models.

Data were available in the HALS for other factors that could have been used to adjust the analyses, such as household income or socioeconomic status; however, in the interest of simplicity they were not included. By only adjusting for age and sex, the predictive ability of the model could be retained, and the goal was not to address causality (Ding et al., 2015). Simplicity reasons were also used to justify the lack of weighting variables; however, this imprecision would only be expected to reduce the significance of the score, and strong significance was still present.

Our model acts as a simple proof of concept to help demonstrate, in a rudimentary sense, that insurance companies may be able to utilize data from wearables as part of their premium rating process. Thus, we extend our discussion to the benefits of using wearable-derived data, in insurance pricing, from a general point of view.

General Benefits of Using Wearables Data in Insurance Pricing Models The recent proliferation of wearable devices, and the resulting explosion in personal self-quantified health data, has opened up the potential for new rating factors to be included as part of current life and health insurance pricing models.

There are numerous benefits regarding the use of a wearables model. First, by recording data on individuals health behavior (e.g., biometric self-quantification data collected via wearable device technology), the information asymmetry between the policyholder and the insurer is reduced, thus enabling an enhanced granular risk differentiation based on the true risk levels of the drivers to be achieved. This potentially reduces the problems of adverse selection, allowing the insurer to price individuals at a more personalized and accurate level, which should result in a more stable cohort of policyholders who are fairly priced. It should be noted however that there is a concern that being under obligation to provide personal data may penalize uninsurables (Yates, 2017); however, it could be argued that those who do not attempt to remain healthy penalize low-risk policyholders due to the information asymmetry between insurer and consumer, and the inevitable adverse selection that comes with it (Gatzert and Wesker, 2014). Although more individualized pricing may open insurance cover to previously uninsurable risks (e.g., diabetics who very carefully manage their diet and exercise regimes), it is important

PRICING INSURANCE USING WEARABLE TECHNOLOGY 403

to also consider that some restrictions on risk classification and hence an acceptable level of adverse selection can increase loss coverage and so make insurance work better for society as a whole (Thomas, 2008).

A further significant is the potential reduction in underwriting costs borne by the in- surer and consequently the policyholder. In younger and healthier age groups, costs from frequent medical examinations can actually exceed expected value of claims over the same period (Pitacco, 2014). If used in combination with other nonwearable met- rics that require measurement, the select period required between examinations could be increased due to the reduced risk. The presence of more information throughout the life of the policyholder, due to the ideally continuous nature of the model, will reduce the variability of costs from their expected level (Pitacco, 2014). Although this could not be achieved in this article’s model, the use of a continuous data set would not require much modification, as at this point the model coefficients are assumed to remain constant over time and only the covariates can change.

One area insurance companies have always struggled is in customer engagement, with customers considering insurance policies more of an obligation, or a “grudge purchase,” rather than a product. The ability to self-monitor could be an incentive to increase good behaviors in individuals due to the increased engagement with their own health data through mobile devices (Abraham, 2016). Policies can also be tailored to individuals’ specific needs. These factors will be important to retain interest, as up to 50 percent of customers become disinterested and stop using their wearables within 1 year of purchase (Gore, 2015). Thus, it could be argued that wearables may play a future role in enhancing the customer relationship, possibly even to the extent that insurance companies begin to play a greater role in the preclaim period, by incentivizing healthy behaviors. This has many implications including the potential for insurers to help with earlier identi- fication of conditions such as diabetes and heart disease, chronic disease management, and improving the obesity epidemic through financial incentives and encouragement. Clearly, there is a potential for enhancing the policyholder relationship as well as also ultimately bringing about interventions that ultimately lead to a reduction in claims. Wearables may also bring about a more fluid and continuous relationship between the insurer and policyholder. Historically, there was little to no interaction between the two parties between point of sale and claim or renewal. Data provided on a more continu- ous basis with potential ensuing rewards (e.g., monthly premium discounts based on activity levels) provide the opportunity for greater “touch points” in the relationship and thus may lead to lower churn rates.

Although there is great potential for insurance companies to incorporate wearables into their insurance products, there are many hurdles yet to overcome. Of paramount importance are the issues of fraud detection and the questionable accuracy of many devices/metrics. At present, some wearable data are open to fraudulent reporting as individuals may be able to record data that are not indicative of their own behavior. As can be easily imagined, this is particularly problematic in relation to metrics such as number of steps taken. Device accuracy also represents another problem. Certain metrics are currently measured consistently and accurately via wearables, whereas other metrics show a large discrepancy. For example, a 2017 Stanford study found that energy expenditure readings were very inaccurate, whereas in contrast, heart rate metrics were

404 RISK MANAGEMENT AND INSURANCE REVIEW

found to be within 5 percent of the true value for most devices (Shcherbina et al., 2017).

LIMITATIONS Despite the numerous diagnostic tests used to validate the model, the findings in this article must be interpreted in the light of the study’s limitations.

Measurement Limitations Despite the large sample size in the HALS, certain categories investigated were small, such as the high-risk category for Health Score III, and to an even greater extent the higher points totals for Health Scores I and II. An increased number of subjects in these categories would allow for a more accurate calculation of model coefficients, hopefully narrowing their 95 percent CIs to a more acceptable level.

It is also worth considering the possibility that the selection of subjects analyzed them- selves were biased. As there was a response rate of 73 percent in the survey, nonpar- ticipation bias might have affected the prevalence of associations, which would impact their generalizability; however, due to the multivariable nature of the model, the health scores would not be affected to the same degree. In addition to this, Galea and Tracy (2007) find that lower participation rates are unlikely to have a substantial effect on exposure event associations. This suggests that associations, relative to prevalence, are less reliant on sample representativeness.

A particularly significant shortfall in the HALS was the length of the follow-up period spanning only 25 years. This is simply due to the date of the initial data collection, and so the data set will improve in time; however, it was speculated that the model may only be relevant for certain segments of the population. For example, a 30-year old with high blood pressure who exercises infrequently is still unlikely to die in the next 25 years, whereas an 80-year old is likely to die over the same period regardless of the underlying health metrics they possess. Thus, the effect of possessing these metrics is hidden from our data set. In order to investigate this, the sample was stratified into age groups spanning 10 years starting from 30 years old. Log-rank tests were performed for Score III within each of the age groups. The results showed that the model is only successful in predicting survival times between ages 40 and 80. This means that there may not be optimal data for 44.7 percent of the participants in our data set. In fact, these 44.7 percent of participants only accounted for 13.1 percent of deaths. While 100 percent of those above 80 years old died in the follow-up period, there were deaths in only 4.0 percent of those under 40 years old. With 96.0 percent of subjects under 40 being censored in June 2009, it would be very difficult to find significant results for that population due to a very small proportion providing an exact survival time.

A final area identified in the HALS was the possibility for measurement error to take place. Although several metrics were taken by a study nurse, others such as physical activity, walking, and sleeping duration were not subject to the same level of scrutiny. When variables are self-reported, they are almost always subject to misclassification bias (Maudsley and Williams, 1996). Physical activity and walking were estimated by calculating the average amount achieved over the previous fortnight; however, for many participants these 2 weeks may not have been representative of their lifestyle as a whole. Something as simple as bad weather in a region could have impacted reported walking

PRICING INSURANCE USING WEARABLE TECHNOLOGY 405

levels for its subjects. The survey question on average sleep duration was little more than a best guess by participants, and an estimate over the long term would likely be difficult for most to answer accurately. There is also the possibility that this bias could be nonrandom: on average, subjects tend to report favorable behaviors due to social desirability bias. Fortunately, the nature of this study means that this bias will more often be toward the null (Ford et al., 2011).

Methodological Limitations While the HALS had its limitations, the design of the model itself had its own short- comings. Poor indicators for different metrics may have the same underlying cause, and thus the presence of a second poor metric may exaggerate the mortality risk due to the additive nature of the health score. This is particularly relevant as our model considers the metrics to be indicators of risk rather than causes. Confounding could also be in effect, which is defined as when an included variable is correlated to both the dependent and an independent variable. Walking duration is a prime example of this, as it was shown with physical activity in the “Combined Health Metrics” section to be individ- ually significant in the same model and also likely to be related to one another. On the other hand, studies have shown that associations of particular pairs of metrics with mortality can be much higher than the sum of their individual associations as measured by hazard ratios (Ding et al., 2015), and so the inclusion of a score multiplier could be a useful addition in these cases. A further modification that could be made to improve the modeling of multiple metrics would be the inclusion of weighting factors to represent their effects on hazard ratios. In the Cox regression performed in the “Combined Health Metrics” section, resting heart rate had a much higher hazard ratio (1.45) relative to high blood pressure (1.17). Allowing a greater weighting to resting heart rate in this scenario might increase the predictability of our health scores. As mentioned before, simplicity was a key aspect of the model and the capturing of these kinds of interactions were not a primary concern when there was already a strong relationship with mortality present.

A common step taken in epidemiology studies is to exclude any subjects with previous diagnoses or chronic diseases such as cancer, heart disease, or stroke. This condition could effect not just survival time, but whether a subject presents poor health metrics or not. However, as this study was not investigating causation, only indicators of poor health, it was opted to leave these subjects in the main analyses. Poor metrics associated with these conditions was something we hoped to capture, as from an insurance per- spective our score must be based on the likelihood of a claim for death or illness being. The presence of previous conditions affecting results in this way is known as reverse causality. In the interest of thoroughness, a separate Cox regression was run excluding all those who possessed these conditions at measurement or died within the first 2 years of follow-up as per the methods of Ding et al. (2015) with results displayed in Table 9. The exclusion of 144 subjects did not effect the significance of the score and comparison with the main analysis results in Table 8 showed little difference apart from a slight reduction of hazard ratio for a value of 2. This is more than likely a result of the sample size, with this additional regression excluding 13 of 78 deaths in this category.

In the interest of simplicity, only all-cause mortality was considered as the primary outcome, yet much information could be gained by recording cause-specific mortality

406 RISK MANAGEMENT AND INSURANCE REVIEW

TABLE 9 Results of Cox Regression Adjusted for Reverse Causality

Variable Coefficient p-Value HR (95% CI)

Score III 0.32 0.00 1.38 (1.27–1.51)

0 1 (Reference)

1 0.32 0.00 1.38 (1.26–1.52)

2 0.66 0.00 1.93 (1.49–2.49)

or onset of particular conditions as well. Without consideration of cause of death, we may be misrepresenting the significance of our health score as deaths may not always be for health reasons. For example, the leading cause of death for 20- to 34-year-olds was suicide, with 24 and 12 percent of male and female deaths in this age group, respectively (Office for National Statistics, 2017). These deaths, among others, would have no causal relationship with our health score.

Finally, a key attraction of using wearables to price insurance is their ability to take mea- surements consistently through time, allowing the insured’s risk profile to be updated in real time without the need to visit a doctor. Due to the nature of the survey data used in this study, the covariates in our model are assumed to remain constant in time. Future waves of follow-up data could be incorporated, increasing the applicability of the model to the real world at the sacrifice of simplicity. This would help to account for behavioral or physical changes resulting in misclassification (Kvaavik et al., 2010). The stability of certain behaviors of metrics differs over time, with several studies describing the stabil- ity of physical activity over time as low or moderate (Telama et al., 2005; Parsons et al., 2006). In this, we must assume that some degree of stability exists, as evidenced by the significance in the model’s coefficients.

CONCLUSION In conclusion, Health Score III acts as a proof of concept, demonstrating the potential for the inclusion of rating factors, based on wearables data, to be included in health and life insurance pricing models. The model also potentially acts as a starting point for wearable-derived data inclusion in a more fully formed pricing model, especially those that wish to utilize rating factors such as resting heart rate, blood pressure, sleep duration, and walking duration. The suitability of the existing metrics would require further evaluation with weighting, substitution, and erasures taking place. With this in mind, there are several areas that could provide the basis for future research.

As this model only considered all-cause mortality as an event of interest, it is not directly applicable to pricing health insurance in its current form. An investigation into cause- specific mortality however would be the first movement in this direction, and inclusion of the onset of disease or other conditions would be a logical next step. It is worth noting that a larger data set would be required to provide enough occurrences of each condition to produce statistically significant results. At a certain point, it would also be necessary to run the model on a continuous data set in order to better simulate the real-world data it was developed for.

PRICING INSURANCE USING WEARABLE TECHNOLOGY 407

REFERENCES Aalen, O., 1978, Nonparametric Inference for a Family of Counting Processes, Annals of

Statistics, 6(4): 701-726. Abraham, M., 2016, Wearable Technology: A Health-and-Care Actuary’s Perspective,

Institute and Faculty of Actuaries. Azzopardi, M., and D. Cortis, 2013, Implementing Automotive Telematics for Insurance

Covers of Fleets, Journal of Technology Management & Innovation, 8(4): 59-67. Becher, S., 2016, Wearables—A New Chance for Private Insurance Companies From the

Underwriting View, Zeitschrift für die gesamte Versicherungswissenschaft, 105(5): 563-565. Belsley, D. A., E. Kuh, and R. E. Welsch, 2005, Regression Diagnostics: Identifying Influential

Data and Sources of Collinearity, Vol. 571 (Hoboken, NJ: John Wiley & Sons). Blaxter, M., 1987, Evidence on Inequality in Health From a National Survey, Lancet,

330(8549): 30-33. Cappuccio, F. P., L. D’Elia, P. Strazzullo, and M. A. Miller, 2010a, Quantity and Quality

of Sleep and Incidence of Type 2 Diabetes, Diabetes Care, 33(2): 414-420. Cappuccio, F. P., L. D’Elia, P. Strazzullo, and M. A. Miller, 2010b, Sleep Duration and

All-Cause Mortality: A Systematic Review and Meta-Analysis of Prospective Studies, Sleep, 33(5): 585-592.

Catlin, T., J. T. Lorenz, B. Münstermann, B. Olesen, and V. Ricciardi, 2017, Insurtech—The Threat That Inspires (Stamford, CT: Mckinsey & Company, Financial Services).

Cleves, M., W. Gould, R. G. Gutierrez, and Y. V. Marchenko, 2010, An introduction to Survival Analysis Using Stata (College Station, TX: Stata Press).

Comstock, J., 2015, Fitbit Adds Auto-Detection of Biking, Running, Elliptical, and More. Retrieved from https://www.mobihealthnews.com/48764/fitbit-adds-auto- detection-of-biking-running-elliptical-and-more

Cox, B. D., 1988, Health and Lifestyle Survey, 1984-1985 [data collection], SN: 2218 (Essex, England: UK Data Service).

Cox, B. D., M. Blaxter, A. L. J. Buckle, N. P. Fenner, J. F. Golding, M. Gore, F. A. Huppert, J. Nickson, M. Roth, J. Stark, et al., 1987, The Health and Lifestyle Survey: Preliminary Report of a Nationwide Survey of the Physical and Mental Health, Attitudes and Lifestyle of a Random Sample of 9,003 British Adults (London: Health Promotion Research Trust).

Cox, D. R., 1972, Regression Models and Life Tables (with Discussion), Journal of the Royal Statistical Society, 34: 187-220.

Cox, D. R., and E. J. Snell, 1968, A General Definition of Residuals, Journal of the Royal Statistical Society Series B (Methodological), 30: 248-275.

Dart, A., 2015, The Case for Connected Wearables in Insurance, Asia Insurance Re- view.Retrieved from http://www.asiainsurancereview.com/Magazine/ReadMaga- zineArticle/aid/35855/The-case-for-Connected-Wearables-in-Insurance

Department of Health, 2016, UK Chief Medical Officers Low Risk Drinking Guidelines. Ding, D., K. Rogers, H. van der Ploeg, E. Stamatakis, and A. E. Bauman, 2015, Traditional

and Emerging Lifestyle Risk Behaviors and All-Cause Mortality in Middle-Aged and Older Adults: Evidence From a Large Population-Based Australian Cohort, PLoS Medicine, 12(12): e1001917.

408 RISK MANAGEMENT AND INSURANCE REVIEW

Ford, E. S., G. Zhao, J. Tsai, and C. Li, 2011, Low-Risk Lifestyle Behaviors and All-Cause Mortality: Findings From the National Health and Nutrition Examination Survey III Mortality Study, American Journal of Public Health, 101(10): 1922-1929.

Galea, S., and M. Tracy, 2007, Participation Rates in Epidemiologic Studies, Annals of Epidemiology, 17(9): 643-653.

Ganna, A., and E. Ingelsson, 2015, 5 Year Mortality Predictors in 498 103 UK Biobank Participants: A Prospective Population-Based Study, Lancet, 386(9993): 533-540.

Gatzert, N., and H. Wesker, 2014, Mortality Risk and Its Effect on Shortfall and Risk Management in Life Insurance, Journal of Risk and Insurance, 81(1): 57-90.

Georgakis, M. K., A. D. Protogerou, E. I. Kalogirou, E. Kontogeorgi, I. Pagonari, F. Sa- rigianni, S. G. Papageorgiou, E. Kapaki, C. Papageorgiou, D. Tousoulis, et al., 2017, Blood Pressure and All-Cause Mortality by Level of Cognitive Function in the Elderly: Results From a Population-Based Study in Rural Greece, Journal of Clinical Hyperten- sion, 19(2): 161-169.

Glei, D. A., N. Goldman, G. Rodrı́guez, and M. Weinstein, 2014, Beyond Self-Reports: Changes in Biomarkers as Predictors of Mortality, Population and Development Review, 40(2): 331-360.

Gopinath, B., V. M. Flood, G. Burlutsky, and P. Mitchell, 2010, Combined Influence of Health Behaviors on Total and Cause-Specific Mortality, Archives of Internal Medicine, 170(17): 1605-1607.

Gore, R., 2015, Insurance, Innovation and IoT: Insurers Have Their Say on the Internet of Things, FC Business Intelligence.

Gray, R. J., 1990, Some Diagnostic Methods for Cox Regression Models Through Hazard Smoothing, Biometrics, 46(1): 93-102.

Gruenewald, T. L., T. E. Seeman, C. D. Ryff, A. S. Karlamangla, and B. H. Singer, 2006, Combinations of Biomarkers Predictive of Later Life Mortality, Proceedings of the Na- tional Academy of Sciences, 103(38): 14158-14163.

Hakim, A. A., H. Petrovitch, C. M. Burchfiel, G. W. Ross, B. L. Rodriguez, L. R. White, K. Yano, J. D. Curb, and R. D. Abbott, 1998, Effects of Walking on Mortality Among Nonsmoking Retired Men, New England Journal of Medicine, 338(2): 94-99.

Hamer, M., C. J. Bates, and G. D. Mishra, 2011, Multiple Health Behaviors and Mortality Risk in Older Adults, Journal of the American Geriatrics Society, 59(2): 370-372.

Harrell, F. E., R. M. Califf, D. B. Pryor, K. L. Lee, and R. A. Rosati, 1982, Evaluat- ing the Yield of Medical Tests, Journal of the American Medical Association, 247(18): 2543-2546.

Hilton, A., 2017, Insurtech Is Set to Take the Insurance Industry by Storm, Raconteur.

Iqbal, M. U., and S. Lim, 2006, A Privacy Preserving GPS-Based Pay-as-You-Drive In- surance Scheme, Symposium on GPS/GNSS (IGNSS2006), pp. 17-21.

Jensen, M. T., P. Suadicani, H. O. Hein, and F. Gyntelberg, 2013, Elevated Resting Heart Rate, Physical Fitness and All-Cause Mortality: A 16-Year Follow-Up in the Copen- hagen Male Study, Heart, 99(12): 882-887.

PRICING INSURANCE USING WEARABLE TECHNOLOGY 409

Jubraj, R., S. Watson, and S. Tottman, 2017, The Rise of Insurtech, Accenture. Kaplan, E. L., and P. Meier, 1958, Nonparametric Estimation from Incomplete Observa-

tions, Journal of the American Statistical Association, 53(282): 457-481. Khaw, K., N. Wareham, S. Bingham, A. Welch, R. Luben, and N. Day, 2008, Combined

Impact of Health Behaviours and Mortality in Men and Women: The Epic-Norfolk Prospective Population Study, PLoS Medicine, 5(1): e12.

Knoops, K. T., L. C. de Groot, D. Kromhout, A.-E. Perrin, O. Moreiras-Varela, A. Menotti, and W. A. Van Staveren, 2004, Mediterranean Diet, Lifestyle Factors, and 10-Year Mortality in Elderly European Men and Women: The Hale Project, JAMA, 292(12): 1433-1439.

Kuh, D., R. Hardy, M. Hotopf, D. A. Lawlor, B. Maughan, R. Westendorp, R. Cooper, S. Black, and G. Mishra, 2009, A Review of Lifetime Risk Factors for Mortality, British Actuarial Journal, 15(S1): 17-64.

Kvaavik, E., G. D. Batty, G. Ursin, R. Huxley, and C. R. Gale, 2010, Influence of Individ- ual and Combined Health Behaviors on Total and Cause-Specific Mortality in Men and Women: The United Kingdom Health and Lifestyle Survey, Archives of Internal Medicine, 170(8): 711-718.

Leitzmann, M. F., Y. Park, A. Blair, R. Ballard-Barbash, T. Mouw, A. R. Hollenbeck, and A. Schatzkin, 2007, Physical Activity Recommendations and Decreased Risk of Mortality, Archives of Internal Medicine, 167(22): 2453-2460.

Lindström, J., and J. Tuomilehto, 2003, The Diabetes Risk Score, Diabetes Care, 26(3): 725-731.

Lucena, F., A. K. Barros, and N. Ohnishi, 2016, The Performance of Short-Term Heart Rate Variability in the Detection of Congestive Heart Failure, BioMed Research International, 2016: Article No. 1675785.

Mann, J., 2017, The Ultimate Guide to Sleep Tracking. Retrieved from https://sleep- junkies.com/features/the-ultimate-guide-to-sleep-tracking/

Maudsley, G., and E. Williams, 1996, Inaccuracy in Death Certification—Where Are We Now, Journal of Public Health, 18(1): 59-66.

Nechuta, S. J., X.-O. Shu, H.-L. Li, G. Yang, Y.-B. Xiang, H. Cai, W.-H. Chow, B. Ji, X. Zhang, W. Wen, et al., 2010, Combined Impact of Lifestyle-Related Factors on Total and Cause-Specific Mortality Among Chinese Women: Prospective Cohort Study, PLoS Medicine, 7(9): e1000339.

Nelson, W., 1972, Theory and Applications of Hazard Plotting for Censored Failure Data, Technometrics, 14(4): 945-966.

Office for National Statistics, 2016, National Life Tables, UK: 2013–2015 [Statistical Bul- letin].

Office for National Statistics, 2017, Avoidable Mortality in England and Wales: 2015 [Statistical Bulletin].

Parsons, T. J., C. Power, and O. Manor, 2006, Longitudinal Physical Activity and Diet Patterns in the 1958 British Birth Cohort, Medicine and Science in Sports and Exercise, 38(3): 547-554.

410 RISK MANAGEMENT AND INSURANCE REVIEW

Pitacco, E., 2014, Health Insurance: Basic Actuarial Models (Cham, Switzerland: Springer International Publishing).

Redlitz, H., 2017, Wrist-Size Wearables Will Help You Keep Your Blood Pressure in Check.Retrieved from https://wearablezone.com/news/wearables-track-blood- pressure-levels/

Schoenfeld, D., 1982, Partial Residuals for the Proportional Hazards Regression Model, Biometrika, 69(1): 239-241.

Schwartz, J. L., and M. A. Hamilton, 2015, Accessory Overload: Wearable Technology’s Impact on the Insurance Industry, Voice Magazine.

Seccareccia, F., F. Pannozzo, F. Dima, A. Minoprio, A. Menditto, C. Lo Noce, and S. Giampaoli, 2001, Heart Rate as a Predictor of Mortality: The Matiss Project, American Journal of Public Health, 91(8): 1258-1263.

Shcherbina, A., C. M. Mattsson, D. Waggott, H. Salisbury, J. W. Christle, T. Hastie, M. T. Wheeler, and E. A. Ashley, 2017, Accuracy in Wrist-Worn, Sensor-Based Measurements of Heart Rate and Energy Expenditure in a Diverse Cohort, Journal of Personalized Medicine, 7(2): 3.

StataCorp, 2015, Stata Statistical Software: Release 14 (College Station, TX: StataCorp LP). Sultan, N., 2015, Reflective Thoughts on the Potential and Challenges of Wearable Tech-

nology for Healthcare Provision and Medical Education, International Journal of Infor- mation Management, 35(5): 521-526.

Tehrani, K., and A. Michael, 2014, Wearable Technology and Wearable Devices–Everything You Need to Know. Wearable Devices. Retrieved from http://www.wearabledevices.com/what-is-a-wearable-device/

Telama, R., X. Yang, J. Viikari, I. Välimäki, O. Wanne, and O. Raitakari, 2005, Physical Activity from Childhood to Adulthood: A 21-Year Tracking Study, American Journal of Preventive Medicine, 28(3): 267-273.

Therneau, T. M., P. M. Grambsch, and T. R. Fleming, 1990, Martingale-Based Residuals for Survival Models, Biometrika, 77(1): 147-160.

Thomas, R. G., 2008, Loss Coverage as a Public Policy Objective for Risk Classification Schemes, Journal of Risk and Insurance, 75(4): 997-1018.

Tudor-Locke, C., C. L. Craig, Y. Aoyagi, R. C. Bell, K. A. Croteau, I. De Bourdeaudhuij, B. Ewald, A. W. Gardner, Y. Hatano, L. D. Lutes, et al., 2011, How Many Steps/Day Are Enough? For Older Adults and Special Populations, International Journal of Behavioral Nutrition and Physical Activity, 8(1): 80.

van Dam, R. M., T. Li, D. Spiegelman, O. H. Franco, and F. B. Hu, 2008, Combined Impact of Lifestyle Factors on Mortality: Prospective Cohort Study in US Women, BMJ, 337: a1440.

van den Brandt, P. A., 2011, The Impact of a Mediterranean Diet and Healthy Lifestyle on Premature Mortality in Men and Women, American Journal of Clinical Nutrition, 94(3): 913-920.

Wahlström, J., I. Skog, and P. Händel, 2015, Driving Behavior Analysis for Smartphone- Based Insurance Telematics, 2nd Workshop on Physical Analytics, WPA 2015, May 22, pp. 19-24. ACM Digital Library.

PRICING INSURANCE USING WEARABLE TECHNOLOGY 411

Wilson, P. W. F., R. B. D’Agostino, D. Levy, A. M. Belanger, H. Silbershatz, and W. B. Kannel, 1998, Prediction of Coronary Heart Disease Using Risk Factor Categories, Circulation, 97(18): 1837-1847.

Wong, M. L., E. Y. Y. Lau, J. H. Y. Wan, S. F. Cheung, C. H. Hui, and D. S. Y. Mok, 2012, The Interplay Between Sleep and Mood in Predicting Academic Functioning, Physi- cal Health and Psychological Health: A Longitudinal Study, Journal of Psychosomatic Research, 74(4): 271-277.

World Health Organization, 1995, Physical Status: The Use of and Interpretation of An- thropometry Report of a WHO Expert Committee (Geneva, Switzerland: World Health Organization).

World Health Organization and International Society of Hypertension Writing Group, 2003, 2003 World Health Organization (WHO)/International Society of Hypertension (ISH) Statement on Management of Hypertension, Journal of Hypertension, 21(11): 1983- 1992.

Yates, H., 2017, Personal Data May Penalise “Uninsurables,” Raconteur. Zhang, D., X. Shen, and X. Qi, 2015, Resting Heart Rate and All-Cause and Cardiovascu-

lar Mortality in the General Population: A Meta-Analysis, Canadian Medical Association Journal, 188(3): E53-E63.

Copyright of Risk Management & Insurance Review is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.