Discussion

yasernoory

P8-1.pdf

Home >Engineering homework help >Discussion

Heart-to-Heart (H2H): Authentication for Implanted Medical Devices

Masoud Rostami Rice University Houston, TX

[email protected]

Ari Juels RSA Laboratories Cambridge, MA

[email protected]

Farinaz Koushanfar Rice University Houston, TX

[email protected]

ABSTRACT We present Heart-to-Heart (H2H), a system to authenticate external medical device controllers and programmers to Implantable Medi- cal Devices (IMDs). IMDs, which include pacemakers and cardiac defibrillators, are therapeutic medical devices partially or wholly embedded in the human body. They often have built-in radio com- munication to facilitate non-invasive reprogramming and data read- out. Many IMDs, though, lack well designed authentication proto- cols, exposing patients to over-the-air attack and physical harm.

H2H makes use of ECG (heartbeat data) as an authentication mechanism, ensuring access only by a medical instrument in phys- ical contact with an IMD-bearing patient. Based on statistical anal- ysis of real-world data, we propose and analyze new techniques for extracting time-varying randomness from ECG signals for use in H2H. We introduce a novel cryptographic device pairing proto- col that uses this randomness to protect against attacks by active adversaries, while meeting the practical challenges of lightweight implementation and noise tolerance in ECG readings. Finally, we describe an end-to-end implementation in an ARM-Cortex M-3 mi- crocontroller that demonstrates the practicality of H2H in current IMD hardware.

Previous schemes have had goals much like those of H2H, but with serious limitations making them unfit for deployment—such as naïvely designed cryptographic pairing protocols (some of them recently broken). In addition to its novel analysis and use of ECG entropy, H2H is the first physiologically-based IMD device pairing protocol with a rigorous adversarial model and protocol analysis.

Categories and Subject Descriptors J.3 [Computer Applications]: Life and Medical Sciences - Medical information systems; C.3 [Computer Systems Organiza- tion]: Special-Purpose and Application-Based System, Real-time and embedded systems

General Terms Security, Design

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full cita- tion on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re- publish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. CCS’13, November 4–8, 2013, Berlin, Germany. Copyright 2013 ACM 978-1-4503-2477-9/13/11 ...$15.00. http://dx.doi.org/10.1145/2508859.2516658.

Keywords Implantable Medical Devices; IMD Security; Security Protocols

1. INTRODUCTION Implantable Medical Devices (IMDs) apply continuous monitor-

ing and automatic therapies to the treatment of chronic medical disorders. Implanted either partially or fully in patients’ bodies, IMDs are often sophisticated devices containing batteries, embed- ded CPUs, radios, sensors, and actuators. As clinical trials validate IMDs’ efficacy [22] and as IMDs treat a broadening range of disor- ders, their use is growing. For instance, in the United States, over 100,000 patients a year receive implantable cardioverter defibrilla- tors (ICDs) [19], which detect dangerous heart rhythms and admin- ister electric shocks to restore normal activity. Other IMDs include pacemakers, neurostimulators, and implantable drug pumps.

Powered IMDs generally contain radios for communication with external devices called commercial device programmers that can reprogram IMDs and extract patient data from them. Such wireless communication permits safe, non-invasive access to IMDs. But it also brings the security risks of embedded control into the human body. Seminal work by Halperin et al. [20], for example, exposes design flaws in a common ICD that enable attackers to seize unau- thorized control wirelessly, and potentially harm victims.

Our work here addresses the tension between two critical re- quirements for IMDs. On the one hand, IMDs must offer rea- sonably permissive access-control policies when life-threatening medical events occur. Emergency responders may need to repro- gram IMDs or extract patient data from them, and shouldn’t in- cur fatal treatment delays by contacting care providers for device- specific keys or passwords. On the other hand, overly loose access- control policies expose IMDs to unauthorized wireless access, such as those illustrated by the Halperin et al. attack, that can physically harm patients or expose their medical data [20].

1.1 Heart-to-Heart (H2H) Our solution is a system called Heart-to-Heart (H2H). H2H

implements a simple access-control policy for IMDs that we call “touch-to-access”: A medical instrument (e.g., commercial device programmer), which we call generically a Programmer, obtains ac- cess to a patient’s IMD if and only if it has significant physical con- tact with the patient’s body. An important facet of touch-to-access is forward security. Authentication to the IMD lapses once the in- strument loses physical contact with the patient.

Touch-to-access offers a practical and effective balance between the competing access requirements of permissiveness in emergen- cies and resistance to attacks. The policy is also common sense: Physical access to a patient means the ability to harm or cure.

1099

Figure 1: Basic H2H operation. The Programmer and IMD take individual ECG readings, respectively β and α. If β ≈ α, then the IMD grants access to the Programmer.

H2H enforces a touch-to-access policy using a time-varying bio- metric, often called a physiological value (PV). When a Program- mer seeks access to an IMD, it initiates an authentication session. The IMD takes a reading α of the PV; at the same time, the Pro- grammer takes its own reading β. If β is “nearly equal” to α, then the Programmer obtains access to the IMD. (“Near equality,” as we explain later, is needed because PV readings are noisy.)

The H2H architecture can in principle rely on any PV, but we fo- cus here specifically on use of the waveform produced by the heart, known as an ECG (electrocardiogram). Thus H2H is well suited for authentication to cardiac IMDs such as ICDs and pacemakers, to- day the largest class of powered IMDs. In principle, though, H2H can work with any IMD equipped to measure ECG anywhere in the body, not just cardiac devices. As we show, suitably processed ECG samples effectively constitute a low-bandwidth stream of ran- dom bits well suited to forward-secure authentication.

Briefly, then, in H2H a Programmer and IMD take independent, time-synchronous ECG readings. The IMD compares the two re- sults to enforce a touch-to-access policy on wireless access by the Programmer. Figure 1 depicts the basic operation of H2H.

1.2 Challenges and contributions Several previous schemes have sought to perform ECG-based

pairing with IMDs like H2H, but have had serious shortcomings. Most notably, previous schemes have relied on cryptographic pair- ing protocols without rigorous adversarial modeling or security analysis. As a result, two of the most recent of them [23, 48] were shown in a 2013 paper [39] to have serious cryptographic flaws.

Thus designing a practical system such as H2H with rigorous security assurances has effectively remained an open problem, one that raises several technical challenges.

The first challenge is demonstrating that ECG is a suitable PV for authentication. H2H derives a key source from the patient’s ECG signal, a sequence of key bits that authenticate a Programmer to an IMD. Secure touch-to-access authentication requires that the key source be truly random, ideally that constituent bits have high entropy and are statistically independent from one another and over time. The key source is then hard for an attacker to guess without physical access to the patient and also ensures forward-security, i.e., that old key source bits don’t reveal future ones.

Previous work has explored the statistical properties of ECG waveforms for key generation, but not the important impact of read error rates on authentication false positive and false negative rates. We present experiments on real patient ECG data showing that it’s possible (with errors) to extract roughly four truly random and sta- tistically uncorrelated bits from the ECG wave corresponding to a single heartbeat. Collection over a 15-second interval suffices for strong Programmer authentication (a false acceptance rate of about 2.7×10−9 and false rejection rate of 10−4).

We also introduce an optimal scheme in H2H for testing PV validity, i.e., testing α ≈ β. Our scheme relies on the Neyman- Pearson Lemma, rather than naïvely on Hamming distance, as in previous work.

Good statistical properties, however, don’t ensure that ECG can enforce the touch-to-access policy in H2H. Recently developed sys- tems can read cardiac rhythms remotely via videocamera, and even accurately measure a patient’s pulse. (Skin color changes subtly with cardiac rhythms.) In Section 3.3, we very briefly report our implementation and test results for the best known of these sys- tems [35] and show that it doesn’t reveal statistically significant information about the ECG key source used by H2H.

Given a good key source, a cryptographic pairing protocol is needed between the Programmer and IMD. Two features of H2H make cryptographic pairing a challenge. First, when the Program- mer and IMD synchronously sample the key source, their respec- tive readings β and α are noisy: Often β ≈ α, but exact equal- ity β = α isn’t obtained. Cryptographic tools such as password- authenticated key agreement (e.g., [5]) require exact equality, while error-tolerant ones, e.g., [14,24], sacrifice entropy needlessly in our setting here.

Second, the IMD has tight computational and power constraints. Microcontrollers in common use for IMDs today can perform only lightweight cryptography. As IMDs are long-lived devices (with an average lifetime today of five to seven years [15]), and battery replacement requires surgical intervention, power conservation is essential. H2H can protect new IMDs as well as legacy in-vivo IMDs with upgradable firmware, as long as the H2H implementa- tion meets the IMD’s limited memory and computational resources.

We present a new pairing protocol that exploits the fact that key source bits are statistically uncorrelated, and thus that we can treat α and β as one-time authentication values. We demonstrate its security in a strong adversarial model that includes man-in-the- middle attacks, such as the jam-and-replay attacks feasible in a wireless environment.

Our H2H pairing protocol requires only a low-exponent RSA encryption (tens of modular multiplications) and a few AES invo- cations and hash computations by the IMD. We demonstrate a full implementation of H2H on an ARM Cortex-M3 processor.

In summary then, our contributions are:

• Statistical characterization of ECG for authentication: Us- ing real-world ECG measurements [30], we experimentally quantify the extractable entropy in ECG signals. (We demon- strate use of the Neyman-Pearson Lemma to achieve optimal use of this randomness.)

• Cryptographic pairing protocol: We present a novel, lightweight, noise-tolerant cryptographic scheme for Programmer-to-IMD pairing in H2H. We formalize an adversarial model and outline proofs of security.

• Implementation: We describe a full implementation of H2H in an ARM Cortex-M3 processor, reporting resource require- ments such as code size and power consumption, and demon- strating the feasibility of H2H for use in contemporary IMDs.

1100

Organization We introduce our statistical model for ECG waveforms and op- erational, trust, and adversarial models for H2H in Section 2. A detailed statistical analysis of real-world ECG waveforms and our proposed entropy extraction techniques follows in Section 3. We specify the cryptographic device-pairing protocol for H2H in Sec- tion 4. We describe an implementation of H2H in Section 5. We review related work in Section 6, and conclude in Section 7 with a discussion of future work. Appendix A outlines a security analysis of the H2H device-pairing protocol; detailed modeling and analysis is deferred for the full version of this paper.

2. MODELING Before diving into details on H2H, some basics on ECG and our

associated statistical model are in order, as well as discussion of our operational, trust, and adversarial models.

2.1 ECG model Figure 2 is a schematic depiction of the ECG waveform of a

healthy patient. The so-called R-peak is the most prominent feature of the ECG waveform; it corresponds to the “beat” in a heartbeat. The time between two consecutive R-peaks, or the heartbeat dura- tion, is commonly referred as the inter-pulse interval (IPI). As the figure shows, a typical ECG cycle includes other physiologically significant, named features: The P-wave, which occurs before the R-peak, the QRS complex, which includes sharp valleys before and after the R-peak, denoted by Q and S respectively, and the T-wave, following the S valley.

The heart rhythm is governed by the parasympathetic nervous system, in which many non-linearly interacting processes give the IPI its well-studied chaotic nature [7, 31]. The ECG waveform, and parasympathetic network more generally, are influenced by both long-term trends such as circadian rhythm and short-term temper- ature and respiratory changes. Thus ECG waves simultaneously exhibit both long term patterns and short-term chaotic behavior.

The ECG signal is well modeled as a stochastic process. The existence of long-term patterns renders the process non-stationary, meaning that the parameters of its underlying distribution, e.g., mean and variance, fluctuate over time. We introduce transforms for H2H, however, that eliminate long-term variations, creating a residual signal that is well-approximated by a wide-sense station- ary stochastic process, i.e., one whose first and second moments don’t change over time. Previous work observed a strongly random element in IPI time series values [7], motivating later use of IPIs as a natural source of randomness [9].

Like H2H, previous systems also exploited the natural synchro- nization property of IPIs. Slight shifts in the time interval over which IPIs are derived don’t impact IPI values, which are com- puted relative to R-peaks, not absolute time.

Entropy and security: We show that it is possible to extract four high-grade random bits per IPI from our processed ECG source, i.e., bits that have maximal entropy and are fully uncorrelated. We use this entropy measure to characterize the security of H2H for- mally using our main theorem, Theorem 1. An important new as- pect of our work is quantification and use of the different error rates incurred by individual high-grade random bits via the Neyman- Pearson Lemma. This simple approach marks a notable advance over earlier work, which assigned the same significance to all ran- dom bits. Our improvement enables authentication with the opti- mal, i.e., minimum possible, false positive rate for a given false negative constraint.

0 200 400 600 800 −2

−1.5

−1

−0.5

0.5

1.5

Time (ms)

V (m

V )

Q S

Figure 2: A typical ECG waveform from lead V2 which is recorded from chest. R-peak is the most prominent feature.

2.2 Operational and trust models We envision use of H2H primarily for emergency authentica-

tion, when medical personnel, e.g., emergency medical techni- cians (EMTs), need access to a patient’s IMD, but have no pre- established keys or trust relationship. Rapid and reliable access to the IMD is important. Thus H2H harvests PV randomness effi- ciently to achieve quick authentication.

We assume no public-key infrastructure (PKI) for certification of trustworthy programmers. The challenges of key revocation, tamperproofing of programmers to prevent key compromise, etc., are substantial. Similarly, we assume it’s impractical for medical personnel to contact an authority on the fly for access credentials, as this approach would require an infrastructure of broad (indeed, worldwide) and robust trust relationships. Thus H2H relies exclu- sively on the touch-to-access policy for authentication.

The ECG waveform goes flat when an acute heart attack occurs. Similarly, in some late-stage terminal diseases, the parasympathetic network collapses and as a result, the ECG waveform loses most of its entropy. The hugely distorted ECG waveform resulting from such conditions is readily identifiable. In such cases, H2H is de- signed to enter a promiscuous mode in which any Programmer may access the IMD: For these acute events, the risks of medical failure greatly outweigh those of malicious attack. Additionally, these ex- treme medical conditions occur rarely.

In non-emergency situations, for instance, when a patient is re- ceiving routine medical care, it may be practical for medical per- sonnel to retrieve device-specific keys. But in unusual situations, e.g., patients traveling abroad, lost keys, and so forth, H2H is addi- tionally useful as a secondary or backup authentication mechanism.

2.3 Adversarial model We design H2H for a strong adversarial model that assumes the

presence of an attacker during a Programmer-to-IMD authentica- tion session. This adversary is active. It has complete network control, i.e., can drop (jam), modify, replay, and forge messages at will. (The adversary can’t compromise the Programmer or IMD, which would render protection of the IMD impossible.) While the presence of an adversary during a medical emergency is admittedly a strong assumption, we believe it is prudent to design robust secu- rity for critical systems such as IMDs by default.

Protecting against strong, active adversaries in a pairing protocol isn’t straightforward. As mentioned above, recent breaks of two recent such protocols illustrate the challenge [39].

1101

3. AUTHENTICATION PROCESS Several papers, e.g., [36, 45, 48], have proposed IMD authenti-

cation based on IPIs. Their common motif is extracting the purely uncorrelated random bits from IPIs and utilizing them as a key. As previously stated, IPIs are not independent across time. So most previous approaches were confined to utilizing only a portion of the quantized IPIs for key derivation. These protocols generate a key by quantizing IPIs and then concatenating the three or four least significant bits of the quantized IPIs. The Programmer is then au- thenticated if and only if the Hamming distance between the keys generated by the two communicating parties is less than a prede- fined threshold value.

The four least significant bits of IPIs (IPI4) are known to be independently and identically distributed (i.i.d.). We confirmed this characteristic by studying a 2Mbit dataset consisting of 48 half- hour ECG records of 47 subjects from the MIT-BIH Arrhythmia Database [30], 549 two-minute records of 290 subjects from the PTB Database [6], and 250 records of 250 patients from MGH/MF Waveform Database [47], each roughly 30 minutes in length. All of these databases are available at [1]. We note that while these widely referenced databases contain records from patients with abnormal cardiac rhythms, these and similar databases remain standards for the study of ECG-based biometric authentication. (See, e.g., [45, 48].) Such use is substantiated by an extensive body of literature (e.g., [17, 42, 49]) documenting stronger chaotic effects in healthy hearts than in diseased ones.

We applied the NIST suite of statistical tests [41] to our dataset. The outputs of the NIST statistical tests are p-values listed in Ta- ble 1. These p-values represent the probability that the dataset was generated by a random process. If this value is less than a threshold (usually 1%), the randomness hypothesis is rejected. Table 1 shows that the p-values are all greater than 1%.

Table 1: p-value of several NIST statistical tests for IPI4. These bits pass all of the random tests.

NIST test p-value Runs 0.311310 Rank 0.879647

Longest runs 0.185359 Frequency 0.011830

Universal test 0.013223 Approximate entropy test 0.464725

FFT test 0.131301 Linear complexity 0.612269

The lack of correlation between bits allowed previous work to use a simple Hamming distance metric to compare received and measured bits. Prior work, however, did not characterize and op- timize for the error rates on sampled bits resulting from noisy IPI readings. More precisely, we define the error rate as the probability, for a given IPI-derived bit, that two devices, e.g., a Programmer and IMD, read the same IPI at different points on the body but output differing bit values.

Lacking the ability to obtain IPI measurements from IMDs in our lab, we estimate the error rates for IPI-derived bits in the H2H setting by means of two external ECG leads, on the left arm and right arm of subjects. The electric potential of the ECG lead III between the left arm of subjects and their left foot is taken as a sur- rogate for the ECG from the IMD. Similarly, the electric potential of the ECG lead II between the right arm and left foot is taken as a surrogate for the ECG recorded by the Programmer and IMD. It should be noted that we are using the interval between consecu-

tive prominent R-peaks, instead of analyzing the whole waveform as in [44]. Therefore, H2H is not sensitive to the location of leads on the body. Any other lead configuration could have been used in our analysis. We performed this analysis with other lead configu- rations and didn’t find any significant variations in the error rate of least significant bits.

In the next step of analysis, the IPI values of these two readings from leads II and III are calculated and quantized. They are then converted to a Gray-code representation to minimize the difference between the quantized bits caused by error in measurement.

The Hamming distance between these two sets of bits is then taken as a surrogate for the error rate between IPI measurements by the IMD and Programmer. The results of our experiment are reflected in the “Error rate” column of Table 2 for IPI values quan- tized to 8-bit representation. The last column shows the 95% con- fidence interval of the error rates. We again used an aggregate of MIT-BIH [30], PTB Database [6], and MGH/MF Database [47] to estimate these error rates.

The table shows that the error rate varies considerably across quantized bits; the lower the significance of the bit, the higher its error rate and entropy. In the next subsection, we describe our sta- tistical approach—a departure from the naïve Hamming approach of previous work—to compare PV readings during authentication with suitable weighting for individual bit errors.

Bit Entropy Error rate 95% CI Denoted by 8 (MSB) 0.27 0.001 – –

7 0.80 0.003 – – 6 0.90 0.004 – – 5 0.98 0.006 – – 4 1 0.009 (e4) 0.008 – 0.012 x4 3 1 0.018 (e3) 0.015 – 0.021 x3 2 1 0.039 (e2) 0.035 – 0.043 x2

1 (LSB) 1 0.080 (e1) 0.075 – 0.086 x1

Table 2: Average entropy and the estimated error rate of quan- tized bits, along with their 95% Confidence Interval (CI).

3.1 Quantifying the probability of skin con- tact

It’s convenient to treat the IMD PV α as correct. Error rates then characterize honest or attacker deviation from α.

A PV in H2H includes only the four least significant bits of an IPI, which we denote collectively by IPI4. The bits in IPI4 are i.i.d. random variables. Thus, an adversary that hasn’t made skin contact with a victim, and has no information about IPIs, can at best guess an IPI4 value by assigning random values to each of its constituent bits. Suppose that n is the number of distinct IPI4 instances read in an H2H authentication session. Then the total number of incorrect guesses by Adv for any given one of the four IPI4 bit positions can be modeled as a binomial distribution with Bernoulli trial probability of 0.5, denoted by B(n,0.5).

The total number of incorrect bit outputs for a given bit position i by a valid Programmer with skin contact can be modeled by another binomial distribution B(n,ei). Here, ei is the error rate of bit i ∈ {1,2,3,4} as given in the third column of Table 2.

Figure 3 compares the distributions of incorrect guesses by an adversary (with no skin contact) against those of a valid Program- mer, for n = 20. The solid line is the distribution for the adversary on any of the four bit positions. For the Programmer, xi = B(n,ei) denotes the random variable corresponding to total incorrect values

1102

in bit position i. The adversary is seen to produce significantly more errors than the Programmer in all bit positions.

0 5 10 15 20 0

0.2

0.4

0.6

0.8

Incorrect guesses by adversary and incorrect outputs by programmer (x

4 ,x

3 , x

2 ,x

1 )

P ro

b a b il

it y

Adv

x 1

x 2

x 3

x 4

Figure 3: Probability distributions on incorrect guesses by a valid Programmer (dotted lines) and an adversary (solid line), for n = 20 (reading of 20 IPI4 values). Here, x1, x2, x3, and x4 denote random variables on Programmer errors for bits posi- tions 1, 2, 3, and 4 respectively. The distribution for the adver- sary is identical across bit positions. Clear separation is seen between valid and adversarial distributions, even for bit posi- tions with relatively high error rates (e.g., for x1).

3.2 Neyman-Pearson hypothesis testing Recall that the goal in the H2H authentication process is to de-

termine whether α ≈ β, where the IMD reads PV α and the Pro- grammer submits PV β. Determining whether Programmer PV β is authentic, i.e., resulting from skin contact, may be viewed as a hy- pothesis test. The underlying hypothesis is that the Programmer’s claimed PV β is drawn from the probability distribution of an hon- est Programmer, instead of an adversary’s guessing distribution.

This observation motivates use of the well known Neyman- Pearson Lemma [32] to distinguish between honest and adversarial authentication attempts. Let error value u denote the set of errors in β, i.e., bit positions that differ from α. In our context, then, Neyman-Pearson Lemma states that for a given maximum accept- able false negative rate, the false positive rate is minimized as fol- lows. For a fixed threshold value Th (whose computation we dis- cuss below), a submitted Programmer value β is accepted as valid only when the following criterion holds:

log

( P (u)

Q(u)

) > Th, (1)

where P(·) denotes the probability of an adversary with no skin contact yielding error value u and Q(·) denotes the probability of a valid Programmer yielding u. We model distributions P(·) and Q(·) according to the binomial distributions discussed above and depicted in Figure 3.

We observe that as the bits in a given bit position i are i.i.d., the correctness of a PV is invariant to which IPI4 values contain er- roneous bits. The authenticity of a PV β is thus determined based on only the total number of correct or incorrect values in each bit position. So we can treat u as an equivalence class of PVs. In par- ticular, it’s convenient to regard u as a vector ~u = 〈u1,u2,u3,u4〉, where ui denotes the total number of IPIs in β that are incorrect in bit position i—again, that differ from those in α.

Now, P(~u) = ∏4 i=1

Pi(ui) and Q(~u) = ∏4 i=1

Qi(ui), where Pi(ui) and Qi(ui) denote the probability of a total of ui incorrect IPI values for bit position i in adversarial and honest scenarios, respectively. It follows that:

log

( P (~u)

Q(~u)

) =

4∑ i=1

log (Pi (ui))− 4∑ i=1

log (Qi (ui)) . (2)

Based on Equation 2, we can construct complete, compact rep- resentations of P(~u) and Q(~u). For a given value of n, it suf- fices to build a table containing log Pi(ui) and log Qi(ui) for i ∈ {1,2,3,4} and ui ∈ Zn. Because the error rate of the ad- versary is 1/2 for all bit positions, Qi(ui) = Qj(uj) for any i,j ∈ {1,2,3,4}. Thus, it suffices to store log Qi(ui) val- ues for i = 1 only. Consequently, the full table contains just (4 + 1)× (n + 1) = 5n + 5 values.

Given such a table, performing the Neyman-Pearson test in Equation 1 requires just (4+1) = 5 table lookups, eight additions, and one subtraction. This computation is online, i.e., performed during authentication. The storage and computational efficiency of our table-driven approach to Neyman-Pearson testing proves valu- able in our implementation of H2H, described later.

One issue remains. The Neyman-Pearson Lemma states the exis- tence of threshold Th, but doesn’t specify how to compute Th. We now describe an algorithm to compute Th in our setting. Note that computation of Th takes place offline: Th need only be computed once and can then be programmed into an H2H-enabled IMD.

Computing Neyman-Pearson threshold value Th: It turns out that the space (Zn)4 of possible values of ~u is relatively small. As we show below, n ≤ 50 is sufficient to achieve our desired strength of authentication for H2H, meaning that the total number of possi- ble values of ~u is at most 504 = 6,250,000. Consequently, we can compute Th essentially by means of a brute force algorithm.

We specify this algorithm in pseudocode below as Algorithm 1. Algorithm 1 computes Th for a target false-negative rate FNReq. It first constructs a matrix M[n4][3] with n4 rows, one for each ~u ∈ (Zn)4, and three columns. For each row ~u, Column 1 contains P(~u), Column 2 contains Q(~u) and Column 3 contains log P(~u)

Q(~u) .

The rows of M are sorted in ascending order with respect to Column 3 values. Then, from top (smallest) to bottom (largest), Column 1 values are accumulated in a variable p until the lowest row τ is reached for which the cumulative value p ≤ FNReq. The Column 3 value of row τ, namely M[τ][3], is the optimum thresh- old value Th. By summing Column 2 values over the first τ rows, we also obtain the corresponding false-positive rate (FP ). (A com- putation failure outputs special symbol ⊥.)

The dominant cost of Algorithm 1 is sorting. Thus its asymptotic complexity is O(n4 log n). In practice, as n is small, the algorithm executes quickly. For example, we implemented Algorithm 1 in MATLAB on a machine with a 3.4GHz Intel i7-2600 CPU running Windows 7. It took 0.2 second to calculate Th for n = 15 and around 8 seconds for all values of n from 1 to 25.

Again, we emphasize that Algorithm 1 is run as a precomputa- tion offline, not in the IMD.

Setting parameters in H2H: In our H2H implementation, we set the false negative rate (FNReq) to 10−4. In practical terms, this means that a valid Programmer with skin access would fail on av- erage in one in every 10,000 attempts; it would fail twice consecu- tively at most once in every 100,000,000 attempts. We believe this choice achieves adequate failure resilience for real-world scenarios.

1103

Algorithm 1 Neyman-Pearson threshold Th computation Inputs: n, {ei}4i=1, FNReq Outputs: Th, FP

P [1 : n + 1] ← binomial(n,0.5); for i = 1 to 4 do

Q[1 : n + 1][i] ← binomial(n,ei) end for j = 1; for ~u = 〈u1,u2,u3,u4〉 ∈ (Zn)4 do

M[j][1] = 4∏ i=1

P [ui];

M[j][2] = 4∏ i=1

Q[ui][i];

M[j][3] = log( M[j][1] M[j][2]

); j ← j + 1;

end for sort M on M[·][3] (Column 3); p ← 0;j ← 0 while p ≤ FNReq do

j ← j + 1; p ← p + M[j][2];

end while τ ← j − 1; if τ < 1 then output ⊥; halt end if FP ←

τ∑ k=1

M[k][1];

Th ← M[τ][3];

Figure 4 illustrates the tradeoffs between false negative rates (FNReq) and false positive rates (FP ), for varying numbers n of IPI4 values used in authentication. Table 3 gives detailed FP values for our implementation choice FNReq = 10−4 and, for comparison, for FNReq = 10−3. Naturally, the lower FNReq, the higher FP .

In addition to FNReq, the other key parameter choice in H2H is the number n of IPI4 values measured for authentication. The larger n is, the better FNReq and FP are. As n grows, though, so does the ECG measurement time in an H2H authentication.

In our implementation, we have chosen to set n = 15. Given our choice of FNReq = 10−4, the corresponding false positive rate is FP = 2.7×10−9. We chose this FP to demonstrate the feasibility of a strong level of authentication. As a point of comparison, this FP is lower than the false acceptance rate of a typical, eight-digit RSA SecurID token [40]. (While the false acceptance rate for such a token is nominally 1×10−8, allowances for synchronization er- rors and multiple tries make it somewhat weaker.) Lower FPs, and thus lower values of n, are likely to be acceptable in practice.

Table 3: FP values achieved for our implementation choice FNReq = 10

−4 and, for comparison, for FNReq = 10−3. Aver- age PV read time is given in the last column. n FNReq = 10

−3 FNReq = 10 −4 Avg. read time (secs.)

5 3×10−3 1.1×10−3 3– 5 10 1.15×10−6 7.9×10−6 7–10 15 0.2×10−9 2.7×10−9 11–15 20 0.27×10−12 5.38×10−13 15–20 25 0.32×10−17 8.4×10−17 18–25

The average resting heart rate is 60-80 beats per minute [2]. Thus, for n = 15, the recording time would on average take be-

5 10 15 20 25

−50

−40

−30

−20

−10

# of committed IPI 4

L o

g 2 (F

a ls

e p

o si

ti v

e r

a te

)

10 −3

10 −4

(used in H2H)

10 −5

10 −6

Figure 4: False positive rates achieved by Neyman-Pearson hy- pothesis testing for various false negative rates.

tween 7 to 10 seconds. The last column of Table 3 lists the average range of PV read times (in seconds) for different choices of n.

Summary: In H2H, we apply Neyman-Pearson hypothesis testing to determine whether to accept a Programmer-submitted PV β as authentic or reject it. We first compute the error value ~u of β. Error value ~u captures the total number of errors ui in each IPI4 bit po- sition i in β (by comparison with the IMD PV α). We then perform Neyman-Pearson testing of ~u via Equation 1. This test involves computation using Equation 2, and is made efficient by precom- puting a small table of ui value probabilities and Neyman-Pearson threshold Th, both of which are stored in the IMD.

An attacker can, of course, make multiple attempts against the IMD and the Programmer as well. In Section 4, we formally char- acterize the success probability of such attacks relative to FP . Ex- ponential backoff in the IMD is one helpful countermeasure.

3.3 Remote attack We also briefly investigated an attack on H2H based on re-

mote cardiac activity monitoring. The best reported remote heart rate monitoring result is achieved by photoplethysmography (PPG) [35]. PPG traces changes in skin color caused by temporal varia- tions in the concentration of blood on the skin surface.

Poh et al. [35] have reported moderately accurate IPI estimation using a commercial webcam at an approximate distance of 50cm from human subjects. We reproduced and evaluated their scheme at the same distance with a 30 frame per second (FPS) camera recording video of the subjects’ faces for PPG evaluation over a two minute period. Our camera had twice the FPS rate of the cam- era in [35].

We have not been able, however, to achieve error rates as low as those reported in [35] for any of our test subjects. (We have con- tacted the authors and requested clarification of details and their original dataset for validation, but have received no response.) The average error rate for the four least significant bits (IPI4) of all four subjects in our PPG experiment were close to 50%, i.e., to random guessing— substantially higher than those achievable by a Programmer with skin access and yielding little advantage to a re- mote attacker targeting H2H. We conclude that without significant advances, PPG is unlikely to pose a significant threat to H2H.

1104

4. PROGRAMMER-TO-IMD PAIRING PROTOCOL

We now describe the design of the H2H PV-based cryptographic pairing protocol. First we explain our design principles, both why we don’t use existing cryptographic protocols and how we exploit special features of the H2H setting to achieve simple, efficient pro- tocol design. We then present the protocol and a security analysis.

A naïve approach to authentication might work as follows. The Programmer establishes a secure connection with the IMD (via TLS, for instance). The two devices then take respective PV read- ings α (IMD) and β (Programmer). The Programmer transmits β to the IMD. If β ≈ α, i.e., β is close to α, the IMD accepts the Programmer as valid.

The problem with this approach is that it’s vulnerable to a man- in-the-middle attack. An adversary Adv can simultaneously pose as the IMD in a session with the Programmer and as the Programmer with the IMD. On receiving β from the Programmer, Adv forwards it to the IMD, resulting in a successful authentication.

Password-authenticated key-exchange (PAKE) schemes [5], are designed precisely to address such attacks, and might seem an ap- propriate tool for H2H. The PVs α and β measured respectively by the IMD and Programmer may be treated as passwords: The Pro- grammer gains access to the IMD by demonstrating its approximate knowledge of “password” α, i.e., that it knows β such that α ≈ β.

There are two problems with PAKEs. First, due to read errors in our setting, the IMD must check for approximate equality, i.e., α ≈ β. But a PAKE requires exact equality. More involved ap- proaches, e.g., bit-by-bit password testing, or use of fuzzy extrac- tion, e.g., [14] can convert PAKE into a “fuzzy” tool for approxi- mate equality testing.

But PAKE presents a second problem: Computational cost. While the several modular exponentiations required by a single PAKE execution are feasible on many devices, they constitute more computation—and more energy expenditure, in particular—than desired on IMDs, which are highly constrained in terms of power and computational resources. A “fuzzy” PAKE would require even more computation.

Thankfully, as it turns out, PAKE is overengineered for H2H. It’s possible to support approximate matching of α and β and gain better computational efficiency than PAKE.

4.1 Protocol overview Our key observation is that the readings α and β in H2H are one-

time values. In contrast to passwords, which are generally multi- use, α and β are transient. Fresh readings may be used to authen- ticate every session and, as we have demonstrated experimentally above, readings are statistically independent across time.

Consequently, it is possible to reveal α and β safely at the end of our authentication protocol—something not possible, of course, with static passwords. The protocol can thus rely primarily on (very fast) symmetric-key commitment and decommitment rounds and explicit IMD testing of the condition α ≈ β, rather than minimal- knowledge cryptographic comparison.

Our protocol has two phases: (1) A secure-channel setup phase, which uses (lightweight) public-key cryptography to create a se- cure but unauthenticated channel between the IMD and Program- mer and (2) An authentication phase, in which the two devices use a commitment / decommitment scheme to check whether α ≈ β.

Secure-channel setup. In the first phase of our protocol, the IMD and Programmer establish a secure channel via TLS. The IMD assumes the role of the TLS client; the Programmer, that of

a TLS server. That is, only the Programmer presents a certificate. When instantiated with RSA, TLS requires little client computa- tion, just one low-exponent (e = 216 + 1) modular exponentiation.

Our protocol makes use of an output from TLS session what we call a label s. Given that at least one of the two entities is honest, s is random and thus unique (with overwhelming probability). It isn’t secret, however. In practice, s might be, e.g., the hash of the TLS session key. For convenience, we abstract away the details of TLS and just model it as a protocol SecChannel that establishes a secure channel between two entities and outputs random label s.

SecChannel (in practice, TLS) creates a secure channel in the sense that it provides confidentiality, integrity, and freshness. But it doesn’t provide authentication: The IMD doesn’t present a certifi- cate, and doesn’t validate the Programmer’s. (As explained above, H2H avoids the burden of a PKI.) Put another way, when an IMD first sets up a secure channel, it has no assurance that it has paired with a valid Programmer, i.e., one actually in contact with the pa- tient. Similarly, a Programmer doesn’t know if it’s communicating with a valid IMD. Thus the next protocol phase.

Authentication. In the authentication phase, the two devices commit to their respective PV readings α and β. Each device binds its commitments to the label s of the secure channel on which it is communicating (preventing its re-use, prior to decommitment, on a different channel).

The IMD can then safely decommit α for the Programmer, as it has already received a commitment for β.

If the Programmer determines that α ≈ β, then it decommits β. Otherwise, it rejects the session. This selective decommitment helps ensure that the Programmer only reveals β to a valid IMD (one that knows α ≈ β), preventing re-use of β by an adversary. If the Programmer had been the party who decommits first, an adver- sary would have easily mounted a man-in-the-middle attack.

The IMD itself then verifies that α ≈ β, and makes an accept / reject authentication decision.

After an invalid authentication attempt, IMD waits a full PV read cycle before accepting a new authentication request. This delay prevents interleaving attacks, in which a Programmer’s ses- sion overlaps with two IMD sessions. (This is in fact necessary to achieve Theorem 1 below.)

4.2 Protocol specification The H2H authentication protocol is specified in Figure 5. Some

technical preliminaries are needed. Define V as the space of valid PVs. Let dist : V ×V → R+0

denote a pairwise distance metric on V. Let τ denote the time re- quired for a device to read a PV.

We make use of a commitment scheme Commit with message space V and key space {0,1}k×{0,1}k. We denote a commitment of message pair (m,s) ∈ V ×{0,1}k under key w ∈ {0,1}k by C = Commit((m,s);w). We adopt the convention of decommit- ment as verification of correct commitment, i.e., decommitting m under key w involves the check C

? = Commit((m,s);w).

For simplicity of analysis, we treat Commit as an ideal function- ality [8], i.e., as unconditionally hiding and binding. When either device outputs the message reject, rejecting the session, it termi- nates communication on the session channel. Additionally, devices support only serial sessions, not concurrent ones.

4.3 Security analysis We consider an adversary Adv that fully controls the channel be-

tween the IMD and Programmer, i.e., Adv can deliver, drop, mod- ify, and forge messages as desired. Adv can’t corrupt the IMD or

1105

IMD Programmer

SecChannel R→s∈{0,1}k ←→

read PV α(∈V); read PV β (∈V); wA

R←{0,1}k; wB R←{0,1}k;

CA ← Commit((α,s);wA) CB ← Commit((β,s);wB) CA−→ CB←− wA−→

if CA 6= Commit((α,s);wA) or dist(α,β) > d then wB ← reject

wB←− if CB 6= Commit((β,s);wB) or dist(α,β) > d then

auth ← reject else

auth ← accept auth−→

if auth = reject wait for time τ

Figure 5: H2H pairing protocol.

Programmer, however. (If it could, protection of the IMD wouldn’t be possible.) Here we briefly present our formal security model and main theorem. Appendix A contains more details.

We assume that readings α and β by the IMD and Programmer come from probability distributions defined by a model M. Given M, p1 denotes the maximum probability that Adv can guess a valid PV reading in one try, i.e., given legitimate PV reading α, that Adv can guess a v such that dist(v,α) ≤ d. In other words, p1 is the authentication probability characterized in Section 3.

Similarly, p2 denotes the maximum probability that Adv does so in exactly two tries. More precisely, given simultaneous PV readings α and β, p2 is the maximum probability, given a known, failed guess for α, that Adv guesses a valid PV reading for β.

We define the security of H2H in terms of an experiment in which Adv succeeds if it can authenticate to the IMD, i.e., cause it to output accept for a session with Adv on SecChannel. To convey intuition for the resistance of H2H to man-in-the-middle attacks, we give the following lemma with a proof sketch. Here succH2HAdv (1,1) denotes the probability that Adv succeeds when it initiates a single session each with the Programmer and IMD.

LEMMA 1. For PV model M, succH2HAdv (1,1) ≤ p1 + p2 − p1p2.

Proof: [sketch] Adv’s goal is to initiate a session on SecChannel with the IMD such that it outputs accept. To do so, Adv must send the IMD (simulated Programmer commitment) C′B, with cor- responding value β′ such that dist(α,β′) ≤ d. Recall that we model Commit as an ideal functionality, both unconditionally hid- ing and binding. As the commitment CA of the IMD is thus hiding, Adv can obtain an advantage over random guessing for β′ only by interacting additionally with the Programmer.

Suppose therefore that Adv has initiated a session with the Pro- grammer prior to completion of its session with the IMD. Adv sends the Programmer (simulated IMD) commitment C′′A on PV

α′′. Each instance of SecChannel emits a uniformly random label. Thus the Adv-IMD session has label s′ and the Adv-Programmer session label s′′ such that s′ 6= s′′. (For simplicity, we disregard the negligible-probability event s′′ = s′.)

Two cases arise, depending on whether Adv sends (simulated Programmer) commitment C′B to the IMD before or after sending a decommitment w′′A to the Programmer. Case 1: Commitment C′B precedes decommitment w

′′ A. CB is

bound to label s′′. To cause the IMD to accept, however, C′B must be bound to label s′. (Intuitively, binding commitments to SecChannel labels prevents Adv from “stitching together” two dis- tinct channels in a man-in-the-middle attack.) Given these bindings and the hiding property of Commit for CB and CA, then, Adv must commit to PV β′ in C′B that is independent of PVs α and β. Thus Adv must guess β′ at random, and the IMD outputs accept with probability p1. Case 2: Commitment C′B follows decommitment w

′′ A. As Adv

sends C′′A prior to any decommitments, and commitments are hid- ing, Adv commits in C′′A to a PV α

′′ that is independent of previous transcript values, and thus correct with probability at most p1. On sending decommitment w′′A, Adv learns whether α

′′ was correct. With this knowledge Adv may then submit a PV guess β′ in C′B to the IMD with success probability at most p2.

Adv’s maximum success probability, achieved for Case 2, is p1+ (1−p1)p2.

Defining succH2HAdv (qi,qr) as the probability that Adv succeeds in at least one session with at most qi queries (session initiations) with IMD and qr with the Programmer yields our main theorem:

THEOREM 1. Given PV model M and q = qi + qr with even- valued q, succH2HAdv (qi,qr) ≤ 1− (1− (p1 + p2 −p1p2))

q/2.

Theorem 1 reflects the fact that Adv’s best strategy against H2H is to initiate sessions simultaneously with the IMD and Program-

1106

mer, which respectively read PVs α and β. Adv tries to authenti- cate to Programmer by guessing β. If this fails, Adv tries to au- thenticate to IMD by guessing α; it gains a small advantage from knowing that its guess for β was incorrect. (The need for even q in our theorem is technical: The proof assumes the adversary can mount q/2 distinct attempts against the two devices.) A proof is sketched in Appendix A.

H2H-specific analysis: For the special case of H2H, Theorem 1 can be simplified. Let MH2H and distH2H denote the PV distri- bution model and (Neyman-Pearson-induced) distance metric re- spectively for H2H. We can then show:

COROLLARY 1. Given PV model MH2H and distance metric distH2H, and q = qi + qr with even-valued q, succH2HAdv (qi,qr) ≤ 1− (1−2p1)q/2.

In other words, the probability of a successful attack in a given session by Adv is at most twice its authentication probability, as characterized in Section 3.

4.4 Privacy H2H protects patient privacy in two senses. First, the IMD

doesn’t release a public key (as a Programmer does), or any other static identifier. As α is random, and protocol values are random (or pseudorandom), H2H thus provides logical-layer tracking privacy: An adversary can’t correlate distinct RF sightings of a given IMD, i.e., can’t track a patient wirelessly from a distance. (For cautions about physical-layer wireless tracking, however, see [12].) Second, the randomness of α prevents leakage of medically significant data, e.g., cardiac abnormalities evident in a full ECG waveform.

5. PROTOTYPE IMPLEMENTATION

ECG board

ARM

Board

Wireless

interface

Wireless

interface

IMD prototype

Programmer

IMD

Figure 6: High-level view of the H2H prototype. The IMD parts are in the dotted box on top. The Programmer runs on a PC.

In this section, we present a prototype implementation of H2H. A high-level architecture is shown in Figure 6. The IMD prototype consists of three boards:

1. A Leopard Gecko EFM-32 microcontroller (EFM32LG- DK3650);

2. an ECG analog A/D front end (TI ADS1298); and

3. a wireless sensor modem (TI CC430F5137). Leopard Gecko is a 32-bit ARM Cortex-M3 processor with an

attractive power-consumption profile and convenient power debug- ging tools. In our implementation, the microcontroller communi- cates with the ECG analog front end and the wireless board. The EFM-32 also extracts ECG features and communicates with the Programmer using TLS. Figure 7 shows our implementation com- ponents. The following three subsections give details.

ECG interface

Wireless

interface

TLS

ECG parameter extraction

& verification

Leopard Gecko EFM-32

AES RSA

Figure 7: Main components of the implementation.

5.1 Secure channel implementation Recall that the IMD and Programmer establish a channel be-

tween them using TLS. The IMD performs the operations of an or- dinary TLS client and the Programmer those of an ordinary server.

TLS is designed to provide an encrypted and authenticated chan- nel between two communicating parties [26]. Standard TLS au- thentication assumes a PKI, however, which H2H doesn’t, as noted above. Thus the one deviation from normal TLS usage in H2H is that the IMD doesn’t verify the Programmer certificate against a PKI. Instead, H2H authentication, i.e., ECG PV comparison, is performed after the TLS handshake to authenticate the channel.

Our H2H prototype uses RSA for the master secret key exchange in TLS, AES-128 for encryption, and SHA256 as the hash function. SHA256 also serves as the commitment function Commit(.) in the H2H pairing protocol.

We chose RSA for key exchange because RSA encryption with a small public exponent is the fastest key-exchange option for TLS [33]. In our implementation, the RSA public exponent is set to 216 +1. The RSA modulus and message length are set to 2048 bits to conform with current NIST key length recommendations [4].

Our RSA implementation is designed to comply with the MISRA-C standard. MISRA-C [28] is a set of software develop- ment recommendations to achieve high levels of reliability for crit- ical embedded devices. For example, MISRA-C prohibits use of dynamic memory allocations. The code size, the number of clock cycles, and approximate power consumption of various blocks of the TLS handshake are listed in Table 4.

Table 4: Approximate resource overhead of each of the com- ponent blocks of our H2H implementation. Here, “# of cycles” and “power” denote resource costs for a single call to the block.

Block name Size (Kb) # of cycles Power (µ Watt) AES encryption 2 6600 8 AES decryption 2 8400 10 RSA encryption 5 5000000 20000

MD5 1 5000 4 SHA256 3 10000 5

R-peak detection 4 5000 2

5.2 Random number generation H2H requires a cryptographically secure pseudo-random num-

ber generator (PRNG) for RSA ciphertext padding, key generation and nonce selection in TLS, and commitment (as shown in Fig- ure 5). We use a NIST-recommended PRNG based on cipher-block chaining (CBC) [27], with AES as the underlying block cipher. The PRNG requires an initial random seed. We generate this seed of- fline and store it in the IMD’s non-volatile memory. (In a commer- cial IMD, it can be, e.g., set at the time of manufacture.)

1107

5.3 ECG parameter extraction Our H2H prototype annotates ECG R-peaks by applying a sim-

ple length transformation to the ECG waveform using an open- source algorithm called “WQRS.” In this algorithm, the arc length of the waveform over a moving window is compared against a threshold to detect heartbeats [34]. Its resource overhead is is spec- ified in Table 4. (An implementation of WQRS is available on the PhysioNet website [1].)

6. RELATED WORK Several early research and development efforts in medical elec-

tronics have addressed safety and reliability of IMD devices, par- ticularly the problem of unexpected failures [29]. Increased net- working of embedded devices and emergence of pervasive health- care technologies motivated security and privacy investigations for general sensor networks and body sensor networks [25, 43, 46].

Halperin et al. [21] first discussed the security and privacy chal- lenges caused by resource constraints and inflexibility in existing IMD designs, and highlighted fundamental tensions among privacy, security, safety, and utility. Fu [16] argued that improving IMD se- curity requires a balance between technology and regulation.

Halperin et al. [20] gave the first systematic and pragmatic secu- rity analysis of a real commercial IMD, an implantable cardiovas- cular defibrillator (ICD). They showed that these devices are sus- ceptible to attacks by malicious programmers that breach patient privacy and, even more seriously, can effect changes to data and functioning, potentially harming patients. Their work highlighted the pressing need for authentication of programmers to IMDs.

Programmer-to-IMD authentication, as noted above, is straight- forward if the Programmer and IMD share a preexisting key. (Au- thentication tokens have the same simplicity, but similar risks of IMD inaccessibility with credential loss.) Authentication without a pre-established relationship, as treated by H2H, is more challeng- ing. Three broad technical approaches bear comparison with H2H.

Distance bounding. Halperin et al. [20] introduce authentication techniques involving an implanted piezo device that generates a random key and emits it acoustically such that the Programmer can only receive it at close range. This approach is complementary to H2H. It results in faster key agreement, needing a mere 400ms to emit a 128-bit key. A serious drawback, however, is that it requires special implantation of the piezo device. This implantation must be at a depth of 1 cm or so from the skin, ruling out incorporation into deep-body IMDs, such as ICDs. Additionally, eavesdropping on acoustic emanations isn’t a well studied security problem.

A promising related approach, by Rasmussen et al. [38], uses ultrasound-based distance bounding (with an RF channel as well) to authenticate Programmer access to an IMD, achieving an ac- cess policy similar to, but slightly looser than touch-to-access. Their system requires RF shielding, however, amplifying IMD en- gineering complexity. In contrast, H2H is agnostic to transmis- sion medium and does not incur the added cost or energy for ul- trasound. For some IMDs, e.g., brain implants, RF antennas are of prohibitive length, and alternatives, e.g., infrared, are preferred. Finally, distance-bounding protocols’ security models have histor- ically proven fragile (see, e.g., [11, 37]).

Shielding. The idea of blocking inappropriate access to an IMD was first proposed in [13] via a device called a Communication Cloaker. The idea was further explored by Gollakota et al. [18]. Their proposed device, called a shield, is worn near the body and used to authenticate / mediate Programmer (or other) communica- tions with the IMD. A shield doesn’t require modification of exist-

ing IMDs. It protects communications with the IMD using a full duplex radio device acting as a jammer-cum-receiver.

IMDGuard [48] is a similar method for IMD protection involv- ing a third party, high-powered device worn externally, called a Guardian. The Guardian authenticates programmers on behalf of the IMD. Like H2H, it requires special-purpose IMD functionality.

While these devices provide more general functionality, H2H has several advantages over them: (1) H2H doesn’t require an external device, which is a burden on patients and increases the risks of system failures and unreliability; (2) H2H doesn’t require jamming, which, as employed to counter attacks in [18,48], can interfere with other RF devices and potentially lead to legal complications.

While the shield has the benefit of legacy compatibility, we note that a growing number of legacy IMDs are built using pro- grammable microcontrollers with in-vivo upgradable firmware, al- lowing an upgrade to the H2H protocol as long as its (lightweight) resource requirements are satisfied.

PV/ECG-based authentication. The use of PVs to secure inter- sensor communications in body area networks (BANs) was first introduced in [9]. Numerous works subsequently used the random- ness in ECG IPIs for IMD authentication, e.g., [3,10,36,45]. None, however, provided a rigorous entropy or protocol-security analysis. (In fact, the motivation for PV-derived keys in BANs is unclear. To pair user-controlled devices in non-emergency settings, even device passwords would seem practical.)

Most similar to H2H are two schemes: A protocol in IMDGuard for pairing the Guardian with an IMD and a generic body-area network pairing protocol called OPFKA (Ordered-Physiological- Feature-based Key Agreement) [23]. Like H2H, these protocols make use of ECG measurements to authenticate a Programmer to an IMD. As noted above, however, both protocols lack rigorous security analysis and have been shown in recent work to have se- rious cryptographic weaknesses [39]. (We also note that the re- ported hardware implementation overhead for the IMDGuard pro- tocol greatly exceeds that of H2H.)

Compared with previous PV-based authentication schemes in general, H2H is the first work that: (1) Includes a statistical anal- ysis of the full stochastic ECG waveform to demonstrate bit inde- pendence over time; (2) Quantifies and uses the individual error rates of the high-grade random bits to distinguish between honest and adversarial Programmers in an optimal way (via the Neyman- Pearson Lemma); and (3) Offers a formally analyzed PV-based cryptographic device pairing protocol.

7. CONCLUSIONS AND FUTURE DIREC- TIONS

This paper addressed the problem of authenticating external medical controllers and programmers to Implantable Medical De- vices (IMD). Presently available IMD devices can be wirelessly accessed and even upgraded / controlled by external devices under loose access-control policies, rendering them vulnerable to attack. This threat, and the vital role of most IMDs, argue an urgent need for trustworthy Programmer-to-IMD authentication schemes. The main challenges are a lack of pre-existing keys in emergency and other situations and the fact that IMD resource constraints forbid the use of heavy cryptographic or signal-processing modules with high energy consumption.

We presented the design and implementation of Heart-to-Heart (H2H), a lightweight “touch-to-access” scheme for Programmer- to-IMD authentication. The touch-to-access policy is enforced in H2H by a time-varying biometric, ECG heartbeat data. We per- formed new statistical analyses of the ECG data, including quan-

1108

tification of the error rates of high entropy bits. H2H draws on these analyses to achieve the first ECG-based authentication scheme that distinguishes honest from adversarial ECG signals in a rigorous statistical model and with a minimal false positive rate for a given false negative bound. We devised a novel cryptographic device pairing protocol for H2H that exploits ECG randomness to secure against active attacks, while satisfying the lightweight implemen- tation requirements and noise margins for reliable authentication. Our end-to-end realization in an ARM Cortex-M3 microcontroller confirmed the practicality and low overhead of H2H for current- generation IMDs.

8. ACKNOWLEDGMENTS This research was supported in part by an Office of Naval Re-

search grant (ONR R17460) and an Army Research Office YIP award grant (No. R17450) to the ACES lab at Rice University . Thanks to Kevin Fu and Denis Foo Kune for their very insight- ful comments on this work. Special thanks as well to Nav Ravin- dranath for his help with embedded-device implementation of our cryptographic algorithms. Finally, the authors would like to thank the anonymous reviewers of this paper for their helpful comments and suggestions.

9. REFERENCES [1] A. L. Goldberger et al. PhysioBank, PhysioToolkit, and

PhysioNet: Components of a new research resource for complex physiologic signals. Circulation, 101(23), 2000.

[2] American Heart Association. Physical activity and blood pressure, 2012.

[3] S. D. Bao, C. C. Y. Poon, Y. T. Zhang, and L. F. Shen. Using the timing information of heartbeats as an entity identifier to secure body sensor network. IEEE Trans. on Info. Tech. in Biomedicine, 12(6):772–779, 2008.

[4] E. Barker and A. Roginsky. Transitions: Recommendation for transitioning the use of cryptographic algorithms and key lengths. NIST Special Publication, 800:131A, 2011.

[5] M. Bellare, D. Pointcheval, and P. Rogaway. Authenticated key exchange secure against dictionary attacks. In Eurocrypt, pages 139–155, 2000.

[6] R. Bousseljot, D. Kreiseler, and A. Schnabel. Nutzung der ekg-signaldatenbank cardiodat der ptb über das internet. Biomedizinische Technik Biomedical Engineering, 40(s1):317–318, 2009.

[7] K. A. Brownley, B. E. Hurwitz, and N. Schneiderman. Cardiovascular psychophysiology. Handbook of psychophysiology, 2:224–264, 2000.

[8] R. Canetti and M. Fischlin. Universally composible commitments. In Crypto, pages 332–349, 2001.

[9] S. Cherukuri, K. K. Venkatasubramanian, and S. K. S. Gupta. Biosec: a biometric based approach for securing communication in wireless networks of biosensors implanted in the human body. In Parallel Processing Workshop, pages 432–439, 2003.

[10] K. Cho and D. Lee. Biometric based secure communications without pre-deployed key for biosensor implanted in body sensor networks. In Information Security Applications, pages 203–218, 2012.

[11] C. Cremers, K. B. Rasmussen, B. Schmidt, and S. Capkun. Distance hijacking attacks on distance bounding protocols. In IEEE Symp. on Security and Privacy, pages 113–127, 2012.

[12] B. Danev, D. Zanetti, and S. Capkun. On physical-layer identification of wireless devices. ACM Computing Surveys, 2011.

[13] T. Denning, K. Fu, and T. Kohno. Absence makes the heart grow fonder: New directions for implantable medical device security. In USENIX HotSec, 2008.

[14] Y. Dodis, L. Reyzin, and A. Smith. Fuzzy extractors: How to generate strong keys from biometrics and other noisy data. SIAM J. on Computing, 38(1):97–139, 2008.

[15] T. Drew and M. Gini. Implantable medical devices as agents and part of multiagent systems. In Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS), pages 1534–1541, 2006.

[16] K. Fu. Inside risks: Reducing risks of implantable medical devices. Communications of the ACM, 52(6):25–27, June 2009.

[17] A. L. Goldberger, D. R. Rigney, and B. J. West. Chaos and fractals in human physiology. Scientific American, 262:42–49, 1990.

[18] S. Gollakota, H. Hassanieh, B. Ransford, D. Katabi, and K. Fu. They can hear your heartbeats: non-invasive security for implantable medical devices. In ACM SIGCOMM, pages 2–13, 2011.

[19] A. J. Greenspon, J. D. Patel, E. Lau, J. A. Ochoa, D. R. Frisch, R. T. Ho, B. B. Pavri, and S. M. Kurtz. 16-year trends in the infection burden for pacemakers and implantable cardioverter-defibrillators in the united states: 1993 to 2008. Journal of the American College of Cardiology, 58(10):1001 – 1006, 2011.

[20] D. Halperin, T. S. Heydt-Benjamin, B. Ransford, S. S. Clark, B. Defend, W. Morgan, K. Fu, T. Kohno, and W. H. Maisel. Pacemakers and implantable cardiac defibrillators: Software radio attacks and zero-power defenses. In IEEE Symp. on Security and Privacy, pages 129–142, 2008.

[21] D. Halperin, T. Kohno, T.S. Heydt-Benjamin, K. Fu, and W.H. Maisel. Security and privacy for implantable medical devices. IEEE Pervasive Computing, 7(1):30–39, Jan.-Mar. 2008.

[22] K. E. Hanna, F. J. Manning, P. Bouxsein, and A. Pope, editors. Innovation and Invention in Medical Devices: Workshop Summary. The National Academies Press, 2001.

[23] C. Hu, X. Cheng, F. Zhangand, D. Wuand, X. Liao, and D. Chen. OPFKA: Secure and efficient ordered-physiological-feature-based key agreement for wireless body area networks. In IEEE INFOCOM, 2013.

[24] A. Juels and M. Wattenberg. A fuzzy commitment scheme. In ACM CCS, pages 28–36, 1999.

[25] D. Karaouglan and A. Levi. A survey on the development of security mechanisms for body area networks. The Computer Journal, 2013.

[26] C. Kaufman, R. Perlman, and M. Speciner. Network security: private communication in a public world. Prentice Hall Press, 2002.

[27] S. S. Keller. NIST-recommended random number generator based on ANSI X9.31 appendix A.2.4 using the 3-key triple DES and AES algorithms. Technical report, NIST, 2005.

[28] MISRA Limited. Misra-c:2004 - guidelines for the use of the c language in critical systems. Technical report, 2004.

[29] W. H. Maisel. Safety issues involving medical devices. Journal of the American Medical Association, 294(8):955–958, Aug. 2005.

1109

[30] G. B. Moody and R. G. Mark. The impact of the MIT-BIH arrhythmia database. IEEE Engineering in Medicine and Biology Magazine, 20(3):45–50, 2001.

[31] J. H. Nagel, K. Han, B. E. Hurwitz, and N. Schneiderman. Assessment and diagnostic applications of heart rate variability. Biomedical engineering-applications, basis & communications, 5:147–158, 1993.

[32] J. Neyman and E. S. Pearson. On the problem of the most efficient tests of statistical hypotheses. Phil. Trans. of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character, 231:289–337, 1933.

[33] C. Paar, J. Pelzl, and B. Preneel. Understanding cryptography: a textbook for students and practitioners. Springer, 2010.

[34] E. Pino, L. Ohno-Machado, E. Wiechmann, and D. Curtis. Real–time ECG algorithms for ambulatory patient monitoring. In AMIA Annual Symp., volume 2005, page 604, 2005.

[35] M. Z. Poh, D. J. McDuff, and R. W. Picard. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Optics Express, 18:10762–10774, 2010.

[36] C. C. Y. Poon, Y. T. Zhang, and S. D. Bao. A novel biometrics method to secure wireless body area sensor networks for telemedicine and m-health. IEEE Communications Magazine, 44(4):73–81, 2006.

[37] M. Poturalski, M. Flury, P. Papadimitratos, J.-P. Hubaux, and J.-Y. Le Boudec. Distance bounding with ieee 802.15.4a: Attacks and countermeasures. IEEE Trans. on Wireless Comms., 10(4):1334– 1344, 2011.

[38] K. B. Rasmussen, C. Castelluccia, T. S. Heydt-Benjamin, and S. Capkun. Proximity-based access control for implantable medical devices. In Proc. of Computer and communications security, pages 410–419, 2009.

[39] M. Rostami, W. Burleson, F. Koushanfar, and A. Juels. Balancing security and utility in medical devices? In Proc. of Design Automation Conference, pages 1–6, 2013.

[40] RSA, The Security Division of EMC. RSA SecurID authentication in action: Securing privileged user access, 2013.

[41] A. Rukhin, J. Soto, J. Nechvatal, M. Smid, and E. Barker. A statistical test suite for random and pseudorandom number generators for cryptographic applications. Technical report, DTIC Document, 2001.

[42] M. G. Signorini, F. Marchetti, and S. Cerutti. Applying nonlinear noise reduction in the analysis of heart rate variability. Engineering in Medicine and Biology Magazine, IEEE, 20(2):59–68, 2001.

[43] F. Stajano and R. J. Anderson. The resurrecting duckling: Security issues for ad-hoc wireless networks. In Int. Workshop of Security Protocols, pages 172–194, 1999.

[44] K. K. Venkatasubramanian, A. Banerjee, and S. K. S. Gupta. PSKA: usable and secure key agreement scheme for body area networks. IEEE Trans. on Information Technology in Biomedicine, 14(1):60–68, 2010.

[45] K. K. Venkatasubramanian and S. K. S. Gupta. Physiological value-based efficient usable security solutions for body sensor networks. ACM Trans. Sensor Networks, 6(4):31:1–31:36, July 2010.

[46] S. Warren, J. Lebak, J. Yao, J. Creekmore, A. Milenkovic, and E. Jovanov. Interoperability and security in wireless

body area network infrastructures. In IEEE Engineering in Medicine and Biology Society, pages 3837–3840, 2005.

[47] J. P. Welch, P. J. Ford, R. S. Teplick, and R. M. Rubsamen. The Massachusetts General Hospital-Marquette Foundation hemodynamic and electrocardiographic database – comprehensive collection of critical care waveforms. Clinical Monitoring, 7(1):96–97, 1991.

[48] F. Xu, Z. Qin, C.C. Tan, B. Wang, and Q. Li. IMDGuard: Securing implantable medical devices with the external wearable guardian. In Proc. of IEEE INFOCOM, pages 1862–1870, 2011.

[49] R. Yulmetyev, P. Hänggi, and F. Gafarov. Quantification of heart rate variability by discrete nonstationary non-Markov stochastic processes. Physical Review E, 65(4):046107, 2002.

APPENDIX A. SECURITY ANALYSIS

In this appendix, we briefly sketch a security analysis of the H2H authentication protocol, outlining a formal model and giving our main theorem with proof sketches. We defer a complete model, analysis, and proofs for the full version of this paper.

A.1 Model overview

Statistical modeling. We model the process of PV sampling in terms of a set V of PV values and function pair (sample,noise). We call M = (V; (sample,noise)) the PV model for H2H.

At the time of PV reading a true PV γ is sampled from V under a distribution defined by probabilistic function sample(t,τ) → γ ∈ V, where t denotes the sampling time and τ the sampling interval length. We assume that sample(t,τ) and sample(t + τ′,τ) are independent and identically distributed for any τ′ ≥ τ. We also assume that sample(t,τ) is identically distributed for any t, i.e., that it’s a stationary process. Thus we let sample(·,τ) denote a PV sample of duration τ taken at an arbitrary time.

We model noise in PV reading by the IMD and Programmer re- spectively by α ← noise(γ) and β ← noise(γ), for probabilistic function noise : V →V. (A model extension can capture different noise in the IMD and Programmer.)

Cryptographic modeling. We treat SecChannel as an ideal functionality. A player P can invoke SecChannel with any other player P ′ of its choice. The functionality then outputs a unique label s ∈ {0,1}k to P and P ′, or else outputs a failure symbol ⊥. All messages labeled with s are privately delivered between P and P ′; an adversary can block messages, but otherwise can’t see, modify, or reorder them. Honest players support only one instance of SecChannel at a given time. Recall that we also treat Commit as an ideal functionality, i.e., perfectly hiding and binding.

Adv can at any time cause the Programmer to initate an H2H session or itself initiate an H2H session with the IMD.

Adversarial model. We assume a Programmer and IMD exe- cuting serial sessions and uncorrupted by Adv. We define secu- rity with respect to an experiment involving an adversary Adv that knows M and fully controls the channel between the IMD and Pro- grammer. There is a query interface send that communicates mes- sages to the IMD and Programmer. Adv may send arbitrary queries m of the form send(entity,m) for entity ∈ (IMD, Programmer). A special query send(entity,start) causes a device to initiate

1110

the H2H protocol, i.e., execute SecChannel. To cause the IMD and Programmer to pair, Adv calls send(Programmer,start), then sends send(IMD,start)) from the Programmer to IMD.

Suppose Adv sends at most qi start queries to the IMD and qr start queries to the Programmer over the course of the se- curity experiment. We define succH2HAdv (qi,qr) as the probability that Adv causes the IMD to output accept for a session where it communicates with Adv on SecChannel.

A.2 Main theorem We now summarize our main result. First, define:

p1 = maxa′∈V(pr[dist(a ′ ,a) ≤ d |

a ← noise(γ),γ ← sample(·,τ)]).

Here, p1 is the probability that making an unconditioned query, i.e., knowing M only, Adv can successfully guess a valid PV . (We can think of p1 as a type of minentropy.)

Similarly, define: p2 = maxa′,b′∈V(pr[dist(a

′ ,a) ≤ d |dist(b′,b) > d,

a ← noise(γ),b ← noise(γ),γ ← sample(·,τ)]).

Here, p2 is the maximum probability, given a failed PV guess b′

for β, that Adv can guess a valid PV a′ for a. We have earlier presented Lemma 1, which is as follows.

LEMMA 1. For PV model M, succH2HAdv (1,1) ≤ p1 + p2 − p1p2.

We now build on Lemma 1, to show that Adv maximizes its probability of success by making q/2 pairs of queries to the IMD and Programmer, and that its success probability for each pair of queries is at most succH2HAdv (1,1). Theorem 1 results:

THEOREM 1. Given PV model M and q = qi + qr with even- valued q, succH2HAdv (qi,qr) ≤ 1− (1− (p1 + p2 −p1p2))

q/2.

Proof: [sketch] Given Lemma 1, it suffices to show that Adv max- imizes its probability of success by making q/2 pairs of queries to the IMD and Programmer, and that its success probability for each pair of queries is at most succH2HAdv (1,1).

Given output reject, the IMD waits a full cycle (time τ) be- fore initiating another session (taking input start). Suppose, then, that the IMD initiates local session i at time t, and thus reads αi ← noise(sample(t,τ)). Then the IMD will only initiate a fresh session i + 1 at time ≥ t + τ.

Thus if the Programmer initiates a session with PV β, then β will be independent of αi provided that β is read at time t + τ or later. Thus, as the Programmer only initiates a session at time t + τ, any Programmer PV reading β is independent of at least one of αi or αi+1. In general, then, any PV reading by the Programmer correlates with at most one αi.

Consequently, can make at most one conditioned query, i.e., query with information about γ, per unconditioned query. It can do so only by initiating overlapping sessions with the IMD and Programmer. Given q queries in total, Adv can create at most q/2 such sessions. Thus, succH2HAdvqi,qr ≤ 1− (1− succ

H2H Adv1,1

)q/2.

A.3 Application to H2H H2H carries two distinctive properties of uniformity that permit

a simplification of Theorem 1. In particular:

• Uniformly random PVs: In PV probability model MH2H for H2H, PVs are distributed uniformly at random (but cor- related). That is, α,β ∈U V. • Uniform regions of validity: For our Neyman-Pearson-

derived distance metric distH2H in H2H, the number of valid PV guesses β is identical for any α in V. (The distance be- tween two PVs depends on their bit differences, not the PVs’ specific bit values.)

Thus we can show (proof omitted):

COROLLARY 1. Given PV model MH2H and distance metric distH2H, and q = qi + qr with even-valued q, succH2HAdv (qi,qr) ≤ 1− (1−2p1)q/2.

1111