9 Pages Work

John.Quality
folder-1.zip

assignment-1.txt

Milestone 1 - Introduction Now that we have had an introduction to security protocols that will help reduce the risk of a security incident we are going to study security incidents. Each student will select a recent security incident that had their data stolen by an outside attack, insider threat, or some other mechanism. There are multiple examples that can be used, i.e., Target, Capital One, Home Depot, OPM, etc. As you evaluate the incident you evaluate several components throughout the remainder of the course. The assignment will have a couple of milestones over the course with a culmination in week 16 of our final evaluation. 1. select a security incident, there are examples above 2. create a 1-page introduction of the security incident 3. Your response must be properly APA formatted with every sentence that comes from another source cited or quoted. 4. The submission will go through safe-assign to check for plagiarism. If a high level of plagiarism is detected this will result in a zero for the assignment and the school will be notified. 5. There are no makeup attempts on any assignments

course_text_books/Cyber_Attacks_Protecting_National_Infrastructure_Edward_G_Amoroso_2010.pdf

C y b e r A t t a c k s

“Dr. Amoroso’s fi fth book Cyber Attacks: Protecting National Infrastructure outlines the chal- lenges of protecting our nation’s infrastructure from cyber attack using security techniques established to protect much smaller and less complex environments. He proposes a brand new type of national infrastructure protection methodology and outlines a strategy presented as a series of ten basic design and operations principles ranging from deception to response. The bulk of the text covers each of these principles in technical detail. While several of these principles would be daunting to implement and practice they provide the fi rst clear and con- cise framework for discussion of this critical challenge. This text is thought-provoking and should be a ‘must read’ for anyone concerned with cybersecurity in the private or government sector.”

— Clayton W. Naeve, Ph.D. , Senior Vice President and Chief Information Offi cer,

Endowed Chair in Bioinformatics, St. Jude Children’s Research Hospital,

Memphis, TN

“Dr. Ed Amoroso reveals in plain English the threats and weaknesses of our critical infra- structure balanced against practices that reduce the exposures. This is an excellent guide to the understanding of the cyber-scape that the security professional navigates. The book takes complex concepts of security and simplifi es it into coherent and simple to understand concepts.”

— Arnold Felberbaum , Chief IT Security & Compliance Offi cer,

Reed Elsevier

“The national infrastructure, which is now vital to communication, commerce and entertain- ment in everyday life, is highly vulnerable to malicious attacks and terrorist threats. Today, it is possible for botnets to penetrate millions of computers around the world in few minutes, and to attack the valuable national infrastructure.

“As the New York Times reported, the growing number of threats by botnets suggests that this cyber security issue has become a serious problem, and we are losing the war against these attacks.

“While computer security technologies will be useful for network systems, the reality tells us that this conventional approach is not effective enough for the complex, large-scale national infrastructure. “Not only does the author provide comprehensive methodologies based on 25 years of expe- rience in cyber security at AT&T, but he also suggests ‘security through obscurity,’ which attempts to use secrecy to provide security.”

— Byeong Gi Lee , President, IEEE Communications Society, and

Commissioner of the Korea Communications Commission (KCC)

C y b e r A t t a c k s Protecting National Infrastructure

Edward G. Amoroso

AMSTERDAM • BOSTON • HEIDELBERG • LONDON

NEW YORK • OXFORD • PARIS • SAN DIEGO

SAN FRANCISCO • SINGAPORE • SYDNEY • TOKYO Butterworth-Heinemann is an imprint of Elsevier

Acquiring Editor: Pam Chester Development Editor: Gregory Chalson Project Manager: Paul Gottehrer Designer: Alisa Andreola

Butterworth-Heinemann is an imprint of Elsevier 30 Corporate Drive, Suite 400, Burlington, MA 01803, USA

© 2011 Elsevier Inc. All rights reserved

No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions .

This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein).

Notices Knowledge and best practice in this fi eld are constantly changing. As new research and experience broaden our understanding, changes in research methods or professional practices, may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information or methods described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility.

To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein.

Library of Congress Cataloging-in-Publication Data Amoroso, Edward G. Cyber attacks : protecting national infrastructure / Edward Amoroso. p. cm. Includes index. ISBN 978-0-12-384917-5 1. Cyberterrorism—United States—Prevention. 2. Computer security—United States. 3. National security—United States. I. Title. HV6773.2.A47 2011 363.325�90046780973—dc22 2010040626

British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.

Printed in the United States of America 10 11 12 13 14 10 9 8 7 6 5 4 3 2 1

For information on all BH publications visit our website at www.elsevierdirect.com/security

CONTENTS v

CONTENTS Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Acknowledgment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi

Chapter 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 National Cyber Threats, Vulnerabilities, and Attacks . . . . . . . . . . . . . . . . 4 Botnet Threat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 National Cyber Security Methodology Components . . . . . . . . . . . . . . . 9 Deception . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 Diversity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 Consistency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Discretion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Correlation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Awareness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Response . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Implementing the Principles Nationally . . . . . . . . . . . . . . . . . . . . . . . . 28

Chapter 2 Deception . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Scanning Stage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 Deliberately Open Ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Discovery Stage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Deceptive Documents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Exploitation Stage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Procurement Tricks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Exposing Stage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Interfaces Between Humans and Computers . . . . . . . . . . . . . . . . . . . . 47 National Deception Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

vi CONTENTS

Chapter 3 Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 What Is Separation? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 Functional Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 National Infrastructure Firewalls . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 DDOS Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 SCADA Separation Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Physical Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Insider Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65 Asset Separation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 Multilevel Security (MLS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

Chapter 4 Diversity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 Diversity and Worm Propagation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 Desktop Computer System Diversity . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 Diversity Paradox of Cloud Computing . . . . . . . . . . . . . . . . . . . . . . . . . 80 Network Technology Diversity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 Physical Diversity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 National Diversity Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

Chapter 5 Commonality. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Meaningful Best Practices for Infrastructure Protection . . . . . . . . . . . . 92 Locally Relevant and Appropriate Security Policy . . . . . . . . . . . . . . . . 95 Culture of Security Protection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 Infrastructure Simplifi cation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 Certifi cation and Education . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Career Path and Reward Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Responsible Past Security Practice . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 National Commonality Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

Chapter 6 Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 Effectiveness of Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Layered Authentication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 Layered E-Mail Virus and Spam Protection . . . . . . . . . . . . . . . . . . . . . . 119

CONTENTS vii

Layered Access Controls . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Layered Encryption . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122 Layered Intrusion Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124 National Program of Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

Chapter 7 Discretion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Trusted Computing Base . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 Security Through Obscurity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133 Information Sharing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 Information Reconnaissance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 Obscurity Layers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139 Organizational Compartments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 National Discretion Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143

Chapter 8 Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145 Collecting Network Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148 Collecting System Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150 Security Information and Event Management . . . . . . . . . . . . . . . . . . 154 Large-Scale Trending . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 Tracking a Worm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159 National Collection Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161

Chapter 9 Correlation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 Conventional Security Correlation Methods . . . . . . . . . . . . . . . . . . . . 167 Quality and Reliability Issues in Data Correlation . . . . . . . . . . . . . . . . 169 Correlating Data to Detect a Worm . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170 Correlating Data to Detect a Botnet . . . . . . . . . . . . . . . . . . . . . . . . . . . 172 Large-Scale Correlation Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174 National Correlation Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176

Chapter 10 Awareness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179 Detecting Infrastructure Attacks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183 Managing Vulnerability Information . . . . . . . . . . . . . . . . . . . . . . . . . . 184

viii CONTENTS

Cyber Security Intelligence Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . 186 Risk Management Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188 Security Operations Centers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190 National Awareness Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

Chapter 11 Response. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193 Pre- Versus Post-Attack Response . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195 Indications and Warning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197 Incident Response Teams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198 Forensic Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201 Law Enforcement Issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203 Disaster Recovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204 National Response Program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206

Appendix Sample National Infrastructure Protection Requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207

Sample Deception Requirements (Chapter 2) . . . . . . . . . . . . . . . . . . . 208 Sample Separation Requirements (Chapter 3) . . . . . . . . . . . . . . . . . . 209 Sample Diversity Requirements (Chapter 4) . . . . . . . . . . . . . . . . . . . . . 211 Sample Commonality Requirements (Chapter 5) . . . . . . . . . . . . . . . . 212 Sample Depth Requirements (Chapter 6) . . . . . . . . . . . . . . . . . . . . . . 213 Sample Discretion Requirements (Chapter 7) . . . . . . . . . . . . . . . . . . . 214 Sample Collection Requirements (Chapter 8) . . . . . . . . . . . . . . . . . . . 214 Sample Correlation Requirements (Chapter 9) . . . . . . . . . . . . . . . . . . 215 Sample Awareness Requirements (Chapter 10) . . . . . . . . . . . . . . . . . 216 Sample Response Requirements (Chapter 11) . . . . . . . . . . . . . . . . . . 216

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

PREFACE ix

PREFACE

Man did not enter into society to become worse than he was before, nor to have fewer rights than he had before, but to have those rights better secured.

Thomas Paine in Common Sense

Before you invest any of your time with this book, please take a moment and look over the following points. They outline my basic philosophy of national infrastructure security. I think that your reaction to these points will give you a pretty good idea of what your reaction will be to the book. 1. Citizens of free nations cannot hope to express or enjoy

their freedoms if basic security protections are not provided. Security does not suppress freedom—it makes freedom possible.

2. In virtually every modern nation, computers and networks power critical infrastructure elements. As a result, cyber attackers can use computers and networks to damage or ruin the infrastructures that citizens rely on.

3. Security protections, such as those in security books, were designed for small-scale environments such as enterprise computing environments. These protections do not extrapo- late to the protection of massively complex infrastructure.

4. Effective national cyber protections will be driven largely by cooperation and coordination between commercial, indus- trial, and government organizations. Thus, organizational management issues will be as important to national defense as technical issues.

5. Security is a process of risk reduction, not risk removal. Therefore, concrete steps can and should be taken to reduce, but not remove, the risk of cyber attack to national infrastructure.

6. The current risk of catastrophic cyber attack to national infra- structure must be viewed as extremely high, by any realistic measure. Taking little or no action to reduce this risk would be a foolish national decision. The chapters of this book are organized around ten basic

principles that will reduce the risk of cyber attack to national infrastructure in a substantive manner. They are driven by

x PREFACE

experiences gained managing the security of one of the largest, most complex infrastructures in the world, by years of learning from various commercial and government organizations, and by years of interaction with students and academic researchers in the security fi eld. They are also driven by personal experiences dealing with a wide range of successful and unsuccessful cyber attacks, including ones directed at infrastructure of considerable value. The implementation of the ten principles in this book will require national resolve and changes to the way computing and networking elements are designed, built, and operated in the context of national infrastructure. My hope is that the sugges- tions offered in these pages will make this process easier.

ACKNOWLEDGMENT xi

ACKNOWLEDGMENT

The cyber security experts in the AT&T Chief Security Offi ce, my colleagues across AT&T Labs and the AT&T Chief Technology Offi ce, my colleagues across the entire AT&T business, and my graduate and undergraduate students in the Computer Science Department at the Stevens Institute of Technology, have had a profound impact on my thinking and on the contents of this book. In addition, many prominent enterprise customers of AT&T with whom I’ve had the pleasure of serving, especially those in the United States Federal Government, have been great infl uencers in the preparation of this material.

I’d also like to extend a great thanks to my wife Lee, daugh- ter Stephanie (17), son Matthew (15), and daughter Alicia (9) for their collective patience with my busy schedule.

Edward G. Amoroso Florham Park, NJ September 2010

This page intentionally left blank

1 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00001-9 2011

INTRODUCTION Somewhere in his writings—and I regret having forgotten where— John Von Neumann draws attention to what seemed to him a contrast. He remarked that for simple mechanisms it is often easier to describe how they work than what they do, while for more complicated mechanisms it was usually the other way round .

Edsger W. Dijkstra 1

National infrastructure refers to the complex, underlying delivery and support systems for all large-scale services considered abso- lutely essential to a nation. These services include emergency response, law enforcement databases, supervisory control and data acquisition (SCADA) systems, power control networks, mili- tary support services, consumer entertainment systems, fi nancial applications, and mobile telecommunications. Some national services are provided directly by government, but most are pro- vided by commercial groups such as Internet service provid- ers, airlines, and banks. In addition, certain services considered essential to one nation might include infrastructure support that is controlled by organizations from another nation. This global interdependency is consistent with the trends referred to collec- tively by Thomas Friedman as a “fl at world.” 2

National infrastructure, especially in the United States, has always been vulnerable to malicious physical attacks such as equipment tampering, cable cuts, facility bombing, and asset theft. The events of September 11, 2001, for example, are the most prominent and recent instance of a massive physical attack directed at national infrastructure. During the past couple of decades, however, vast portions of national infrastructure have become reliant on software, computers, and networks. This reli- ance typically includes remote access, often over the Internet, to

1

1 E.W. Dijkstra, Selected Writings on Computing: A Personal Perspective , Springer-Verlag, New York, 1982, pp. 212–213. 2 T. Friedman, The World Is Flat: A Brief History of the Twenty-First Century , Farrar, Straus, and Giroux, New York, 2007. (Friedman provides a useful economic backdrop to the global aspect of the cyber attack trends suggested in this chapter.)

2 Chapter 1 INTRODUCTION

the systems that control national services. Adversaries thus can initiate cyber attacks on infrastructure using worms, viruses, leaks, and the like. These attacks indirectly target national infra- structure through their associated automated controls systems (see Figure 1.1 ).

A seemingly obvious approach to dealing with this national cyber threat would involve the use of well-known computer security techniques. After all, computer security has matured substantially in the past couple of decades, and considerable expertise now exists on how to protect software, computers, and networks. In such a national scheme, safeguards such as fi re- walls, intrusion detection systems, antivirus software, passwords, scanners, audit trails, and encryption would be directly embed- ded into infrastructure, just as they are currently in small-scale environments. These national security systems would be con- nected to a centralized threat management system, and inci- dent response would follow a familiar sort of enterprise process. Furthermore, to ensure security policy compliance, one would expect the usual programs of end-user awareness, security train- ing, and third-party audit to be directed toward the people build- ing and operating national infrastructure. Virtually every national infrastructure protection initiative proposed to date has followed this seemingly straightforward path. 3

While well-known computer security techniques will certainly be useful for national infrastructure, most practical experience to date suggests that this conventional approach will not be suf- fi cient. A primary reason is the size, scale, and scope inherent in complex national infrastructure. For example, where an enter- prise might involve manageably sized assets, national infrastruc- ture will require unusually powerful computing support with the ability to handle enormous volumes of data. Such volumes

Indirect Cyber Attacks

Direct Physical Attacks

“Worms, Viruses, Leaks”

“Tampering, Cuts,

Bombs”

National

Infrastructure

Automated Control

Software

Computers

Networks

Figure 1.1 National infrastructure cyber and physical attacks.

3 Executive Offi ce of the President, Cyberspace Policy Review: Assuring a Trusted and Resilient Information and Communications Infrastructure , U.S. White House, Washington, D.C., 2009 ( http://handle.dtic.mil/100.2/ADA501541 ).

Chapter 1 INTRODUCTION 3

will easily exceed the storage and processing capacity of typical enterprise security tools such as a commercial threat manage- ment system. Unfortunately, this incompatibility confl icts with current initiatives in government and industry to reduce costs through the use of common commercial off-the-shelf products.

In addition, whereas enterprise systems can rely on manual intervention by a local expert during a security disaster, large- scale national infrastructure generally requires a carefully orches- trated response by teams of security experts using predetermined processes. These teams of experts will often work in different groups, organizations, or even countries. In the worst cases, they will cooperate only if forced by government, often sharing just the minimum amount of information to avoid legal conse- quences. An additional problem is that the complexity associated with national infrastructure leads to the bizarre situation where response teams often have partial or incorrect understand- ing about how the underlying systems work. For these reasons, seemingly convenient attempts to apply existing small-scale security processes to large-scale infrastructure attacks will ulti- mately fail (see Figure 1.2 ).

As a result, a brand-new type of national infrastructure protec- tion methodology is required—one that combines the best ele- ments of existing computer and network security techniques with the unique and diffi cult challenges associated with complex, large- scale national services. This book offers just such a protection methodology for national infrastructure. It is based on a quarter century of practical experience designing, building, and operating

Small-Scale

Small Volume

Possibly Manual

Local Expert

High

Focused

High Volume

Large-Scale

Process-Based

Distributed Expertise

Partial or Incorrect

Broad

Collection

Emergency

Expertise

Knowledge

Analysis

Large-Scale Attributes Complicate Cyber Security

Figure 1.2 Differences between small- and large-scale cyber security.

National infrastructure databases far exceed the size of even the largest commercial databases.

4 Chapter 1 INTRODUCTION

cyber security systems for government, commercial, and con- sumer infrastructure. It is represented as a series of protection principles that can be applied to new or existing systems. Because of the unique needs of national infrastructure, especially its mas- sive size, scale, and scope, some aspects of the methodology will be unfamiliar to the computer security community. In fact, certain elements of the approach, such as our favorable view of “security through obscurity,” might appear in direct confl ict with conven- tional views of how computers and networks should be protected.

National Cyber Threats, Vulnerabilities, and Attacks Conventional computer security is based on the oft-repeated tax- onomy of security threats which includes confi dentiality, integrity, availability, and theft. In the broadest sense, all four diverse threat types will have applicability in national infrastructure. For example, protections are required equally to deal with sensitive information leaks (confi dentiality ), worms affecting the operation of some criti- cal application (integrity), botnets knocking out an important system (availability), or citizens having their identities compromised (theft). Certainly, the availability threat to national services must be viewed as particularly important, given the nature of the threat and its rela- tion to national assets. One should thus expect particular attention to availability threats to national infrastructure. Nevertheless, it makes sense to acknowledge that all four types of security threats in the conventional taxonomy of computer security must be addressed in any national infrastructure protection methodology.

Vulnerabilities are more diffi cult to associate with any taxon- omy. Obviously, national infrastructure must address well-known problems such as improperly confi gured equipment, poorly designed local area networks, unpatched system software, exploit- able bugs in application code, and locally disgruntled employ- ees. The problem is that the most fundamental vulnerability in national infrastructure involves the staggering complexity inher- ent in the underlying systems. This complexity is so pervasive that many times security incidents uncover aspects of computing functionality that were previously unknown to anyone, including sometimes the system designers. Furthermore, in certain cases, the optimal security solution involves simplifying and cleaning up poorly conceived infrastructure. This is bad news, because most large organizations are inept at simplifying much of anything.

The best one can do for a comprehensive view of the vulner- abilities associated with national infrastructure is to address their

Any of the most common security concerns— confi dentiality, integrity, availability, and theft— threaten our national infrastructure.

Chapter 1 INTRODUCTION 5

relative exploitation points. This can be done with an abstract national infrastructure cyber security model that includes three types of malicious adversaries: external adversary (hackers on the Internet), internal adversary (trusted insiders), and supplier adversary (vendors and partners). Using this model, three exploi- tation points emerge for national infrastructure: remote access (Internet and telework), system administration and normal usage (management and use of software, computers, and networks), and supply chain (procurement and outsourcing) (see Figure 1.3 ).

These three exploitation points and three types of adversaries can be associated with a variety of possible motivations for initi- ating either a full or test attack on national infrastructure.

Remote

Access System

Administration and

Normal Usage

External

Adversary

Three Exploitation Points

National Infrastructure

Three Adversaries

Supply

Chain

Internal

Adversary

Software

Computers

Networks Supplier

Adversary

Figure 1.3 Adversaries and exploitation points in national infrastructure.

Five Possible Motivations for an Infrastructure Attack

● Country-sponsored warfare —National infrastructure attacks sponsored and funded by enemy countries must be considered the most signifi cant potential motivation, because the intensity of adversary capability and willingness to attack is potentially unlimited.

● Terrorist attack —The terrorist motivation is also signifi cant, especially because groups driven by terror can easily obtain suffi cient capability and funding to perform signifi cant attacks on infrastructure.

● Commercially motivated attack —When one company chooses to utilize cyber attacks to gain a commercial advantage, it becomes a national infrastructure incident if the target company is a purveyor of some national asset.

● Financially driven criminal attack —Identify theft is the most common example of a fi nancially driven attack by criminal groups, but other cases exist, such as companies being extorted to avoid a cyber incident.

● Hacking —One must not forget that many types of attacks are still driven by the motivation of hackers, who are often just mischievous youths trying to learn or to build a reputation within the hacking community. This is much less a sinister motivation, and national leaders should try to identify better ways to tap this boundless capability and energy.

6 Chapter 1 INTRODUCTION

Each of the three exploitation points might be utilized in a cyber attack on national infrastructure. For example, a supplier might use a poorly designed supply chain to insert Trojan horse code into a software component that controls some national asset, or a hacker on the Internet might take advantage of some unprotected Internet access point to break into a vulnerable ser- vice. Similarly, an insider might use trusted access for either sys- tem administration or normal system usage to create an attack. The potential also exists for an external adversary to gain valu- able insider access through patient, measured means, such as gaining employment in an infrastructure-supporting organiza- tion and then becoming trusted through a long process of work performance. In each case, the possibility exists that a limited type of engagement might be performed as part of a planned test or exercise. This seems especially likely if the attack is country or terrorist sponsored, because it is consistent with past practice.

At each exploitation point, the vulnerability being used might be a well-known problem previously reported in an authoritative public advisory, or it could be a proprietary issue kept hidden by a local organization. It is entirely appropriate for a recognized authority to make a detailed public vulnerability advisory if the benefi ts of notifying the good guys outweigh the risks of alert- ing the bad guys. This cost–benefi t result usually occurs when many organizations can directly benefi t from the information and can thus take immediate action. When the reported vulner- ability is unique and isolated, however, then reporting the details might be irresponsible, especially if the notifi cation process does not enable a more timely fi x. This is a key issue, because many government authorities continue to consider new rules for man- datory reporting. If the information being demanded is not prop- erly protected, then the reporting process might result in more harm than good.

Botnet Threat Perhaps the most insidious type of attack that exists today is the botnet . 4 In short, a botnet involves remote control of a collec- tion of compromised end-user machines, usually broadband- connected PCs. The controlled end-user machines, which are referred to as bots , are programmed to attack some target that is designated by the botnet controller. The attack is tough to stop

4 Much of the material on botnets in this chapter is derived from work done by Brian Rexroad, David Gross, and several others from AT&T.

When to issue a vulnerability risk advisory and when to keep the risk confi dential must be determined on a case- by-case basis, depending on the threat.

Chapter 1 INTRODUCTION 7

because end-user machines are typically administered in an inef- fective manner. Furthermore, once the attack begins, it occurs from sources potentially scattered across geographic, political, and service provider boundaries. Perhaps worse, bots are pro- grammed to take commands from multiple controller systems, so any attempts to destroy a given controller result in the bots sim- ply homing to another one.

The Five Entities That Comprise a Botnet Attack ● Botnet operator —This is the individual, group, or country that creates the botnet, including its setup and operation.

When the botnet is used for fi nancial gain, it is the operator who will benefi t. Law enforcement and cyber security initiatives have found it very diffi cult to identify the operators. The press, in particular, has done a poor job reporting on the presumed identity of botnet operators, often suggesting sponsorship by some country when little supporting evidence exists.

● Botnet controller —This is the set of servers that command and control the operation of a botnet. Usually these servers have been maliciously compromised for this purpose. Many times, the real owner of a server that has been compromised will not even realize what has occurred. The type of activity directed by a controller includes all recruitment, setup, communication, and attack activity. Typical botnets include a handful of controllers, usually distributed across the globe in a non-obvious manner.

● Collection of bots —These are the end-user, broadband-connected PCs infected with botnet malware. They are usually owned and operated by normal citizens, who become unwitting and unknowing dupes in a botnet attack. When a botnet includes a concentration of PCs in a given region, observers often incorrectly attribute the attack to that region. The use of smart mobile devices in a botnet will grow as upstream capacity and device processing power increase.

● Botnet software drop —Most botnets include servers designed to store software that might be useful for the botnets during their lifecycle. Military personnel might refer to this as an arsenal . Like controllers, botnet software drop points are usually servers compromised for this purpose, often unknown to the normal server operator.

● Botnet target —This is the location that is targeted in the attack. Usually, it is a website, but it can really be any device, system, or network that is visible to the bots. In most cases, botnets target prominent and often controversial websites, simply because they are visible via the Internet and generally have a great deal at stake in terms of their availability. This increases gain and leverage for the attacker. Logically, however, botnets can target anything visible.

The way a botnet works is that the controller is set up to com- municate with the bots via some designated protocol, most often Internet Relay Chat (IRC). This is done via malware inserted into the end-user PCs that comprise the bots. A great challenge in this regard is that home PCs and laptops are so poorly administered. Amazingly, over time, the day-to-day system and security admin- istration task for home computers has gravitated to the end user.

8 Chapter 1 INTRODUCTION

This obligation results in both a poor user experience and gen- eral dissatisfaction with the security task. For example, when a typical computer buyer brings a new machine home, it has prob- ably been preloaded with security software by the retailer. From this point onward, however, that home buyer is then tasked with all responsibility for protecting the machine. This includes keep- ing fi rewall, intrusion detection, antivirus, and antispam software up to date, as well as ensuring that all software patches are cur- rent. When these tasks are not well attended, the result is a more vulnerable machine that is easily turned into a bot. (Sadly, even if a machine is properly managed, expert bot software designers might fi nd a way to install the malware anyway.)

Once a group of PCs has been compromised into bots, attacks can thus be launched by the controller via a command to the bots, which would then do as they are instructed. This might not occur instantaneously with the infection; in fact, experi- ence suggests that many botnets lay dormant for a great deal of time. Nevertheless, all sorts of attacks are possible in a bot- net arrangement, including the now-familiar distributed denial of service attack (DDOS). In such a case, the bots create more inbound traffi c than the target gateway can handle. For example, if some theoretical gateway allows for 1 Gbps of inbound traffi c, and the botnet creates an inbound stream larger than 1 Gbps, then a logjam results at the inbound gateway, and a denial of service condition occurs (see Figure 1.4 ).

Any serious present study of cyber security must acknowl- edge the unique threat posed by botnets. Virtually any Internet- connected system is vulnerable to major outages from a botnet-originated DDOS attack. The physics of the situation are especially depressing; that is, a botnet that might steal 500 Kbps

Broadband

Carriers

Capacity Excess

Creates Jam

Bots

Target A’s

Designated

Carrier

1 Gbps

Ingress Target A

1 Gbps DDOS Traffic

Aimed at Target A

Figure 1.4 Sample DDOS attack from a botnet.

Home PC users may never know they are being used for a botnet scheme.

A DDOS attack is like a cyber traffi c jam.

Chapter 1 INTRODUCTION 9

of upstream capacity from each bot (which would generally allow for concurrent normal computing and networking) would only need three bots to collapse a target T1 connection. Following this logic, only 16,000 bots would be required theoretically to fi ll up a 10-Gbps connection. Because most of the thousands of bot- nets that have been observed on the Internet are at least this size, the threat is obvious; however, many recent and prominent bot- nets such as Storm and Confi cker are much larger, comprising as many as several million bots, so the threat to national infrastruc- ture is severe and immediate.

National Cyber Security Methodology Components Our proposed methodology for protecting national infrastruc- ture is presented as a series of ten basic design and operation principles. The implication is that, by using these principles as a guide for either improving existing infrastructure components or building new ones, the security result will be desirable, includ- ing a reduced risk from botnets. The methodology addresses all four types of security threats to national infrastructure; it also deals with all three types of adversaries to national infrastructure, as well as the three exploitation points detailed in the infrastruc- ture model. The list of principles in the methodology serves as a guide to the remainder of this chapter, as well as an outline for the remaining chapters of the book: ● Chapter 2: Deception —The openly advertised use of deception

creates uncertainty for adversaries because they will not know if a discovered problem is real or a trap. The more common hid- den use of deception allows for real-time behavioral analysis if an intruder is caught in a trap. Programs of national infrastruc- ture protection must include the appropriate use of deception, especially to reduce the malicious partner and supplier risk.

● Chapter 3: Separation —Network separation is currently accomplished using fi rewalls, but programs of national infra- structure protection will require three specifi c changes. Specifi cally, national infrastructure must include network- based fi rewalls on high-capacity backbones to throttle DDOS attacks, internal fi rewalls to segregate infrastructure and reduce the risk of sabotage, and better tailoring of fi rewall fea- tures for specifi c applications such as SCADA protocols. 5

5 R. Kurtz, Securing SCADA Systems , Wiley, New York, 2006. (Kurtz provides an excellent overview of SCADA systems and the current state of the practice in securing them.)

10 Chapter 1 INTRODUCTION

● Chapter 4: Diversity —Maintaining diversity in the products, services, and technologies supporting national infrastruc- ture reduces the chances that one common weakness can be exploited to produce a cascading attack. A massive program of coordinated procurement and supplier management is required to achieve a desired level of national diversity across all assets. This will be tough, because it confl icts with most cost-motivated information technology procurement initia- tives designed to minimize diversity in infrastructure.

● Chapter 5: Commonality —The consistent use of security best practices in the administration of national infrastructure ensures that no infrastructure component is either poorly managed or left completely unguarded. National programs of standards selection and audit validation, especially with an emphasis on uniform programs of simplifi cation, are thus required. This can certainly include citizen end users, but one should never rely on high levels of security compliance in the broad population.

● Chapter 6: Depth —The use of defense in depth in national infrastructure ensures that no critical asset is reliant on a single security layer; thus, if any layer should fail, an addi- tional layer is always present to mitigate an attack. Analysis is required at the national level to ensure that all critical assets are protected by at least two layers, preferably more.

● Chapter 7: Discretion —The use of personal discretion in the sharing of information about national assets is a practical technique that many computer security experts fi nd diffi cult to accept because it confl icts with popular views on “security through obscurity.” Nevertheless, large-scale infrastructure protection cannot be done properly unless a national culture of discretion and secrecy is nurtured. It goes without saying that such discretion should never be put in place to obscure illegal or unethical practices.

● Chapter 8: Collection —The collection of audit log informa- tion is a necessary component of an infrastructure security scheme, but it introduces privacy, size, and scale issues not seen in smaller computer and network settings. National infrastructure protection will require a data collection approach that is acceptable to the citizenry and provides the requisite level of detail for security analysis.

● Chapter 9: Correlation —Correlation is the most fundamen- tal of all analysis techniques for cyber security, but modern attack methods such as botnets greatly complicate its use for attack-related indicators. National-level correlation must be performed using all available sources and the best available

Chapter 1 INTRODUCTION 11

technology and algorithms. Correlating information around a botnet attack is one of the more challenging present tasks in cyber security.

● Chapter 10: Awareness —Maintaining situational awareness is more important in large-scale infrastructure protection than in traditional computer and network security because it helps to coordinate the real-time aspect of multiple infrastructure components. A program of national situational awareness must be in place to ensure proper management decision- making for national assets.

● Chapter 11: Response —Incident response for national infra- structure protection is especially diffi cult because it gener- ally involves complex dependencies and interactions between disparate government and commercial groups. It is best accomplished at the national level when it focuses on early indications, rather than on incidents that have already begun to damage national assets. The balance of this chapter will introduce each principle, with

discussion on its current use in computer and network security, as well as its expected benefi ts for national infrastructure protection.

Deception The principle of deception involves the deliberate introduc- tion of misleading functionality or misinformation into national infrastructure for the purpose of tricking an adversary. The idea is that an adversary would be presented with a view of national infrastructure functionality that might include services or inter- face components that are present for the sole purpose of fakery. Computer scientists refer to this functionality as a honey pot , but the use of deception for national infrastructure could go far beyond this conventional view. Specifi cally, deception can be used to protect against certain types of cyber attacks that no other security method will handle. Law enforcement agen- cies have been using deception effectively for many years, often catching cyber stalkers and criminals by spoofi ng the reported identity of an end point. Even in the presence of such obvi- ous success, however, the cyber security community has yet to embrace deception as a mainstream protection measure.

Deception in computing typically involves a layer of clev- erly designed trap functionality strategically embedded into the internal and external interfaces for services. Stated more simply, deception involves fake functionality embedded into real inter- faces. An example might be a deliberately planted trap link on

Deception is an oft-used tool by law enforcement agencies to catch cyber stalkers and predators.

12 Chapter 1 INTRODUCTION

a website that would lead potential intruders into an environ- ment designed to highlight adversary behavior. When the decep- tion is open and not secret, it might introduce uncertainty for adversaries in the exploitation of real vulnerabilities, because the adversary might suspect that the discovered entry point is a trap. When it is hidden and stealth, which is the more common situa- tion, it serves as the basis for real-time forensic analysis of adver- sary behavior. In either case, the result is a public interface that includes real services, deliberate honey pot traps, and the inevi- table exploitable vulnerabilities that unfortunately will be pres- ent in all nontrivial interfaces (see Figure 1.5 ).

Only relatively minor tests of honey pot technology have been reported to date, usually in the context of a research effort. Almost no reports are available on the day-to-day use of decep- tion as a structural component of a real enterprise security program. In fact, the vast majority of security programs for com- panies, government agencies, and national infrastructure would include no such functionality. Academic computer scientists have shown little interest in this type of security, as evidenced by the relatively thin body of literature on the subject. This lack of interest might stem from the discomfort associated with using computing to mislead. Another explanation might be the relative ineffectiveness of deception against the botnet threat, which is clearly the most important security issue on the Internet today. Regardless of the cause, this tendency to avoid the use of decep- tion is unfortunate, because many cyber attacks, such as subtle break-ins by trusted insiders and Trojan horses being maliciously inserted by suppliers into delivered software, cannot be easily remedied by any other means.

The most direct benefi t of deception is that it enables foren- sic analysis of intruder activity. By using a honey pot, unique insights into attack methods can be gained by watching what is occurring in real time. Such deception obviously works best in a hidden, stealth mode, unknown to the intruder, because if

Interface to

Valid Services

Trap Interface

to Honey Pot

Should Resemble

Valid Services

Vulnerabilities Possible

Uncertainty

Real

Assets

Honey

Pot

???

Figure 1.5 Components of an interface with deception.

Deception is less effective against botnets than other types of attack methods.

Chapter 1 INTRODUCTION 13

the intruder realizes that some vulnerable exploitation point is a fake, then no exploitation will occur. Honey pot pioneers Cliff Stoll, Bill Cheswick, and Lance Spitzner have provided a major- ity of the reported experience in real-time forensics using honey pots. They have all suggested that the most diffi cult task involves creating believability in the trap. It is worth noting that connect- ing a honey pot to real assets is a terrible idea.

An additional potential benefi t of deception is that it can introduce the clever idea that some discovered vulnerability might instead be a deliberately placed trap. Obviously, such an approach is only effective if the use of deception is not hidden; that is, the adversary must know that deception is an approved and accepted technique used for protection. It should therefore be obvious that the major advantage here is that an accidental vulnerability, one that might previously have been an open door for an intruder, will suddenly look like a possible trap. A further profound notion, perhaps for open discussion, is whether just the implied statement that deception might be present (perhaps without real justifi cation) would actually reduce risk. Suppliers, for example, might be less willing to take the risk of Trojan horse insertion if the procuring organization advertises an open research and development program of detailed software test and inspection against this type of attack.

Separation The principle of separation involves enforcement of access policy restrictions on the users and resources in a computing environ- ment. Access policy restrictions result in separation domains, which are arguably the most common security architectural concept in use today. This is good news, because the creation of access-policy-based separation domains will be essential in the protection of national infrastructure. Most companies today will typically use fi rewalls to create perimeters around their presumed enterprise, and access decisions are embedded in the associated rules sets. This use of enterprise fi rewalls for separation is com- plemented by several other common access techniques: ● Authentication and identity management —These methods are

used to validate and manage the identities on which separa- tion decisions are made. They are essential in every enterprise but cannot be relied upon solely for infrastructure security. Malicious insiders, for example, will be authorized under such systems. In addition, external attacks such as DDOS are unaf- fected by authentication and identity management.

Do not connect honey pots to real assets!

14 Chapter 1 INTRODUCTION

● Logical access controls —The access controls inherent in oper- ating systems and applications provide some degree of sepa- ration, but they are also weak in the presence of compromised insiders. Furthermore, underlying vulnerabilities in appli- cations and operating systems can often be used to subvert these methods.

● LAN controls —Access control lists on local area network (LAN) components can provide separation based on infor- mation such as Internet Protocol (IP) or media access control (MAC) address. In this regard, they are very much like fi rewalls but typically do not extend their scope beyond an isolated segment.

● Firewalls —For large-scale infrastructure, fi rewalls are particu- larly useful, because they separate one network from another. Today, every Internet-based connection is almost certainly protected by some sort of fi rewall functionality. This approach worked especially well in the early years of the Internet, when the number of Internet connections to the enterprise was small. Firewalls do remain useful, however, even with the massive connectivity of most groups to the Internet. As a result, national infrastructure should continue to include the use of fi rewalls to protect known perimeter gateways to the Internet. Given the massive scale and complexity associated with

national infrastructure, three specifi c separation enhancements are required, and all are extensions of the fi rewall concept.

Required Separation Enhancements for National Infrastructure Protection

1. The use of network-based fi rewalls is absolutely required for many national infrastructure applications, especially ones vulnerable to DDOS attacks from the Internet. This use of network-based mediation can take advantage of high-capacity network backbones if the service provider is involved in running the fi rewalls.

2. The use of fi rewalls to segregate and isolate internal infrastructure components from one another is a mandatory technique for simplifying the implementation of access control policies in an organization. When insiders have malicious intent, any exploit they might attempt should be explicitly contained by internal fi rewalls.

3. The use of commercial off-the-shelf fi rewalls, especially for SCADA usage, will require tailoring of the fi rewall to the unique protocol needs of the application. It is not acceptable for national infrastructure protection to retrofi t the use of a generic, commercial, off-the-shelf tool that is not optimized for its specifi c use (see Figure 1.6 ).

Chapter 1 INTRODUCTION 15

With the advent of cloud computing, many enterprise and government agency security managers have come to acknowl- edge the benefi ts of network-based fi rewall processing. The approach scales well and helps to deal with the uncontrolled complexity one typically fi nds in national infrastructure. That said, the reality is that most national assets are still secured by placing a fi rewall at each of the hundreds or thousands of pre- sumed choke points. This approach does not scale and leads to a false sense of security. It should also be recognized that the fi rewall is not the only device subjected to such scale problems. Intrusion detection systems, antivirus fi ltering, threat manage- ment, and denial of service fi ltering also require a network-based approach to function properly in national infrastructure.

An additional problem that exists in current national infrastruc- ture is the relative lack of architectural separation used in an internal, trusted network. Most security engineers know that large systems are best protected by dividing them into smaller systems. Firewalls or packet fi ltering routers can be used to segregate an enterprise net- work into manageable domains. Unfortunately, the current state of the practice in infrastructure protection rarely includes a disciplined approach to separating internal assets. This is unfortunate, because it allows an intruder in one domain to have access to a more expan- sive view of the organizational infrastructure. The threat increases when the fi rewall has not been optimized for applications such as SCADA that require specialized protocol support.

Required New Separation

Mechanisms

(Less Familiar)

Existing Separation

Mechanisms

(Less Familiar)

Internet Service Provider

Commercial and

Government

Infrastructure

Commercial

Off-the-Shelf

Perimeter

Firewalls

Authentification and

Identity Management,

Logical Access Controls,

LAN Controls

Internal

Firewalls

Tailored

Firewalls

(SCADA)

Network-Based

Firewalls

(Carrier)

Figure 1.6 Firewall enhancements for national infrastructure.

Parceling a network into manageable smaller domains creates an environment that is easier to protect.

16 Chapter 1 INTRODUCTION

Diversity The principle of diversity involves the selection and use of tech- nology and systems that are intentionally different in substan- tive ways. These differences can include technology source, programming language, computing platform, physical location, and product vendor. For national infrastructure, realizing such diversity requires a coordinated program of procurement to ensure a proper mix of technologies and vendors. The purpose of introducing these differences is to deliberately create a measure of non-interoperability so that an attack cannot easily cascade from one component to another through exploitation of some common vulnerability. Certainly, it would be possible, even in a diverse environment, for an exploit to cascade, but the likelihood is reduced as the diversity profi le increases.

This concept is somewhat controversial, because so much of computer science theory and information technology prac- tice in the past couple of decades has been focused on maxi- mizing interoperability of technologies. This might help explain the relative lack of attentiveness that diversity considerations receive in these fi elds. By way of analogy, however, cyber attacks on national infrastructure are mitigated by diversity technol- ogy just as disease propagation is reduced by a diverse biologi- cal ecosystem. That is, a problem that originates in one area of infrastructure with the intention of automatic propagation will only succeed in the presence of some degree of interoperability. If the technologies are suffi ciently diverse, then the attack propa- gation will be reduced or even stopped. As such, national asset managers are obliged to consider means for introducing diver- sity in a cost-effective manner to realize its security benefi ts (see Figure 1.7 ).

Attack Target

Component

3

Attack Target

Component

2

Non-Diverse

(Attack Propagates)

Diverse

(Attack Propagation Stops)

Attack

Adversary Target

Component

1

Figure 1.7 Introducing diversity to national infrastructure.

Chapter 1 INTRODUCTION 17

Diversity is especially tough to implement in national infra- structure for several reasons. First, it must be acknowledged that a single, major software vendor tends to currently dominate the personal computer (PC) operating system business landscape in most government and enterprise settings. This is not likely to change, so national infrastructure security initiatives must sim- ply accept an ecosystem lacking in diversity in the PC landscape. The profi le for operating system software on computer servers is slightly better from a diversity perspective, but the choices remain limited to a very small number of available sources. Mobile oper- ating systems currently offer considerable diversity, but one can- not help but expect to see a trend toward greater consolidation.

Second, diversity confl icts with the often-found organiza- tional goal of simplifying supplier and vendor relationships; that is, when a common technology is used throughout an organiza- tion, day-to-day maintenance, administration, and training costs are minimized. Furthermore, by purchasing in bulk, better terms are often available from a vendor. In contrast, the use of diversity could result in a reduction in the level of service provided in an organization. For example, suppose that an Internet service pro- vider offers particularly secure and reliable network services to an organization. Perhaps the reliability is even measured to some impressive quantitative availability metric. If the organization is committed to diversity, then one might be forced to actually introduce a second provider with lower levels of reliability.

In spite of these drawbacks, diversity carries benefi ts that are indisputable for large-scale infrastructure. One of the great chal- lenges in national infrastructure protection will thus involve fi nd- ing ways to diversify technology products and services without increasing costs and losing business leverage with vendors.

Consistency The principle of consistency involves uniform attention to secu- rity best practices across national infrastructure components. Determining which best practices are relevant for which national asset requires a combination of local knowledge about the asset, as well as broader knowledge of security vulnerabilities in generic infrastructure protection. Thus, the most mature approach to consistency will combine compliance with relevant standards such as the Sarbanes–Oxley controls in the United States, with locally derived security policies that are tailored to the organiza- tional mission. This implies that every organization charged with the design or operation of national infrastructure must have a

Enforcing diversity of products and services might seem counterintuitive if you have a reliable provider.

18 Chapter 1 INTRODUCTION

local security policy. Amazingly, some large groups do not have such a policy today.

The types of best practices that are likely to be relevant for national infrastructure include well-defi ned software lifecycle methodologies, timely processes for patching software and sys- tems, segregation of duty controls in system administration, threat management of all collected security information, secu- rity awareness training for all system administrators, operational confi gurations for infrastructure management, and use of soft- ware security tools to ensure proper integrity management. Most security experts agree on which best practices to include in a generic set of security requirements, as evidenced by the inclu- sion of a common core set of practices in every security standard. Attentiveness to consistency is thus one of the less controversial of our recommended principles.

The greatest challenge in implementing best practice consis- tency across infrastructure involves auditing. The typical audit process is performed by an independent third-party entity doing an analysis of target infrastructure to determine consistency with a desired standard. The result of the audit is usually a numeric score, which is then reported widely and used for management decisions. In the United States, agencies of the federal govern- ment are audited against a cyber security standard known as FISMA (Federal Information Security Management Act). While auditing does lead to improved best practice coverage, there are often problems. For example, many audits are done poorly, which results in confusion and improper management deci- sions. In addition, with all the emphasis on numeric ratings, many agencies focus more on their score than on good security practice.

Today, organizations charged with protecting national infra- structure are subjected to several types of security audits. Streamlining these standards would certainly be a good idea, but some additional items for consideration include improving the types of common training provided to security administrators, as well as including past practice in infrastructure protection in common audit standards. The most obvious practical consid- eration for national infrastructure, however, would be national- level agreement on which standard or standards would be used to determine competence to protect national assets. While this is a straightforward concept, it could be tough to obtain wide con- currence among all national participants. A related issue involves commonality in national infrastructure operational confi gu- rations; this reduces the chances that a rogue confi guration

A good audit score is important but should not replace good security practices.

A national standard of competence for protecting our assets is needed.

Chapter 1 INTRODUCTION 19

installed for malicious purposes, perhaps by compromised insiders.

Depth The principle of depth involves the use of multiple security layers of protection for national infrastructure assets. These layers pro- tect assets from both internal and external attacks via the familiar “defense in depth” approach; that is, multiple layers reduce the risk of attack by increasing the chances that at least one layer will be effective. This should appear to be a somewhat sketchy situ- ation, however, from the perspective of traditional engineering. Civil engineers, for example, would never be comfortable design- ing a structure with multiple fl awed supports in the hopes that one of them will hold the load. Unfortunately, cyber security experts have no choice but to rely on this fl awed notion, perhaps highlighting the relative immaturity of security as an engineering discipline.

One hint as to why depth is such an important requirement is that national infrastructure components are currently con- trolled by software, and everyone knows that the current state of software engineering is abysmal. Compared to other types of engineering, software stands out as the only one that accepts the creation of knowingly fl awed products as acceptable. The result is that all nontrivial software has exploitable vulnerabilities, so the idea that one should create multiple layers of security defense is unavoidable. It is worth mentioning that the degree of diversity in these layers will also have a direct impact on their effectiveness (see Figure 1.8 ).

To maximize the usefulness of defense layers in national infra- structure, it is recommended that a combination of functional

Software engineering standards do not contain the same level of quality as civil and other engineering standards.

Attack Gets

Through Here...

...Hopefully

Stopped Here

Multiple Layers of Protection

Adversary Target Asset

Asset Protected

Via Depth Approach

Figure 1.8 National infrastructure security through defense in depth.

20 Chapter 1 INTRODUCTION

and procedural controls be included. For example, a common fi rst layer of defense is to install an access control mechanism for the admission of devices to the local area network. This could involve router controls in a small network or fi rewall access rules in an enterprise. In either case, this fi rst line of defense is clearly functional. As such, a good choice for a second layer of defense might involve something procedural, such as the deployment of scanning to determine if inappropriate devices have gotten through the fi rst layer. Such diversity will increase the chances that the cause of failure in one layer is unlikely to cause a similar failure in another layer.

A great complication in national infrastructure protection is that many layers of defense assume the existence of a defi ned net- work perimeter. For example, the presence of many fl aws in enter- prise security found by auditors is mitigated by the recognition that intruders would have to penetrate the enterprise perimeter to exploit these weaknesses. Unfortunately, for most national assets, fi nding a perimeter is no longer possible. The assets of a country, for example, are almost impossible to defi ne within some geo- graphic or political boundary, much less a network one. Security managers must therefore be creative in identifying controls that will be meaningful for complex assets whose properties are not always evident. The risk of getting this wrong is that in providing multiple layers of defense, one might misapply the protections and leave some portion of the asset base with no layers in place.

Discretion The principle of discretion involves individuals and groups making good decisions to obscure sensitive information about national infrastructure. This is done by combining formal man- datory information protection programs with informal discre- tionary behavior. Formal mandatory programs have been in place for many years in the U.S. federal government, where docu- ments are associated with classifi cations, and policy enforce- ment is based on clearances granted to individuals. In the most intense environments, such as top-secret compartments in the intelligence community, violations of access policies could be interpreted as espionage, with all of the associated criminal implications. For this reason, prominent breaches of highly clas- sifi ed government information are not common.

In commercial settings, formal information protection pro- grams are gaining wider acceptance because of the increased need to protect personally identifi able information (PII) such as

Naturally, top-secret information within the intelligence community is at great risk for attack or infi ltration.

Chapter 1 INTRODUCTION 21

credit card numbers. Employees of companies around the world are starting to understand the importance of obscuring certain aspects of corporate activity, and this is healthy for national infra- structure protection. In fact, programs of discretion for national infrastructure protection will require a combination of corpo- rate and government security policy enforcement, perhaps with custom-designed information markings for national assets. The resultant discretionary policy serves as a layer of protection to prevent national infrastructure-related information from reach- ing individuals who have no need to know such information.

A barrier in our recommended application of discretion is the maligned notion of “security through obscurity.” Security experts, especially cryptographers, have long complained that obscurity is an unacceptable protection approach. They correctly reference the problems of trying to secure a system by hiding its underly- ing detail. Inevitably, an adversary discovers the hidden design secrets and the security protection is lost. For this reason, con- ventional computer security correctly dictates an open approach to software, design, and algorithms. An advantage of this open approach is the social review that comes with widespread adver- tisement; for example, the likelihood is low of software ever being correct without a signifi cant amount of intense review by experts. So, the general computer security argument against “security through obscurity” is largely valid in most cases.

Nevertheless, any manager charged with the protection of nontrivial, large-scale infrastructure will tell you that discretion and, yes, obscurity are indispensable components in a protec- tion program. Obscuring details around technology used, soft- ware deployed, systems purchased, and confi gurations managed will help to avoid or at least slow down certain types of attacks. Hackers often claim that by discovering this type of informa- tion about a company and then advertising the weaknesses they are actually doing the local security team a favor. They suggest that such advertisement is required to motivate a security team toward a solution, but this is actually nonsense. Programs around proper discretion and obscurity for infrastructure information are indispensable and must be coordinated at the national level.

Collection The principle of collection involves automated gathering of sys- tem-related information about national infrastructure to enable security analysis. Such collection is usually done in real time and involves probes or hooks in applications, system software, net- work elements, or hardware devices that gather information of

“Security through obscurity” may actually leave assets more vulnerable to attack than an open approach would.

22 Chapter 1 INTRODUCTION

interest. The use of audit trails in small-scale computer security is an example of a long-standing collection practice that introduces very little controversy among experts as to its utility. Security devices such as fi rewalls produce log fi les, and systems purported to have some degree of security usefulness will also generate an audit trail output. The practice is so common that a new type of product, called a security information management system (SIMS), has been developed to process all this data.

The primary operational challenge in setting up the right type of collection process for computers and networks has been two- fold: First, decisions must be made about what types of informa- tion are to be collected. If this decision is made correctly, then the information collected should correspond to exactly the type of data required for security analysis, and nothing else. Second, decisions must be made about how much information is actu- ally collected. This might involve the use of existing system func- tions, such as enabling the automatic generation of statistics on a router; or it could involve the introduction of some new type of function that deliberately gathers the desired information. Once these considerations are handled, appropriate mechanisms for collecting data from national infrastructure can be embedded into the security architecture (see Figure 1.9 ).

The technical and operational challenges associated with the collection of logs and audit trails are heightened in the protec- tion of national assets. Because national infrastructure is so com- plex, determining what information should be collected turns out to be a diffi cult exercise. In particular, the potential arises with large-scale collection to intrude on the privacy of individu- als and groups within a nation. As such, any initiative to protect

Typical Infrastructure

Collection Points

Type and Volume

Issues

Device Status Monitors

Distributed Across

Government and Industry

Interpretation

and Action

Operating System Logs

Network Monitors

Application Hooks

Transport

Issues

Privacy

Issues

Data

Collection

Repositories

Figure 1.9 Collecting national infrastructure-related security information.

Chapter 1 INTRODUCTION 23

infrastructure through the collection of data must include at least some measure of privacy policy determination. Similarly, the vol- umes of data collected from large infrastructure can exceed prac- tical limits. Telecommunications collection systems designed to protect the integrity of a service provider backbone, for example, can easily generate many terabytes of data in hours of processing.

In both cases, technical and operational expertise must be applied to ensure that the appropriate data is collected in the proper amounts. The good news is that virtually all security protection algorithms require no deep, probing information of the type that might generate privacy or volumetric issues. The challenge arises instead when collection is done without proper advance analysis which often results in the collection of more data than is needed. This can easily lead to privacy problems in some national collection repositories, so planning is particularly necessary. In any event, a national strategy of data collection is required, with the usual sorts of legal and policy guidance on who collects what and under which circumstances. As we sug- gested above, this exercise must be guided by the requirements for security analysis—and nothing else.

Correlation The principle of correlation involves a specifi c type of analysis that can be performed on factors related to national infrastructure protection. The goal of correlation is to identify whether security- related indicators might emerge from the analysis. For example, if some national computing asset begins operating in a sluggish man- ner, then other factors would be examined for a possible correlative relationship. One could imagine the local and wide area networks being analyzed for traffi c that might be of an attack nature. In addi- tion, similar computing assets might be examined to determine if they are experiencing a similar functional problem. Also, all soft- ware and services embedded in the national asset might be ana- lyzed for known vulnerabilities. In each case, the purpose of the correlation is to combine and compare factors to help explain a given security issue. This type of comparison-oriented analysis is indispensable for national infrastructure because of its complexity.

Interestingly, almost every major national infrastructure pro- tection initiative attempted to date has included a fusion cen- ter for real-time correlation of data. A fusion center is a physical security operations center with means for collecting and ana- lyzing multiple sources of ingress data. It is not uncommon for such a center to include massive display screens with colorful,

What and how much data to collect is an operational challenge.

Only collect as much data as is necessary for security purposes.

Monitoring and analyzing networks and data collection may reveal a hidden or emerging security threat.

24 Chapter 1 INTRODUCTION

visualized representations, nor is it uncommon to fi nd such cen- ters in the military with teams of enlisted people performing the manual chores. This is an important point, because, while such automated fusion is certainly promising, best practice in cor- relation for national infrastructure protection must include the requirement that human judgment be included in the analysis. Thus, regardless of whether resources are centralized into one physical location, the reality is that human beings will need to be included in the processing (see Figure 1.10 ).

In practice, fusion centers and the associated processes and correlation algorithms have been tough to implement, even in small-scale environments. Botnets, for example, involve the use of source systems that are selected almost arbitrarily. As such, the use of correlation to determine where and why the attack is occurring has been useless. In fact, correlating geographic infor- mation with the sources of botnet activity has even led to many false conclusions about who is attacking whom. Countless hours have been spent by security teams poring through botnet infor- mation trying to determine the source, and the best one can

Correlation Process

Output

Recommended

Actions

Multiple

Ingress Data

Feeds

Comparison and

Analysis of

Relevant Factors

Derive

Real-Time

Conclusions

Figure 1.10 National infrastructure high-level correlation approach.

Three Steps to Improve Current Correlation Capabilities

1. The actual computer science around correlation algorithms needs to be better investigated. Little attention has been placed in academic computer science and applied mathematics departments to multifactor correlation of real-time security data. This could be changed with appropriate funding and grant emphasis from the government.

2. The ability to identify reliable data feeds needs to be greatly improved. Too much attention has been placed on ad hoc collection of volunteered feeds, and this complicates the ability for analysis to perform meaningful correlation.

3. The design and operation of a national-level fusion center must be given serious consideration. Some means must be identifi ed for putting aside political and funding problems in order to accomplish this important objective.

Chapter 1 INTRODUCTION 25

hope for might be information about controllers or software drops. In the end, current correlation approaches fall short.

What is needed to improve present correlation capabilities for national infrastructure protection involves multiple steps.

Awareness The principle of awareness involves an organization under- standing the differences, in real time and at all times, between observed and normal status in national infrastructure. This status can include risks, vulnerabilities, and behavior in the target infra- structure. Behavior refers here to the mix of user activity, system processing, network traffi c, and computing volumes in the soft- ware, computers, and systems that comprise infrastructure. The implication is that the organization can somehow characterize a given situation as being either normal or abnormal. Furthermore, the organization must have the ability to detect and measure differences between these two behavioral states. Correlation analysis is usually inherent in such determinations, but the real challenge is less the algorithms and more the processes that must be in place to ensure situational awareness every hour of every day. For example, if a new vulnerability arises that has impact on the local infrastructure, then this knowledge must be obtained and factored into management decisions immediately.

Managers of national infrastructure generally do not have to be convinced that situational awareness is important. The big issue instead is how to achieve this goal. In practice, real-time aware- ness requires attentiveness and vigilance rarely found in normal computer security. Data must fi rst be collected and enabled to fl ow into a fusion center at all times so correlation can take place. The results of the correlation must be used to establish a profi led baseline of behavior so differences can be measured. This sounds easier than it is, because so many odd situations have the ability to mimic normal behavior (when it is really a problem) or a problem (when it really is nothing). Nevertheless, national infrastructure protection demands that managers of assets create a locally rele- vant means for being able to comment accurately on the state of security at all times. This allows for proper management decisions about security (see Figure 1.11 ).

Interestingly, situational awareness has not been considered a major component of the computer security equation to date. The concept plays no substantive role in small-scale security, such as in a home network, because when the computing base to be protected is simple enough, characterizing real-time situational status is just not necessary. Similarly, when a security manager puts in place security controls for a small enterprise, situational

Awareness builds on collection and correlation, but is not limited to those areas alone.

26 Chapter 1 INTRODUCTION

awareness is not the highest priority. Generally, the closest one might expect to some degree of real-time awareness for a small system might be an occasional review of system log fi les. So, the transition from small-scale to large-scale infrastructure protec- tion does require a new attentiveness to situational awareness that is not well developed. It is also worth noting that the general notion of “user awareness” of security is also not the principle specifi ed here. While it is helpful for end users to have knowl- edge of security, any professionally designed program of national infrastructure security must presume that a high percentage of end users will always make the wrong sorts of security deci- sions if allowed. The implication is that national infrastructure protection must never rely on the decision-making of end users through programs of awareness.

A further advance that is necessary for situational awareness involves enhancements in approaches to security metrics report- ing. Where the non-cyber national intelligence community has done a great job developing means for delivering daily intelligence briefs to senior government offi cials, the cyber security commu- nity has rarely considered this approach. The reality is that, for sit- uation awareness to become a structural component of national infrastructure protection, valid metrics must be developed to accurately portray status, and these must be codifi ed into a suit- able type of regular intelligence report that senior offi cials can use to determine security status. It would not be unreasonable to expect this cyber security intelligence to fl ow from a central point such as a fusion center, but in general this is not a requirement.

Response The principle of response involves assurance that processes are in place to react to any security-related indicator that becomes

Large-scale infrastructure protection requires a higher level of awareness than most groups currently employ.

Targeted at

Managers Collection

Raw Data

Combined Automation and Manual Process

Fusion

Intelligence

Situational

Awareness

Figure 1.11 Real-time situation awareness process fl ow.

Chapter 1 INTRODUCTION 27

available. These indicators should fl ow into the response pro- cess primarily from the situational awareness layer. National infrastructure response should emphasize indicators rather than incidents. In most current computer security applications, the response team waits for serious problems to occur, usually including complaints from users, applications running poorly, and networks operating in a sluggish manner. Once this occurs, the response team springs into action, even though by this time the security game has already been lost. For essential national infrastructure services, the idea of waiting for the service to degrade before responding does not make logical sense.

An additional response-related change for national infra- structure protection is that the maligned concept of “false posi- tive” must be reconsidered. In current small-scale environments, a major goal of the computer security team is to minimize the number of response cases that are initiated only to fi nd that nothing was wrong after all. This is an easy goal to reach by sim- ply waiting for disasters to be confi rmed beyond a shadow of a doubt before response is initiated. For national infrastructure, however, this is obviously unacceptable. Instead, response must follow indicators, and the concept of minimizing false positives must not be part of the approach. The only quantitative metric that must be minimized in national-level response is risk (see Figure 1.12 ).

A challenge that must be considered in establishing response functions for national asset protection is that relevant indica- tors often arise long before any harmful effects are seen. This suggests that infrastructure protecting must have accurate situ- ational awareness that considers much more than just visible impacts such as users having trouble, networks being down, or services being unavailable. Instead, often subtle indicators must

• Higher False-Positive Rate

• Lower Security Risk

• Recommended for National Infrastructure

Response Process

(pre-attack)

indicator indicator indicator

• Lower False-Positive Rate

• Higher Security Risk

• Use for National Infrastructure Only If Required

effect effect effect

Response Process

(post-attack)

attack threshold time

Figure 1.12 National infrastructure security response approach.

28 Chapter 1 INTRODUCTION

be analyzed carefully, which is where the challenges arise with false positives. When response teams agree to consider such indi- cators, it becomes more likely that such indicators are benign. A great secret to proper incident response for national infrastruc- ture is that higher false positive rates might actually be a good sign.

It is worth noting that the principles of collection, correlation, awareness, and response are all consistent with the implemen- tation of a national fusion center. Clearly, response activities are often dependent on a real-time, ubiquitous operations center to coordinate activities, contact key individuals, collect data as it becomes available, and document progress in the response activ- ities. As such, it should not be unexpected that national-level response for cyber security should include some sort of central- ized national center. The creation of such a facility should be the centerpiece of any national infrastructure protection program and should involve the active participation of all organizations with responsibility for national services.

Implementing the Principles Nationally To effectively apply this full set of security principles in practice for national infrastructure protection, several practical imple- mentation considerations emerge: ● Commissions and groups —Numerous commissions and

groups have been created over the years with the purpose of national infrastructure protection. Most have had some minor positive impact on infrastructure security, but none has had suffi cient impact to reduce present national risk to accept- able levels. An observation here is that many of these commis- sions and groups have become the end rather than the means toward a cyber security solution. When this occurs, their likeli- hood of success diminishes considerably. Future commissions and groups should take this into consideration.

● Information sharing —Too much attention is placed on infor- mation sharing between government and industry, perhaps because information sharing would seem on the surface to carry much benefi t to both parties. The advice here is that a comprehensive information sharing program is not easy to implement simply because organizations prefer to maintain a low profi le when fi ghting a vulnerability or attack. In addi- tion, the presumption that some organization—government or commercial—might have some nugget of information that could solve a cyber attack or reduce risk is not generally

A higher rate of false positives must be tolerated for national infrastructure protection.

Chapter 1 INTRODUCTION 29

consistent with practice. Thus, the motivation for a commer- cial entity to share vulnerability or incident-related informa- tion with the government is low; very little value generally comes from such sharing.

● International cooperation —National initiatives focused on creating government cyber security legislation must acknowl- edge that the Internet is global, as are the shared services such as the domain name system (DNS) that all national and global assets are so dependent upon. Thus, any program of national infrastructure protection must include provisions for interna- tional cooperation, and such cooperation implies agreements between participants that will be followed as long as everyone perceives benefi t.

● Technical and operational costs —To implement the princi- ples described above, considerable technical and operational costs will need to be covered across government and commer- cial environments. While it is tempting to presume that the purveyors of national infrastructure can simply absorb these costs into normal business budgets, this has not been the case in the past. Instead, the emphasis should be on rewards and incentives for organizations that make the decision to imple- ment these principles. This point is critical because it suggests that the best possible use of government funds might be as straightforward as helping to directly fund initiatives that will help to secure national assets. The bulk of our discussion in the ensuing chapters is techni-

cal in nature; that is, programmatic and political issues are conve- niently ignored. This does not diminish their importance, but rather is driven by our decision to separate our concerns and focus in this book on the details of “what” must be done, rather than “how.”

This page intentionally left blank

31 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00002-0 2011

DECEPTION

Create a highly controlled network. Within that network, you place production systems and then monitor, capture, and analyze all activity that happens within that network Because this is not a production network, but rather our Honeynet, any traffic is suspicious by nature .

The Honeynet Project 1

The use of deception in computing involves deliberately mislead- ing an adversary by creating a system component that looks real but is in fact a trap. The system component, sometimes referred to as a honey pot , is usually functionality embedded in a computing or networking system, but it can also be a physical asset designed to trick an intruder. In both cases, a common interface is presented to an adversary who might access real functionality connected to real assets, but who might also unknowingly access deceptive functionality connected to bogus assets. In a well-designed decep- tive system, the distinction between real and trap functionality should not be apparent to the intruder (see Figure 2.1 ).

The purpose of deception, ultimately, is to enhance security, so in the context of national infrastructure it can be used for large-scale protection of assets. The reason why deception works is that it helps accomplish any or all of the following four security objectives: ● Attention —The attention of an adversary can be diverted from

real assets toward bogus ones. ● Energy —The valuable time and energy of an adversary can be

wasted on bogus targets.

2

1 The Honeynet Project, Know Your Enemy: Revealing the Security Tools, Tactics, and Motives of the Blackhat Community , Addison–Wesley Professional, New York, 2002. (I highly recommend this amazing and original book.) See also B. Cheswick and S. Bellovin, Firewalls and Internet Security: Repelling the Wily Hacker , 1st ed., Addison– Wesley Professional, New York, 1994; C. Stoll, The Cuckoo’s Egg: Tracking a Spy Through the Maze of Computer Espionage , Pocket Books, New York, 2005.

32 Chapter 2 DECEPTION

● Uncertainty —Uncertainty can be created around the veracity of a discovered vulnerability.

● Analysis —A basis can be provided for real-time security analy- sis of adversary behavior. The fact that deception diverts the attention of adversaries,

while also wasting their time and energy, should be familiar to anyone who has ever used a honey pot on a network. As long as the trap is set properly and the honey pot is suffi ciently realistic, adversaries might direct their time, attention, and energy toward something that is useless from an attack perspective. They might even plant time bombs in trap functionality that they believe will be of subsequent use in targeting real assets. Obviously, in a honey pot, this is not the case. This type of deception is a pow- erful deterrent, because it defuses a cyber attack in a way that could fool an adversary for an extended period of time.

The possibility that deception might create uncertainty around the veracity of a discovered vulnerability has been poorly explored to date. The idea here is that when an intruder inevitably stumbles onto an exploitable hole it would be nice if that intruder were led to believe that the hole might be a trap. Thus, under the right circumstances, the intruder might actu- ally choose to avoid exploitation of a vulnerability for fear that it has been intentionally planted. While this might seem diffi cult to implement in many settings, the concept is powerful because it allows security managers to defuse existing vulnerabilities without even knowing about them . This is a signifi cant enough concept that it deserves repeating: The use of deception in com- puting allows system security managers to reduce the risk of vul- nerabilities that they might not even know are present .

The fact that real-time analysis can be performed on a honey pot is reasonably well known in the computing community today.

Connected to

Real Assets

Connected to

Bogus Assets

Computing

Functionality

(Real)

Computing

Functionality

(Deceptive)

Normal

User

Normal Access

Common

Interface

Malicious

User

Normal Access

Figure 2.1 Use of deception in computing.

Deception is a powerful security tool, as it protects even unknown vulnerabilities.

Chapter 2 DECEPTION 33

Perhaps this is because it is a widely accepted best practice that security administrators should try to observe the behavior of intruders that have been detected. Most intrusion detection sys- tems, for example, include threat management back-end systems that are designed to support such an objective. In the best case, the forensic analysis gathered during deception is suffi ciently detailed to allow for identifi cation of the adversary and possibly even pros- ecution. In the most typical case, however, accurate traceability to the original human source of a problem is rarely accomplished.

Luckily, the success of deceptive traps is assisted by the fact that intruders will almost always view designers and opera- tors of national assets as being sloppy in their actions, defi cient in their training, and incompetent in their knowledge. This extremely negative opinion of the individuals running national infrastructure is a core belief in virtually every hacking com- munity in the world (and is arguably justifi ed in some environ- ments). Ironically, this low expectation is an important element that helps make stealth deception much more feasible, because honey pots do not always have to mimic a perfectly managed environment. Instead, adversaries can generally be led to fi nd a system environment that is poorly administered, and they will not bat an eyelash. This helps the deception designer.

The less well-understood case of openly advertised deception relies on the adversary believing that designers and operators of national assets are competent enough to plant a believable trap into a national asset. This view represents a hurdle, because the hacking community will need to see convincing evidence before they will ever believe that anyone associated with a large organization would be competent enough to manage a complex program of deceptive computing. This is too bad, because open use of deception carries great advantages, as we will explain in more detail below. In any event, the psychology of understanding and managing adversary views is not straightforward. This soft issue must become part of the national infrastructure protection equation but will obviously require a new set of skills among security experts.

The most common implementation of deception involves the insertion of fake attack entry points, such as open service ports, that adversaries might expect to see in a normal system. The hope is that an adversary would discover (perhaps with a scan- ner) and then connect to these open service ports, which would in turn then lead to a honey pot. As suggested above, creating realism in a honey pot is not an easy task, but several design options do exist. One approach involves routing inbound open port connections to physically separate bogus systems that are isolated from real assets. This allows for a “forklift”-type copying

Honey pots should not necessarily mimic perfect environments.

Effective cyber deception involves understanding your adversary.

34 Chapter 2 DECEPTION

of real functionality (perhaps with sensitive data sanitized) to an isolated, safe location where no real damage can be done.

Recall that, if the deception is advertised openly, the possibil- ity arises that an adversary will not bother to attempt an attack. Admittedly, this scenario is a stretch, but the possibility does arise and is worth mentioning. Nevertheless, we will assume for the balance of this discussion that the adversary fi nds the decep- tive entry point, presumes that it is real, and decides to move forward with an attack. If the subsequent deception is properly managed, then the adversary should be led down a controlled process path with four distinct attack stages: scanning , discovery , exploitation , and exposing (see Figure 2.2 ).

During the initial scanning stage, an adversary is search- ing through whatever means is available for exploitable entry points. The presumption in this stage is that the service inter- face includes trap functionality, such as bogus links on prox- ied websites that lead to a honey pot for collecting information. It is worth noting, however, that this “searching” process does not always imply the use of a network by an adversary. Instead, the adversary might be searching for exploitable entry points in contracts, processes, locked cabinets, safes, or even relation- ships with national infrastructure personnel. In practice, one might even expect a combination of computing and noncomput- ing searches for information about exploitable entry points. The deception must be designed accordingly.

During the discovery phase, an adversary fi nds an exploit- able entry point, which might be real or fake. If the vulnerability is real, then one hopes that good back-end security is in place to avoid an infrastructure disaster. Nevertheless, the decision on the

Forensics Performed

in This Stage

National Asset

Interface

Trap

Honey

Pot

Scanning Discovery Exploitation Exposing

Adversary

Figure 2.2 Stages of deception for national infrastructure protection.

Bear in mind that a cyber honey pot might require coordination with a tangible exploitable point outside the cyber world.

Chapter 2 DECEPTION 35

part of the intruder to exploit a discovered vulnerability, real or fake, is an important trigger point. Good infrastructure security systems would need to connect this exploitation point to a threat management system that would either open a security trouble ticket or would alert a security administrator that an intruder has either started an attack or fallen for the deceptive bait. Obviously, such alerts should not signal an intruder that a trap is present.

During the exploitation stage, the adversary makes use of the discovered vulnerability for whatever purposes they might have. If the vulnerability is real, then the usual infrastructure break-in scenario results. If the vulnerability is a trap, however, then its effectiveness will be directly related to the realism of the honey pot. For both stealth and non-stealth deception, this is the initial stage during which data becomes available for forensic analy- sis. A design consideration is that the actual asset must never become compromised as a result of the trap. This requirement will likely result in deceptive functionality running on computing “islands” that are functionally separated from the real assets.

During the exposing stage in deception, adversary behavior becomes available for observation. Honey pots should include suffi cient monitoring to expose adversary technique, intent, and identity. This is generally the stage during which management decisions are made about whether response actions are war- ranted. It is also a stage where real-time human actions are often required to help make the deceptive functionality look real. As we stated above, a great advantage that arises here is the low expec- tation the adversary will have regarding system administrative competency on the part of the infrastructure team. This allows the security team to use the excuse of poor setup to cover func- tional gaps that might exist in the deception.

Any one of the four stages of deception can raise signifi cant legal and social issues, so any program of national infrastruc- ture protection must have participation from the national legal community to determine what is considered acceptable. The difference between a passive trap and an active lure, for exam- ple, is subtle and must be clarifi ed before a live deployment is made into infrastructure. From a social perspective, one might hope that the acceptance that exists for using deception to catch online stalkers would be extended to the cyber security commu- nity for catching adversaries targeting national infrastructure.

Scanning Stage In this fi rst stage, the presumption is that an adversary is scan- ning whatever is available to fi nd exploitation points to attack

Actual assets must remain separate and protected so they are not compromised by a honey pot trap.

Monitoring honey pots takes security to the next level: potential for responsive action.

36 Chapter 2 DECEPTION

national infrastructure. This scanning can include online searches for web-based information, network scans to determine port availability, and even offl ine searches of documents for rele- vant information. Deception can be used to divert these scanning attempts by creating false entry points with planted vulnerabili- ties. To deal with the offl ine case, the deception can extend to noncomputing situations such as intentionally leaving a nor- mally locked cabinet or safe door open with bogus documents inserted to deceive a malicious insider.

The deceptive design goal during scanning is to make avail- able an interface with three distinct components: authorized services , real vulnerabilities , and bogus vulnerabilities . In a per- fect world, there would be no vulnerabilities, only authorized services. Unfortunately, given the extreme complexity associated with national infrastructure services, this is an unrealistic expec- tation, so real vulnerabilities will always be present in some way, shape, or form. When deception is used, these real vulnerabilities are complemented by fake ones and should be indistinguishable. Thus, an adversary will see three components when presented with a national asset interface with deception (see Figure 2.3 ).

Bogus vulnerabilities will generally be inserted based on the usual sorts of problems found in software. This is one of the few cases where the defi ciencies of the software engineering disci- pline can actually be put to good use for security. One might imag- ine situations where new vulnerabilities are discovered and then immediately implemented as traps in systems that require protec- tion. Nevertheless, planted holes do not always have to be based on such exploitable software bugs or system misconfi gurations. In some cases, they might correspond to properly administered func- tionality, but that might not be considered acceptable for local use.

Valid User

Adversary

National Asset

Authorized Service

Uncertainty About

Which is Real

Three Components of

Service Interface

Real

Vulnerabilities

Bogus

Vulnerabilities

Honey

Pots

Figure 2.3 National asset service interface with deception.

Chapter 2 DECEPTION 37

Honey Pots can be Built into Websites A good example of a trap based on properly administered functionality might be a promiscuous tab on a website that openly solicits leaks of information; this is found sometimes on some of the more controversial blog sites. If legal and policy acceptance is given, then these links might be connected in a local proxied Intranet to a honey pot collection site. Insiders to an organization might then consider leaking information directly using this link to the seemingly valid Internet site, only to be duped into providing the leak to the local security team. Again, this should only be considered for deployment if all legal and policy requirements are met, but the example does help illustrate the possibilities.

A prominent goal of deception is to observe the adversary in action. This is done via real-time collection of data about intruder activity, along with reasoned analysis about intent. For example, if the intruder seems to be guessing passwords over and over again to gain access to a honey pot system, the administrator might decide in real time to simply grant access. A great challenge is that the automation possibilities of such response are not currently well understood and are barely included in security research pro- grams. This is too bad, because such cases could really challenge and ultimately improve the skills of a good security administra- tor. One could even imagine national groups sponsoring contests between live intruders and live administrators who are battling against each other in real time in a contrived honey pot.

Deliberately Open Ports Intruders routinely search the Internet for servers that allow connections to exploitable inbound services. These services are exploitable generally because they contain some weakness such as a buffer overfl ow condition that can be tripped to gain privi- leged access. Once privileged access is obtained, the intruder can perform administrative tasks such as changing system fi les, installing malware, and stealing sensitive information. All good system administrators understand the importance of harden- ing servers by disabling all exploitable and unnecessary services. The problem is that hardening is a complex process that is made more diffi cult in environments where the operating system is proprietary and less transparent. Amazingly, most software and server vendors still deliver their products in confi gurations that include most services being default enabled.

The deliberate insertion of open service ports on an Internet- facing server is the most straightforward of all deceptive computing

Allowing an intruder access increases your risk level but also allows the security administrator to monitor the intruder’s moves.

38 Chapter 2 DECEPTION

practices. The deliberately open ports are connected to back-end honey pot functionality, which is connected to monitoring systems for the purpose of observation and analysis. The result is that serv- ers would thus present adversaries of national infrastructure with three different views of open service ports: (1) valid open ports one might expect, such as HTTP, DNS, and SMTP; (2) open ports that are inadvertently left open and might correspond to exploit- able software; and (3) open ports that are deliberately inserted and connected to bogus assets in a honey pot. As long as it is generally understood that deception could potentially be deployed, there could be some uncertainty on the part of the adversary about which open ports are deliberate and which are inadvertent (see Figure 2.4 ).

Security managers who use port scanners as part of a normal program of enterprise network protection often cringe at this use of deception. What happens is that their scanners will fi nd these open ports, which will result in the generation of reports that highlight the presumed vulnerabilities to managers, users, and auditors. Certainly, the output can be manually cropped to avoid such exposure, but this might not scale well to a large enterprise. Unfortunately, solutions are not easily identifi ed that solve this incompatibility between the authorized use of port scanners and the deliberate use of open ports as traps. It represents yet another area for research and development in deceptive computing.

Valid Open Ports

TCP 80: HTTP

TCP 53: DNS

TCP 25: SMTP

. . .

To Real Assets

To Bogus Assets

Trap Open Ports (Deliberate):

UDP 1820

UDP 1830

. . .

System A

Valid Open Ports

TCP 80: HTTP

TCP 53: DNS

TCP 25: SMTP

. . .

To Real Assets

To Bogus Assets

Open Ports (Inadvertant):

UDP 1334

UDP 1862

. . .

System B

Which is deliberate and which is inadvertent?

Intruder

Figure 2.4 Use of deceptive open ports to bogus assets.

Another challenge is for security managers to knowingly keep open ports after running scanners that highlight these vulnerabilities.

Chapter 2 DECEPTION 39

An additional consideration with the deliberate use of open ports is that care must be taken on the back end to ensure that real assets cannot be exploited. Not surprisingly, practical tech- niques for doing this are not well known. For example, if the back-end deceptive software connected to deliberately open ports shares resources with valid assets, then the potential exists for negative side effects. The only reasonable approach today would involve deliberately open ports on bogus servers that are honey pots with no valid resources. These servers should be subtly embedded into server complexes so they look normal, but they should be hardwired to separate honey pot assets. This reduces the likelihood of negative side effects on normal servers (see Figure 2.5 ).

In practice, the real challenge to the deceptive use of open ports is creating port-connected functionality that is suffi ciently valid to fool an expert adversary but also properly separated from valid services so no adversary could make use of the honey pot to advance an attack. Because computer science does not cur- rently offer much foundational assistance in this regard, national infrastructure protection initiatives must include immediate programs of research and development to push this technique forward.

Discovery Stage The discovery stage corresponds to the adversary fi nding and accepting the security bait embedded in the trap. The two corre- sponding security goals during this stage are to make an intruder believe that real vulnerabilities could be bogus and that bogus

Honey

Pot

Complex of Internet-Connected Servers

Servers

Should

Look Same

(Normal and

Honey Pot)

Subtly Embedded

Honey Pot Server

Normal

Server

Normal

Server

Figure 2.5 Embedding a honey pot server into a normal server complex.

40 Chapter 2 DECEPTION

vulnerabilities could be real. The fi rst of these goals is accom- plished by making the deception program well-established and openly known. Specifi c techniques for doing this include the following: ● Sponsored research —The use of deception in national infra-

structure could become generally presumed through the open sponsorship and funding of unclassifi ed research and devel- opment work in this area.

● Published case studies —The open publication of case studies where deception has been used effectively in national asset protection increases the likelihood that an adversary might consider a found vulnerability to be deliberate.

● Open solicitations —Requests for Information (RFIs) and Requests for Proposals (RFPs) should be openly issued by national asset protectors. This implies that funding must be directed toward security projects that would actually use deceptive methods. Interestingly, the potential that an adversary will hesitate

before exploiting a real vulnerability increases only when the use of deception appears to be a real possibility. It would seem a hollow goal, for example, to simply announce that deception is being used without honest efforts to really deploy such decep- tions in national infrastructure. This is akin to placing a home protection sign in the landscaping without ever installing a real security system. For openly advertised deception to work, the national infrastructure team must be fully committed to actually doing the engineering, deployment, and operation.

The second goal of making bogus vulnerabilities look real will be familiar to computer security experts who have considered the use of honey pots. The technique of duplication is often used in honey pot design, where a bogus system is a perfect copy of a real one but without the back-end connectivity to the real asset being protected. This is generally done by duplicating the front- end interface to a real system and placing the duplicate next to a back-end honey pot. Duplication greatly increases realism and is actually quite easy to implement in practice (see Figure 2.6 ).

As suggested above, the advantage of duplication in honey pot design is that it maximizes authenticity. If one fi nds, for example, a real vulnerability in some front-end server, then an image of that vulnerable server could be used in future deceptive confi g- urations. Programs of national infrastructure protection should thus fi nd ways to effectively connect vulnerability discovery pro- cesses to honey pot design. Thus, when a truly interesting vulner- ability is found, it can become the front end to a future deceptive trap.

Openly advertised use of deception may cause adversaries to question whether a discovered vulnerability is valid or bogus.

Turn discovered vulnerabilities into advantages by mimicking them in honey pot traps.

Chapter 2 DECEPTION 41

Deceptive Documents The creation and special placement of deceptive documents is an example method for tricking adversaries during discovery. This technique, which can be done electronically or manually, is espe- cially useful for detecting the presence of a malicious insider and will only work under two conditions: ● Content —The bogus document must include information that

is convincingly realistic. Duplication of a valid document with changes to the most sensitive components is a straightforward means for doing this.

● Protection —The placement of the bogus document should include suffi cient protections to make the document appear truly realistic. If the protection approach is thin, then this will raise immediate suspicion. Sabotage can be detected by pro- tecting the bogus document in an environment that cannot be accessed by anyone other than trusted insiders. An illustrative approach for national infrastructure protec-

tion would follow these steps: First, a document is created with information that references a specially created bogus asset, such as a phone number, physical location, or server. The informa- tion should never be real, but it should be very realistic. Next, the document is stored in a highly protected location, such as a locked safe (computer or physical). The presumption is that under normal circumstances the document should sit idly in the locked safe, as it should have no real purpose to anyone. Finally, the specially created bogus asset is monitored carefully for any attempted compromise. If someone fi nds and grabs the document, then one can conclude that some insider is not to be trusted.

Should BE NO OBVIOUS

or VISIBLE DIFFERENCES

to an ADVERSARY Off-line Duplication

(“Make a Copy of the Real Interface”)

Real Front-End

Interface

Real Back-End

Asset

Same Front-End

Interface

Back-End

Honey Pot

Figure 2.6 Duplication in honey pot design.

42 Chapter 2 DECEPTION

It should be obvious that the example scheme shown in Figure 2.7 works as well for an electronic document protected by encryption and access control as for a manual paper document locked in a protected safe. In both cases, one would expect that no one would ever correlate these bogus references. If it turns out that the monitoring shows access to these bogus assets in some related way, then one would have to assume that the pro- tected enclave has been compromised. (Monitoring a hotel might require complex logistics, such as the use of hidden cameras.) In any event, these assets would provide a platform for subsequent analysis of exploitation activity by the adversary.

Exploitation Stage The third stage of the deception lifecycle for an adversary involves exploitation of a discovered vulnerability. This is a key

Steps to Planting a Bogus Document To effectively plant a bogus document, consider following these steps: 1. Create a fi le with instructions for obtaining what would appear to be extremely sensitive information. The fi le could

include a phone number, an Internet address for a server, and perhaps a room location in some hotel. 2. Encrypt the fi le and store it on a server (or print and lock it in a safe) that one would presume to be protected from

inside or outside access. 3. Put monitoring of the server or safe in place, with no expectation of a time limit. In fact, the monitoring might go on

indefi nitely, because one would expect to see no correlative behavior on these monitored assets (see Figure 2.7 ).

Protected Enclave

(Should Prevent Normal Access)

Correlative Monitoring

(Only Invoked if Bogus Document Used) Adversary

Believes

Bogus

Document Bogus AssetsBogus Document

In-line references to . . .

telephone (987) 654-3210

address 192.123.4567

hotel rm. 1a, 23 main st.

(987)

654-3210

1a, 23 main st.

192.

123.4567

Figure 2.7 Planting a bogus document in a protected enclave.

Chapter 2 DECEPTION 43

step in the decision process for an adversary because it is usu- ally the fi rst stage in which policy rules or even laws are actu- ally violated. That is, when an intruder begins to create a cyber attack, the initial steps are preparatory and generally do not vio- late any specifi c policy rules or laws. Sometimes security experts refer to this early activity as low radar actions , and when they are detected they are referred to as indications and warnings . Determining whether to respond to indications and warnings is a challenge, because response requires time and energy. If the track record of the security team involves many response actions to indications and warnings that are largely false positives, then the organization is often tempted to reduce the response trig- ger point. This is a bad idea for national infrastructure, because the chances increase that a real event will occur that is not responded to promptly.

As such, the protection of national infrastructure should involve a mind shift away from trying to reduce false positive responses to indications and warnings. Instead, the goal should be to deal with all instances in which indication and warning actions would appear to be building up to the threshold at which exploitation begins. This is especially important, because this threshold marks the fi rst stage during which real assets, if tar- geted, might actually be damaged (see Figure 2.8 ).

The key requirement at this decision point is that any exploi- tation of a bogus asset must not cause disclosure, integrity, theft, or availability problems with any real asset. Such non- interference between bogus and real assets is easiest to accom- plish when these assets are kept as separate as possible. Physical separation of assets is straightforward; a real software applica- tion with real data, for example, could be separated from a bogus application with fake data by simply hosting each on different

Post-Attack

Stages

Pre-Attack

Stages

ExploitationDiscoveryScanning

Decision to

Exploit Discovered

Vulnerability Vulnerability

Discovered

Figure 2.8 Pre- and post-attack stages at the exploitation stage.

Responding to a large number of false positives is necessary to adequately protect national infrastructure.

44 Chapter 2 DECEPTION

servers, perhaps even on different networks. This is how most honey pots operate, and the risk of interference is generally low.

Achieving noninterference in an environment where resources are shared between real and fake assets is more challenging. To accomplish this goal, the deception designer must be creative. For example, if some business process is to be shared by both real and fake functionality, then care must be taken by the decep- tion operators to ensure that real systems are not degraded in any way. Very little research has been done in this area, especially for availability threats. Allowing a malicious adversary to execute programs on a live, valid system, for example, would provide opportunities for malicious resource exhaustion. Nevertheless, the general approach has considerable promise and deserves more attention.

A related issue involves the possibility that intrusion detection and incident response systems might be fooled during exploita- tion into believing that trap functionality is real. White hat teams in companies have dealt with this problem for years, and they must coordinate with security teams to ensure that their activity does not cause a false alarm. This can be accomplished in several ways: ● Process coordination —This involves the honey pot team coor-

dinating their activities in advance with the infrastructure security teams. The problem is that it tends to highlight the deception and might destroy some hidden transparency, espe- cially if the deception is designed to detect malicious insiders.

● Trap isolation —This involves making certain that security systems, such as intrusion detection probes, are not in the deceptive path. Ongoing architectural analysis is required in this case to make sure that the condition holds throughout the system lifecycle.

● Back-end insiders —If the security team includes some trusted insiders involved in response activities, then they might be able to ensure that a response to deceptive action does not waste time and resources. This works best if the insider is a decision-maker.

● Process allowance —In this case, the detection and response activities are allowed to proceed. This is the recommended case if the deception is considered sensitive and must be totally hidden from insiders. The exploitation stage is the most dangerous part of the

deceptive lifecycle because it involves real malicious activity from an adversary. Infrastructure teams must learn to under- stand and respect the potential for real damage if this stage is not managed carefully.

When bogus and real assets reside on the same server, vulnerability risk increases dramatically.

Chapter 2 DECEPTION 45

Procurement Tricks One way to understand adversary behavior is to compare it in dif- ferent environments. An example involves a deception designer creating two separate supplier solicitations for a given product or service. One solicitation would be for a benign, noncritical, non- sensitive application; the other would be for an obviously sensi- tive, nationally critical application. In both cases, exactly the same product or service would be requested, but when they are deliv- ered to each application detailed examination would be initiated to identify differences. Any differences found in the delivered product by the intentionally planted “critical buyer” and “benign buyer” would be viewed as potential exploits (see Figure 2.9 ).

The deception described above only works if suffi cient foren- sic capability exists to compare the two delivered products. For any product or service, this could include comparison of rela- tive software size, system performance, product documenta- tion, service functionality, or technical support. One could even imagine a second level of deception using social engineering where an impassioned plea would be made to the supplier for some undocumented type of emergency support, usually remote administration. If either of the delivered products is set up for such remote administration, then the national asset manager would know that something is wrong.

Delivered

Product

Deception Designer

Exactly

Same

Differences

Must Be

Explained

Should Be

Exactly Same

Ensure

Specs

Same

Monitor

Supplier

Activity

Carefully

Compare

Products

Delivered

Product

Supplier

Create

Two

Buyers

Benign Buyer

(e.g., school)

Critical Buyer

(e.g., military)

Order

Specification

Order

Specification

Figure 2.9 Using deception against malicious suppliers.

46 Chapter 2 DECEPTION

The procurement lifecycle is one of the most underestimated components in national infrastructure protection from an attack perspective. Generally, security teams focus on selecting, test- ing, installing, and operating functionality, with seemingly mun- dane procurement tasks left to the supply chain team. This is a huge mistake, and adversaries understand this point well. Thus, national infrastructure protection initiatives must extend to the procurement process, and the clever use of deception is a power- ful tool in this regard.

Exposing Stage The fi nal stage in the deception lifecycle involves the adversary exposing behavior to the deception operator. Presumably, in this stage, the adversary is now hacking away at the trap functionality, convinced that all systems and assets are real. All sorts of possi- bilities arise in terms of how this hacking will proceed. It could be a fl urry of intense activity in a short period of time or it could be a drawn-out process of low and slow actions, so the decep- tion team must have patience. Also, during this stage, the adver- sary might expose the use of well-known hacking techniques and tools or, alternatively, could demonstrate use of techniques not previously seen by the security team (see Figure 2.10 ).

The challenge in this stage is that the deception must allow a window for observation of intruder activity, but must also be hidden. It must provide a convenient interface for the deception team to collect data but also must provide a way for commands to be issued or changes to be made in real time. Unfortunately, few commercial products exist that are designed to support these features. Specifi c functional requirements for the monitoring

Real-Time Forensic Analysis

of Adversary Behavior

Exploitation/ExposingDiscoveryScanning

Vulnerability is

Discovered

Decision to Exploit

Vulnerability

Figure 2.10 Adversary exposing stage during deception.

National infrastructure protection must extend from procurement to operating functionality in order to be truly effective.

Chapter 2 DECEPTION 47

environment during the exposing stage of deception include the following: ● Suffi cient detail —The monitoring environment must provide

suffi cient detail so the deception operator can determine what is going on. For example, overly cryptic audit logs in terse for- mat with gaps in certain areas would not be the best approach. The usual administrative interface to an operating system (generally through a command interface) is often the most desirable approach. One should not expect fancy, colorful dis- plays for the monitoring task because most security analysts prefer the convenience of a terse command line interface.

● Hidden probes —Monitoring in this stage of the deception works only if it is completely hidden. If an adversary fi g- ures out that someone is watching, then behavior modifi ca- tion would occur immediately. Simple tasks must therefore be implemented such as suppressed listing of any processes launched by the deception team (unless desired). The art of creating realistic functionality to hide probes requires support and nurturing in the security community.

● Real-time observation —The deception operator should have access to information about exposed behavior as it happens. The degree of real time for such monitoring (e.g., instantaneous, within seconds, within minutes) would depend on the local cir- cumstances. In most cases, this observation is simply done by watching system logs, but more advanced tools are required to record and store information about intruder behavior. As we suggested above, in all cases of deception monitoring

the key design goal should be to ensure a believable environ- ment. No suspicious or unexplainable processes should be pres- ent that could tip off an intruder that logging is ongoing. Fake audit logs are also a good way to create believability; if a honey pot is developed using an operating system with normal audit logging, then this should be enabled. A good adversary will likely turn it off. The idea is that hidden monitoring would have to be put in place underneath the normal logging—and this would be functionality that the adversary could not turn off.

Interfaces Between Humans and Computers The gathering of forensic evidence during the analysis of intruder behavior in a honey pot often relies on detailed understanding of how systems, protocols, and services interact. Specifi cally, this type of communication can be performed in four different ways: human-to-human , human-to-computer , computer-to-human , and

Observing intruder activity can be an informative but risky process during the exposure stage.

48 Chapter 2 DECEPTION

computer-to-computer . If we take the fi rst term (human or com- puter) to mean the intruder and we take the second term to mean the honey pot manager, then we can make some logical distinctions.

First, it should be obvious that, in an automated attack such as a botnet, the real-time behavior of the attack system will not change based on some subjective observation of honey pot func- tionality. Certainly, the interpretation of the results of the bot- net could easily affect the thinking of the botnet operator, but the real-time functionality is not going to be affected. As such, the most powerful cases in real-time forensic analysis of honey pot behavior will be the cases where human-to-human and human-to-computer interactions are being attempted by an intruder. Let’s examine each in turn.

The most common human-to-human interaction in national infrastructure involves help desk or customer care support func- tions, and the corresponding attack approach involves social engineering of such activity. The current state of the art in deal- ing with this vulnerability is to train operators and customer care personnel to detect attempts at social engineering and to report them to the security team. Deception, however, introduces a more interesting option. If the likelihood is high that social engi- neering is being attempted, then an advanced approach to pro- tection might involve deceiving the adversary into believing that they have succeeded. This can be accomplished quite easily by simply training operators to divert social engineering attempts to specially established help desks that are phony. The operators at these phony desks would reverse social engineer such attackers to get them to expose their identity or motivation (see Figure 2.11 ).

The most common human-to-computer interaction occurs when an intruder is trying to gain unauthorized access through a series of live, interactive commands. The idea is that intruders should be led to believe that their activity is invoking services on the target system, as in the usual type of operating system hacking. A good example might involve an intruder repeatedly trying to execute some command or operation in a trap system. If

1. Attempt to Social Engineer

Help Desk

(Real)

2. Suspicious Call Diverted

3. Reverse Social Engineering (Attempt to Determine Identity)

Help Desk

(Deceptive)

Figure 2.11 Deceptively exploiting the human-to-human interface.

Real-time forensic analysis is not possible for every scenario, such as a botnet attack.

Chapter 2 DECEPTION 49

the security team notices this intent and can act quickly enough, the desired command or operation could be deliberately led to execute. This is a tricky engagement, because an expert adver- sary might notice that the target confi guration is changing, which obviously is not normal.

National Deception Program One might hope that some sort of national deception program could be created based on a collection of traps strategically planted across national infrastructure components, tied together by some sort of deception analysis backbone. Such an approach is unlikely, because deception remains a poorly understood secu- rity approach, and infrastructure managers would be very hesi- tant to allow traps to be implanted in production systems. These traps, if they malfunction or do not work as advertised, could trick authorized users or impede normal operations.

Any realistic assessment of current security and information technology practice suggests that large-scale adoption of decep- tion for national infrastructure protection would not be widely accepted today. As a result, programs of national deception would be better designed based on the following assumptions: ● Selective infrastructure use —One must assume that cer-

tain infrastructure components are likely to include decep- tive traps but that others will not. At the time of this writing, many infrastructure teams are still grappling with basic com- puter security concepts; the idea that they would agree to install traps is not realistic. As such, any program of national deception must assume that not all components would utilize honey pots in the same manner.

● Sharing of results and insights —Programs of national decep- tion can and should include a mechanism for the sharing of results and insights gained through operational use of traps and honey pots. Certainly, insight obtained through forensic analysis of adversary behavior can be shared in a structured manner.

● Reuse of tools and methods —National deception programs could serve as means for making honey pot and trap software available for deployment. In some cases, deception tools and methods that work in one infrastructure area can be reused in another. The most common criticism of deception in large-scale

national security is that automated tools such as botnets are not affected by trap functionality. While it is true that botnets attack

An expert adversary may become aware of the security team observing the attempted intrusion.

50 Chapter 2 DECEPTION

infrastructure in a blindly automated manner regardless of whether the target is real or fake, the possibility remains that trap functionality might have some positive impact. A good example might be national coordination of numerous bogus endpoints that might be ready and willing to accept botnet software. If these endpoints are designed properly, one could imagine them being deliberately designed to mess up the botnet communication, perhaps by targeting the controllers themselves. This approach is often referred to as a tarpit , and one might imagine this method being quite interesting for degrading the effectiveness of a botnet.

51 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00003-2 2011

SEPARATION

A limitation of firewalls is that they can only be as good as their access controls and filters. They might fail to detect subversive packets. In some situations, they might be bypassed altogether. For example, if a computer behind a firewall has a dial-up port, as is all too common, an intruder can get access by dialing the machine .

Dorothy Denning 1

The separation of network assets from malicious intruders using a fi rewall is perhaps the most familiar protection approach in all of computer security. Today, you will fi nd some sort of fi rewall deployed in or around virtually every computer, application, sys- tem, and network in the world. They serve as the centerpiece in most organizations’ security functionality, including intrusion detection, antivirus fi ltering, and even identity management. An enormous fi rewall industry has emerged to support such mas- sive deployment and use, and this industry has done nothing but continue to grow for years and years.

In spite of this widespread adoption, fi rewalls as separation mechanisms for large-scale infrastructure have worked to only a limited degree. The networks and systems associated with national infrastructure assets tend to be complex, with a multi- tude of different entry points for intruders through a variety of Internet service providers. In addition, the connectivity require- ments for complex networks often result in large rule sets that permit access for many different types of services and source addresses. Worse, the complexity of large-scale networks often leads to unknown, unprotected entry points into and out of the enterprise (see Figure 3.1 ).

Certainly, the use of traditional perimeter fi rewalls will con- tinue to play a role in the protection of national assets, as we will describe below. Egress fi ltering, for example, is often most effi ciently performed at the perceived perimeter of an organiza- tion. Similarly, when two or more organizations share a private

3

1 D. Denning, Information Warfare and Security , Addison–Wesley, New York, 1999, p. 354.

Firewalls are valuable and frequently employed but may not provide enough protection to large-scale networks.

52 Chapter 3 SEPARATION

connection, the connection endpoints are often the most natu- ral place to perform fi rewall fi ltering, especially if traditional circuit-switched connections are involved. To achieve optimal separation in the protection of large-scale national assets, how- ever, three new fi rewall approaches will be required: ● Network-based separation —Because the perimeter of any

complex national infrastructure component will be diffi cult to defi ne accurately, the use of separation methods such as network-based fi rewalls is imperative. Such cloud-based func- tionality allows a broader, more accurate view of the egress and ingress activity for an organization. It also provides a richer environment for fi ltering high-capacity attacks. The fi ltering of denial of service attacks aimed at infrastructure, for example, can only be stopped with special types of cloud- based fi ltering fi rewalls strategically placed in the network.

● Internal separation —National infrastructure protection will require a program of internal asset separation using fi rewalls strategically placed in infrastructure. This type of separation of internal assets using fi rewalls or other separation mecha- nisms (such as operating system access controls) is not gener- ally present in most infrastructure environments. Instead, the

Perimeter

Single Firewall

Simple

Network

Single

Internet Service

Provider

Large,

Non-Uniform

Rule Bases

Complexity

of Multiple

Providers

Unknown,

Unprotected

Link

Complex

Connectivity

to Firewall

Multiple

Internet

Service

Providers

Complex

Network

Figure 3.1 Firewalls in simple and complex networks.

Chapter 3 SEPARATION 53

idea persists that insiders should have unrestricted access to internal resources and that perimeter fi rewalls should protect resources from untrusted, external access. This model breaks down in complex infrastructure environments because it is so easy to plant insiders or penetrate complex network perimeters.

● Tailored separation —With the use of specialized protocols in national infrastructure management, especially supervisory control and data acquisition (SCADA), tailoring fi rewalls to handle unique protocols and services is a requirement. This is a challenge because commercial fi rewalls are generally designed for generic use in a wide market and tailoring will require a more focused effort. The result will be more accurate fi rewall operation without the need to open large numbers of service ports to enable SCADA applications. The reader might be amused to consider the irony pre-

sented today by network connectivity and security separation. Twenty years ago, the central problem in computer network- ing involved the rampant interoperability that existed between systems. Making two computers connect over a network was a signifi cant challenge, one that computer scientists worked hard to overcome. In some instances, large projects would be initi- ated with the goal of connecting systems together over networks. Amazingly, the challenge we deal with today is not one of con- nectivity, but rather one of separation. This comes from the ubiq- uity of the Internet Protocol (IP), which enables almost every system on the planet to be connected with trivial effort. Thus, where previously we did not know how to interconnect systems, today we don’t know how to separate them!

What Is Separation? In the context of national infrastructure protection, separation is viewed as a technique that accomplishes one of the following security objectives: ● Adversary separation —The fi rst separation goal involves sepa-

rating an asset from an adversary to reduce the risk of direct attack. Whatever implementation is chosen should result in the intruder having no direct means for accessing national assets.

● Component distribution —The second separation goal involves architecturally separating components in an infrastructure to distribute the risk of compromise. The idea here is that a com- promise in one area of infrastructure should not be allowed to propagate directly.

Now that we are able to connect systems with ease, we must learn to separate them for protection!

Commercially available fi rewalls are not designed for the large-scale complexity of our national infrastructure networks.

54 Chapter 3 SEPARATION

The access restrictions that result from either of these separa- tion approaches can be achieved through functional or physical means. Functional means involve software, computers, and net- works, whereas physical means include tangible separations such as locks, safes, and cabinets. In practice, most separation access restrictions must be designed to focus on either the insider or outsider threat. The relationship between these different separa- tion options can be examined based on the three primary factors involved in the use of separation for protecting infrastructure (see box).

A Working Taxonomy of Separation Techniques The three primary factors involved in the use of separation for protecting infrastructure include the source of the threat (insider or outsider), the target of the security control (adversary or asset), and the approach used in the security control (functional or physical). We can thus use these three factors to create a separation taxonomy that might help to compare and contrast the various options for separating infrastructure from adversaries (see Figure 3.2 ).

The fi rst column in the taxonomy shows that separation controls are focused on keeping either insiders or outsiders away from some asset. The key difference here is that insiders would typically be more trusted and would have more opportunity to gain special types of access. The second column indicates that the separation controls are focused on either keeping an adversary away from some asset or inherently separating components of the actual asset, perhaps through distribution. The third column identifi es whether the separation approach uses computing functionality or would rely instead on some tangible, physical control.

Threat

Insider Adversary Functional Internal access control

Outsider Adversary Functional Internet-facing firewall

Insider Asset Functional Application separation

Functional

Adversary

Techniques

Functional

Asset

Techniques

Physical

Adversary

and Asset

Techniques

Outsider Asset Functional Application distribution

Insider Adversary Physical Project compartmentalization

Outsider Adversary Physical Information classification

Insider Asset Physical Internal network diversity

Outsider Asset Physical Physical host distribution

Target Approach Example

Figure 3.2 Taxonomy of separation techniques.

Chapter 3 SEPARATION 55

Functional Separation Functional separation of an adversary from any computing asset is most commonly achieved using an access control mechanism with the requisite authentication and identity management. Access controls defi ne which users can perform which actions on which entities. The access rules should be predetermined in a security policy. They should specify, for example, which users can access a given application, and, obviously, the validation of user identity must be accurate. In some cases, security policy rules must be more dynamic, as in whether a new type of traffi c stream is allowed to proceed to some Internet ingress point. This might be determined by real-time analysis of the network fl ow.

An access policy thus emerges for every organization that identifi es desired allowances for users requesting to perform actions on system entities. Firewall policies are the most com- mon example of this; for example, users trying to connect to a web server might be subjected to an access control policy that would determine if this was to be permitted. Similarly, the IP addresses of some organization might be keyed into a fi rewall rule to allow access to some designated system. A major prob- lem that occurs in practice with fi rewalls is that the rule base can grow to an enormous size, with perhaps thousands of rules. The result is complexity and a high potential for error. National infrastructure initiatives must identify rewards and incentives for organizations to keep their fi rewall rule bases as small as pos- sible. Some organizations have used optimization tools for this purpose, and this practice should be encouraged for national assets.

From the fi rst two rows of the taxonomy, it should be clear that internal access controls demonstrate a functional means for separating insider adversaries from an asset, whereas Internet fi rewalls achieve roughly the same end for outside adversaries. These fi rewalls might be traditional devices, as one might fi nd in an enterprise, or special fi ltering devices placed in the network to throttle volume attacks. The third and fourth rows show that logical separation of an application is a good way to complicate an insider attack; this is comparably done for outsiders by distributing the application across different Internet-facing hosts. The last four rows in Figure 3.2 demonstrate different ways to use physical means to protect infrastructure, ranging from keeping projects and people separate from an asset to maintaining diversity and distribution of infrastructure assets. The following sections provide more detail on these separation taxonomy elements.

56 Chapter 3 SEPARATION

Two broad categories of security can be followed when trying to achieve functional separation of adversaries from any type of national infrastructure assets. The fi rst involves distributing the responsibility for access mediation to the owners of smaller asset components such as individual computers or small networks; the second involves deployment of a large, centralized mediation mechanism through which all access control decisions would be made (see Figure 3.3 ).

The distributed approach has had considerable appeal for the global Internet community to date. It avoids the problem of having to trust a large entity with mediation decisions, it allows for com- mercial entities to market their security tools on a large scale to end users, and it places control of access policy close to the asset, which presumably should increase the likelihood that the policy is appropriate. The massive global distribution of computer security responsibility to every owner of a home personal computer is an example of this approach. End users must decide how to protect their assets, rather than relying on some centralized authority.

Unfortunately, in practice, the distributed approach has led to poor results. Most end users are unqualifi ed to make good deci- sions about security, and even if a large percentage make excellent decisions, the ones who do not create a big enough vulnerability as to place the entire scheme at risk. Botnets, for example, prey on poorly managed end-user computers on broadband connections. When a home computer is infected with malware, there really is no centralized authority for performing a cleansing function. This lack of centralization on the Internet thus results in a huge secu- rity risk. Obviously, the Internet will never be redesigned to include centralized control; that would be impractical, if not impossible.

One Firewall

Centralized MediationDistributed Mediation

Multiple Firewalls

Internet

Figure 3.3 Distributed versus centralized mediation.

In large networks, fi rewall rules can become so numerous that they actually increase the margin for error.

Chapter 3 SEPARATION 57

For national infrastructure, however, the possibility does exist for more centralized control. The belief here is that an increased reliance on centralized protection, especially in con- junction with the network service provider, will improve overall national asset protection methods. This does not imply, how- ever, that distributed protection is not necessary. In fact, in most environments, skilled placement of both centralized and distributed security will be required to avoid national infra- structure attack.

National Infrastructure Firewalls The most common application of a fi rewall involves its place- ment between a system or enterprise to be protected and some untrusted network such as the Internet. In such an arrangement for the protection of a national asset, the following two possibili- ties immediately arise: ● Coverage —The fi rewall might not cover all paths between the

national asset to be protected and the untrusted network such as the Internet. This is a likely case given the general complex- ity associated with most national infrastructure.

● Accuracy —The fi rewall might be forced to allow access to the national asset in a manner that also provides inadvertent, unau- thorized access to certain protected assets. This is common in large-scale settings, especially because specialized protocols such as those in SCADA systems are rarely supported by com- mercial fi rewalls. As a result, the fi rewall operator must compen- sate by leaving certain ports wide open for ingress traffi c. To address these challenges, the design of national security

infrastructure requires a skillful placement of separation func- tionality to ensure that all relevant traffi c is mediated and that no side effects occur when access is granted to a specifi c asset. The two most effective techniques include aggregation of protections in the wide area network and segregation of protections in the local area network (see Figure 3.4 ).

Aggregating fi rewall functionality at a defi ned gateway is not unfamiliar to enterprise security managers. It helps ensure cov- erage of untrusted connections in more complex environments. It also provides a means for focusing the best resources, tools, and staff to one aggregated security complex. Segregation in a local area network is also familiar, albeit perhaps less practiced. It is effective in reducing the likelihood that external access to System A has the side effect of providing external access to System B. It requires management of more devices and does

Centralized control versus multiple, independent fi rewalls—both have their advantages, so which is best for national infrastructure?

58 Chapter 3 SEPARATION

generally imply higher cost. Nevertheless, both of these tech- niques will be important in national infrastructure fi rewall placement.

A major challenge to national infrastructure comes with the massive increase in wireless connectivity that must be presumed for all national assets in the coming years. Most enterprise work- ers now carry around some sort of smart device that is ubiqui- tously connected to the Internet. Such smart devices have begun to resemble computers in that they can support browsing, e-mail access, and even virtual private network (VPN) access to applica- tions that might reside behind a fi rewall. As such, the ease with which components of infrastructure can easily bypass defi ned fi rewall gateways will increase substantially. The result of this increased wireless connectivity, perhaps via 4G deployment, will be that all components of infrastructure will require some sort of common means for ensuring security.

Massive distribution of security to smart wireless endpoint devices may not be the best option, for all the reasons previously cited. It would require massive distribution, again, of the security responsibility to all owners of smart devices. It also requires vigi- lance on the part of every smart device owner, and this is not a reasonable expectation. An alternative approach involves iden- tifying a common transport infrastructure to enforce desired policy. This might best be accomplished via the network trans- port carrier. Network service providers offer several advantages with regard to centralized security: ● Vantage point —The network service provider has a wide van-

tage point that includes all customers, peering points, and

Firewall Aggregation

(Wide Area)

Firewall Segregation

(Local Area)

Internet

A

C

B

Figure 3.4 Wide area fi rewall aggregation and local area fi rewall segregation.

Effective protection of national infrastructure will undoubtedly be expensive due to the increased management of devices.

Smart devices have added another layer of complexity to network protection.

Chapter 3 SEPARATION 59

gateways. Thus, if some incident is occurring on the Internet, the service provider will observe its effects.

● Operations —Network service providers possess the opera- tional capability to ensure up-to-date coverage of signatures, updates, and new security methods, in contrast to the inabil- ity of most end users to keep their security software current.

● Investment —Where most end users, including enterprise groups, are unlikely to have funds suffi cient to install multiple types of diverse or even redundant security tools, service pro- viders can often support a business case for such investment. For these reasons, a future view of fi rewall functionality for

national infrastructure will probably include a new aggregation point—namely, the concept of implementing a network-based fi rewall in the cloud (see Figure 3.5 ).

In the protection of national infrastructure, the use of net- work-based fi rewalls that are embedded in service provider fab- ric will require a new partnership between carriers and end-user groups. Unfortunately, most current telecommunications ser- vice level agreements (SLAs) are not compatible with this notion, focusing instead on packet loss and latency issues, rather than policy enforcement. This results in too many current cases of a national infrastructure provider being attacked, with the ser- vice provider offering little or no support during the incident.

Service

Provider

Fabric

Network-Based Firewall

(Provider Managed)

Internet

A

B

Wireless

Connection

(3G/4G)

C

Wired

Connection

Figure 3.5 Carrier-centric network-based fi rewall.

A fi rewall in the cloud may be the future of fi rewall functionality.

60 Chapter 3 SEPARATION

Obviously, this situation must change for the protection of national assets.

DDOS Filtering A major application of the network-based fi rewall concept includes a special type of mediation device embedded in the wide area network for the purpose of throttling distributed denial of service (DDOS) attacks. This device, which can be crudely referred to as a DDOS fi lter , is essential in modern networking, given the magnifi ed risk of DDOS attacks from botnets. Trying to fi lter DDOS attacks at the enterprise edge does not make sense given the physics of network ingress capacity. If, for example, an enterprise has a 1-Gbps ingress connection from the Internet, then a botnet directing an inbound volume of anything greater than 1 Gbps will overwhelm the connection.

The solution to this volume problem is to move the fi lter- ing upstream into the network. Carrier infrastructure gener- ally provides the best available option here. The way the fi ltering would work is that volumetric increases in ingress traffi c would cause a real-time redirection of traffi c to a DDOS fi ltering com- plex charged with removing botnet-originating traffi c from valid traffi c. Algorithms for performing such fi ltering generally key on the type of traffi c being sent, the relative size of the traffi c, and any other hint that might point to the traffi c being of an attack nature. Once the traffi c has been fi ltered, it is then funneled to the proper ingress point. The result is like a large safety valve or shock absorber in the wide area network that turns on when an attack is under way toward some target enterprise (see Figure 3.6 ).

Quantitative analysis associated with DDOS protection of national infrastructure is troubling. If, for example, we assume that bots can easily steal 500 Kbps of broadband egress from the unknowing infected computer owner, then it would only require three bots to overwhelm a T1 (1.5-Mbps) connection. If one carries out this argument, then botnets with 16,000 bots are suffi cient to overwhelm a 10-Gbps connection. Given the exis- tence of prominent botnets such as Storm and Confi cker, which some experts suggest could have as many as 2 or 3 million bots, the urgency associated with putting DDOS fi ltering in place cannot be understated. An implication is that national infrastruc- ture protection initiatives must include some measure of DDOS fi ltering to reduce the risk of DDOS attacks on national assets.

A serious problem that must be addressed, however, in current DDOS attacks on infrastructure involves a so-called

The risk of DDOS attacks must be effectively addressed.

Moving the fi ltering functionality into the network will allow legitimate traffi c to pass through and the discovery of potential DDOS attacks.

Chapter 3 SEPARATION 61

amplifi cation approach. Modern DDOS attacks are generally designed in recognition of the fact that DDOS fi lters exist to detect large inbound streams of unusual traffi c. Thus, to avoid inbound fi ltering in carrier infrastructure, adversaries have begun to follow two design heuristics. First, they design DDOS traffi c to mimic normal system behavior, often creating transac- tions that look perfectly valid. Second, they design their attack to include small inbound traffi c that utilizes some unique aspect of the target software to create larger outbound responses. The result is a smaller, less obvious inbound stream which then pro- duces much larger outbound response traffi c that can cause the DDOS condition.

1 Gbps

Ingress

>> 1 Gbps DDOS Traffic

Redirected to Filters

<< 1 Gbps Valid Traffic

Tunneled to Target A

Carriers

Target A

Bots >> 1 Gbps DDOS Traffic

Aimed at Target A

Target A’s

Designated

Carrier

Figure 3.6 DDOS fi ltering of inbound attacks on target assets.

The Great Challenge of Filtering Out DDOS Attacks The great challenge regarding current DDOS attacks is that the only way to avoid the sort of problem mentioned in the text is through nontrivial changes in target infrastructure. Two of these nontrivial changes are important to mention here: 1. Stronger authentication of inbound inquiries and transactions from users is imperative. This is not desirable for

e-commerce sites designed to attract users from the Internet and also designed to minimize any procedures that might scare away customers.

2. To minimize the amplifi cation effects of some target system, great care must go into analyzing the behavior of Internet-visible applications to determine if small inquiries can produce much larger responses. This is particularly important for public shared services such as the domain name system, which is quite vulnerable to amplifi cation attacks.

These types of technical considerations must be included in modern national infrastructure protection initiatives.

Modern DDOS attacks take into account a more advanced fi ltering system and thus design the DDOS traffi c accordingly.

62 Chapter 3 SEPARATION

SCADA Separation Architecture Many critical national infrastructure systems include supervi- sory control and data acquisition (SCADA) functionality. These systems can be viewed as the set of software, computers, and networks that provide remote coordination of controls systems for tangible infrastructures such as power generation systems, chemical plants, manufacturing equipment, and transportation systems. The general structure of SCADA systems includes the following components: ● Human-machine interface (HMI) —The interface between the

human operator and the commands relevant to the SCADA system

● Master terminal unit (MTU) —The client system that gathers data locally and transmits it to the remote terminal unit

● Remote terminal unit (RTU) —The server that gathers data remotely and sends control signals to fi eld control systems

● Field control systems —Systems that have a direct interface to fi eld data elements such as sensors, pumps, and switches The primary security separation issue in a SCADA system archi-

tecture is that remote access from an MTU to a given RTU must be properly mediated according to a strong access control policy. 2 The use of fi rewalls between MTUs and RTUs is thus imperative in any SCADA system architecture. This separation must also enforce policy from any type of untrusted network, such as the Internet, into the RTUs. If this type of protection is not present, then the obvious risk emerges that an adversary can remotely access and change or infl uence the operation of a fi eld control system.

As one might expect, all the drawbacks associated with large- scale fi rewall deployment are also present in SCADA systems. Coverage and accuracy issues must be considered, as well as the likelihood that individual components have direct or wireless connections to the Internet through unknown or unapproved channels. This implies that protection of RTUs from unauthor- ized access will require a combination of segregated local area fi rewalls, aggregated enterprise-wide fi rewalls, and carrier- hosted network-based fi rewalls (see Figure 3.7 ).

The biggest issue for SCADA separation security is that most of the associated electromechanical systems were designed and evolved in an environment largely separate from conventional computing and networking. Few computing texts explain the sub- tle details in SCADA system architecture; in fact, computer scien- tists can easily complete an advanced program of study without the slightest exposure to SCADA issues. Thus, in far too many

2 R. Krutz, Securing SCADA Systems , John Wiley & Sons, New York, 2006.

Remote access from MTUs to RTUs opens the door for adversaries to take advantage of this separation.

Chapter 3 SEPARATION 63

SCADA environments, the computerized connections between tangible systems and their control networks have occurred in an ad hoc manner, often as a result of establishing local convenience such as remote access. For this reason, the likelihood is generally low that state-of-the-art protection mechanisms are in place to protect a given SCADA system from cyber attack.

An additional problem that emerges for SCADA fi rewall usage is that commercial fi rewalls do not generally support SCADA protocols. When this occurs, the fi rewall operator must exam- ine which types of ports are required for usage of the protocol, and these would have to be opened. Security experts have long known that one of the great vulnerabilities in a network is the inadvertent opening of ports that can be attacked. Obviously, national infrastructure protection initiatives must be considered that would encourage and enable new types of fi rewall function- ality such as special proxies that could be embedded in SCADA architecture to improve immediate functionality.

Physical Separation One separation technique that is seemingly obvious, but amaz- ingly underrepresented in the computer security literature, is the physical isolation of one network from another. On the sur- face, one would expect that nothing could be simpler for sepa- rating one network from any untrusted environment than just unplugging all external connections. The process is known as

Internet

SCADA

Service

Provider

Fabric

SCADA

Enterprise

Firewall

SCADA

Enterprise

LAN

Network-Based SCADA

Firewall (Provider-Managed)

Wireless

Connection

(3G/4G)

Wired

Connection

Adversary

To Field

Data Element

RTU CMTUs To Field

Data Element

RTU B

To Field

Data Element

RTU B

Figure 3.7 Recommended SCADA system fi rewall architecture.

Protection mechanisms must be updated to effectively protect a SCADA system from cyber attack.

Opening ports, although necessary, is a risky endeavor, as it subjects the SCADA system to increased vulnerabilities.

64 Chapter 3 SEPARATION

air gapping , and it has the great advantage of not requiring any special equipment, software, or systems. It can be done to sepa- rate enterprise networks from the Internet or components of an enterprise network from each other.

The problem with physical separation as a security technique is that as complexity increases in some system or network to be isolated, so does the likelihood that some unknown or unautho- rized external connection will arise. For example, a small com- pany with a modest local area network can generally enjoy high confi dence that external connections to the Internet are well known and properly protected. As the company grows, however, and establishes branch offi ces with diverse equipment, people, and needs, the likelihood that some generally unrecognized external connectivity will arise is high. Physical separation of net- work thus becomes more diffi cult.

So how does one go about creating a truly air-gapped net- work? The answer lies in the following basic principles: ● Clear policy —If a network is to be physically isolated, then

clear policy must be established around what is and what is not considered an acceptable network connection. Organizations would thus need to establish policy checks as part of the net- work connection provision process.

● Boundary scanning —Isolated networks, by defi nition, must have some sort of identifi able boundary. Although this can cer- tainly be complicated by fi rewalls embedded in the isolated net- work, a program of boundary scanning will help to identify leaks.

● Violation consequences —If violations occur, clear conse- quences should be established. Government networks in the U.S. military and intelligence communities, such as SIPRNet and Intelink, are protected by laws governing how individuals must use these classifi ed networks. The consequences of vio- lation are not pleasant.

● Reasonable alternatives —Leaks generally occur in an isolated network because someone needs to establish some sort of communication with an external environment. If a network connection is not a reasonable means to achieve this goal, then the organization must provide or support a reasonable work-around alternative. Perhaps the biggest threat to physical network isolation

involves dual-homing a system to both an enterprise network and some external network such as the Internet. Such dual- homing can easily arise where an end user utilizes the same system to access both the isolated network and the Internet. As laptops have begun to include native 3G wireless access, this like- lihood of dual-homing increases. Regardless of the method, if any

Air gapping allows for physical separation of the network from untrusted environments.

As a company grows, physical separation as a protection feature becomes increasingly complex.

Chapter 3 SEPARATION 65

sort of connectivity is enabled simultaneously to both systems, then the end user creates an inadvertent bridge (see Figure 3.8 ).

It is worth mentioning that the bridge referenced above does not necessarily have to be established simultaneously. If a sys- tem connects to one network and is infected with some sort of malware, then this can be spread to another network upon sub- sequent connectivity. For this reason, laptops and other mobile computing devices need to include some sort of native protec- tion to minimize this problem. Unfortunately, the current state of the art for preventing malware downloads is poor.

A familiar technique for avoiding bridges between networks involves imposing strict policy on end-user devices that can be used to access an isolated system. This might involve preventing certain laptops, PCs, and mobile devices from being connected to the Internet; instead, they would exist solely for isolated net- work usage. This certainly reduces risk, but is an expensive and cumbersome alternative. The advice here is that for critical sys- tems, especially those involving safety and life-critical applica- tions, if such segregation is feasible then it is probably worth the additional expense. In any event, additional research in multi- mode systems that ensure avoidance of dual-homing between networks is imperative and recommended for national infra- structure protection.

Insider Separation The insider threat in national infrastructure protection is espe- cially tough to address because it is relatively easy for determined

Dual-homing creates another area of vulnerability for enterprise networks.

Isolated Environment

Isolated

Network Internet

End Users

Simultaneous

Dual-Homing

Leak

Figure 3.8 Bridging an isolated network via a dual-homing user.

Imposing strict policies regarding connection of laptops, PCs, and mobile devices to a network is both cumbersome and expensive but necessary.

66 Chapter 3 SEPARATION

adversaries to obtain trusted positions in groups with responsi- bility for national assets. This threat has become even more diffi - cult to counter as companies continue to partner, purchase, and outsource across political boundaries. Thus, the ease with which an adversary in one country can gain access to the internal, trusted infrastructure systems of another country is both growing and troubling.

Traditionally, governments have dealt with this challenge through strict requirements on background checking of any individuals who require access to sensitive government systems. This practice continues in many government procurement set- tings, especially ones involving military or intelligence infor- mation. The problem is that national infrastructure includes so much more than just sensitive government systems. It includes SCADA systems, telecommunications networks, transportation infrastructure, fi nancial networks, and the like. Rarely, if ever, are requirements embedded in these commercial environments to ensure some sort of insider controls against unauthorized data collection, inappropriate access to customer records, or admin- istrative access to critical applications. Instead, it is typical for employees to be granted access to the corporate Intranet, from which virtually anything can be obtained.

Techniques for reducing the risk of unauthorized insider access do exist that can be embedded in the design and operation of national infrastructure operation. These techniques include the following: ● Internal fi rewalls —Internal fi rewalls separating components

of national assets can reduce the risk of insider access. Insiders with access to component A, for example, would have to suc- cessfully negotiate through a fi rewall to gain access to com- ponent B. Almost every method for separating insiders from assets will include some sort of internal fi rewall. They can be implemented as fully confi gured fi rewalls, or as packet fi lter- ing routers; but regardless, the method of separating insiders from assets using fi rewalls must become a pervasive control in national infrastructure.

● Deceptive honey pots —As we discussed in Chapter 2, internal honey pots can help identify malicious insiders. If the decep- tion is openly advertised, then malicious insiders might be more uncertain in their sabotage activity; if the deception is stealth, however, then operators might observe malicious behavior and potentially identify the internal source.

● Enforcement of data markings —Many organizations with responsibility for national infrastructure do not properly mark their information. Every company and government agency

An adversarial threat may come from a trusted partner.

The commercially run components of our national infrastructure do not have the same stringent personnel requirements as the government-run components.

Chapter 3 SEPARATION 67

must identify, defi ne, and enforce clearly visible data markings on all information that could be mishandled. Without such markings, the likelihood of proprietary information being made available inadvertently to adversaries increases sub- stantially. Some companies have recently begun to use new data markings for personally identifi able information (PII).

● Data leakage protection (DLP) systems —Techniques for sniff- ing gateway traffi c for sensitive or inappropriate materials are becoming common. Tools called DLP systems are routinely deployed in companies and agencies. At best, they provide weak protection against insider threats, but they do help identify erro- neous leaks. Once deployed, they provide statistics on where and how insiders might be using corporate systems to spill informa- tion. In practice, however, no knowledgeable insider would ever be caught by a data leakage tool. Instead, the leak would be done using non-company-provided computers and networks. One of the more effective controls against insider threats

involves a procedural practice that can be embedded into virtu- ally every operation of an organization. The technique is known as segregation of duties , and it should be familiar to anyone who has dealt with Sarbanes-Oxley requirements in the United States. Security researchers will recognize the related separation of duties notion introduced in the Clark-Wilson integrity model. In both cases, critical work functions are decomposed so that work com- pletion requires multiple individuals to be involved. For example, if a fi nancial task requires two different types of activities for com- pletion, then a segregation of duties requirement would ensure that no one individual could ever perform both operations.

The purpose of this should be obvious. By ensuring that mul- tiple individuals are involved in some sensitive or critical task, the possibility of a single insider committing sabotage is greatly reduced. Of course, multiple individuals could still collude to cre- ate an internal attack, but this is more diffi cult and less likely in most cases. If desired, the risk of multiple individuals creating sabotage can be reduced by more complex segregation of duty policies, perhaps supported by the use of security architectural controls, probably based on internally positioned fi rewalls. In fact, for network-based segregation tasks, the use of internal fi re- walls is the most straightforward implementation.

In general, the concept of segregation of duties can be rep- resented via a work function ABC that is performed either by a single operator A or as a series of work segments by multiple operators. This general schema supports most instances of segre- gation of duties, regardless of the motivation or implementation details (see Figure 3.9 ).

Segregation of duties offers another layer of protection.

Internal fi rewalls create a straightforward de facto separation of duties.

68 Chapter 3 SEPARATION

The idea of breaking down work functions into components is certainly not new. Managers have decomposed functions into smaller tasks for many years; this is how assembly lines origi- nated. Unfortunately, most efforts at work function decomposi- tion result in increased bureaucracy and decreased worker (and end-user) satisfaction. The stereotyped image arises of the gov- ernment bureau where customers must stand in line at this desk for this function and then stand in line at that desk for that func- tion, and so on. The process is clearly infuriating but, ironically, is also diffi cult to sabotage by a malicious insider.

The challenge for national infrastructure protection is to inte- grate segregation of duty policies into all aspects of critical asset management and operation, but to do so in a manner that mini- mizes the increased bureaucracy. This will be especially diffi - cult in government organizations where the local culture always tends to nurture and embrace new bureaucratic processes.

Asset Separation Asset separation involves the distribution, replication, decompo- sition, or segregation of national assets to reduce the risk of an isolated compromise. Each of these separation techniques can be described as follows: ● Distribution involves creating functionality using multiple

cooperating components that work together as a distributed system. The security advantage is that if the distributed system is designed properly then one or more of the components can be compromised without breaking the overall system function.

● Replication involves copying assets across disparate compo- nents so that if one asset is broken then replicated versions

Original Work

Function with One

Operator

Operator A

Work Function ABC

Decomposed

Function with

Segregation of

Duties

Operator BOperator A Operator C

Work Function A Work Function B Work Function C

Figure 3.9 Decomposing work functions for segregation of duty.

How to effectively separate duties without increasing the unwieldy bureaucracy is a challenge that must be addressed.

Chapter 3 SEPARATION 69

will continue to be available. Database systems have been pro- tected in this way for many years. Obviously, no national asset should exist without a degree of replication to reduce risk.

● Decomposition is the breaking down of complex assets into individual components so that isolated compromise of a com- ponent will be less likely to break the overall asset. A common implementation of a complex business process, for example, generally includes some degree of decomposition into smaller parts.

● Segregation is the logical separation of assets through spe- cial access controls, data markings, and policy enforcement. Operating systems, unfortunately, provide weak controls in this regard, largely because of the massive deployment of single- user machines over the past couple of decades. Organizations thus implement logical separation of data by trying to keep it on different PCs and laptops. This is a weak implementation. Each of these techniques is common in modern infrastruc-

ture management. For example, content distribution networks (CDNs) are rarely cited as having a positive impact on national infrastructure security, but the reality is that the distribution and replication inherent in CDNs for hosting are powerful techniques for reducing risk. DDOS attacks, for example, are more diffi cult to complete against CDN-hosted content than for content resident only on an origination host. Attackers have a more diffi cult time targeting a single point of failure in a CDN (see Figure 3.10 ).

It is important to emphasize that the use of a CDN certainly does not ensure protection against a DDOS attack, but the rep- lication and distribution inherent in a CDN will make the attack more diffi cult. By having the domain name system (DNS) point

Segregation is one method of separation.

CarriersBots DDOS Attack Aimed

at Origination Host

Target A’s

Designated

Carrier CDN

Replicated

Hosts

Same Possibly

Unaffected by

DDOS Attack

Origination Host

Network

CDN

CDN

Figure 3.10 Reducing DDOS risk through CDN-hosted content.

70 Chapter 3 SEPARATION

to CDN-distributed assets, the content naturally becomes more robust. National infrastructure designers and operators are thus obliged to ensure that CDN hosting is at least considered for all critically important content, especially multimedia content (streaming and progressive download) and any type of critical software download.

This is becoming more important as multimedia provi- sion becomes more commonly embedded into national assets. In the recent past, the idea of providing video over the Internet was nothing more than a trivial curiosity. Obviously, the mas- sive proliferation of video content on sites such as YouTube.com has made these services more mainstream. National assets that rely on video should thus utilize CDN services to increase their robustness. Additional DDOS protection of content from the backbone service provider would also be recommended.

Multilevel Security (MLS) A technique for logical separation of assets that was popular in the computer security community during the 1980s and 1990s is known as multilevel security (MLS). MLS operating systems and applications were marketed aggressively to the security commu- nity during that time period. A typical implementation involved embedding mandatory access controls and audit trail hooks into the underlying operating system kernel. Assurance methods would then be used to ensure that the trusted component of the kernel was correct, or at least as correct as could be reasonably verifi ed. Today, for reasons largely economic, MLS systems are no longer available, except in the most esoteric classifi ed govern- ment applications.

The idea behind MLS was that, by labeling the fi les and direc- tories of a computer system with meaningful classifi cations and by also labeling the users of that system with meaningful clear- ances, a familiar security policy could be enforced. This scheme, which was motivated largely by paper methods used to protect information in government, produced a logical separation of cer- tain assets from certain users, based on the existing policy. For example, fi les marked “secret” could only be read by users with suffi cient clearances. Similarly, users not cleared to the level of “top secret” would not be allowed to read fi les that were so labeled. The result was an enforced policy on requesting users and protected assets (see Figure 3.11 ).

Several models of computer system behavior with such MLS functionality were developed in the early years of computer

The increase in multimedia components within national infrastructure networks argues for increased reliance on CDN services.

The familiar notion of “top- secret clearance” comes from MLS systems.

Chapter 3 SEPARATION 71

security. The Bell-La Padula disclosure and Biba integrity models are prominent examples. Each of these models stipulated policy rules that, if followed, would help to ensure certain desirable security properties. Certainly, there were problems, especially as networking was added to isolated secure systems, but, unfortu- nately, most research and development in MLS dissolved myste- riously in the mid-1990s, perhaps as a result of the economic pull of the World Wide Web. This is unfortunate, because the function- ality inherent in such MLS separation models would be valuable in today’s national infrastructure landscape. A renewed interest in MLS systems is thus strongly encouraged to improve protec- tion of any nation’s assets.

Requesting

Users

Top Secret

Cleared

Secret

Cleared

MLS Policy

Enforcement

Read (allowed)

Read (blocked)

Read (allowed)

Protected

Assets

Top Secret

Classified

Secret

Classified

MLS Produces Logical

Separation of Assets

TS

TS

TS S

S

S

TS

TS

S

S

Figure 3.11 Using MLS logical separation to protect assets.

Implementing a National Separation Program Implementation of a national separation program would involve verifi cation and validation of certain design goals in government agencies and companies with responsibility for national infrastructure. These goals, related to policy enforcement between requesting users and the protected national assets, would include the following: ● Internet separation —Certain critical national assets simply should not be accessible from the Internet. One would

imagine that the control systems for a nuclear power plant, for example, would be good candidates for separation from the Internet. Formal national programs validating such separation would be a good idea. If this requires changes in business practice, then assistance and guidance would be required to transition from open, Internet connectivity to something more private.

● Network-based fi rewalls —National infrastructure systems should be encouraged to utilize network-based fi rewalls, preferably ones managed by a centralized group. The likelihood is higher in such settings that signatures will be

MLS systems seem to have gone by the wayside but should be revived as another weapon in the national infrastructure protection arsenal.

72 Chapter 3 SEPARATION

Obviously, once a national program is in place, consideration of how one might separate assets between different cooperat- ing nations would seem a logical extension. Certainly, this would seem a more distant goal given the complexity and diffi culty of creating validated policy enforcement in one nation.

kept up to date and that security systems will be operated properly on a 24/7 basis. Procurement programs in government, in particular, must begin to routinely include the use of network-based security in any contract with an Internet service provider.

● DDOS protection —All networks associated with national assets should have a form of DDOS protection arranged before an attack occurs. This protection should be provided on a high-capacity backbone that will raise the bar for attackers contemplating a capacity-based cyber attack. If some organization, such as a government agency, does not have a suitable DDOS protection scheme, this should be likened to having no disaster recovery program.

● Internal separation —Critical national infrastructure settings must have some sort of incentive to implement an internal separation policy to prevent sabotage. The Sarbanes-Oxley requirements in the United States attempted to enforce such separation for fi nancial systems. While the debate continues about whether this was a successful initiative, some sort of program for national infrastructure seems worth considering. Validation would be required that internal fi rewalls exist to create protection domains around critical assets.

● Tailoring requirements —Incentives must be put in place for vendors to consider building tailored systems such as fi rewalls for specialized SCADA environments. This would greatly reduce the need for security administrators in such settings to confi gure their networks in an open position.

73 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00004-4 2011

DIVERSITY

We are looking at computers the way a physician would look at genetically related patients, each susceptible to the same disorder .

Mike Reiter, professor of electrical and computer engineering and computer science at Carnegie-Mellon University 1

Making national infrastructure more diverse in order to create greater resilience against cyber attack seems to be a pretty sen- sible approach. For example, natural scientists have known for years that a diverse ecosystem is always more resilient to disease than a monoculture. When a forest includes only one tree, the possibility arises that a single disease could wipe out the entire ecosystem. This type of situation arises even in business. Certain airlines, for example, have decided to use only one model of air- craft. This reduces the cost of maintenance and training but does create a serious risk if that particular aircraft were grounded for some reason. The airline would be out of business—a risk that is avoided by a diversity approach.

So it would stand to reason that the process of securing any set of national assets should always include some sort of diver- sity strategy. This diversity should extend to all applications, soft- ware, computers, networks, and systems. Unfortunately, with the exception of familiar geographic requirements on network routes and data centers, diversity is not generally included in infra- structure protection. In fact, the topic of deliberately introduc- ing diversity into national infrastructure to increase its security has not been well explored by computer scientists. Only recently have some researchers begun to investigate the benefi ts of diver- sity in software deployment.

Diversity in national infrastructure involves the introduc- tion of intentional differences into systems. Relevant differences include the vendor source, deployment approach, network con- nectivity, targeted standards, programming language, operating

4

1 Quoted in “Taking Cues from Mother Nature to Foil Cyber Attacks” (press release), Offi ce of Legislative and Public Affairs, National Science Foundation, Washington, D.C., 2003 ( http://www.nsf.gov/od/lpa/news/03/pr03130.htm ).

Introducing diversity at all levels of functionality has not been properly explored as a protection strategy.

74 Chapter 4 DIVERSITY

system, application base, software version, and so on. Two sys- tems are considered diverse if their key attributes differ, and non- diverse otherwise (see Figure 4.1 ).

The general idea is that an adversary will make assumptions about each of the relevant attributes in a target system. In the absence of diversity, a worst-case scenario results if the adversary makes the right assumptions about each attribute. If, for exam- ple, the adversary creates an attack on a set of computers that assumes an underlying Microsoft ® operating system environ- ment, and the national asset at risk employs only these types of systems, then the effect could be signifi cant. In the presence of diversity, however, it becomes much more diffi cult for an adver- sary to create an attack with maximal reach. This is especially rel- evant for attacks that are designed to automatically propagate. Eventually, the attack will reach a point where it can no longer copy itself or remotely execute, and the process will cease.

Why, then, is diversity so underrepresented in national infra- structure protection? To understand this, one must fi rst recognize the near-obsessive goal of enforcing sets of common standards that the information technology and security communities have attempted to achieve. In nearly every facet of computing, sets of standard, auditable practices have been defi ned and backed by powerful organizations. In the United States, the Sarbanes- Oxley standard has had a profound infl uence on the operation of every major corporation in the country, leading to more common approaches to fi nancial systems operation. Commonality, as we discuss in the next chapter, is somewhat at odds with diversity.

This focus on maintaining common, standard operating envi- ronments should not come as a surprise. The rise of the Internet, for example, was driven largely by the common acceptance of a single protocol suite. Even the provision of Internet-based ser- vices such as websites and mail servers requires agreement among system administrators to follow common port assign- ments. Chaos would ensue if every administrator decided to

System

A

B

C

Company X

Company Y

Company Z

Off-the-shelf

Custom

Custom

IP

TDM

TDM

IP sec

None

None

C++

Java

Java

Windows

Unix

Unix

Vendor

Source

Deployment

Approach

Network

Connectivity

Targeted

Standards

Programming

Language

Operating

System Attributes

A and B:

Diverse

B and C:

Non diverse

Figure 4.1 Diverse and nondiverse components through attribute differences.

Diversity increases the number of assumptions an adversary has to make about the system and creates more potential for an adversary’s plan to fail.

Standardized operations are important for compliance but are at odds with diversity.

Chapter 4 DIVERSITY 75

assign random ports to their Internet services; end users would not be able to easily locate what they need, and the Internet would be a mess (although this would certainly complicate broad types of attacks). So, the result is general agreement on common computing confi gurations.

Another key motivation to avoid diversity for most system managers is the costs involved. Typical computing and net- working management teams have created programs focused on removing differences in enterprise systems in order to reduce operating expenses. Clearly, nondiverse information technology systems simplify platform deployment, end-user training, system administrative practices, and system documentation. For these cost-related reasons, diversity is generally not a prominent goal in most current national infrastructure settings. The result is less secure infrastructure.

Diversity and Worm Propagation The self-propagation of a computer worm is a good example of an attack that relies on a nondiverse target environment to function properly. The box shows how relatively simple an attack can be.

Diversity currently competes with commonality and cost savings.

Worm Functionality in Three Easy Steps The functionality of a typical, generic computer worm is quite straightforward (only three steps) and can be described in simple pseudo-code terms as follows:

Program: Worm Start

Step 1. Find a target system on the network for propagation of Program Worm . Step 2. Copy Program Worm to that target system. Step 3. Remotely execute Program Worm on that target system.

Repeat Steps 1 through 3.

As you can see, a worm program relies on the ability to fi nd common, reachable, interoperable systems on the network that will accept and execute a copy of the worm program. In the early days of the Internet, this would be accomplished by checking a local fi le that would include a list of systems that were reach- able. Today, it’s done by creating batches of Internet Protocol

76 Chapter 4 DIVERSITY

addresses. Also, in those early days, it was quite easy to copy and execute programs from one system to another, because no one had yet invented the fi rewall.

One would have hoped that the global deployment of fi re- walls would have stopped the ability of adversaries to create worms, but sadly it has not. Instead, vulnerabilities or services open through the fi rewalls are used as the basis for worms. Nondiversity in such setups is also the norm. This is unfortunate, because if a worm operates in a diverse environment, and thus cannot fi nd systems that consistently meet one or more of these criteria, then its propagation will cease more rapidly. This can be depicted in a simple reachability diagram showing the point of initiation for the worm through its propagation to the fi nal point at which the activity ceases as a result of diversity. As the worm tries to propagate, diversity attributes that reduce its ability to locate reachable systems, make copies, and remotely execute are the most effective (see Figure 4.2 ).

Obviously, all worms will eventually cease to propagate, regardless of the degree of diversity in a given network. The secu- rity advantage one gains with diversity is that the worm is likely to cease more quickly and perhaps without human intervention. Empirical experience in the global security community deal- ing with worms such as the SQL/Slammer and Blaster worms of 2003 and the Sasser worm of 2004 suggest that signifi cant human intervention is required to halt malicious operation. During the early hours of the SQL/Slammer worm, most of the security inci- dent response calls involved people trying to fi gure out what to

Unreachable

Worm

Initiated

Worm

Ceased

Unable to Accept

Copy of Worm

Unable to

Remotely

Execute

Unreachable

Unable to Accept

Copy of Worm

Unable to

Remotely

Execute

Figure 4.2 Mitigating worm activity through diversity.

A worm propagates by fi nding interoperable systems to target.

Chapter 4 DIVERSITY 77

do. Eventually, the most effective solution involved putting local area network blocks in place to shut down the offending traf- fi c. By the time the event died down, many millions of hours of global labor had been expended working on the problem. By increasing diversity, one should expect to reduce response costs around the world associated with fi ghting worms.

The real challenge here is that both the Internet and the networks and systems being run by companies and agencies charged with national infrastructure are simply not diverse— and there is little discussion in place to alter this situation. As we suggested earlier, this is driven largely by the goal to maximize interoperability. There are some exceptions in the broader com- puting community, such as digital rights management (DRM)- based systems that have tended to limit the execution of certain content applications to very specifi c devices such as the iPod ® and iPhone ® . The general trend, however, is toward more open, interoperable computing. What this means is that, for national infrastructure components that must be resilient against auto- mated attacks such as worms, the threat will remain as long as the networking environment is a monoculture.

Desktop Computer System Diversity Typical individual computer users in the home or offi ce, regard- less of their location in the world, are most likely to be using a commercial operating system running on a standard processor platform and utilizing one of a couple of popular browsers to perform searches on a popular search engine. This might seem an obvious statement, but in the early days of computing there were many users on home-grown or proprietary systems using all sorts of software that might only be known locally.

Today, however, the most likely confi guration would be a Windows ® -based operating system on an Intel ® platform with Internet Explorer ® being used for Google ® searches. We can say this confi dently, because almost all current estimates of market share list these products as dominant in their respective fi elds. Certainly, competing platforms and services from Apple ® and others have made inroads, but for the most part, especially in business and government environments, the desktop confi gura- tion is highly predictable (see Figure 4.3 ).

This dominant position for these few companies has admit- tedly led to a number of positive results. It has, for instance, pushed a deeper common understanding of computing among individuals around the world. Different people from different

Although introducing security can seem expensive, one should expect to save money on response costs with an effective diverse environment.

The average home PC user is working in a highly predictable computing environment.

78 Chapter 4 DIVERSITY

cultures around the world can share their experiences, recom- mendations, and suggestions about operating systems, search engines, CPUs, and browsers, and the likelihood of applicability is high. The dominant position of these respective products has also helped the software development industry by creating rich and attractive common target markets. Developers generally love to see a dominant platform confi guration, because it increases their potential profi ts through maximal usage. So, computing certainly has moved forward as a result of commonality; not much disagreement exists on this point.

The drawback from a national infrastructure perspective, how- ever, is that adversaries will have an easier time creating attacks with signifi cant reach and implication. Just as a game of dominoes works best when each domino is uniformly designed and positioned, so does common infrastructure become easier to topple with a single, uniform push. In some cases, the effect is signifi cant; the operat- ing system market on desktop PCs, for example, is dominated by Microsoft ® to the point where a well-designed Windows ® -based attack could be applicable to 90% of its desktop targets.

More likely, however, is the situation where the creation of a botnet becomes much easier given the nondiversity of PC con- fi gurations. When a botnet operator conceptualizes the design of a new botnet, the most important design consideration involves reach. That is, the botnet operator will seek to create malware that has the maximal likelihood of successfully infecting the larg- est number of target PCs. As such, the nondiversity of end-user confi gurations plays right into the hands of the botnet operator. Combine this with the typically poor system administrative prac- tices on most PCs, and the result is lethal. Worse, many security managers in business and government do not understand this risk. When trying to characterize the risk of attack, they rarely understand that the problem stems from a global set of nondiverse end-user PCs being mismanaged by home and offi ce workers.

Most Likely Configuration for

Home or Office PC User

Windows – 90%

Google – 81%

Intel – 79%

Internet Explorer – 67%

Operating System

Search Engine

CPU

Browser

Two-Thirds of All PCs

Nine-Tenths of All PCs

Figure 4.3 Typical PC confi guration showing nondiversity.

Targeting the most popular operating system software with a worm attack could bring the majority of PCs to a standstill.

Security managers are unlikely to consider the home PC user when assessing risk.

Chapter 4 DIVERSITY 79

In response to this threat, national infrastructure protection requires a deliberate and coordinated introduction of diver- sity into the global desktop computing environment. Enterprise attention is obviously different than that of individuals in homes, but the same principle applies. If the desktop computing assets that can reach a national asset must be maximally resilient, then desktop diversity is worth considering. The most obvious chal- lenge here is related to the consumer marketplace for PCs; that is, the reason why consumers use the same platform is because they prefer it and have chosen to purchase it. If Microsoft ® and Intel ® , for example, were not providing value in their products, then people would buy something else. The biggest hurdle, there- fore, involves enabling nondiversity without altering the ability of companies to provide products that people like to use. Perhaps this goal could be accomplished via diversity elements coming from within the existing vendor base.

Desktop Diversity Considerations Additional issues that arise immediately with respect to desktop diversity programs include the following: ● Platform costs —By introducing multiple, diverse platforms into a computing environment, the associated hardware

and software costs might increase. This is a common justifi cation by information technology (IT) managers for avoiding diversity initiatives. Certainly, the procurement of larger volumes of a given product will reduce the unit cost, but by introducing competition into the PC procurement arena increased costs might be somewhat mitigated.

● Application interoperability —Multiple, diverse platforms will complicate organizational goals to ensure common interoperability of key applications across all platforms. This can be managed by trying to match the desktop platform to local needs, but the process is not trivial. The good news is that most web-based applications behave similarly on diverse platforms.

● Support and training —Multiple, diverse platforms will complicate support and training processes by adding a new set of vendor concerns. In practical terms, this often means introducing a platform such as Mac OS ® to a more traditional Windows ® -based environment. Because many consumers are comfortable with both platforms, especially youngsters who tend to be more diverse in their selections, the problem is not as intense as it might be.

For national infrastructure protection, desktop diversity ini- tiatives that are focused on ensuring enterprise differences in companies and agencies have a good chance of success. Rewards and incentives can be put in place to mix up the desktop plat- forms in a given enterprise. The problem is that this will have only limited usefulness from the perspective of botnet design and recruitment. The real advantage would come from diversity in

80 Chapter 4 DIVERSITY

broadband-connected PCs run by consumers around the world. Unfortunately, this is not something that can be easily controlled via an initiative in any country, including the United States.

Interestingly, a related problem that emerges is the seem- ingly widespread software piracy one fi nds in certain areas of the globe. Software piracy on the desktop introduces the problem of security updates; that is, depending on the specifi cs of the theft, it is often diffi cult for pirated PCs to be properly protected with required patches. When many millions of PCs are in this state, the problem of nondiversity becomes all the more severe.

Diversity Paradox of Cloud Computing To better understand how diversity goals can be accomplished, it helps to introduce a simple model of desktop computing sys- tems. The model is represented as a linear spectrum of options related to the degree to which systems are either diverse or non- diverse. As such, the two ends of the model spectrum are easy to identify for a given environment. On one side of the spectrum would be the option of complete nondiversity, where every desk- top system in the organization, enterprise, or group is exactly the same. On the other side of the spectrum would be the option of complete diversity across the organization, where no two desk- top systems are the same. In the middle of the spectrum would be the usual types of settings, where some minor degree of diver- sity exists, but with a clearly dominant platform.

The model spectrum is useful because it allows illustration of our basic infrastructure security proposition around PCs— namely, as diversity increases, desktop attacks, including the use of worms to create a local denial of service condition, are more diffi cult to accomplish. One might also suggest that the creation and use of botnets would also be more diffi cult, but this benefi t might be more modest (see Figure 4.4 ).

Desktop Attack Difficulty (increases)

Typical Enterprise

(Mostly Same, Some Different)

Desktops

Different

Desktops

Same

Figure 4.4 Spectrum of desktop diversity options.

Global diversity in broadband-connected home PCs would stymie many botnet attacks.

Chapter 4 DIVERSITY 81

In fact, diverse desktops are tougher to uniformly compro- mise, because they are less conducive as a group to a scalable, self-propagating attack. For example, if a company has half of its PCs running Windows ® -based operating systems and half run- ning Mac OS ® -based operating systems, then this will clearly be more challenging for an automatically propagating attack. Hence, the level of diversity and the associated diffi culty of attack appear to correlate. A challenge with this view, however, is that it does not properly characterize the optimal choice in reducing desktop attack risk—namely, the removal of desktops from the target environment. After all, one cannot attack systems that are not even there. This suggests a new (and admittedly theoretical) diversity and attack diffi culty spectrum (see Figure 4.5 ).

This suggests that the ultimate (albeit impossible) option for making desktops more secure involves their removal. Obviously, this is not a practical goal, but computer security objectives are often made more tractable via clear statements of the ideal con- dition. So, while current enterprise or home computing architec- tures do not include the option of having no desktop computers, older readers will remember the days when desktops did not exist. Rather, people used computer terminals to access informa- tion on mainframes, and security benefi ts were certainly pres- ent in such a setup. This included no need for end-user software patching, as well as no end-user platform for targeted malware. One great irony in the present deployment of desktops to every man, woman, and child on the planet is that most people really do not need such computing power. It is likely that they would be just fi ne with a keyboard, screen, and mouse connected to network-resident applications that are ubiquitously available via the Internet.

In modern computing, the closest thing we have to this arrangement is virtualized, cloud-based computing. In such a setup, computing power and application intelligence move to a centralized complex of servers, accessible via light clients. In

Desktop Attack Difficulty

Desktops

Removed

Desktops

Different

Desktops

Same

Figure 4.5 Diversity and attack diffi culty with option of removal.

As the level of diversity increases, the level of diffi culty for an attack likewise increases.

The global proliferation of home PCs has increased the risk of malware attacks.

82 Chapter 4 DIVERSITY

fact, handheld mobile devices provide the equivalent of a desk- top computer in such a cloud environment. One should therefore presume, from the diagram in Figure 4.5 , that cloud computing would provide considerable security benefi ts by removing non- diverse desktops from the environment. This is most likely true, as long as the infrastructure supporting the cloud applications is properly secured, as per the various principles described in this book. If this is not the case, then one is simply moving nondiver- sity vulnerabilities from the desktops to the servers.

Network Technology Diversity Modern telecommunications network systems can be viewed as consisting of the following two basic types of technologies: ● Circuit-switched —This includes legacy, circuit-switched

systems that support traditional plain old telephone ser- vices (POTS) and related voice and data services. The public switched telephone network (PSTN) is the most signifi cant example of deployed circuit-switched technology.

● Packet-switched —This includes more modern, packet-switched systems that support Internet Protocol (IP) and related voice, data, and multimedia services. In addition to the Internet as the most obvious example of packet switching, the signal- ing network controlling the PSTN is itself a packet-switched system. For the most part, both logical and physical diversity naturally

exist between these two types of services, largely due to technol- ogy interoperability. That is, the vast majority of equipment, soft- ware, processes, and related infrastructure for these services are fundamentally different. Packets cannot accidentally or inten- tionally spill into circuits, and vice versa .

From a networking perspective, what this means is that a security event that occurs in one of these technologies will gen- erally not have any effect on the other. For example, if a network worm is unleashed across the Internet, as the global community experienced so severely in the 2003–2004 time frame, then the likelihood that this would affect traditional time-division multi- plexed (TDM) voice and data services is negligible. Such diversity is of signifi cant use in protecting national infrastructure, because it becomes so much more diffi cult for a given attack such as a worm to scale across logically separate technologies (see Figure 4.6 ).

Even with the logical diversity inherent in these different tech- nologies, one must be careful in drawing conclusions. A more

Cloud computing may offer home PC users the diverse, protected environment they cannot otherwise access.

Circuit-switched and packet-switched systems automatically provide diversity when compared to one another.

Chapter 4 DIVERSITY 83

accurate view of diverse telecommunications, for example, might expose the fact that, at lower levels, shared transport infrastruc- ture might be present. For example, many telecommunications companies use the same fi ber for their circuit-switched delivery as they do for IP-based services. Furthermore, different carriers often use the same right-of-way for their respective fi ber delivery. What this means is that in many locations such as bridges, tun- nels, and major highways, a physical disaster or targeted terror- ist attack could affect networks that were designed to be carrier diverse.

While sharing of fi ber and right-of-way routes makes sense from an operational implementation and cost perspective, one must be cognizant of the shared infrastructure, because it does change the diversity profi le. As suggested, it complicates any reli- ance on a multivendor strategy for diversity, but it also makes it theoretically possible for an IP-based attack, such as one pro- ducing a distributed denial of service (DDOS) effect, that would have negative implications on non-IP-based transport due to volume. This has not happened in practical settings to date, but because so much fi ber is shared it is certainly a possibility that must be considered (see Figure 4.7 ).

A more likely scenario is that a given national service tech- nology, such as modern 2G and 3G wireless services for citizens

End Users

(Phones,

Circuits)

Worm

Circulating

Network

Management

Electronic Switching

Switch

Signaling

IP Routing

Non-

Propagation

(Logical

Diversity)

Traditional

Circuit-Switched

Modern

Packet-Switched

End Users

(Computers,

Intranets)

Figure 4.6 Worm nonpropagation benefi t from diverse telecommunications.

Unfortunately, vulnerabilities will always be present in IP-based and circuit-switched systems.

84 Chapter 4 DIVERSITY

and business, could see security problems stemming from either circuit- or packet-switched-based attacks. Because a typical car- rier wireless infrastructure, for example, will include both a cir- cuit- and packet-switched core, attacks in either area could cause problems. Internet browsing and multimedia messaging could be hit by attacks at the serving and gateway systems for these types of services; similarly, voice services could be hit by attacks on the mobile switching centers supporting this functionality. So, while it might be a goal to ensure some degree of diversity in these technology dependencies, in practice this may not be possible.

What this means from a national infrastructure protection perspective is that maximizing diversity will help to throttle large-scale attacks, but one must be certain to look closely at the entire architecture. In many cases, deeper inspection will reveal that infrastructure advertised as diverse might actually have components that are not. This does not imply that suffi - cient mitigations are always missing in nondiverse infrastruc- ture, but rather that designers must take the time to check. When done properly, however, network technology diversity remains an excellent means for reducing risk. Many a security offi cer will report, for example, the comfort of knowing that circuit-switched voice services will generally survive worms, botnets, and viruses on the Internet.

End Users

(Phones,

Circuits)

Worm

Circulating

Network

Management

Electronic Switching

Switch

Signaling

IP

Routing

Possible Impact

(Physical Non-

Diversity)

End Users

(Computers,

Intranets)

Fiber

Figure 4.7 Potential for impact propagation over shared fi ber.

Diversity may not always be a feasible goal.

Chapter 4 DIVERSITY 85

Physical Diversity The requirement for physical diversity in the design of comput- ing infrastructure is perhaps the most familiar of all diversity- related issues. The idea is that any computing or networking asset that serves as an essential component of some critical func- tion must include physical distribution to increase its survivabil- ity. The approach originated in the disaster recovery community with primary emphasis on natural disasters such as hurricanes and fi res, but, as the security threat has matured, infrastructure managers have come to recognize the value of providing some degree of physical diversity. This reduces, for example, reliance on a single local power grid, which is a valued cyber attack target for adversaries. It also greatly reduces the chances of a physical or premise-based attack, simply because multiple facilities would be involved.

These issues are not controversial. In fact, for many years, procurement projects for national asset systems, in both govern- ment and industry, have routinely included the demand that the following physical diversity issues be considered: ● Backup center diversity— If any major center for system, net-

work, or application management is included in a given infrastructure component, then it is routinely required that a backup center be identifi ed in a physically diverse loca- tion. Few would argue with this approach; if properly applied, it would ensure that the two centers are in different weather patterns and power grid segments.

● Supplier/vendor diversity— Many organizations dictate that for critical infrastructure components, some degree of diversity must be present in the supplier and vendor mix. This reduces the likelihood that any given fi rm would have too much infl u- ence on the integrity of the infrastructure. It also reduces the likelihood of a cascading problem that might link back to some common element, such as a software routine or library, embedded in one vendor’s product portfolio.

● Network route diversity —When network infrastructure is put in place to support national infrastructure, it is not uncom- mon to demand a degree of network route diversity from the provider or providers. This helps reduce the likelihood of malicious (or nonmalicious) problems affecting connectiv- ity. As mentioned above, this is complicated by common use of bridges, tunnels, or highways for physical network media deployments from several different vendors.

Physical diversity adds another important layer of protection against cascading effects.

Physical diversity has been incorporated into the national asset system for many years.

86 Chapter 4 DIVERSITY

Achieving Physical Diversity via Satellite Data Services

A good example application that demonstrates physical diversity principles is the provision of certain types of SCADA systems using IP over satellite (IPoS). Satellite data services have traditionally had the great advantage of being able to operate robustly via the airwaves in regions around the globe where terrestrial network construction would be diffi cult. Generally, in such regions commercial wireless coverage is less ubiquitous or even completely unavailable. Some SCADA applications have thus taken advantage of this robust communication feature in satellite systems to connect remote end-user terminals to the SCADA host system, but the requirement remains that some degree of diversity be utilized. As suggested above, most of this diversity emphasis has been driven largely by concerns over natural and physical disasters, but a clear cyber security benefi t exists as well.

Generally, the setup for satellite-connected SCADA involves end users connecting to a collection of physically diverse hubs via IPoS. These diverse hubs are then connected in a distributed manner to the SCADA hosts. An adversary seeking to attack these hubs would have to use either logical or electronic means, and a great degree of logistic effort would be required, especially if the hubs are located in different parts of the world. The Hughes Corporation, as an example, has been aggressive in marketing these types of confi gurations for SCADA customers. Their recommended remote access confi guration for diverse SCADA system control is shown in Figure 4.8 .

Space

Terrestrial

Remote terminal

SCADA Hosts

Geographically Diverse Hubs

Infrastructure Component

Access

Network

Access

Network

Access

Network

Figure 4.8 Diverse hubs in satellite SCADA confi gurations.

Chapter 4 DIVERSITY 87

National Diversity Program The development of a national diversity program would

require coordination between companies and government agen- cies in the following areas: ● Critical path analysis —An analysis of national infrastructure

components must be made to determine certain critical paths that are required for essential services. For example, if a mili- tary group relies on a specifi c critical path to complete some logistic mission, then assurance should exist that this criti- cal path is supported by diverse vendors, suppliers, support teams, and technology.

● Cascade modeling —A similar analysis is required to identify any conditions in a national infrastructure component where a cascading effect is possible due to nondiversity. If, for exam- ple, 100% of the PCs in an organization are running in exactly the same confi guration, then this poses a risk. Admittedly, the organization might choose to accept the risk, but this should be done explicitly after a security investigation, rather than by default.

● Procurement discipline —The selection and procurement of technology by organizations charged with critical infrastructure should include a degree of diversity requirements. This generally occurs naturally in most large organizations, so the urgency here might not be as intense but the security benefi ts are obvious. The decision of whether to provide rewards and incentives

for diversity versus a stricter approach of requiring evidence of some targeted percentage of diversity must be driven by the local environment and culture. The threat environment in a mili- tary setting is considerably different than one might fi nd in tele- communications or transportation, so it would seem prudent to make such implementation decisions locally.

The advantage of diverse hubs is obvious; if any should be directly compromised, fl ooded, or attacked (physically or logically), then the SCADA hosts are still accessible to end users. In addition, attacks on local infrastructure components on which the SCADA operation depends, such as power, will not have a cascading effect. Such an approach only works, however, if all diverse components operate at a common service level. For example, if one service provider offers highly reliable, secure services with historical compliance to advertised service level agreements (SLAs), then introducing a diverse provider with poor SLA compliance might not be such a good idea. This is a key notion, because it is not considered reasonable to take a highly functioning system and make it diverse by introducing an inferior counterpart. In any event, this general concept of diverse relay between users and critical hosts should be embedded into all national infrastructure systems.

This page intentionally left blank

89 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00005-6 2011

COMMONALITY

The only truly secure system is one that is powered off, cast in a block of concrete, and sealed in a lead-lined room with armed guards—and even then I have my doubts .

Eugene Spafford, Executive Director of the Purdue University Center for Education and Research in Information Assurance and Security (CERIAS) 1

Now that we have outlined our proposal in the previous chap- ter for national infrastructure systems to include diversity, we can discuss the seemingly paradoxical requirement that infrastruc- ture systems must also demonstrate a degree of commonality . In particular, certain desirable security attributes must be present in all aspects and areas of national infrastructure to ensure maxi- mal resilience against cyber attack. Anyone who has worked in the security fi eld understands this statement and is likely to agree with its basis. The collection of desirable security attributes is usu- ally referred to collectively as security best practices . Example best practices include routine scanning of systems, regular penetration testing of networks, programs for security awareness, and integrity management checking on servers.

When security best practices are easily identifi ed and measur- able, they can become the basis for what is known as a security standard . A security standard then becomes the basis for a pro- cess known as a security audit , in which an unbiased third-party observer determines based on evidence whether the requirements in the standard are met. The key issue for national infrastructure protection is that best practices, standards, and audits establish a low-water mark for all relevant organizations (see Figure 5.1 ).

Organizations that are below a minimally acceptable security best practices level will fi nd that security standards audits intro- duce new practices, in addition to revisiting existing practices. The desired effect is that the pre-audit state will transition to an improved post-audit state for all practices. This does not always happen, especially for organizations that have a poor environment

5

1 Quoted in A. K. Dewdney, “Computer recreations: of worms, viruses and Core War,” Sci. Am. , 260(3), 90–93, 1989.

90 Chapter 5 COMMONALITY

for introducing new security practices, but it is the goal. For organi- zations that are already above the minimally acceptable level, per- haps even with world-class features, the audit will rarely introduce new practices but will instead revisit existing ones. The desired effect here is that these practices would be strengthened, but, again, this does not always work perfectly, especially if the auditors are less familiar with the world-class security features already in place. Some common security-related best practices standards that one will fi nd in national infrastructure settings are listed in the box.

Organization B

Organization A

Pre-Audit Post-Audit Pre-Audit Post-Audit

Audit

Audit

Security

Practices

World

Class

Minimally

Acceptable

Revisit

Existing

New Practices

Revisit Existing

Figure 5.1 Illustrative security audits for two organizations.

The purpose of a security audit is to raise the level of security features currently in place.

Common Security-Related Best Practices Standards ● Federal Information Security Management Act (FISMA) —FISMA sets minimal standards for security best practices in

federal environments. It is enforced by congressional legislation and involves an annual letter grade being assigned to individual agencies. The following departmental agencies received an “F” for their FISMA rating in 2007: Defense, Commerce, Labor, Transportation, Interior, Treasury, Veterans Affairs, and Agriculture (so did the Nuclear Regulatory Commission).

● Health Insurance Portability and Accountability Act (HIPAA) —Title II of HIPAA includes recommended standards for security and privacy controls in the handling of health-related information for American citizens. It is also enforced by congressional legislation.

● Payment Card Industry Data Security Standard (PCI DSS) —This security standard was developed by the PCI Security Council, which includes major credit card companies such as Visa ® Card, Discover ® Card, American Express ® , and MasterCard ® . It includes requirements for encrypting sensitive customer data.

Chapter 5 COMMONALITY 91

With such redundancy in security standards and compli- ance, one would guess that the principle of commonality would be largely met in national infrastructure protection. For example, some organizations might be required to demonstrate compli- ance to dozens of different security standards. One would expect that such intense and focused attention on security would lead to a largely common approach to security around the globe. Sadly, the belief here is that in spite of the considerable audit and com- pliance activity around the world, most of it does not address the type of security commonality that will make a positive differ- ence in national infrastructure protection. The activity instead tends to focus on requirements that have some value but do not address the most critical issues. In fact, most of these practices exist in the category of state-of-the-art security, far beyond the minimally acceptable levels addressed in most audits.

The audit problem stems from the inherent differences between meaningful and measurable security best practices. There’s an old dumb joke about a man looking for his lost money on 42nd and Eighth. When a passerby asks whether the money was actually lost at that spot, the man looks up and says that the money was actually lost over on 41st and Tenth but the light is much better here. Security audit of best practices is often like this; the only practices that can be audited are ones where the light is good and measurable metrics can be established. This does not, however, imply that such metrics are always meaning- ful (see Figure 5.2 ).

The example requirements shown in Figure 5.2 provide a hint as to the types of requirements that are likely to be included in each category. One can easily levy a measurable requirement on password length, for example, even though this is generally a less useful constraint. This could be viewed as an example that

● ISO/IEC 27000 Standard (ISO27K) —The International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC) evolved a British Security Standard known as BS-7799 into an internationally recognized set of auditable security best practices. Some security experts believe that the ISO27K family of security standards is the most global and generally agreed upon set of best practices. All of these standards, and the many additional ones that are not mentioned above, include a large subset of security

and functional requirements that are virtually the same. For example, each standard requires carefully documented policies and procedures, authentication and authorization controls, data collection systems, and embedded encryption. Each standard also requires management oversight, ongoing security monitoring, compliance scores issued by designated auditors, and some form of fi nes or punishment if the standard best practices are not met.

92 Chapter 5 COMMONALITY

is measurable but not meaningful. Conversely, one can levy the important requirement that a strong culture of security be pres- ent in an environment. This is a meaningful condition but almost impossible to measure. The example requirement that a security policy be present is both meaningful and measurable. It demon- strates that there are certainly some requirements that reside in both categories.

Meaningful Best Practices for Infrastructure Protection A provocative implication here is that the ability to audit a given best practice does not determine or infl uence whether it is useful for infrastructure protection. In fact, the primary motivation for proper infrastructure protection should not be one’s audit score; rather, the motivation should be success based and economic. The fact is that companies, agencies, and groups with responsi- bility for infrastructure protection will eventually fail if they do not follow the best available recommendations for security best practices. Unfortunately, the best recommendations come not from the security standards and audit community but from prac- tical experience.

If you do not agree, then please consider that security stan- dards backed by powerful and authoritative groups have existed

Meaningful

Requirements

Focus of Protection

Documented Security Policy

Focus of Auditor

Measurable

Requirements

Culture of

Security

Protection

Constraint

on Password

Length

Figure 5.2 Relationship between meaningful and measurable requirements.

Ideally, security practices are both meaningful and measurable.

A great audit score does not necessarily guarantee successful infrastructure protection.

Chapter 5 COMMONALITY 93

for many decades. In addition, security auditors have been in business for decades, performing diligent analysis and issuing embarrassing failure grades to security teams around the world. Our earlier reference to FISMA, for example, included failing grades for many prominent government agencies in the United States. In spite of all this activity and reporting, however, nothing truly material has changed during these past decades in the way computer and network systems are secured. In fact, one could easily make the claim that national infrastructure is more vul- nerable to attack today than it was 20 years ago. What makes one think that more stringent security standards and audit processes are going to change this now?

Based on this author’s experiences managing the security of major critical infrastructure components for many years, the answer lies in a two-step methodology: ● Step 1. Standard audit —The fi rst step is conventional, in that

it recommends that every organization submit to a standard audit to ensure that no group is operating below the mini- mally acceptable threshold. While most organizations would claim to already have this step ongoing, the goal here is to be given a desirable rating or score, rather than a failing one. So, even if a company or agency has ongoing audits, the goal here is to pass these audits. Any one of the major audit standards mentioned above is probably acceptable; they all roughly direct the same sort of minimal practices.

● Step 2. World-class focus —The second step involves a more intense focus on a set of truly meaningful national infrastruc- ture protection practices. These practices are derived largely from experience. They are consistent with the material pre- sented in this book, and they will only be present in pieces in most existing security audit standards. The greatest success will typically come from organizations self-administering this new focus, especially because these practices are not easy to measure and audit (see Figure 5.3 ). For the fi rst step, an important issue involves ensuring that

the audit does not cause more harm than good. For example, suppose that a competent and trustworthy system administra- tor has been charged with a bevy of responsibilities for an infra- structure component and that she has demonstrated excellent results over a long period of time, with no security problems. This is a common situation, especially in companies and agencies that take system administration seriously. Unfortunately, a security auditor would look at such a setup with horror and would deem it a clear violation of least privilege, separation of duties, and so on.

A successful protection strategy should start with at least a passing score on a standard security audit.

Sometimes security audit standards and best practices proven through experience are in confl ict.

94 Chapter 5 COMMONALITY

In the United States, if the component being administered was a fi nancial one in a public company, then this would be a violation of the Sarbanes-Oxley segregation of duties require- ments. The auditor would typically require that the single com- petent administrator be replaced by a bureaucratic process involving a team of potentially inferior personnel who would each only see a portion of the total task. It is not diffi cult to imag- ine the component being more poorly managed and, hence, less secure. This is the worst case in any audit and must be explicitly avoided for national infrastructure protection.

For the second step, the box lists specifi c meaningful secu- rity best practices, six in total, for national infrastructure protec- tion. These six best practices do not contradict current auditing processes and standards, but they are certainly not designed for easy audit application; for example, it is diffi cult to validate whether something is “appropriate” or “simplifi ed.” Nevertheless, our strong advice is that attentiveness to ensuring commonality across national infrastructure with these six practices will yield signifi cant benefi ts.

Existing Set of Practices

Minimally Acceptable Set of Practices

World-Class Infrastructure Protection

Standard Audit (FISMA, ISO, Etc.)

Self-Administer (Six Best Practices)

Figure 5.3 Methodology to achieve world-class infrastructure protection practices.

Six Best Practices for National Infrastructure Protection

● Practice 1. Locally relevant and appropriate security policy —Every organization charged with the design or operation of national infrastructure must have a security policy that is locally relevant to the environment and appropriate to the organizational mission. This implies that different organizations should expect to have different security policies. The good news is that this policy requirement is largely consistent with most standards and should be one of the more straightforward practices to understand.

● Practice 2. Organizational culture of security protection —Organizations charged with national infrastructure must develop and nurture a culture of security protection. The culture must pervade the organization and must include

Chapter 5 COMMONALITY 95

Readers familiar with standards and audits will recognize immediately the challenges with the subjective notions intro- duced in the box. For this reason, the only way they can be applied appropriately is for security managers to understand the purpose and intent of the requirements, and to then honestly self-administer a supporting program. This is not optimal for third-party assurance, but it is the only reasonable way to reach the level of world-class security best practices.

Locally Relevant and Appropriate Security Policy Any commercial or government organization that is currently developing or managing national infrastructure already has some sort of security policy. So the question of whether to develop a policy is not relevant; every organization has something . The real question instead for most organizations in national infra- structure roles is how to make the policy more relevant and

great incentives for positive behavior, as well as unfortunate consequences for negative. No security standard currently demands cultural attentiveness to security, simply because it cannot be measured.

● Practice 3. Commitment to infrastructure simplifi cation —Because complexity is arguably the primary cause of security problems in most large-scale environments, a commitment to simplifying infrastructure is critical to ensuring proper security. Determining what “simplifi cation” means is a subjective, local concept that is dependent on the specifi cs of the target environment. No current security standards demand infrastructure simplifi cation.

● Practice 4. Certifi cation and education program for decision-makers —A program of professional certifi cation and security education must be present for those who are making decisions about national infrastructure or who are directly charged with their implementation. Ideally, this should not have to include end users, because this greatly reduces the chances of proper coverage.

● Practice 5. Career path and reward structure for security teams —Those performing security in national infrastructure environments must have clearly defi ned career paths and desirable rewards as part of their professional journey. In the absence of these enticements, important security work is often handled by people who are untrained and poorly motivated. This requirement is generally more meaningful in larger organizations.

● Practice 6. Evidence of responsible past security practice —Just as most craftsmen go through a period of apprenticeship to learn and to demonstrate proper skills, so should an organization have to demonstrate a period of learning and attainment of proper skills before being charged with national infrastructure protection. It is amazing that existing security audits generally do not include a careful inspection of past security practices in dealing with live cyber attacks.

96 Chapter 5 COMMONALITY

appropriate to the local environment. Specifi cally, four basic security policy considerations are highly recommended for national infrastructure protection: ● Enforceable —Most security policies are easy to write down but

are not easy to enforce. Organizations must therefore spend a great deal of time on the issue of security policy enforce- ment. The local threat environment must be a consideration here, because the employees of some companies and agen- cies are more apt to follow security policy rules than others. Nevertheless, a policy is only as good as its degree of enforce- ability, so every organization should be able to explicitly describe their enforcement strategy.

● Small —Most security policies are too large and complex. If there is one exercise that would be the healthiest for national infrastructure teams, it would be to go through existing policy language to prune out old references, obsolete statements, and aged examples. Large, complex security policies with too much detail are to be avoided. A key issue is the direction in which one’s policy is headed; it is either staying the same (stagnant), getting more complex (unhealthy), or becoming smaller and more compact (healthy).

● Online —Policy language must be online and searchable for it to be truly useful in national infrastructure settings. Teams must be able to fi nd relevant requirements easily and should have the ability to cut and paste the relevant statements into their project or process documentation. The old days of printing and distributing a security policy with a fancy cover should be long gone.

● Inclusive —Policy must be inclusive of the proper computing and networking elements in the local national infrastructure environment. This can only be determined by an analysis. Unfortunately, this analysis can be somewhat time consuming and tedious, and without proper attention it could result in an overly complex policy. Considerable skill is required to write policy that is inclusive but not too complicated. These four requirements for security policies in groups

charged with national infrastructure can be subjected to a simple decision analysis that would help determine if the local policy is relevant and appropriate to the mission of the organization; this decision process is shown in Figure 5.4 .

It’s worth mentioning that, as will be seen in the next sec- tion, the culture of the local environment can really have an impact on the development of security policy. In an environment where technology change is not dramatic and operational skills are mature (e.g., traditional circuit-switched telephony), policy

The question is not whether to develop a security policy, but rather what that policy will entail.

Chapter 5 COMMONALITY 97

language can be less detailed and used to identify unexpected procedures that might be required for security. In an environ- ment where technology change is dramatic and operational skills might be constantly changing (e.g., wireless telephony), then pol- icy language might have to be much more specifi c. In either case, the issue is not whether the policy has certain required elements, but rather whether the policy is locally relevant and appropriate.

Culture of Security Protection Our second recommended common practice involves creation of an organizational culture of security protection. When an orga- nization has such a culture of security protection, the poten- tial for malicious exploitation of some vulnerability is greatly reduced for two reasons: First, the likelihood for the vulnerabil- ity itself to be present is reduced, as local diligence will weigh in favor of more secure decision-making. Second, real-time human vigilance in such a culture often helps avoid exploitation. Time after time, the alertness of human beings in a culture of security is effective in helping to avoid malicious attacks. (Readers will remember that the only effective security measures that took place on September 11, 2001, were the ones initiated by human beings.)

Here’s a simple test to determine if a given organization has a culture of security protection. Go to that organization’s local facility and observe how carefully the physical premises are policed for unauthorized entry. If an electronic door is used to authenticate entry, followed by a guard eyeballing every visitor, then chances are pretty good that the culture is one of protection.

Need to perform inventory analysis on locally relevant technologies, tools, systems, and processes

Need to spend significant time with local teams to fix this one – this is more a cultural than technical issue

Need to consider use of documentation tool to place requirements on line (good opportunity for pruning)

Need to go through policy to prune old references, simplify language, and remove obsolete statements

No

No

No

No

Is the policy tight and compact?

Does the policy address all relevent local issues?

Existing

Security

Policy

Much

Better

Policy

Is the policy on line?

Can the policy be enforced?

Figure 5.4 Decision process for security policy analysis.

98 Chapter 5 COMMONALITY

If, however, the person in front of you holds the door open for you to enter without bothering to check for your credentials or, worse, the door itself is propped open, then the culture is proba- bly more open. A culture of security certainly does not imply that things will be perfectly secure, but such a culture is essential in the protection of national assets.

Unfortunately, most of us tend to equate an organizational culture of security with a rigid, paranoid, authoritative, perhaps even military environment. Furthermore, a culture of security is generally associated with managers who avoid risks, stay away from the media, dislike remote access or telecommuting, and demonstrate little comfort with new technologies such as social networking. Similarly, one would equate a nonculture of secu- rity with a young, dynamic, creative, open, and egalitarian envi- ronment. In such a culture, managers are generally viewed to be comfortable with risk, open in speaking to outsiders about their work, in love with every new technology that comes along, and supportive of remote access and telecommuting.

The reality is that neither stereotype is accurate. Instead, the challenge in promoting a culture of security is to combine the best elements of each management approach, without the cor- responding weaknesses. The idea is to nurture any positive environmental attributes, but in a way that also allows for sen- sible protection of national assets; that is, each local environ- ment must have a way to adapt the various adjectives just cited to their own mission. For example, no group generally wants to be referred to as closed and paranoid, but a military intelligence group might have no choice. Similarly, no group wants to be referred to as being loose with security, but certain creative orga- nizations, such as some types of colleges and universities, make this decision explicitly.

As such, organizations must consider the spectrum of options in developing a suitable local culture. This spectrum acknowl- edges how straightforward it can be to assume an inverse rela- tionship between organizational rigidity and security. It’s easy to just make everything rigid and authoritative and hope that a culture of increased security will develop. The challenge, how- ever, lies in trying to break up this relationship by allowing open, creative activity in a way that does not compromise secu- rity. This might result in some aspects of the environment being more secure and others being less so. Such a combined cultural goal should be viewed as a common requirement for all groups involved with national assets (see Figure 5.5 ).

So an obvious question one might ask from the perspective of national infrastructure protection is why the highest level of

An organization with a culture of security is one in which standard operating procedures work to provide a secure environment.

An ideal security environment can marry creativity and interest in new technologies with caution and healthy risk aversion.

Chapter 5 COMMONALITY 99

security culture should not be required in all cases, regardless of any cultural goals of being open, creative, and willing to interact publicly. The U.S. military, for example, might exemplify such a level of rigid cultural commitment to security. One answer, as we’ve discussed above, is that it is diffi cult to require that a cul- ture be in place in an organization. Specifi c aspects of a culture might be required such as strong policy, tough enforcement, and so on, but to require the presence of a culture is easy to confi rm. Nevertheless, the premise is correct; that is, for national infra- structure, certain security standards are required that can only be met in an environment where a culture of security protection is met. This demands the uncomfortable situation in which local managers must honestly work to create the appropriate culture, which in some cases might require decades of attention.

An important element of security culture is the symbolism that management can create by its own behavior. This means that when senior executives are given passes that allow policy violations, this is a serious error as it detracts from the cultural objectives. Unfortunately, the most senior executives almost always outrank security staff, and this practice of senior exemp- tion is all too common. Perhaps major national infrastructure solicitations should include questions about this type of senior executive practice before contracts can be granted to an organi- zation. This might give the security team more concrete ammuni- tion to stop such exemptions.

Infrastructure Simplification Our third recommended common practice involves an explicit organizational commitment to infrastructure simplifi cation. Defi ning what we mean by simplifi cation in the context of

More secure

More rigid

Challenging

cultural option

Straightforward

cultural option

Less secure

More open

More inclusiveMore authoritative

Figure 5.5 Spectrum of organizational culture of security options.

Implementation of a true culture of security cannot happen overnight; it may take years to develop.

A true culture of security must be implemented at all levels of an organization—including the most senior executives.

100 Chapter 5 COMMONALITY

infrastructure requires that we use subjective language. Simpler infrastructure is easier to understand, less cumbersome, and more streamlined. As such, simplifi cation initiatives will be sub- jective and much more diffi cult to measure using some quan- titative metric. To illustrate this process of simplifi cation, let’s look at a typical sort of cluttered engineering schematic that one might use to describe network infrastructure. The chart shown in Figure 5.6 is derived from the design documentation embedded in an infrastructure project with which this author was recently involved. This diagram suffers from the typical sorts of issues that one fi nds in the design and operation of national infrastructure: ● Lack of generalization —Systems in the diagram are not viewed

in a generalized manner. The same thing is shown multiple times in different places in the diagram (e.g., servers), rather than just generalizing one component to depict both.

● Clouding the obvious —Interfaces in the diagram are not depicted obviously. Lines are cluttered across the drawing, and simple interfaces are clouded to avoid what is actually quite obvious connectivity.

● Stream-of-consciousness design —The diagram seems to be the product of fi rst-draft, stream-of-consciousness thinking rather than a carefully planned layout. Too often, infrastructure is put

Popular Public

Network (IP-Based)

System

Management Popular IP-Based

Public Network

. . . Multiple

Mgmt. Apps.

Server

Server

Figure 5.6 Sample cluttered engineering chart.

Chapter 5 COMMONALITY 101

in place in a fi rst draft without anyone taking the time to review and revise.

● Nonuniformity —Objects are not referred to uniformly; the ref- erences to an IP-based network are slightly different, and in fact should just reference the Internet anyway. If one applies some rational simplifi cation to the design in the

cluttered chart shown above, with attention to each of the ele- ments just mentioned, then the resultant functionally equivalent product is much easier to understand. The more improved dia- gram requires that you go back and confi rm that it really does describe the same function, but in fact it does (see Figure 5.7 ).

Analysis of how we simplifi ed the cluttered diagram into something more easily understood highlights some of the tech- niques that can be useful in simplifying a national infrastructure component environment (see box).

Internet

Multiple

Clients

Multiple

Servers

System

Management

Management

Applications

Figure 5.7 Simplifi ed engineering chart.

How to Simplify a National Infrastructure (or Otherwise Complex) Environment

● Reduction in size —The second diagram is smaller than the fi rst one. Relevance of such action to national infrastructure should be obvious. Simplifi cation should include reduction wherever possible. Less code, fewer interfaces, and reduced functionality are all healthy simplifi cation objectives that will almost certainly improve security. In fact, a requirement for national infrastructure should be demonstrable evidence of software removal or reduction initiatives. The only truly secure piece of code is the one that you have removed.

102 Chapter 5 COMMONALITY

The process of auditing these subjective goals will be chal- lenging, if not intractable, but this does not reduce the impor- tance of trying to attain each goal in national infrastructure. Infrastructure simplifi cation could, in fact, be argued to be the most important single goal in the protection of national assets. One bright spot here is that security managers will fi nd kindred spirits with most information technology managers, although it is the rare CIO who truly knows how to manage the simplifi cation and reduction of infrastructure. A good sign that the local orga- nization is trying would be some sort of initiative focused on the reduction or removal of software applications.

Certification and Education Our fourth recommended common practice involves certifi cation and education programs for key decision-makers. Most current computer security education initiatives tend to focus on teaching awareness to end users about proper selection of passwords, stor- age of data, handing of devices, and so on. These awareness ini- tiatives stem from the common belief that computer and network systems would be perfectly secure if end users would just take the time to learn and follow the security policy rules. The situation is reminiscent of doctors blaming their patients for their diseases.

Security auditors generally agree with this view of end-user responsibility, and they will often perform spot checks in target

Simplifi cation may be the fi rst and most tractable step toward creating a new, more secure infrastructure environment.

● Generalization of concepts —The second diagram generalizes concepts more effectively than the fi rst. This should be true in national infrastructure as well. Rather than managing dozens or hundreds or thousands of special cases, it is more effective to have a planned generalization strategy that allows for simpler management. Obviously, this requires some balancing with local diversity requirements.

● Cleaner interfaces —Perhaps the most obvious difference between the two diagrams is the much cleaner view of interfaces that the second one provides. Because national infrastructure will include complex interfaces between systems, initiatives to simplify these interfaces must be present to optimize security for national assets.

● Highlighting of patterns —The second diagram demonstrates functional and data fl ow patterns in an obvious manner. This simplifi es any changes that might have to be made to the architecture. Infrastructure should also be designed in a manner to highlight important patterns in data or processing.

● Reduction of clutter —The fi rst diagram is as cluttered as one can imagine, and this generally indicates a stream- of-consciousness design process. Too often, national infrastructure emerges in the same manner, with one system being put in place and then another, they are then connected to something else, and on and on. The result is usually not optimal from a security perspective.

Chapter 5 COMMONALITY 103

environments. This usually involves quizzing random individu- als about their knowledge and interpretation of the local secu- rity policy. When the inevitable bad grade occurs because high percentages of individuals do not know some of the policy rules, security teams are forced to increase the intensity of the aware- ness program with posters, videos, mandatory tests, and even punishments for end-user ignorance.

Based on decades of experience in performing these types of audits, supporting them, and also being subjected to them, the conclusion reached here is that the goal of reaching 100% end- user awareness of security is impractical. Certainly, security edu- cation for end users does not hurt, because everyone should be aware of the risks of any actions they might take that could dam- age security in the local environment. If end users are entrusted with proprietary information, for example, they need to under- stand the implications of allowing such information to be pro- vided to unauthorized sources.

For national infrastructure protection, however, a much more practical goal is to focus primarily on improving the security competence of decision-makers rather than on end users. The distinction here is subtle, but fundamental. Key decision-makers in national infrastructure settings include the following: ● Senior managers —These are the people who set fi nancial and

operational priorities affecting national infrastructure. They include the most senior managers in an organization or the highest ranking in the military.

● Designers and developers —These are the network, system, and application designers and developers who determine what security features and functionality are in the systems that peo- ple use. They often work in information technology groups.

● Administrators —These are the system and network adminis- trators who perform the day-to-day tasks of maintaining and running the systems that people use. Too often, these folks are underpaid and poorly trained.

● Security team members —These are the security staff charged with the organizational systems for protecting assets. An increasing number of organizations outsource aspects of this work. There is nothing wrong with this trend, as long as the arrangement is well managed and coordinated. These four types of key decision-makers are the people who

can make the most substantive difference in the security of an organization and for whom 100% coverage should be a trac- table goal. It doesn’t hurt that the size of the key decision-maker population in a company or agency will be much smaller than the total population. It also doesn’t hurt that they tend to be the

One hundred percent end- user awareness of security policies may remain an illusive goal.

104 Chapter 5 COMMONALITY

ones best trained to understand the importance of security. From an investment perspective, the returns on education invest- ment look quite different for end users and decision-makers (see Figure 5.8 ).

The message embedded in the ROI curves in Figure 5.8 is that a small initial investment in security certifi cation and edu- cation for end users produces a reasonable initial return. This return rapidly diminishes, however, because in a typical envi- ronment there is only so much an end user can do. In fact, in the best designed environments, the obligation for end users to make security decisions on their own is always minimized. For key decision-makers, the ROI is ongoing and steadily increas- ing throughout the investment lifecycle. Unlike end users, key decision-makers can consistently apply their increased secu- rity knowledge to infrastructure in a meaningful and scalable manner.

To summarize, our recommendation here is a twofold approach for security certifi cation and education in a national infrastructure environment: ● Key decision-makers —Focus on providing ongoing, lifecycle

programs for decision-makers in security certifi cation and

Cost/Size of Investment

Return on Investment (ROI)

End User Curve

High Initial ROI (End Users)

Increasing Ongoing ROI (Key Decision-Makers)

Key Decision-Makers

Curve

Recommended Investment Period for Key Decision-Makers

Recommended Investment Period for End Users

Figure 5.8 Return on investment (ROI) trends for security education.

Target the key decision- makers in your quest for organizational security policy awareness and competence.

Chapter 5 COMMONALITY 105

education. By focusing on key decision-makers, the returns will be consistent, increasing, and scalable.

● End users —Create low-cost, high-initial-return activities for certifying and educating end users. As a complement, systems must be designed that minimize the decisions end users make about security. The specifi c certifi cation and education programs for a given

environment should be locally determined and appropriately applied. They are not diffi cult to fi nd or create but can be mis- applied without some careful planning. Well-known security certifi cations, such as Certifi ed Information Systems Security Professional (CISSP), are excellent for system or network admin- istrators but totally unnecessary for end users. Similarly, aware- ness programs on selecting good passwords are fi ne for end users but will just annoy your system administrators.

Career Path and Reward Structure Our fi fth recommended common practice involves the creation and establishment of career paths and reward structures for security professionals. It should come as no surprise that organi- zations charged with national infrastructure should demonstrate some common form of career path and reward structure for security staff. This is particularly important, because to perform security tasks properly, some degree of longevity is desirable. Too often, important cyber security tasks are attempted by staff who are new to the security discipline and who are poorly compen- sated for their work.

Fixing this might seem obvious, but virtually no security stan- dards used for the purposes of audit include this in a meaning- ful way. Elements that should be commonly present in national infrastructure environments include the following: ● Attractive salaries —Infrastructure organizations should dem-

onstrate salary structure that takes into account the special- ized skills associated with cyber security. Salaries should be above industry averages, a metric that can be quantitatively audited. (Amazingly, I’ve never seen security staff salaries audited as part of any due diligence activity by an auditor.)

● Career paths —Opportunities for career advancement, promo- tion, and salary increase should be present in infrastructure organizations. Perhaps more than any other information tech- nology or network-related discipline, security engineering of national infrastructure requires years of experience in order to develop proper judgment. If these years do not include

Creating career paths and incentives is important in any fi eld, no less so in security management.

106 Chapter 5 COMMONALITY

attention to career issues, then the organization is unlikely to maintain the best staff.

● Senior managers —It is desirable for senior managers in infra- structure organizations to have some degree of heritage in the security community. This certainly will help with decision- making at the senior level, but more importantly it serves as a symbol for the security staff that senior level management is attainable from the security ranks. These career-related organizational attributes are rarely dis-

cussed in the context of determining whether proper security is in place in an organization. Auditors never discuss these issues. This is unfortunate, as good salaries and career paths for security staff are more relevant to the overall security posture of an orga- nization than checking for trivia such as password length, time- outs after bad login attempts, and other elements commonly found in security standards.

It is also worth noting that companies and agencies should not actively recruit and hire individuals who have a history of breaking laws on computers and networks. Hacking, in its origi- nal incarnation, was all about the desire to learn and share; when hackers demonstrate this type of perspective, they can easily blend into a company or agency and be productive. The associ- ated career and reward track for such individuals is rarely promo- tion or money but rather ongoing or increased access to the latest and greatest types of technologies.

Responsible Past Security Practice Our sixth recommended common practice involves two specifi c actions: The fi rst is that any company or agency being considered for national infrastructure work should be required to demon- strate past practice in live security incidents. The second is that companies and agencies must do a better job of managing their inventory of live incidents, including databases of key factors, root causes, and security learning from events. These two seem- ingly obvious actions are almost never performed explicitly, and most companies and agencies do not even maintain formal doc- umentation on past security incidents.

The good news is that most solicitations for national infra- structure project work do include some requirement for dem- onstrating past engineering practices, so there is certainly a base on which to improve matters for security. When federal agen- cies contract for engineering or technical work, for example, boilerplate language is usually embedded into the contract for

A strong indicator of a healthy security environment might be something that is often overlooked, such as heritage of the senior security managers in a company.

Companies and agencies should maintain a historical record showing clear incident response documentation.

Chapter 5 COMMONALITY 107

information on previous projects, similar work activities, and lists of reference clients. This practice is appropriate and valu- able, although it is usually treated too much as a generic type of information-gathering task.

For security, in particular, this practice currently involves requests for information on security policies, security architec- tural elements, and even specifi c techniques such as encryption. Such requests are important and should be highlighted for national infrastructure protection projects. The problem is that such inqui- ries simply do not go far enough. In particular, any organization being considered in a solicitation that involves national infrastruc- ture should provide evidence of at least the following past practices: ● Past damage —The organization should be able to provide

evidence of past security incidents that it dealt with that pro- duced real malicious damage to some valued asset. Although this might seem paradoxical, the reality is that no organization can claim true skill in securing large infrastructure if it has not dealt with a real incident in the past. Groups who are forth- coming in explaining these past incidents are also generally more mature in their current security processes.

● Past prevention —Similarly, the organization should be able to provide evidence of incidents prevented. This is tougher than one might think, because in many cases security protec- tions have a preventive effect that is not easily determined or measured. So only the truly skilled security organizations can provide this evidence of deliberate action that prevented an attack from succeeding. A good example might be the estab- lishment of real-time network fi ltering well in advance of any DDOS attack; if this fi ltering was actually used to stop an attack attempt, it demonstrates excellent judgment regarding the organizational priorities around security.

● Past response —This is the most commonly cited security experience component. Groups can generally point to their response functions as being invoked during worms, viruses, and other attacks. In any formal project solicitation, these requirements should

be highlighted and assigned high priority. Few requirements can properly highlight an organization’s ability to handle security situations in the future as their experiences dealing with similar matters in the past.

National Commonality Program The challenge in creating a new national program of ensur- ing commonality with state-of-the-art security practices in

A mature security organization will admit to successful attacks against them.

Providing evidence of successful preventive measures is a challenge for most organizations.

108 Chapter 5 COMMONALITY

infrastructure protection involves balancing several different concerns: ● Plethora of existing standards —Most organizations are already

frustrated with the number of standards and audits that must be covered. The implication is that the creation of a new national security standard commensurate with the six prac- tices described in this chapter would not be well received.

● Low-water mark versus world class —As we’ve discussed, the existing security standards and audits in place today are more focused on creating a common low-water mark, rather than pushing groups to reach for world-class status in security.

● Existing commissions and boards —The fi eld is already crowded with national commissions, working groups, and boards com- prised of business and government leaders who are working to create sets of recommendations for infrastructure security. They are unlikely to go away and must be factored into any implementation plan. While these may not be formal standards with associated

audit processes, affected organizations feel the pressure to review these works and to demonstrate some degree of acceptance, if not compliance. The solution to balancing these concerns lies in several implementation approaches and hints that are based on previous experiences with multiple standards and requirements, such as the Orange Book, Red Book, and associated “security rainbow series” in the 1980s. The fi rst is that government really should adopt a single standard for all commercial and govern- ment security audits. It really doesn’t even matter which audit standard is selected as long as it is only one . All subsequent gov- ernment solicitations and contracts should demand compliance with this standard. Commercial entities might gradually merge toward this standard.

Second, the world-class practices described here should be embedded into all government solicitations and contracts as func- tional requirements on companies and agencies. This would avoid the problems of audit compliance and would push the security components into the functional category along with performance, processing, storage, and networking. Government agencies could perhaps complement this by rewarding or providing incentives for the inclusion of these requirements in private deals between companies.

Do not try to work around the existing security commissions and boards; instead, factor them into your overall security plans and policies.

109 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00006-8 2011

DEPTH

Sun myth: If a person is wearing a foundation makeup with SPFs of #4 or #8, then she won’t need additional sunscreen or sunblock .

www.ultimate-cosmetics.com

The general security strategy of defense in depth is based on the observation that any given layer of protection can fail at any time. As such, defense in depth involves the deliberate introduc- tion of multiple layers of defense in order to increase the likeli- hood that a given attack will be stopped or at least slowed down. This likelihood is dependent upon the quality and relative attri- butes of the various defensive layers. Cost and end-user experi- ence issues usually create constraints on just how strong the various layers can actually be in practice. Most security experts understand this strategy of defense in depth, but evidence of its use in national infrastructure settings is often lacking. This is too bad, because the protection of national infrastructure lends itself naturally to multiple layers of defense.

The general schema associated with layered defense is that a series of protective elements is located between an asset and the adversary. Obviously, it would be best if the series is actually that—a serial collection of protective elements that must each be traversed successfully to gain access to a protected resource. Most of the time, however, the layering is not so effi cient and may include different combinations of elements between an asset and an adversary. The strategic goals in such cases are to detect and remove any single-layer access paths and, obviously, to avoid sit- uations where the layers might be confl icting. For national infra- structure, the goal is to place multiple security layers in front of all essential services (see Figure 6.1 ).

The security intent for any series of layers is to enforce policy across all possible access paths to the target asset. Thus, if an asset is accessible through a single entry point, then the layers only need to enforce policy at that point. If an asset is broadly accessible from a collection of different entry points, then the layered defense needs to fan out across these points to enforce policy. Defense in depth methods are said to fail if all of the

6

110 Chapter 6 DEPTH

layers do not either block or suffi ciently degrade attack attempts at the protected asset, resulting in security policy violations by an adversary. It is relatively easy to determine that a failure has occurred when an attack is detected; however, when an attack goes unnoticed or when the forensic analysis after an attack can- not determine the point of exploitation, then holes in layered defenses might remain indefi nitely.

Defense in depth implementations are sometimes inappropri- ately or even maliciously bypassed by presumably trusted users, generally insiders to a company or agency, including its employ- ees, contractors, and partners. For example, an infrastructure organization might create diverse layers of security functional- ity to ensure that intruders cannot compromise assets from an external environment such as the Internet. Problems arise, how- ever, if malicious insiders can directly access and compromise assets. This implies that great rigor and discipline are required to ensure that defense in depth truly surrounds an asset, both inter- nally to an organization, as well as externally on the Internet. This generally requires additional functional controls on the local enterprise network to protect assets from insiders.

Depth strategies sometimes involve the familiar military notion of one protection layer slowing down an intruder. It turns out that throttling does not always extrapolate well to cyber security. In practice, cyber security methods tend to be binary in their functionality; that is, a protection will either work or it will not. Debates thus arise around how long an approach will hold off attackers, as in the selection of cryptographic key length.

Security Policy

Attack

Slowed

Attack

Blocked

Attack Gets Through

(Layered Defense Failure) Attack

Unblocked

Series of Defensive Layers

Asset

Layer 2 . . . Layer nLayer 1

X X

X

Figure 6.1 General defense in depth schema.

If layered defenses are penetrated, it is crucial to identify the entry point used by the attacker.

Do not overlook the need for protection against both internal and external adversaries.

Chapter 6 DEPTH 111

Similarly, network attacks are often dealt with by throttling or rate-limiting the traffi c allowable into a target asset environment. These approaches might work to a degree, but they are the excep- tions, and it is recommended that cyber security architectures for national infrastructure not rely on any element having only a partial effect on a given attack.

Effectiveness of Depth Academics formally model the effectiveness of a collection of defensive layers using mathematical probability. Such an approach requires that one quantitatively measure the relative dependencies between the layers, as well as the probability of effectiveness for any given layer. Unfortunately, in any nontrivial environment, both of these estimates are unlikely to be more than just an educated guess. We know, for example, that the success of access controls for enterprise applications is dependent on the success of strong authentication for remote access. Trying to accurately quantify this dependency for probabilistic analysis is a waste of time and will not result in any estimate better than an expert guess.

Thus, from a practical perspective, and in the context of real national infrastructure protection, determining the effectiveness of a defense in depth scheme must be done via educated guesses. We can make this sound better by referring to it as informal sub- jective reasoning based on relevant security factors , but it is still just a guess. The relevant factors for estimating effectiveness of a layer include the following: ● Practical experience —One can certainly analyze practi-

cal experience and past results for a given security method. This is dangerous if taken too literally, because many attacks are missed, and seemingly correct, but actually vulnerable, defenses might be dormant for a period of time before an attack.

● Engineering analysis —Experienced security engineers will use their knowledge and expertise to provide excellent judgment on whether a given layer will be effective. Vendors and sales- people are to be avoided in this process, because they will invariably distort their product and service capability.

● Use-case studies —Providing some rigor to the engineering analysis is a good idea, and the familiar use-case methodology is especially appropriate for security layers. It is really a form of testing.

● Testing and simulation —Actual testing of a layer in a controlled setting will provide good information on its effectiveness.

Ideal defensive strategies will stop—not slow down—an adversary.

How can effectiveness of a security layer be measured or quantifi ed?

112 Chapter 6 DEPTH

Simulation is also a good idea in cases where a defensive layer protects against something not easily tested, such as a massive denial of service attack. To illustrate this approach, let’s start with a simple setup,

as shown in Figure 6.2 . Specifi cally, a single layer of protection depth is depicted and is estimated to have “moderate” effective- ness. We can assume that some subset of the factors described above was used to make this determination. Maybe some team of experts analyzed the protection, looked at its effectiveness in similar settings, and performed a series of tests and simulations. In any event, let’s assume that they decided that a given protec- tion would be moderately effective against the types of attacks to be expected in the local threat environment.

The determination that this single layer is “moderately” effec- tive is nothing more than a subjective guess in most cases. It is, however, an important piece of information for national infra- structure protection because it implies that the protection will not work in all cases; that is, the experts have determined that some types of attacks will bypass or break the protection and will thus expose the asset to malicious intruders. As a result, when a given protection layer does not address all known attacks, then we can conclude the following: ● Flaws —The protection might be fl awed. This could be some

minor issue such as an obscure bug that would allow certain types of attacks or it could be potentially front-page news with major implications. In either case, fl aws in protections require either that they be fi xed or that they be mitigated by a comple- mentary layer of protection.

● Suitability —The protection might be unsuited to the target environment; for example, it might be intended to prevent

Single Defensive Layer

(Moderate Effectiveness)

Moderately Effective

Layer Allows Access

Layer 1

X

X

X

Asset

Figure 6.2 Moderately effective single layer of protection.

A moderately effective defense strategy will stop most, but not all, attacks.

Chapter 6 DEPTH 113

events A and B in an environment where the real threat is event C. Such scenarios are commonly found during incident response, when some event has occurred and the presumed protections are discovered to have had little effect, simply because of a mismatch. This is fi xed by either changing the layer or complementing it with another. Whether the layer is fl awed or mismatched, the situation

is made worse if the adversary has knowledge of the situation. Regardless of the common argument by hackers that exposing problems in a protection method should always be reported, the reality is that such information generally does more harm than good. Certainly, if an organization is lax in fi xing a problem with broad implications, this is unacceptable, but the technique of extorting that group into taking immediate action is not always in everyone’s best interests. The hacker who exposes vulnerabili- ties in a moderately effective mobile telephony control, for exam- ple, without fi rst alerting the service provider, might be guilty of degrading essential communication services that might affect human lives.

Assuming an organization is diligent and chooses to improve or fi x a moderately effective protection, the result will be that the new estimate or guess might be “highly” effective. For example, suppose that some home-grown intrusion detection system is becoming diffi cult to maintain. The local team might thus deter- mine that it is only moderately effective and might replace it with a vendor-supported product. In most cases, the new system would now be viewed as highly effective (with the caveat that no intrusion detection systems ever seem to work as well as they should). The end result is that the layer has now been improved from moderately to highly effective. It should be obvious that even in a highly effective protection environment, there will always be exceptional conditions where the protection may fail (see Figure 6.3 ).

Improving one layer is not, however, the only option available. An alternative would be for the moderately effective control to be left in place and complemented with another layer of protection. This has certain advantages, including reducing the cost and risk of forklifting out a security protection layer and replacing it with a new one. The result of complementing one moderately effec- tive protection layer with another is that the end result should mitigate a larger set of attacks. This does introduce an odd sort of calculus to the security manager, where decisions are required around whether some number of moderately effective protec- tions is better or worse than a smaller number of stronger protec- tions (see Figure 6.4 ).

Multiple layers of protection will mitigate the effects of fl aws or protections that are unsuited to the target environment.

A protection layer can be improved to become “highly” effective, but no layer is 100% effective all of the time.

114 Chapter 6 DEPTH

The answer to whether multiple moderately effective layers outperform fewer highly effective ones will depend on aggrega- tion considerations. That is, if two moderate protections comple- ment each other by balancing each respective weakness, then the composite protection will be quite good. If, on the other hand, multiple moderate protections suffer from similar weak- nesses, then the weakness will remain in the aggregate protec- tion. In practice, security managers generally should look for a diverse set of protections that are as strong as possible and that balance weaknesses in some demonstrable manner. For national infrastructure protection, this will typically involve layers of pro- tection in authentication, malware protection, access controls, encryption, and intrusion detection.

Single Defensive Layer

(High Effectiveness)

Highly Effective Layer

Disallows Virtually

All Access

(always exceptions)

Layer 1

X

X X

X

Asset

Figure 6.3 Highly effective single layer of protection.

Layer 2

Two Defensive Layers

(Both Moderate Effectiveness)

Less Secure

Than One Highly

Effective Layer?

Layer 1

X

X

X

Asset

Figure 6.4 Multiple moderately effective layers of protection.

Diversity of protection layers—including diversity of weaknesses—is critical in maintaining successful protection against attacks.

Chapter 6 DEPTH 115

Layered Authentication Most information technology (IT) and security teams in govern- ment and industry are committed to reducing the number of passwords, passphrases, handheld tokens, certifi cates, biomet- rics, and other validation tokens that exist in their environment. These initiatives are generally met with great enthusiasm among end users, because they result in simpler, cleaner infrastructure and much less for end users to have to remember, protect, or write down. One cannot deny that such simplifi cation has a ben- efi cial impact on overall security. For these reasons, various pro- posals have been made for national authentication systems run by government and that would include every citizen.

Single sign-on (SSO) initiatives are generally used to accom- plish this authentication simplifi cation objective. SSO is accom- plished by the use of a single, common identifi cation and authentication system for all relevant applications. This com- mon system is then embedded into one identity management process so reported identities can be administered and protected uniformly. The simplifi cation inherent in SSO is desirable from a security perspective, because it reduces the likelihood of errors that result when multiple complex login systems are present. Common identity management is thus generally desirable from a security perspective, especially in enterprise settings.

Problems can arise, however, in national infrastructure pro- tection environments if the process of streamlining authenti- cation goes too far. Even the staunchest advocate of SSO must agree that, for certain applications, a properly managed, properly designed, and diverse series of authentication challenges that are reliant on separate proof factors will be more secure than a com- parable SSO system. The diverse series of authentication steps will certainly be less convenient for end users but, if run cor- rectly, will be more secure. This is because such a scheme avoids the nightmarish scenario where a single login provides an adver- sary with common access across multiple national infrastructure systems. This attack scenario is so unacceptable at the national level that it dictates special consideration.

Specifi cally, for national infrastructure management, organi- zations can acceptably maintain the goal of balancing the risks and rewards of SSO for all enterprise-grade applications such as business e-mail, routine applications, and remote access. As long as no national assets can be directly compromised with SSO access, this is fi ne. Companies and agencies charged with national infrastructure can and should move to an SSO scheme with corresponding identity management. For critical national

End users will embrace authentication simplifi cation initiatives, and these are certainly easier to monitor from a security management standpoint.

Single sign-on initiatives may be embraced by end users but may not provide the ideal level of security protection.

116 Chapter 6 DEPTH

services and applications, however, a more complex, defense in depth scheme is highly recommended for end-user authentica- tion (see box).

Single sign-in access can be part of a multilayered defense in depth strategy.

Factors of a Successful National Infrastructure SSO Access System

Critical national infrastructure services need a defense in depth scheme that is developed with the following considerations: ● Diversity with single sign-on —Authentication systems for national asset protection must be different from the SSO

scheme used for enterprise access. This implies that a separate technology, vendor, and management process should be considered between enterprise SSO and national infrastructure authentication. The goal is to ensure that fl aws in one authentication system are not present in the other.

● Diversity of proof factors —Similarly, the familiar proof factors: ● “Something you know” ● “Something you have” ● “Something you embody (biometrics)” ● “Somewhere you are” should be diverse for national assets from any SSO proof factors. This implies that employees should not be handed a single handheld authenticator that can be used to gain access to e-mail and also to some critical infrastructure operational component.

● Emphasis on security —While it is acceptable to emphasize usability in enterprise SSO initiatives, the emphasis of national infrastructure protection should shift squarely toward security. The only relevant end-user issues are ones that simplify usage to reduce errors. Convenience should not necessarily be a major goal, as long as the authentication scheme does not drive bad behavior such as sharing tokens or writing down passwords.

A resultant typical defense in depth scheme for national infra- structure organizations would include SSO for enterprise-grade applications and access and a subsequent, diverse authentica- tion process for all national assets. The result is that end users would need to be authenticated twice before gaining access to a critical asset. Correspondingly, intruders would have to break through two authentication systems to gain malicious access to the target asset. End users probably would not like this and the costs are higher, but the increased security is worth the trouble (see Figure 6.5 ).

For multiple critical national assets in an infrastructure envi- ronment, the depth strategy should include maximal diversity for each asset. That is, the general computing characteristics

Chapter 6 DEPTH 117

and source of the authentication functionality should be diverse. Furthermore, the factors used in establishing proof of identity for critical assets should be stronger than simple passwords; handheld authentication or biometrics would be recommended. An implica- tion here is that the underlying infrastructure be operated with the greatest precision and correctness. Administrative procedures for obtaining an authentication token, restoring access when a token or password is lost, and providing assistance to confused end users must be carefully designed to avoid social engineering attacks. At the national level, this would require frequent testing.

A key modern consideration for enterprise authentication is the degree to which mobile access to infrastructure poten- tially changes security posture. As an example, consider that most organizations go to great lengths to ensure that several lay- ers of authentication reside between remote workers and sensi- tive applications such as enterprise e-mail. In fact, see the box to follow the experience most people have when trying to get their enterprise e-mail from a remote location using a laptop.

The example in the box also highlights the importance of rec- ognizing trends in technology as national infrastructure protec- tion initiatives are considered. For the enterprise, the old notion of protected perimeter thus disappears with the advent of mobile access across wireless carrier infrastructure. One still fi nds archi- tectures where users must “hairpin” their mobile access to the enterprise and then through a fi rewall to the target application, but this practice is likely to wane (see Figure 6.6 ).

Diverse

Authentication

Two Diverse Authentication Layers

SSO

X

X

Critical

Asset

Enterprise

Assets

Enterprise

Figure 6.5 Schema showing two layers of end-user authentication.

Unfortunately, mobile devices eliminate the multi- layered protection most companies build into their remote network access.

118 Chapter 6 DEPTH

For applications such as enterprise e-mail, this type of conve- nient bypass might be perfectly fi ne. In fact, for enterprise e-mail specifi cally, it would be unreasonable to expect that workers in national infrastructure settings should not be allowed mobile access. For more sensitive national infrastructure applications, however, such as those that provision or control critical systems, a threat analysis would be required before any alternative paths with mobile devices are allowed. Classifi ed information would be another example asset that requires multiple layers with- out mobile access bypass. These types of requirements should fi nd their way into any type of national infrastructure support contracts.

Exposing critical national assets to mobile access (even by trusted personnel) opens a gateway for an adversarial attack.

Multi-Layered Protection: Five Steps to Remote E-Mail Access

A typical remote worker will need to follow these steps to access their enterprise e-mail account: ● Authentication layer 1 . The user must fi rst login to the computer. Presumably, this is done using a password that is

set by the enterprise information technology or security group. ● Authentication layer 2. The user must then login to the local WiFi or broadband access network.

Sometimes this is free; other times it requires a credit card, which can be viewed as an added identifi cation step.

● Authentication layer 3. The user must then login to the remote access server, probably over a virtual private network (VPN). Most of the time, companies and agencies require a personal identifi cation number (PIN), password, or handheld token to authenticate VPN access.

● Authentication layer 4. The user must then login to the enterprise network, probably with some sort of domain password. This is also controlled by the local information technology or security group.

● Authentication layer 5. The user must fi nally login to the specifi c e-mail application being used by the enterprise. Sometimes this requires another password, but often it just requires access. On the surface, this would seem like the ultimate in layered authentication with no less than fi ve layers! The

problem is that many organizations provide their employees with means to remotely access applications such as e-mail with a handheld device. Consider, in this case, the experience most people have when trying to retrieve their enterprise e-mail using a mobile device: ● Authentication layer 1. The user must simply login to the mobile device, click on the e-mail icon, and then read or

create mail. This is obviously only one layer of authentication for mobile devices, and it demonstrates the importance of

recognizing that users might fi nd more convenient paths around presumed layers of authentication.

Chapter 6 DEPTH 119

Layered E-Mail Virus and Spam Protection Commercial environments are increasingly turning to virtual- ized, in-the-cloud solutions for their gateway fi ltering of e-mail viruses and spam. This decision allows the organization to remove the gateway fi lters or to simply offl oad the work those fi lters must perform. This is a healthy decision, because a gen- eral security principle is that attacks should be stopped as close as possible to their source. The network is certainly closer than the attack target’s ingress point, so virtual fi ltering is desirable. It is also helpful to the carrier, because it reduces the junk fl oat- ing around network infrastructure, which helps carriers perform their tasks more effi ciently in support of national services.

Managers of commercial environments have also come to recognize that their computing end points cannot rely solely on gateway or in-the-cloud processing. As such, the state of the practice in e-mail virus and spam protection involves a defense in depth deployment of fi lters to each laptop, netbook, personal computer, and server in the enterprise. The approach is even beginning to fi nd its way to the mobile handheld device, where the threat of viruses and spam is increasing. As such, a given virus or spam e-mail sent from a malicious source will have to fi nd its way through at least two layers of fi ltering in order to reach its intended source (see Figure 6.7 ).

This cloud fi ltering arrangement found in most compa- nies is acceptable for organizations charged with national

Layer 1

(Computer

or Device)

Layer 2

(WiFi)

Layer 3

(VPN)

Layer 4

(Enterprise)

Layer 5

(E-mail)

Direct Mobile Access

Figure 6.6 Authentication options including direct mobile access.

Mobile devices are susceptible to viruses and spam, yet spam is more of a nuisance than an actual threat to national infrastructure.

120 Chapter 6 DEPTH

infrastructure. For the most critical applications, it is recom- mended that a depth approach involving both in-the-cloud and perimeter processing be employed. In addition, for key execu- tives in these companies and agencies who might be directly targeted by adversaries, additional desktop and application fi l- tering might be prudent. Practical experience suggests that spam is more a nuisance than signifi cant threat to national asset man- agement, so the likelihood of attackers using spam to interrupt national services is only moderate. In addition, antivirus soft- ware has become less relevant in recent years, simply because so many software threats such as well-coded bots are not easily detected by antivirus software. Research into better techniques for detecting the presence of malware should become an imme- diate national priority.

Layered Access Controls Access controls determine who can access what resources under which conditions. They are one of the most common and most mature security protection methods, dating back to the earliest electronic computers. If some asset is protected by a single set of access controls, then this is similar to using a single combina- tion lock to protect a physical asset. That is, if an individual has the correct combination, then access is allowed. Common access controls include access control lists (ACLs) on Windows ® -based operating systems and permissions vectors in UNIX ® -based operating systems. These are implemented as software data structures that determine access based on some defi ned policy.

One approach to using defense in depth to protect a software application involves embedding one type of access control into the application environment and then hosting the application

Layer 1 Layer 2

Security

Software

E-mail

Source Cloud

Security

Provider

Internet Enterprise

Figure 6.7 Typical architecture with layered e-mail fi ltering.

Antivirus software, while still necessary, is not likely to detect such threats as a botnet attack.

Chapter 6 DEPTH 121

on an operating system that utilizes a different type of access control. In such a setup, access to the application can only be obtained by successfully negotiating the following layers: ● Access control layer 1 . The user must be permitted entry to the

operating system via the operating system access controls. This might be UNIX ® permissions, Windows ® ACLs, or some- thing similar.

● Access control layer 2 . The user must be permitted entry to the application via the application access controls. This is likely to be a password embedded in the application environment and controlled by the application owner. In cases where an operating system and application cannot

be remotely reached, these two layers can be augmented with additional diverse controls such as guarded access to the physi- cal premise or to a locked data center. This implies that access to an application would require fi rst obtaining physical access to a console before access to the operating system and application can even be attempted. These two layers of authentication are important and should be tested in every national infrastructure environment, especially ones employing supervisory control and data acquisition (SCADA), where computer security techniques have a more short-lived legacy. A caution, however, is that insid- ers are likely to possess both types of access, so the layers will not be helpful in stopping most forms of sabotage.

In cases where remote access is allowed, then the use of a fi re- wall is the most common method to ensure policy compliance for those permitted access. Such policy is almost always based on the source Internet protocol (IP) address of the requesting party. This is not the strongest of access control methods, sim- ply because IP addresses are so easily spoofed. Also, to maintain such a scheme, a complex and potentially error-prone or socially engineered bureaucracy must be put in place that accepts and maintains access requests. When used in conjunction with additional access control layers such as operating system and application controls, the result might be acceptable in some environments (see Figure 6.8 ).

For national infrastructure protection, critical assets should be covered by as many layers of access control as deemed fea- sible. As with authentication, the issue of end-user convenience must be viewed as lower priority if critical national services are at stake. Some general heuristics for protecting national infrastruc- ture with layered access controls include the following: ● Network-based fi rewalls —Using cloud fi rewalls offers an addi-

tional blanket layer of control. This technique is useful as a complement to existing enterprise controls, especially because

Some form of access control is present in any network connection (e.g., your personal password to access your e-mail account).

Restricting physical access to assets always adds another layer of protection from outsiders, but not from internal saboteurs.

The implementation of layered access controls places greater emphasis on protection than on end- user convenience.

122 Chapter 6 DEPTH

carrier-based systems will generally differ from whatever fi re- walls and related systems might be deployed in the enterprise.

● Internal fi rewalls —This provides yet another layer of protec- tion within the enterprise to ensure that individuals with access to resource X only gain access to that resource and no other. Routers can often provide a simple packet-fi ltering capability as part of their native processing suite, which sim- plifi es architecture and minimizes cost.

● Physical security —Excellent facility and premise-access secu- rity provides an additional tangible layer of protection and is essential for any national infrastructure protection initiatives. This must be complemented by selecting suitable applica- tions and systems that can never be accessed remotely or even across a local area network. When multiple access control systems are in place, the benefi t

of layering is reduced when the underlying administration function is performed by one team using a common set of tools. When this involves a protected and carefully managed security operations center the situation is acceptable, but when the management is ad hoc and poorly controlled the layering might be undermined by an attacker who successfully infi ltrates the administration systems.

Layered Encryption Encryption is an effective and well-known security control for protecting information. While mathematicians and computer scientists have created hundreds of different taxonomies for categorizing symmetric and public key systems, the box shows specifi c methods that are useful for the protection of national infrastructure.

Layer 1

(Firewall Based on

Source IP Address)

Layer 2, 3

(Application and OS

Access Controls)

Enterprise Remote Access

Application

Figure 6.8 Three layers of protection using fi rewall and access controls.

Multiple access control systems must be well managed so as not to allow an internal attacker successful infi ltration to the systems.

Chapter 6 DEPTH 123

The good news is that, for the most part, these fi ve encryption methods will not collide in practice. They can be used in combi- nation and in cooperation, with no great functional or admin- istrative problems expected. It is also perfectly fi ne to encrypt information multiple times, as long as the supporting adminis- trative tools are working properly. As such, one can easily imag- ine scenarios where all fi ve systems are in place and provide fi ve different layers of information protection. Not all will typically reside in a perfect series, but all can be in place in one infrastruc- ture setting providing layered security (see Figure 6.9 ).

The bad news, however, is that each will typically require its own user administration and key management systems. The result is a disparate view of cryptography across the enterprise that can be seen in the somewhat scattered arrangement in

Five Encryption Methods for National Infrastructure Protection

1. Mobile device storage —Mobile smart phones and laptops should have native encryption to protect against loss or theft and the resulting information compromise. The encryption will never be perfect but should provide useful protection in the fi eld. Several vendors offer this type of encryption as an add-on service, but this should eventually become a native function in all mobile devices and laptops.

2. Network transmission —Any sensitive data being transmitted within an enterprise or between knowing partners should be encrypted. The traditional means for such encryption has been symmetric and embedded in hardware devices. More recently, the associated cryptography is often software based and involves public keys supported by public key infrastructure (PKI) tools. When network transmission occurs in an ad hoc manner, the practical consideration is that shared cryptography simply does not exist between organizations due to complexity. This makes it diffi cult to encrypt network traffi c without coordinating things in advance.

3. Secure commerce —If an organization offers electronic commerce services over the Internet, the use of common encryption techniques such as Secure Sockets Layer (SSL) is presumed. The associated cryptography here will be public key based.

4. Application strengthening —E-mail is the most obvious application that can introduce secrecy and authentication properties via the use of encryption. As noted above, federating this cryptography, almost always public key based, between organizations has not been done on a wide scale to date.

5. Server and mainframe data storage —Encryption on servers and mainframes has received considerable attention in recent years but should be viewed with suspicion. Data at rest is poorly protected by cryptography because the associated key management systems, which require a long life, can have obvious holes. In the worst case, sloppy key management can make data less secure. Note that smart phones and laptops are different from servers because they are moving .

Information can be encrypted multiple times to achieve layered protection.

124 Chapter 6 DEPTH

Figure 6.9 . This is unfortunate, because it increases complexity, which increases the chances of error or compromise, especially to underlying infrastructure. Regardless, the use of cryptography in national infrastructure protection should be encouraged, even if the layers are not optimally coordinated.

Layered Intrusion Detection Intrusion detection was once viewed as the most promising of large-scale security techniques. Even the provocative and hopeful name “intrusion detection” suggests a powerful technology that can be inserted into an environment to alert security teams when an intrusion is imminent. While this goal has not been fully met in practice, intrusion detection does provide a useful means for detecting indicators of potentially harmful behavior. These indi- cators are sometimes used for early warning, but more often are used to correlate with other types of available information during an incident.

Access Access

AccessAccess

Trans Trans

Crypt

E-mail E-mail Ecom Ecom

EcomEcom

Layer

Enterprise Perimeter

Layer

Layer

LayerLayer

Layer

Layer

External User

External User

Mainframe

Commerce

Server

Partner

Gateway

Partner

Gateway

Laptop

Internal User

Internal UserExternal User

Figure 6.9 Multiple layers of encryption.

Chapter 6 DEPTH 125

Because intrusion detection is typically performed offl ine, it lends itself to multiple layers of monitoring. Obviously, if the intrusion detection includes an active response—which is referred to collectively as intrusion prevention —the layered arrangement could be more complex, but for now let’s analyze strategies for passive, offl ine monitoring of attack. Most organiza- tions accomplish this task using commercial systems that include three components: monitors that are placed in strategic locations to collect data, transmission systems that move alarm informa- tion to a central location, and a master monitoring function that processes incoming data and provides some sort of correlated summary, usually in the form of an alarm to a console. When this type of intrusion detection system is in place in an enterprise, it can be viewed as an explicit layer of protection. In fact, many auditors will accept intrusion detection as a complementary con- trol when some other protection displays weaknesses.

One can conceptualize an alternate layer of intrusion detec- tion being put in place at a broader level, perhaps coordinated by some government or industry group. The components of the system would be the same, but differences from the enterprise would include diverse monitor placement, different signatures of attack, and a broader base on which to perform correlation of data. An issue with this alternative layer is that the protection would likely involve network paths that are largely separate from those in specifi c enterprise settings. For example, an intrusion aimed at some government agency would not be detected by the intrusion detection system located within a separate enterprise. There are, however, three specifi c opportunities for different intrusion detection systems to provide layered protection: ● In-band detection —If two intrusion detection systems both

have monitoring access to the same attack stream, or a related one, then they might both have the opportunity to detect the condition. Thus, if one system fails, it is possible that another might not. This is the essence of defense in depth, but it only works if the response processes for each detection system are coordinated.

● Out-of-band correlation —During an incident, the operators of an intrusion detection system might benefi t from information that might become available from other operators. This can be intelligence about sources, methods, or techniques being used by attackers. It is usually best used if made available in real time.

● Signature sharing —A special case of the above correlation involves sharing of specifi c attack signatures by one opera- tor that can be keyed into the systems being run by other

Intrusion detection with data security is similar to physical security intrusion detection: monitoring, an alarm system, and a central console.

126 Chapter 6 DEPTH

operators. Military organizations, for example, sometimes develop signatures that could be shared with industrial groups to improve their security. In each of these cases, diverse intrusion detection systems can

be viewed as providing a defense in depth for target assets. The result is a potentially coordinated series of intrusion detection layers that will help protect national infrastructure. This coor- dination usually requires sharing between different monitoring and analysis centers; that is, if one intrusion detection system notices an attack such as a botnet, then it might share this infor- mation with another system that might not have detected the condition (see Figure 6.10 ).

This idea of coordinated intrusion detection systems is cer- tainly not new; for example, government cyber security commis- sions and groups have advocated the notion of signature sharing between government and industry for years. For whatever rea- son, however, such coordination has rarely occurred, but for national infrastructure protection to reach its full potential such cooperation must be encouraged and rewarded.

National Program of Depth Creating a coordinated program of defense in depth using mul- tiple layers of security for national infrastructure can only be

Asset A

Monitoring Probe

Detects Botnet

Not Detect Botnet

Monitoring Probe

Alarm Feed

Intrusion

Detection

System B

Intrusion

Detection

System A

Botnet

Asset B

Internet

Private Network

Private

Network

A Shares

Detection

Info with B

Figure 6.10 Sharing intrusion detection information between systems.

A certain amount of information sharing between government agencies may serve to increase intrusion detection effectiveness.

Chapter 6 DEPTH 127

ensured via careful architectural analysis of all assets and pro- tection systems. The architectural analysis should result in a mapping, perhaps represented as a matrix, where each critical national asset is shown to be protected by certain multiple lay- ers of security. For each layer, subjective determination of its effectiveness is also required. Once this is done, simple calcula- tions can be performed to determine the diffi culty of penetra- tion through the various layers. This task is easier than it sounds; some of the more practical considerations that arise in such an exercise include the following: ● Identifying assets —This is a required step for several of our

recommended national infrastructure protection principles, including, for example, deception. It is particularly important for defense in depth, because the analysis of depth effective- ness can only be measured from the specifi cally identifi ed assets.

● Subjective estimations —The challenges inherent in this step were explained in detail above; certainly, in practice, cer- tain conventions could arise that would help security experts arrive at common estimations of effectiveness. In the 1980s, the U.S. Department of Defense created a set of criteria (infor- mally called the Orange Book) for measuring the effectiveness of security in systems. Perhaps some elements of this criteria approach could be introduced to provide assistance in subjec- tive estimations of the effectiveness of a layer.

● Obtaining proprietary information —If a company or agency has some defense in place (or, more importantly, per- haps some defense that may be missing) for some essential national service, then obtaining this information for broad analysis may be diffi cult. The goal would be to demonstrate value for organizations sharing detailed information, even if it is bad news.

● Identifying all possible access paths —Perhaps the toughest part of any cyber security exercise involves trying to deter- mine means for accessing some target. If this is not done properly, then the defense in depth strategy will fall apart, so this important step requires special consideration. These considerations can introduce signifi cant challenges in

practice. It does not help that most existing security teams, even in large-scale settings, rarely go through a local exercise of iden- tifying defense in depth conditions. As a result, most national infrastructure protection teams would be working this exercise for the fi rst time in the context of a national program.

Reviewing systems and strategies to identify existing layers of protection will create a “map” of the current depth of defensive protection.

This page intentionally left blank

129 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00007-X 2011

DISCRETION The British spook said it on the way to the pub—a seemingly random confession that stood out in contrast to the polite evasions that were Ellis’s standard form of reply. Public key cryptography? “You did a lot more with it than we did,” he said .

Steven Levy 1

A belief found occasionally in the hacking community is that all information should be free and that anyone trying to suppress information fl ow is evil. The problem with this view is that it sug- gests that sensitive personal data should be exposed to the world. As such, this extreme view is commonly modifi ed by hackers as follows: All information associated with organizations , especially government, should be free, but private data about individuals should never be disclosed. From a logical perspective, this is a curious distinction, because large organizations are comprised of individuals, but in practice the view makes perfect sense. Hackers are almost universally concerned with protecting the rights of the individual; this view of information establishes a charter for the hacking community to make public anything that might degrade individual rights.

The result is a hacking culture where it is considered accept- able to expose proprietary information from government and industry in hacking magazines, on websites, at conferences, and across the Internet. Hackers often claim that reporting commer- cial and national vulnerabilities is a useful public service that prompts a more rapid security fi x. This certainly does not jus- tify leaking proprietary information that has nothing to do with vulnerabilities, but it does offer some value—albeit in an overly forceful manner. Regardless of the motivation, the fact is that proprietary information in companies and agencies will most defi nitely be widely exposed if discovered by hackers. Perhaps worse, terrorists and information warriors are also interested in

7

1 S. Levy, The open secret: public key cryptography—the breakthrough that revolutionized email and ecommerce—was fi rst discovered by American geeks. Right? Wrong, Wired , 7(4), 1999.

130 Chapter 7 DISCRETION

this information, but for more malicious purposes—and they will rarely make their intentions public in advance.

The result is that national infrastructure protection initia- tives must include means for protecting sensitive information from being leaked. The best approach is to avoid vulnerabilities in the fi rst place, as this information is the most urgently sought and valuable for public disclosure. More practically, however, national infrastructure includes a wide spectrum of information ranging from innocuous tidbits and gossip to critically sensitive data about infrastructure. This spectrum requires a customized protection program focused primarily on the most critical infor- mation. Any practical implementation should therefore com- bine mandatory, functional security controls with programs that dictate the use of discretion by individuals possessing important information. Mandatory controls can be implemented centrally, but discretion must be embedded in the local culture and fol- lowed in a distributed and individualized manner.

Trusted Computing Base The nearest the computer security community has come to rec- ognizing the importance of human discretion lies in an archi- tectural construct introduced in the 1980s called a trusted computing base (TCB). The defi nition of TCB is the totality of hardware, software, processes, and individuals whose correct operation and decision-making are considered essential to the overall security of the system. In an operating system, this would include the system fi les and processes in the underlying kernel. In an organization, this would include the system and security administrators who operate the critical protection systems. For an organization, it would also include all constructs for manag- ing and storing personally identifi able information (PII) about employees and customers. Candidates for exclusion from a TCB include anything whose malfunction or public disclosure would not create a signifi cant or cascading problem. In modern infra- structure, the TCB generally extends to the systems and networks of partner and supplier groups. This greatly complicates the pro- tection of TCB assets because it extends the TCB perimeter to an environment that is more diffi cult to control.

The primary goal of any program of discretion in national infrastructure protection should be to ensure that informa- tion about TCB functionality, operations, and processes is not exposed inappropriately to anyone not properly authorized and to avoid disclosure to anyone who does not possess a clear

Exposure of vulnerabilities can force a quick response, but that same exposure might lead adversaries directly to private data.

A modern TCB extends beyond a single organization, making protection all the more diffi cult.

Chapter 7 DISCRETION 131

business need for that information. Such a program will combine two distinct components: ● Mandatory controls —These are the functional and procedural

mechanisms that are put in place to ensure that information is protected from unauthorized access. Other than key adminis- trators within the TCB, no individual in any organization should be able to bypass mandatory controls, which will typically include fi rewalls, intrusion detection systems, and honey pots.

● Discretionary policy —These are the rules, recommenda- tions, and guidelines that are put in place by an organization to protect its information, especially with respect to the TCB. The discretion here is generally driven by practical concerns; for example, no functional mechanism can control what peo- ple mention informally to colleagues or customers. The only way to ensure protection here is the discretionary guidance afforded by the local culture. This can certainly be comple- mented with severe punishments if someone clearly violates the spirit of protection for TCB-related information. As one might expect, the TCB is easiest to protect if its size

and complexity are minimized. Having fewer people that must be trusted to support security, for example, is better than having to trust many different people and groups. Similarly, the fewer the systems one must trust in some base, and the less complex these systems are, the better off an organization will be from a security perspective. So, the minimization of a TCB is an excellent goal, albeit one that is often ignored in practice. Security practice has all too often involved the introduction of some new security system that is large and complex and requires full trust (see Figure 7.1 ).

Trusted

Computing

Base

Trusted

Computing

Base

Small Base of

Less Critical

Functions

(Less Desirable)

Large Base Requiring

Protection and Trust

(Less Desirable)

Smaller Base Requiring

Protection and Trust

(More Desirable)

Large Base of

Less Critical

Functions

(More Desirable)

Figure 7.1 Size comparison issues in a trusted computing base.

A smaller, less complex TCB is much easier to protect.

132 Chapter 7 DISCRETION

A major consideration in the protection of national infra- structure thus becomes how to manage, promote, and ensure proper human discretion around critical information related to TCB assets. This requires that policies, procedures, and even functional controls be put in place to assist in exercising such discretion. The idea is that, before any TCB-related informa- tion is disclosed that could have an impact on the security of some national asset, the following types of questions must be considered: ● Assistance —Could this information assist an adversary in

attacking some aspect of national infrastructure? For example, if terrorists or country-sponsored information warriors had this information, could they mount a malicious campaign against services such as emergency 911?

● Fixes —Does disclosure of this information assist in identifying a timelier or more effective security fi x? For example, will this disclosure provide someone with information that can reduce the time required to fi x the problem?

● Limits —Can the information disclosure be limited to those in a position to design a security fi x? More specifi cally, can the disclosure be done quietly and in private to a targeted group such as the vendor or service provider that can directly solve the problem?

● Legality —Is disclosure of this information a legal or contrac- tual requirement in the local environment? Or, is this disclo- sure being done for some other reason—perhaps personal gain or pent-up anger with some organization for moving too slowly?

● Damage —Is any individual or group harmed or damaged by protection and nondisclosure of this information?

● Need —Do others need this information to protect their own systems or infrastructure? As suggested, proper human discretion in the interpretation

of these questions, along with subsequent decision-making, is critical to protecting national assets. In many cases, government organizations will demand information related to some national infrastructure component or service, especially if the information relates to some trusted computing base. This is fi ne, as long as the purpose of sharing is reasonable and focused on improving the situation. When such information is demanded by a govern- ment group for unspecifi ed purposes (or, at worst, for the pur- pose of power or gossip), then such sharing is not recommended.

In any event, regardless of the security process, architectures, and systems put in place to protect assets, humans will remain a critical link in the chain. In fact, in many environments, they

Asking the right questions can help determine the impact of TCB security- related disclosures.

Before sharing critical information, consider who is requesting it and what the purpose is behind their request.

Chapter 7 DISCRETION 133

may be the weakest link. This is why the exercising of discretion in sharing information is such an important principle.

Security Through Obscurity A barrier to proper discretion is the much maligned and poorly understood notion of security through obscurity . Ask any security expert what they think of this concept, and you will receive a reli- gious argument, especially from cryptographers, that deliberately hiding information to ensure security will not work. Their claim is that anyone trying to hide design, implementation, or opera- tional detail is probably just trying to conceal fl aws. Furthermore, all information presumably fi nds its way public, they will argue, and any dependencies on suppression will eventually topple. The most objectionable applications of security through obscurity can be described in the following two scenarios: ● Long-term hiding of vulnerabilities —This involves the opera-

tors of a system concealing the existence of some exploitable fl aw as their primary, long-term means for securing the sys- tem, as opposed to the more desirable approach in which the vulnerability would be removed.

● Long-term suppression of information —This involves the operators of a target system deliberately suppressing general information about a system to make things more diffi cult for adversaries, hackers, and third parties to discover potential fl aws in a system. In each of these scenarios, the primary control involves hid-

ing information. Most would agree that this is not a reliable long- term method, because suppressed information has a tendency to eventually become public. The situation can be depicted as a knowledge time line, where zero information is initially made public about some system. With time, a gradual increase will occur in available public knowledge. If this increase reaches the point where suffi cient information is available to mount an exploit, then the security through obscurity scheme has failed. Obviously, disruptive events such as hacker announcements can create abrupt increases in knowledge (see Figure 7.2 ).

Although security through obscurity is not recommended for long-term protection as a primary control, it remains an excel- lent complementary control in many cases, as well as being an essential requirement in the short term for many types of security problems in infrastructure. For example, there are no compelling reasons for information about some organization’s security architecture to be made public. As long as the security

There are many opponents of security through obscurity as a meaningful protection strategy.

134 Chapter 7 DISCRETION

design receives expert local treatment, it is best left not publi- cized. Certainly, no one should recommend this as a primary control, and it should not be used to hide fl aws, but such discre- tion raises the bar against adversaries and might be the differ- ence between an attack that succeeds and one that fails.

Correspondingly, when some exploitable fl aw is discovered locally that requires immediate attention, the worst thing that can happen is for that information to be shared broadly. When this occurs, perhaps as a result of a posting to the Internet, the local response becomes distorted by concerns related to public relations, imminent threat, and legal concerns. Engineering solu- tions would be much improved if the fl aw can be analyzed care- fully and embedded into proper development and operations lifecycles. In addition, suppose that the steady state for some sys- tem is that suffi cient security exists to ensure proper operation, and any vulnerability that might exist is suffi ciently obscure as to make the technology reasonably dependable. If a severe vulner- ability is then found, the result is that the new steady state could jump to an unacceptably high risk state, and the integrity and dependability of the operation could be in jeopardy. This is sim- ply not acceptable, even for short periods of time, for essential national services.

The familiar argument that hackers often make here is that by exposing the vulnerability, a fi x is rushed into place. In addition, when the fi x is embedded into the original system, the integrity of that system has, by defi nition, been increased, simply because an existing fl aw has been removed. This is a powerful argument and is in fact a correct one. The problem is that for essential ser- vices, the vulnerability period—during which risk grows higher than some tolerable threshold—must be avoided. Cold logic generally goes out the window when a service must be in place

Security through obscurity should not be a primary protective strategy but can certainly be part of a defense package.

Essential national services cannot afford to be in a high risk state, even for a short period of time.

Attack Threshold

System Inception

Gradual

Increase

Security Through

Obscurity Fails Here

Sufficient Knowledge

Available to Mount Attacks

System is

VulnerableAvailable

Public

Knowledge

High

Low Time

Figure 7.2 Knowledge lifecycle for security through obscurity.

Chapter 7 DISCRETION 135

to ensure that a heart attack victim receives aid, or that tenants in an inner city receive electricity and heat, or that operators of a nuclear power plant can avoid dangerous emergency situations that could create serious health disasters (see Figure 7.3 ).

Regardless of the specifi c steady-state attack intensities and acceptable thresholds, the requirement is that the individuals charged with protecting vulnerability information must exercise proper discretion to ensure a level of obscurity for their systems. Without such discretion and obscurity, the chances are great that attack intensities can exceed desired levels, thus leading to seri- ous problems. In general, the practice should be to avoid public disclosure of vulnerabilities until a responsible fi x has been put in place. This suggests that disclosure of vulnerability information must be minimized and confi ned to those in a position to design and embed a proper solution.

Information Sharing Sensitive information can be exposed in different ways, includ- ing deliberate leaks, stray comments, document theft, and hacker disclosure. Each of these occurrences can be jolting for a security team, and their potential creates a general feeling of unease, espe- cially in national infrastructure settings. An additional path for the exposure of sensitive information involves willful information sharing with some controlled, authoritative group. While this is a predictable event, and the recipients are usually delighted with the information, the group doing the sharing is rarely pleased with the overall process.

Fixes Put

in Place

Increase in

Attack Intensity

Time

Threshold 1

Attack

Intensity

Threshold 2

Steady State

Vulnerability Information

Disclosed New Steady State

Security

Progress

Figure 7.3 Vulnerability disclosure lifecycle.

Information sharing may be inadvertent (stray comments), secretive (document theft), or willful (federal regulations or audits).

136 Chapter 7 DISCRETION

Government agencies are the most aggressive in promoting information sharing. Obviously, where legal requirements dictate reporting of data, there is no reason for debate. Law enforcement groups and federal regulators, for example, regularly demand information, but this is done under extremely controlled condi- tions and rarely, if ever, results in vulnerability-related data being disclosed to an adversary. For cyber security, however, govern- ment agencies request that industry share sensitive information for the following reasons: ● Government assistance to industry —In theory, attack signa-

tures and related security data could be provided by govern- ment to industry, as long as government is fully aware of the vulnerabilities that might reside in commercial infrastruc- ture. This requires information sharing from companies to government.

● Government situational awareness —For government to prop- erly assess cyber security risk at the national level, informa- tion sharing from industry is required, as such a large portion of national infrastructure resides in industry.

● Politics —Government groups are political by nature, and sensitive information provided by industry serves as a type of “power currency” that is used to push political objectives within government. This is rarely stated, but no government offi cial would deny its validity. In practice, information sharing between industry and gov-

ernment tends to provide spotty results for both parties. The idea of government providing direct cyber security assistance to industry, for example, is mostly theoretical. Valid scenarios can easily be imagined, especially for attack signatures that might be known by a military or intelligence group, but the practical real- ization of this is rarely seen. Similarly, the idea of government using shared information to form an aggregate view of national cyber security risk sounds great, but has never been done—at least in any public way. In contrast, the political objective has been the primary driver for most information sharing initiatives, which helps explain the enthusiasm that remains in government for this activity. This is a shame, because of all the motivations this one is the least important to the operator sharing data. In fact, an inverse relationship seems to exist between the respec- tive measures of value to the sharing and receiving parties (see Figure 7.4 ).

The relationship illustrated in Figure 7.4 shows that whereas government primarily seeks political power with informa- tion, industry cares the least about this; correspondingly, where industry would benefi t most from government assistance, this

Government and industry are not mutually invested in information sharing for the same reason.

Chapter 7 DISCRETION 137

is an area where government is in the weakest position to help. Both government and industry would agree that it is moderately important that government maintain situation awareness of vul- nerabilities, but neither would list this as their primary objec- tive. It is this inverse relationship that helps one understand why information sharing initiatives have rarely worked. It also goes without saying that any cases where information has been shared with government and is then sloppily handled, perhaps even leaked to the press, just makes matters worse.

The recommendation here is that any energy available for expenditure in this area should focus on fl attening the two curves somewhat. Government should be less focused on politics, and industry should be less concerned with getting something in return for sharing. The end result is that sharing objectives will nat- urally converge to an agreed-upon situational awareness objective, which is important but certainly not so important as to warrant all the attention this issue brings to the cyber security discussion.

Information Reconnaissance Reconnaissance activity performed by an adversary is another means by which sensitive information can be exposed. This is

High

Medium

Low

Most Important

Least Important

Industry

Industry

Government

Industry

Value to

Respective

Party

Government

Government

Government

Assistance

to Industry

Government

Situational

Awareness

Government

Politics

Figure 7.4 Inverse value of information sharing for government and industry.

Certainly, poor handling of sensitive or private information lessens industry’s trust in government when sharing information on vulnerabilities.

138 Chapter 7 DISCRETION

important to recognize because attacks on national infrastruc- ture will always include some form of reconnaissance. It can be done at arm’s length using remote access over the Internet; it can be done using compromised or planted insiders with access to critical local data; it can be done using social engineering tech- niques; it can be done via deliberate theft, remote hacking, or quiet sabotage, and so on. Regardless of the technique or vantage point, reconnaissance is used to plan and prepare for attacks on infrastructure.

Adversarial attacks are rarely spontaneous; some amount of planning goes into each attack.

This three-stage model suggests that at each layer of informa- tion collection by an adversary the opportunity exists for security engineers to introduce information obscurity. The purpose of the

v

Reconnaissance Planning Levels Three levels of reconnaissance are followed in most instances of cyber attack planning: 1. The fi rst level involves broad, wide-reaching collection from a variety of possible sources. This can include web

searches, personal contact, and business interaction. 2. The second level of reconnaissance involves targeted collection, often involving automation to provide assistance.

Network scanning is the most common functional support for this second level of reconnaissance. 3. The third level involves direct access to the target. A successful hacking break-in to some system, followed by the

collection of targeted data, is an example. One possible scenario that strings the three phases together might involve broad reconnaissance, where something

found on the Internet would prompt more targeted reconnaissance, which would involve the scanning activity to fi nd something that could then be used in the third phase for direct access to a target (see Figure 7.5 ).

Access

Stage 1:

Broad

Stage 2:

Targeted

Stage 3:

Direct

Adversary

Target

Personal

Internet

Business

Media

All-Source

Tools

Scanning

Monitoring

Figure 7.5 Three stages of reconnaissance for cyber security.

Chapter 7 DISCRETION 139

obscurity would be to try to prevent a given type of information from being disclosed through the reconnaissance activity. The specifi c types of security-related national infrastructure informa- tion that should be obscured are as follows: ● Attributes —This is information about seemingly nonsecurity-

related features, functions, and characteristics of the com- puting, networking, applications, and software associated with national infrastructure. It could include equipment type, vendor name, size and capacity, and supported functional- ity. Adversaries often covet this type of information because it helps provide context for a given attack.

● Protections —This is information related to the security pro- tection of a national asset. It can range from technical confi g- uration or setup data about systems to nontechnical contact information for key security administrative staff. The value of this information should be obvious; when obtained, it pro- vides a roadmap for the type of countermeasures an adversary must consider in planning a successful attack.

● Vulnerabilities —This is information related to exploitable holes in national infrastructure. It can range from well-known bugs in commercial operating systems to severe vulnerabili- ties in some national asset. Adversaries will seek this type of information from any possible source. This can include the national infrastructure management team, relevant technology or service vendors, or even the general public. The hacking community is also a rich source of vulnerability information, especially as it relates to national assets. Of these three attributes, vulnerability information tends to

dominate most discussions about the types of information an adversary might desire. Go to the technical section of any book- store, for example, and you can fi nd thick tomes chronicling the exploitable holes in virtually any technology you can imagine. This gives you some idea of how diffi cult it really is to obscure vulnerability information. This should not discourage the opera- tors of national infrastructure; when serious problems are dis- covered that can degrade essential services, the only responsible action is to work toward some sort of fi x with the responsible parties before the information is shared to the rest of the world, which obviously includes the adversary.

Obscurity Layers One conceptual approach to managing discretion in protect- ing national infrastructure information involves obscurity layers .

Although truly obscuring vulnerability information is likely an impossibility, security managers should strive for discretion and privacy on this point whenever possible.

140 Chapter 7 DISCRETION

These layers are intended to reduce the likelihood that critical information is disclosed to unauthorized individuals. Techniques for introducing layers of obscurity range from common-sense human discretion to more structured processes for controlling information fl ow. If designed properly, obscurity layers should make unauthorized disclosure possible only if multiple, diverse obscurity techniques are somehow bypassed. In this sense, obscurity layers can be viewed as an instance of defense in depth.

In the best case, obscurity layers provide diverse, complemen- tary, and effi cient coverage around national asset information. That is, an asset might fi rst be protected by an obscurity layer that includes data markings to remind individuals of their obli- gation to use discretion. A second obscurity layer might involve some mandate that no technical information about local net- works, software, or computing platforms be shared beyond the team of trusted administrators. A third layer of obscurity might then involve the mandate that, if information does somehow leak out about critical infrastructure, the organization will never com- ment publicly on any aspect of the leak.

These three example layers are complementary and provide guidance to individuals on how to exercise discretion in what information to share and what information to suppress. As such, they can be viewed as an effective discretionary tool for protect- ing assets (see Figure 7.6 ).

Layering the methods of obscurity and discretion adds depth to a defensive security program.

Asset

Data Markings

Sharing Policy

Leakage Policy

Information Leaks

Through Only

a Single Layer

Information Leak

Not Countered

by Any Layer

First Obscurity Layer

Second Obscurity Layer

Third Obscurity Layer

Figure 7.6 Obscurity layers to protect asset information.

Chapter 7 DISCRETION 141

Leaks through obscurity layers might make their way through each layer or might be countered by one or more layers. For example, in Figure 7.6 , an information leak that would not be countered by any layer might involve someone exercising poor discretion by ignoring data markings (through the fi rst layer), violating information sharing policies (through the second layer), and being ignorant of policies for disclosure after an incident (through the third layer). This demonstrates the human element in the use of discretion to protect critical infrastructure informa- tion. Additional examples of obscurity layers in national infra- structure protection include the following: ● Public speaking —A policy might be in place that would delib-

erately prevent anyone with responsibility for national infra- structure from speaking publicly without explicit public relations preparation and planning.

● Approved external site —A ubiquitous mechanism, such as a website, might be in place to constantly and consistently pro- vide organizationally approved information about infrastruc- ture that might be desired by external entities.

● Search for leakage —Search engines might be used via ethi- cal hacking techniques to determine the degree and scope of inappropriate information that might already be located on websites or in a cache. This can be complemented by modern data leakage protection (DLP) tools. As suggested above, the purpose of these discretionary con-

trols is not to suppress information for the purposes of hid- ing incompetence or inappropriate behavior. The purpose is to responsibly control the type of information made available to a malicious adversary.

Organizational Compartments An information protection technique used successfully by the U.S. federal government, especially in the military and intelli- gence communities, involves the compartmentalization of indi- viduals and information. These compartments can be thought of as groups for which some set of policy rules uniformly apply. Typically, individuals are put through a background check to determine their level of trustworthiness. They are then given a designated security clearance . Information is similarly put through an analysis to determine its level of criticality; it is then given a designated security classifi cation .

The specifi cs of how clearances and classifi cations work are beyond the scope of this book, but a key notion is that each

Even with layered obscurity, asset information may leak through to an adversary.

Government clearance levels and information classifi cation are techniques used to protect data by limiting accessibility.

142 Chapter 7 DISCRETION

combines some notion of hierarchical level (e.g., Top Secret, Secret, Confi dential, Unclassifi ed) with a corresponding notion of “need to know” categories (e.g., Navy, Air Force). The cross-prod- uct of some set of classifi ed information with the corresponding individuals cleared to access that information is called a compart- ment . Policy rules for accessing data, such as classifi ed documents, from a compartment can then be implemented (see Figure 7.7 ).

The examples in Figure 7.7 show an individual cleared to Top Secret in categories Navy and Air Force being successful in read- ing a document that is classifi ed to the same level and categories. In addition, an individual cleared to Top Secret in category Navy is successful reading a document cleared to the same level and categories. On the other hand, an individual cleared to Top Secret in category Air Force is denied access to a document whose cat- egory is only Navy. This type of approach is especially powerful in an actual government setting, because information leaks can be interpreted as violations of federal law. In the most intense case, such violations could be interpreted as espionage, with all the associated punishment that comes with such action. The result is a mature environment in most government settings for reduc- ing the chances that national security-related information will be leaked.

Clearly, the protection of national services is not just the responsibility of government. Thus, industry needs a correspond- ing approach to policy-based access control. The good news is that translation of government compartments to a corporate setting is relatively straightforward. Clearance and classifi cation levels can be mapped to company-defi ned organizational levels such as “supervisor” and “senior manager.” Categories can be mapped to specifi c projects in a company. Thus, a compartment in some company might correspond to the senior manager level, within some project A and project B (see Figure 7.8 ).

Certain secure government data can only be accessed by a few top-level offi cials.

(Top Secret (Navy, Air Force))

(Top Secret (Navy))

(Top Secret (Air Force))

(Top Secret (Navy, Air Force))

(Top Secret (Navy))

Read

Individual (Clearance) Document (Classification)

Read

Deny

Read

"Need to Know"

Category X

Figure 7.7 Using clearances and classifi cations to control information disclosure.

Chapter 7 DISCRETION 143

The bottom line with compartmentalization is that it should be used to help defi ne boundaries around which information can or cannot be accessed. This helps guide decisions that require human discretion. Too often, in computer security settings today, the underlying goal of many projects and in the management of many critical systems is to avoid the use of information boundar- ies, often in the interest of openness and sharing. These concepts are valuable for many types of standards, information, data, soft- ware, and services, but unfortunately openness and sharing are not always consistent with protecting security-related informa- tion about national infrastructure.

National Discretion Program To implement a national program of information obscurity and discretion, several management and security engineering tasks will be required: ● TCB defi nition —Although it could be diffi cult to do so, effort

should be directed by suitable national authorities toward try- ing to defi ne a nationwide trusted computing base. This will require coordination between government and industry, but the resulting construct will help direct security management decisions.

● Reduced emphasis on information sharing —Government must immediately reduce its emphasis on demanding that information be shared by industry. Any information sharing initiatives that do maintain such an emphasis should focus only on providing government with situation status.

● Coexistence with hacking community —The national infra- structure community in government and industry would benefi t by creating an improved spirit of cooperation with

Private companies can mirror government clearance levels by classifying data and limiting access.

(Senior Manager (Projects A, B))

(Manager (Project B))

(Manager (Project A))

(Senior Manager (Projects A, B))

(Manager (Project B))

Read

Individual (Level/Authority) Document Marking

Read

Deny

Read

"Need to Know"

Category

X

Figure 7.8 Example commercial mapping of clearances and classifi cations.

144 Chapter 7 DISCRETION

the hacking industry. This could come in the form of fi nan- cial support from government for hacking groups and forums, or it could be more explicit in terms of actual tasking on real programs.

● Obscurity layered model —A national obscurity layer should also be put in place to guide decisions about human discre- tion in protecting sensitive national infrastructure-related information.

● Commercial information protection models —Industry should be provided with incentives and rewards for demonstrating some degree of embedded policy-based access control similar to the military model. Certainly, to increase the chances that these tasks are success-

ful, a culture of human discretion around sensitive information must be created. Senior managers must reinforce this culture by not exercising their right to bypass discretionary controls; for example, all documents, even those created by senior managers, should be marked appropriately. Similarly, if violations of basic information discretion do occur, the consequences should be similarly applied, regardless of organizational position or level.

145 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00008-1 2011

COLLECTION It is important to have a fairly clear understanding of what you are looking for and what events you are interested in, because you cannot collect or detect everything .

Stephen Northcutt 1

A basic tenet of computer security is that diligent and ongoing observation of computing and networking behavior can high- light malicious activity. This works best when the observer has a good frame of reference for what constitutes normal behavior. Algorithms and human judgment can then be used to compare profi les with observations to identify activity that might be sus- picious. Follow-up analysis can then be used to partition sus- picious activity into benign and malicious categories. All this processing and analysis can only be done in the context of an existing program of data collection .

At the national level, security-relevant data must fi rst be col- lected at the local or regional level by individual asset manag- ers and a subset then selected for broader aggregation into a national collection system. In some cases, local and regional col- lection can be directly connected to national programs. Larger- scale collection points on wide-area networks, perhaps run by carriers or government agencies, can also be embedded into the collection scheme and combined with local, regional, and aggre- gated data (see Figure 8.1 ).

Such a national collection process does not exist today in any organized manner. To build one will require considerable resolve. From a technical perspective, each collection point requires that decisions be made about which data is gathered, what methods will be used for collection, how it will be used, and how it will be protected. It is not reasonable for any organization to collect any sort of data without having specifi c answers to these simple questions. Improper collection of data, where no clear justifi ca- tion exists for its aggregation, could lead to serious legal, policy,

8

1 S. Northcutt, Network Intrusion Detection: An Analyst’s Handbook , New Riders Publishing, Berkeley, CA, 1999, p. 34.

Data collection should not be attempted until an organized plan is in place to analyze and protect the data.

146 Chapter 8 COLLECTION

or even operational problems for organizations charged with protecting some national asset.

As an illustration, many government groups have done a ter- rible job in the past protecting data once it has been aggregated. Several years ago, for example, sensitive information collected from chemical companies in the New York area was published by a government agency on its website. This information was then collected by reporters and reproduced as an article in a New York City newspaper, replete with a map showing which types of dan- gerous chemicals were present and their exact location, as well as noting the health and safety implications of these chemicals. This type of information is of great interest, obviously, to terror- ists. Dissemination of this information could also have a nega- tive impact on business operations and the reputations of these companies.

At both local and national levels, data collection decisions for national infrastructure should be based on the following three security goals: ● Preventing an attack —Will the data collected help stop a pres-

ent or future attack? This implies that the recipient of col- lected data must justify its role in stopping the attack. If the recipient manages some critical infrastructure component, such as a backbone network, that can be used to throttle or stop the attack, then the justifi cation is obvious. If, however, the recipient is a government agency, then the justifi cation might be more diffi cult.

National Level Data Collection Program

Wide-Area Collection

(Backbone, Shared Services)

Regional Collection

(Geographic Coverage)

Regional

Aggregation

Enterprise

Aggregation

Wide-Area

Aggregation

Local Collection

(Enterprise, Agency)

Figure 8.1 Local, regional, and national data collection with aggregation.

Chapter 8 COLLECTION 147

● Mitigating an attack —Will the data collected assist in the response to an ongoing attack? The implication here is that the recipient of data should be able to help interpret what is happening or should be able to direct resources toward a solu- tion. One of the most relevant questions to be answered about an ongoing attack, for example, is how widespread the attack might be. Collecting information from a broad distribution will help to answer this question.

● Analyzing an attack —Will the data collected assist in the forensic analysis of an attack after it has occurred? This goal is important but can be easily abused, because it could jus- tify collection of any sort of data available. Forensic analysts generally maintain that their task is made easier in the pres- ence of large volumes of data. Care must therefore be taken to ensure that inappropriate data collection does not occur simply because a forensic analyst might claim to need the information. These three requirements should direct the scope, coverage,

and degree of detail associated with a data collection program for every national infrastructure component. In fact, they provide a suitable template for determining exactly what sort of data should be collected and aggregated. At the local, regional, wide area, and national levels, data collection should only proceed if affi rmative answers to these questions can be made (see Figure 8.2 ).

The decision to not collect data might be among the most dif- fi cult for any organization, especially a government agency. One of the great axioms of government computer security has been that more data is always better, especially if a path exists to per- form such collection. The reality, however, is that improper data collection not only is unnecessary but could also actually weaken national infrastructure.

Initiate Data Collection

Mitigate

Present

Attack?

Analyze

Past

Attack?

Prevent

Future

Attack?

No

Yes Yes Yes

No No No Data

Collection

Required

Figure 8.2 Justifi cation-based decision analysis template for data collection.

Data collection must be justifi ed as to who is collecting the data and why.

Beware the “more is better” axiom regarding data collection; focus on quality, not quantity.

148 Chapter 8 COLLECTION

Collecting Network Data Perhaps the most useful type of data for collection in national infrastructure is network metadata . Also referred to as netfl ow , metadata provides many security-relevant details about net- work activity. In a Transmission Control Protocol (TCP)/Internet Protocol (IP) environment, metadata allows the security analyst to identify source address, destination address, source port, des- tination port, protocol, and various header fl ags in a given ses- sion. This information is security relevant because it provides a basis for analyzing activity. A nontechnical analogy would be that metadata is akin to the information that postal workers can see in the mail they process. The size, weight, color, texture, and addressing information on the envelopes and wrappers are apparent, whereas the contents are not.

The collection of metadata involves the placement of equip- ment or software into the target network for the purpose of pro- ducing metadata records. These records are collected and stored for analysis. Obviously, to make this collection feasible, certain functional considerations must be made. There must be legal jus- tifi cation for collecting the data, there must be suffi cient storage capacity for maintaining collecting data, and there must be ana- lysts with proper capability to make effective interpretations about the data. Perhaps the most important consideration, however, is whether the collection functionality is suffi ciently powerful to keep up with the target network bandwidth capacity (see Figure 8.3 ).

One issue with large-scale versions of this collection approach is that many metadata collection systems were deployed in car- rier backbones during the early part of the century, with the intention of pulling security data from 10-Gbps backbones.

Local Storage

and Reduction

Collection Equipment

(Hardware and Software)

Transport from other

Collection Points

Processing

and Analysis

Storage

Transport Target

Network

Figure 8.3 Generic data collection schematic.

Metadata is information about the data, not what the data is about.

Chapter 8 COLLECTION 149

The challenge is that carrier backbones have begun to grow to 40- and even 100-Gbps capacities. If the collection systems are not increased at a commensurate rate, then the ability to collect metadata could decrease by as much as a factor of ten.

One solution many security analysts use to deal with increas- ing network capacity is to sample the data. This technique involves grabbing some of the data at predetermined intervals so the inbound fl ow matches the ability to process. Sampled data is generally acceptable for broad analysis of network activity, but it is not as effective for detailed forensics as unsampled metadata. In an unsampled environment, analysts can often detect tiny anomalies in massive amounts of data. This design consideration affects the overall collection process.

As an example, several years ago unsampled metadata on an IP backbone allowed analysts in a global carrier environment to detect that a small number of packets of an unusual proto- col type were beginning to show up. Packets of this type had not been seen on the backbone for years, so this was clearly an anomaly to be investigated. Suspicious packets from this unusual event were collected and observed for four days, until a key equipment vendor contacted the carrier to report a serious secu- rity fl aw in their operating system software. Interestingly, exploits of this vulnerability involved traffi c being sent over precisely the protocol type being observed. The collection point thus detected network activity evidence of a security issue that had not even been publicly reported (see Figure 8.4 ).

Thousands of

Packets Detected

Vendor Notification of

Vulnerability on this Protocol

Zero Packets

Detected

# of Packets

of Unusual

Protocol Type

Day

1,000

2,000

3,000

4,000

5,000

n n+4 n+5n+3n+2n+1 n+6 n+7

Figure 8.4 Collection detects evidence of vulnerability in advance of notifi cation.

Data collection systems need to keep pace with growth of carrier backbones.

Sampling data is less time consuming, yet unsampled data may reveal more vulnerabilities in the system.

150 Chapter 8 COLLECTION

The key observation here is that, under normal conditions, no instances of this type of protocol packet were observed on the carrier backbone. When suddenly the unusual protocol type essentially came alive, there was no easy way to determine why this was the case other than that some sort of anomaly was tak- ing place. When the vendor reported the problem on this proto- col, analysts were able to put together this information to solve the riddle of why this anomaly had occurred. This illustrates the importance of integrating all-source information into any data collection environment. National infrastructure protection must include this type of collection and associated analysis to be fully effective in protecting essential services.

Collecting System Data National infrastructure protection initiatives have not tradition- ally included provision for collecting data from mainframes, serv- ers, and PCs. The justifi cation for such omission is usually based on the scaling and sizing issues inherent in the massive amount of data that would have to be processed from these computers, along with the common view that such systems probably do not provide much security-relevant data. An additional consideration is the potential for privacy abuses, an issue that the citizens of most nations have come to recognize as being important to their lives. As a result, no serious national infrastructure protection initiative to date has included a proposal or plan for this type of functionality.

Regarding scaling and sizing issues, the computing infrastruc- ture required for collection of data from every mainframe, server, and PC deemed part of national infrastructure services would certainly be complex. That said, computing historians know that it is not unprecedented for the complex requirements of one gen- eration to become routine features in another. Furthermore, the tactical approach of identifying a workable subset of the relevant computers in a nation is possible. For example, the mainframes, servers, and PCs running in companies and agencies charged with national infrastructure could be targeted for collection, and this is a tractably sized challenge.

On the issue of whether mainframes, servers, and PCs provide suitable security-relevant information for national infrastructure protection, many critical incidents are best identifi ed through collection of data at this level. Operating system logs, mainframe event summaries, and PC history records provide excellent evi- dence that malicious activity might be ongoing. Engineering metrics such as memory utilization or processor load can also

Analysis of unsampled metadata can reveal concerning data traffi c patterns that would otherwise go unnoticed.

We may not currently have the capacity to collect data from all relevant computers, but it is an important goal to try to reach.

Chapter 8 COLLECTION 151

provide valuable signals about security issues. For example, when a server shows increases in processor usage as a result of an attack, this condition is often easiest to identify using monitoring tools embedded in the operating system of the computer.

System monitoring is important to national infrastructure protection because it is often the only indicator that some secu- rity event is under way—even in the presence of fi rewalls, intru- sion detection systems, and other security tools. As a result, national infrastructure protection initiatives will have to include provision for the gathering and processing of data from main- frames, servers, and PCs. This data will have to be selected, collected, transmitted with suitable protection, stored in an envi- ronment properly sized for large amounts of data, and processed in real time. Four specifi c types of information that should be collected include those listed in the box below.

System monitoring provides an overview of activity that may reveal troubling patterns.

Top Four Data Collection Areas 1. Utilization —One of the most important metrics in determining whether an attack is ongoing is the utilization profi le

of servers in the local environment. National asset managers must identify which servers are relevant for monitoring and should instrument an associated program of data collection. This will require cooperation between government and industry, as well as the inclusion of appropriate functional requirements in infrastructure support contracts.

2. Usage —Patterns of usage on the mainframes, servers, and PCs in a given nation are important to establish for protection of infrastructure. If certain mainframes are never touched after hours, for example, then this will help to identify smaller attacks during unusual times. Detecting small, active usage events is often easier in a quiet environment than in a noisy environment; however, detecting usage drops is often easier in a noisy environment than in a quieter one.

3. Applications —Collecting data about the applications resident on system infrastructure provides useful hints about possible cyber attacks. A common metric is a “top ten” list of most commonly used applications. If the mix changes in some meaningful way, then this could signal an attack. Network gateway systems including proxies are excellent candidates for collecting this type of data for an enterprise. Carriers could provide this type of data in a wide area network or across a given region.

4. Outages —Information about outages is important for security, because events that are presumed to have been benign might actually be part of a cyber attack. It is not uncommon for system managers to ignore this possibility; hence, data collection around outages is important. As an example, root-cause analyses after serious outages should be viewed as important information for gathering and analysis.

Two techniques are useful at embedding system management data into cyber security infrastructure. First, an inventory process is required to identify the systems that are considered critical in

152 Chapter 8 COLLECTION

a given environment. This process might require engineering analysis across relevant government and industrial infrastructure to determine if a given system resides in the critical path of some national service. Alternatively, the decision might be made to try to collect information from every system that is available for col- lection. Second, for those systems deemed worthy of data collec- tion, a process of either instrumenting or reusing data collection facilities must be identifi ed. This could involve the use of operat- ing system audit trails or it could involve the installation of some sort of application-level logging program.

Regardless of the approach, data would fl ow from the tar- get computers of interest across some network medium to vari- ous aggregation points. Regional and enterprise networks would probably have to introduce an aggregation function for their organization before the data is shared externally. One would expect that network carriers could easily step into this role of pro- viding different types of aggregations; that is, customers of DSL and cable services could agree, under suitable incentives, to allow for collection of data related to the presence of malware, viruses, and other compromising software. Encryption could be used to help protect the confi dentiality of the data in transit and storage.

There would also have to be some sort of fi ltering or data reduction to focus the collection on specifi c systems of inter- est and to limit data to only that which is likely to be useful. For example, if a nation tried to collect security-related data from hundreds of thousands or millions of PCs every day, the resultant daily datafl ow would be measured in the multiple terabyte range. Commercial databases would probably be insuffi cient for stor- ing this volume, so customized databases would be required. The volume of collected data would ultimately be made available to a security processing and interpretive system that could be used for national infrastructure purposes.

Although more creative overall architectures could be imag- ined, such as peer-to-peer, the centralized collection approach would be more likely to be implemented in practice. It also lends itself quite well to the establishment and operation of a national security operations center (see Figure 8.5 ).

Readers might cringe at the idea of collecting data in this manner, especially from end-user PCs scattered across a nation, but this practice is more common than one might think. Every large enterprise and government agency, for example, routinely embeds integrity management software, such as tripwire func- tionality, into their mainframes and servers. Furthermore, almost every enterprise and agency uses software agents on PCs to col- lect relevant security and management data. Perhaps ironically,

Aggregation points would allow for regional collection of data.

Chapter 8 COLLECTION 153

botnet operators have also perfected the idea of collecting data from massive numbers of end-user computers for the pur- pose of attack. The idea that this general schema would be extended to benevolent national infrastructure protection seems straightforward.

The challenge is that this sort of scheme can be abused. Computer scientists lament software running with high privilege on their systems, and citizens resist the notion of an unknown monitor pulling data from their system to some unknown col- lection facility, possibly violating privacy principles. Both con- cerns are valid and need to be debated publicly. If an acceptable compromise is reached between government and its busi- nesses and citizenry, then the result can be incorporated into the design of an appropriate national system. At minimum, such a compromise would have to include demonstrable evidence that mainframes, servers, and PCs provide only harmless com- puter security-related information such as scan data, security state, and signature-based malware detection. Anything more penetrating that might allow, for example, remote access and execution from a centralized control station would probably be unacceptable, even though organizations do this routinely with their employee base.

Another possibility might be some sort of citizen-sponsored, citizen-run, grassroots data collection effort for PCs and servers,

National Level Collection Infrastructure

Aggregation Layer

Data Transport Layer

Enterprise

Mainframes

and ServersEnterprise PCsCommunity PCs

Carriers

Figure 8.5 Collecting data from mainframes, servers, and PCs.

A national data collection center may not differ much from current enterprise and agency data collection.

A national data collection program would have to be sensitive to citizens’ concerns for privacy.

154 Chapter 8 COLLECTION

where participants agree to provide security information to a massive distributed system of peers. Such a system would not perfectly match the geographic or political perimeter of a nation, and many citizens would refuse to participate based on prin- ciple. Few members, however, of massive peer-to-peer networks for music or video complain about the privacy implications of running such software, often questionable or illegal, on their local machine. They just enjoy getting free content. The idea that a similar construct could be used to help secure national infra- structure would require demonstrating some sort of benefi t to participants. This may not be possible, but the effort is worth- while from a security perspective because data collected from a massive deployment of computers across a given nation would provide a valuable and unmatched window into the security pos- ture of national infrastructure.

Security Information and Event Management The process of aggregating system data from multiple sources for the purposes of protection is referred to in the computer secu- rity community as security information and event management (SIEM). Today, SIEM tools can be purchased that allow collec- tion of a diverse set of technologies from different vendors. This typically includes fi rewalls, intrusion detection systems (IDS), servers, routers, and applications. Just about every commercial enterprise and government agency today includes some sort of SIEM deployment. One could easily imagine this functional- ity being extended to include collection feeds from mainframes, servers, and PCs (see Figure 8.6 ).

The SIEM system will include translation functions to take proprietary outputs, logs, and alarm streams from the differ- ent vendors into a common format. From this common collec- tion format, a set of common functions can thus be performed, including data storage, display, sharing, and analysis. National infrastructure protection must include rational means for inter- preting SIEM data from components, if only because many orga- nizations will already have a SIEM system in place for processing their locally collected data. This interpretation of SIEM data from multiple feeds will be complicated by the fact that most existing SIEM deployments in different companies, sectors, and govern- ment agencies are mutually compatible. A more critical prob- lem, however, is the reluctance among most security managers to instrument a real-time connection from their SIEM system to a national collection system. A comparable problem is that service

Citizens who see the benefi t of a national data collection system would likely be willing to participate voluntarily.

Security managers will be reluctant to link their SIEM system to a national collection system.

Chapter 8 COLLECTION 155

providers do not currently feed the output of their consumer cus- tomers’ data into a regional SIEM system.

In any event, the architecture for a national system of data collection using SIEM functionality is not hard to imagine. Functionally, each SIEM system could be set up to collect, fi lter, and process locally collected data for what would be considered nationally relevant data for sharing. This fi ltered data could then be sent encrypted over a network to an aggregation point, which would have its own SIEM functionality. Ultimately, SIEM func- tions would reside at the national level for processing data from regional and enterprise aggregation points. In this type of archi- tecture, local SIEM systems can be viewed as data sources, much as the fi rewalls, intrusion detection systems, and the like are viewed in a local SIEM environment (see Figure 8.7 ).

Unfortunately, most local infrastructure managers have not been comfortable with the architecture shown in Figure 8.7 for several reasons. First, there are obviously costs involved in setting up this sort of architecture, and generally these funds have not been made available by government groups. Second, it is possible that embedded SIEM functionality could introduce functional problems in the local environment. It can increase processor uti- lization on systems with embedded SIEM hooks, and it can clog up network environments, especially gateway choke points, with data that might emanate from the collection probes.

A much more critical problem with the idea of national SIEM deployment is that most enterprise and government agency

Storage Display Sharing Analysis

Firewall IDS Server Router Apps . . .

. . .

Common

Functions

Translation Foods

Data Collection

Figure 8.6 Generic SIEM architecture.

Local and regional SIEM systems would work as fi lters to feed only relevant data to a national collection point.

Will a national data collection system put an increased fi nancial burden on private agencies and enterprises?

156 Chapter 8 COLLECTION

security managers will never be comfortable with their sensitive security data leaving local enterprise premises. Certainly, a man- aged security service provider might be already accepting and processing security data in a remote location, but this is a virtual private arrangement between a business and its supplier. The data is not intended for analysis other than to directly benefi t the originating environment. Furthermore, a service level agreement generally dictates the terms of the engagement and can be termi- nated by the enterprise or agency at any time. No good solutions exist for national SIEM implementation, other than the gener- ally agreed-upon view that national collection leads to better national security, which in turn benefi ts everyone.

Large-Scale Trending The most fundamental processing technique used for data that is collected across national infrastructure involves the identifi ca- tion of trends . In many cases, trends in collected data are obvi- ous, as in simple aggregate volume increases, such as packets delivered on a network. In other cases, however, trends might not be so obvious. For instance, when the collection process or monitored systems are experiencing change, the trend identifi ca- tion might not be easy. Suppose, for example, that a monitored network is growing, but the collection system is not. The result is that critical data might be missed, which could be misleading.

There are still too many unanswered questions about the security of sensitive data leaving private enterprises.

Storage Display Sharing Analysis

SIEM 1 SIEM 2 SIEM 3 SIEM 4 SIEM n . . .

. . .

Common National

Functions

Local SIEM

Translation

SIEM

Foods

National Data Collection (National SIEM)

Figure 8.7 Generic national SIEM architecture.

Chapter 8 COLLECTION 157

Similarly, if a change is made to the underlying collection system, perhaps involving a new technology or vendor, then this could infl uence the trends presumably being observed.

At the simplest level, a trend involves some quantitative attri- bute going up (growth), going down (reduction), staying the same (leveling), or doing none of the above (unpredictability). When data jumps around, for example, it might not be easy to draw a conclusion; however, the fact that it is jumping around might itself be an important and useful conclusion. Perhaps the most common question infrastructure managers ask with respect to security is whether attacks are increasing, decreasing, or stay- ing the same with respect to some component in question. This question about attack trends is a favorite among CEOs and national legislators. It can only be answered accurately in the context of collected data.

As a concrete example, over a nine-month period from June 2006 to March 2007, a stable collection system embedded in a global service provider’s backbone detected an increase in behav- ior consistent with malicious bots. As was outlined in the fi rst chapter, a bot is a piece of software inserted into a target system, usually a broadband-connected PC, for malicious or question- able purposes. The bot might be used to attack some target, it might be used to send spam, or it might be used to steal personal information. The detection of bot behavior comes from collect- ing traffi c information for the purpose of identifying communi- cation between a number of end-user PCs and a smaller number of servers on the Internet.

By collecting evidence of bot behavior and rendering the results in a simple histogram, the growth of bots can be seen clearly, and local management decisions can be made accord- ingly (see Figure 8.8 ).

Most managers shown the growth trend in Figure 8.8 would conclude that bots represented an increasing threat during this time period; however, proper national infrastructure protection requires a more thorough analysis before any real conclusions are drawn. The following are some basic practical considerations that must be made by security analysts before the trend in any data collection chart can be trusted: ● Underlying collection —Amazingly, trend data such as that

shown in Figure 8.8 is often provided in the context of a collec- tion architecture that might be changing. For example, if a collec- tion system for bots is getting more accurate through algorithmic improvements or better coverage, then the observed growth in bots might simply refl ect a more effective use of detection tools.

Tracking trends may tell us whether adversarial attacks are increasing, decreasing, or staying the same.

Collected data must be analyzed to determine what it can accurately tell us about trends.

158 Chapter 8 COLLECTION

● Volunteered data —It is common for government organiza- tions to use data volunteered from commercial entities as the basis for drawing conclusions about trends. This can be dan- gerous, because weak or nonexistent controls are in place regarding how the information is collected and managed. It is also possible that data might be volunteered that is incorrect or tampered with for some malicious or mischievous purpose.

● Relevant coverage —The amount of coverage across the envi- ronment in which the data is collected will affect the validity of an observed trend. Suppose, for example, that a small orga- nization with an Internet connection uses that connection to draw conclusions about traffi c trends. This certainly would be a less attractive vantage point than a global Internet carrier making the same determination. These issues highlight the importance of national infrastruc-

ture managers taking a mature approach to the interpretation of collected data. This is especially important because trend infor- mation so often drives the allocation of critical resources and funding. At the national level, for example, experienced security experts can point to dozens of cases where some sort of trend is used to advance the case for the funding of an initiative. This often involves hype about the rise of some virus or worm.

The Confi cker worm, for example, reportedly included some sort of embedded attack that would occur on April 1, 2009. Confi cker

0.5M

1.0M

1.5M

2.0M

2.5M

3.0M

# Bots

07/06 08/06 09/06 10/06 11/06 12/06 01/07 02/07 03/07

Figure 8.8 Growth trend in botnet behavior over 9-month period (2006–2007).

Trends must be interpreted carefully before they are used to justify changes in funding levels.

Chapter 8 COLLECTION 159

was especially relevant—and still is—because its operation involved several million bots. This makes it one of the more potentially pow- erful botnets known to the security community. Most security experts understood that there was nothing in the Confi cker code to suggest such an event on that particular date, but predicted attack dates are convenient for attracting attention and are thus common. National infrastructure protection begs a more mature approach to the public interpretation of collected data.

Tracking a Worm Data collection provides an excellent means for tracking a worm. Recall that a worm is a program that does three things: (1) it fi nds network-visible computers that can accept a copy of the worm program, (2) it sends a copy of itself to one of the identifi ed network-visible machines, and (3) it initiates remote execution of the new remote instance of the program on the network-visible target. This starts a chain reaction in which the identifying, copy- ing, and remote execution continue indefi nitely. By collecting network metadata while this is all happening, security analysts can generally determine what the worm is doing and how seri- ous the event might be. In the best possible cases, the collection might even provide hints that can be used to stop a worm from developing, which is obviously attractive for national infrastruc- ture security.

In 2003 and 2004, the Internet experienced an unusually large number of worm events. This was due primarily to the poor pro- cesses that were in place at the time for operating system and application-level software patching. This patching problem was true for both enterprise systems and home broadband users. During this time period, one worm after another seemed to rage across the Internet, and most observers viewed these events as largely spontaneous; that is, the general consensus was that worms would spread in just a few minutes, and that data col- lection was useless. If a worm was going to get you, the thinking went, it would get you fast, and there was nothing you could do in advance to stop the event.

The reality of the situation was actually more subtle. The SQL/Slammer worm of January 2003, for example, was one that appeared to have a spontaneous impact on traffi c. In the minutes during which the worm appeared to have spread signifi cantly, packets of User Datagram Protocol (UDP) traffi c went from small, predictable volumes with few anomalies to an immedi- ately spiked upward volume. On fi rst glance, this happened in a

Collecting network metadata allows security analysts to track a worm’s progress and predict its course.

160 Chapter 8 COLLECTION

manner that suggested no warnings, no time for preparation, and no time for incident response (see Figure 8.9 ).

The spike in packet volume due to the SQL/Slammer worm certainly appeared to be immediate and without warning. Upon much closer examination, however, one fi nds that the UDP data leading up to this event might have carried some indications and warning value from a security perspective. In particular, starting in early January 2003, UDP volumes on the specifi c SQL port used by the worm were displaying anomalous behavior. On January 2, 2003, the fi rst spike occurred, and this was followed by three weeks of similarly odd behavior. While it might be a stretch to absolutely conclude that these odd spikes were early attempts at producing a worm, no one can argue that they suggested a serious change in UDP behavior on the Internet (see Figure 8.10 ).

The suggestion here is that a more detailed inspection of UDP behavior on the SQL port before the SQL/Slammer worm achieved its aim could have given valuable data to security engineers. In particular, the vulnerability exploited by the SQL/ Slammer worm was known at the time, although most secu- rity managers were lax to install the patch. If the information in Figure 8.10 had been widely disseminated at the time, then any- one wise enough to heed the warning and install the patch would have been immune from the SQL/Slammer worm. The implica- tions of this situation should be obvious from the perspective of national infrastructure protection.

Collecting and analyzing data are important steps; the next is acting on the data in a timely manner.

SQL/Slammer Worm Released

(01/25/03)

Small Baseline

Normal UDP

Traffic Volume

Dramatic Spike

in UDP Activity

Across Internet

UDP Packets

(Internet

Backbone)

"Coarse View"

Figure 8.9 Coarse view of UDP traffi c spike from SQL/Slammer worm. (Figure courtesy of Dave Gross and Brian Rexroad.)

Chapter 8 COLLECTION 161

National Collection Program Implementing a program of national data collection for infra- structure security will require a combination of public outreach initiatives before any large-scale structures can be put in place. The citizenry and business community must fully understand the purpose, usage, and controls associated with a collection sys- tem. Mechanisms for preventing privacy abuses must be para- mount in the discussion and embedded into any architecture that might be proposed. The specifi cs of how this debate might be infl uenced are beyond the scope of this book, but it goes with- out saying that no national collection program can be successful without this requisite step.

Once general acceptance has been obtained for the creation of a national data collection program, the following technical and architectural issues must be addressed: ● Data sources —National attention is required to defi ne which

data sources are deemed valuable for providing security infor- mation to the broad collection function. Important main- frames and servers in organizations and agencies charged with infrastructure protection would seem the most obvious to include. End-user PCs owned and operated by private citi- zens would probably be the most diffi cult to include.

SQL/Slammer Worm Released

(01/25/03)

Various Spikes in

Traffic Volume Before

Worm is Successful Dramatic Spike

in UDP Activity

on SQL Port

Across Internet

UDP Packets on

SQL Port

(Internet

Backbone)

"Fine View"

Figure 8.10 Fine view of UDP traffi c spike due to SQL/Slammer worm. (Figure courtesy of Dave Gross and Brian Rexroad.)

A successful national data collection program must address the concerns of citizens and the business community regarding protection of private data.

162 Chapter 8 COLLECTION

● Protected transit —Security-relevant data collected from iden- tifi ed sources would need to be transmitted via suitable net- works with suffi cient encryption. Sizing consideration could dictate limits on the amount of information that could be pulled from a particular source.

● Storage considerations —The amount of information collected is obviously controllable, but the appetite for data from secu- rity analysts is usually unlimited. As such, pressure would exist to maximize the amount of information stored, as well as the length of time the data is available for analysis.

● Data reduction emphasis —Across the entire national initiative for data collection, time and energy should be directed toward reducing the amount of data being handled. Obviously, this is critical if a given collection method should inadvertently grab more information than is needed or might include informa- tion that has no relevance to the security challenge. While each of these issues represents a technical challenge,

particularly in terms of sizing and scaling, they can be com- bined into a reasonable system if engineered properly. The over- all approach will benefi t from stepwise refi nement methods that start with a tractable subset of data sources initially which gradu- ally increases with time.

A planned, stepwise approach to national data collection could create a system that would be of immense value in the quest to protect our national infrastructure.

163 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00009-3 2011

CORRELATION A benefit of anomaly detection is that it can potentially recognize unforeseen attacks. A limitation is that it can be hard to distinguish normal from abnormal behavior .

Dorothy Denning 1

Computer and network security experts understand that cor- relation is one of the most powerful analytic methods available for threat investigation. Intrusion detection systems, for exam- ple, are only useful when the alarm streams that result from sig- nature or profi le-based processing can be correlated with data from other areas. When alarms are viewed in isolation, they are of only limited use. This limitation in processing alarms is directly related to the complexity of the target environment; that is, deci- sion makers in more complex environments will be more reliant on correlating collected data than in more limited environments. Proper national infrastructure protection is therefore highly dependent upon a coordinated program of information correla- tion from all available sources.

From a foundational perspective, four distinct analytic meth- ods are available for correlating cyber security information: profi le- based , signature-based , domain-based , and time-based correlation . Profi le-based correlation involves comparison of a normal profi le of target activity with observed patterns of activity. Presumably, if a substantive difference exists between normal and observed, this could signal a possible intrusion. Obviously, many situations exist where observed activity is not normal but does not signal an intru- sion. Websites running specials or supporting some limited-time engagement, for example, will see traffi c spikes during these peri- ods that do not match normal patterns. Nevertheless, anomalies with activity profi les are worthy of attention from a security per- spective (see Figure 9.1 ).

Signature-based correlation involves comparing a signature pattern of some known malicious condition to observed activity. If the two match, then high confi dence exists that an intrusion

9

1 D. Denning, Information Warfare and Security , Addison-Wesley, New York, 1999, p. 362.

Data in a vacuum is irrelevant; it must be compared with other data to determine its relevance and importance.

Comparing data determines what is normal and what is an anomaly.

164 Chapter 9 CORRELATION

is under way. The challenge is when observed activity shares characteristics with a signature but does not exactly match. This requires diligence from the security team to stay focused. Most signature-based correlation patterns involve some sequence of events, such as commands, which are defi ned as a discrete sig- nature, and comparison against logs of observed activity. For example, antivirus software, antispam algorithms, and intrusion detection systems all operate in this manner (see Figure 9.2 ).

Domain-based correlation involves comparing data from one domain with data collected in an entirely different con- text. Relevant differences in the data collection environments include computing environment, software architecture, network- ing technology, application profi les, and type of business being supported. For example, data collected by a power company about an attack could easily differ from data collected by a fed- eral civilian agency on the same incident. Similarly, two targets of a botnet attack could report different isolated views that could

Activity

Measure

Day 1 Day 2 Day 3 Day 4

Normal Profile

Observed Activity

Profile

Anomaly

(Starting Day 3)

Figure 9.1 Profi le-based activity anomaly.

Command

Activity Log

Day 4Day 3Day 2Day 1

Command V

Command B

Command A

Command X

Command Z

Command D

Command M

Command E

Command A

Command X

Command Y

Command Y

Command S

Command R

Command H

Command A

Command X

Command Y

Command Y

Attack Signature:

Command K

Command F

Command X

Command F

Command S

Command S

Command R

Command F

Command X

Command S

Command D

Command A

Command Q

Attack Signature Matches

Figure 9.2 Signature-based activity match.

Data comparison, especially from different domains, creates a clearer picture of adversary activity.

Chapter 9 CORRELATION 165

be correlated into a single common view. This requires a prear- ranged transport, collection, and analysis approach leading to a common correlated output (see Figure 9.3 ).

Time-based correlation compares data collected during one time period with data collected at a different time. This can involve the same or different data source but is obviously more effective if the data source is the same, because this removes one variable from the correlative analysis. Many types of attacks will not be time sensitive and are thus not well suited to this type of correlation; for example, a single break-in, during which malware is embedded in a target system, might not be a good candidate for time-based correlation. Attacks that are multistage, however, such as many “low and slow” approaches, are quite well suited to the approach. Botnet attacks are increasingly being designed by adversaries in this manner, with the distributed program attack- ing its target in a slower and more deliberate manner than via a single bombardment. Detection of such an event is well suited to time-based correlation, because potentially signifi cant time periods could exist between successive steps in an attack. Time- based correlation would be required to connect relevant steps and to fi lter out noisy, irrelevant activity (see Figure 9.4 ).

The essence of correlation in cyber security involves com- parison of various pieces of data to determine whether an intru- sion is under way. In the most desirable set of circumstances, this involves comparing two pieces of data for which every associ- ated, relevant attribute is the same except for one . Such a scenario allows the analyst to focus in on that one attribute. Time-based

Target A

Target B

CarriersBots Correlated

View

Domain-Specific Views from

A and B

Network

Manager B

Correlation of

Event Views from

A and B

Internet

Botnet Attack Aimed at Targets A and B

Network

Manager A

Figure 9.3 Domain-based correlation of a botnet attack at two targets.

Changes that appear over time may indicate a slowly building, deliberate attack.

166 Chapter 9 CORRELATION

correlation works nicely when the collection environment is exactly the same but the data is collected at different times. The analyst does not have to worry about whether changes in other factors are affecting the data, as only the time changes. In the most complex case, however, multiple pieces of data are col- lected from environments where the associated, relevant attri- butes differ. The analyst thus must juggle concerns about which attributes in which environments might be affecting the data. This greatly complicates the correlation task (see Figure 9.5 ).

Target A

CarriersBots Time-Correlated

View

Time

Domain-Specific Views from A

(Different Times)

Correlation of

Event Views from

A at Different Times

Internet

Botnet Attack Aimed at Target A

Network

Manager A

T0 T1 T2

Figure 9.4 Time-based correlation of a botnet attack.

Magnitude of Difference in Data

Collection Environment Attributes

Number of

"Pieces of Data"

to Be

Compared

2

4

6

Easiest to Correlate

Two Pieces of Data, Similar Attributes

Similar Diverse

Many Pieces of Data, Diverse Attributes

Hardest to Correlate

Figure 9.5 Taxonomy of correlation scenarios.

Chapter 9 CORRELATION 167

This data collection attribute taxonomy is important to national infrastructure, because most practical cases tend to be very complex cases that are diffi cult to correlate. Information that becomes available during an incident usually originates from wildly different sources with diverse collection methods, processing tools, network views, and so on. Worm events on the Internet, for example, are often observed with consider- able scrutiny by some organizations (perhaps with bad conse- quences), whereas other groups might not even notice that a security event is ongoing. Only the most mature correlation ana- lysts will have the ability to factor these differences in viewpoint into an accurate broad conclusion about security. To date, this has required experienced human beings with considerable train- ing. Additional research is required before dependable tools will be available to perform accurate correlation on multiple, diverse inputs.

Conventional Security Correlation Methods The current state of the practice in day-to-day network security correlation in existing national infrastructure settings is based on a technique known as threat management . In this approach, data aggregated from multiple sources is correlated to identify patterns, trends, and relationships. The overall approach relies on a security information and event management (SIEM) system for the underlying collection and aggregation of relevant data. A SIEM system does the best it can in terms of identifying correla- tion instances, using the best available algorithms for profi le, sig- nature, domain, and time-based analysis, subject to the practical limitations mentioned above. Four of the primary feeds into a typical commercially available SIEM tool for threat management are listed in the box.

We currently rely on human analysis of data across different domains and during different time periods; no software or program can factor in all relevant elements.

Information Feeds for SIEM Threat Management ● Firewall audit trails —Firewalls generate audit records when certain types of security-relevant events occur such as

denied connection requests. These records are of limited use in isolation but are often useful for correlation with other data. Other static information about a fi rewall, such as its inbound and outbound policy, is also important for correlation.

● Intrusion detection and prevention system alarms —Intrusion detection and prevention systems are designed specifi cally to generate alarm data when suspicious activity is observed. The problem is that it is not always easy to determine if something suspicious is truly malicious. Generally, correlation with other data is required to make this determination.

168 Chapter 9 CORRELATION

The interplay between the various security devices in a local threat management system is sometimes straightforward. If an intrusion detection system generates an alarm signaling some sort of problem involving a given Internet protocol (IP) source address and corresponding destination port, and if the local envi- ronment also allows inbound traffi c to this destination port, then the correlation process could generate a recommendation that the local fi rewall block either this source address or this port. Many commercial fi rewalls and intrusion detection systems pro- vide this capability today, although the reality is that many net- work managers do not make use of this type of protection. This is usually due to a lack of familiarity with the process, as well as a common lack of local knowledge about the egress and ingress traffi c through an enterprise gateway or perimeter. This is a shame, because when it is done properly the protection achieved can be quite powerful (see Figure 9.6 ).

The example shown above demonstrates the natural feedback loop that can occur when data is correlated—that is, as interpre- tation resulting from the correlation task is fed back to the fi re- wall as a new data input. This in turn affects processing and will eventually change the correlation function output. This feedback loop will cease when the resultant interpretations are no longer new and have no changes to report back to the fi rewall. Security managers often confi gure their intrusion detection systems to suppress output when this steady-state condition occurs. This reduces operator burden but great care must be taken to ensure that valid indicators are not being missed.

The correlation function can extend to different parts of the same organization with different networks, servers, applications, and management groups. Surprisingly, many correlation activities

Many security managers underutilize the commercial fi rewalls at their disposal.

Exercise caution in suppressing output once a steady-state condition has been achieved; otherwise, valid indicators may be missed.

● Operating system or application logs —Output log fi les generated by activity on an operating system or software application can provide useful indications and warnings for security. The fi rst step in forensics, for example, involves examination of log fi les for evidence. (Good hackers know not to leave such obvious tracks, of course.) In addition to logs, the specifi c attributes of the operating system and application are also important for correlation. This can include version, vendor, and confi guration data.

● Network device metadata —Information about network behavior is quickly becoming recognized by cyber security experts as possibly being the most powerful tool available for threat management. Metadata showing source and destination information about addresses and ports, as well as information about protocol, direction of fl ow, and status of protocol fl ags and settings, gives security analysts a view into network activity unavailable through any other means.

Chapter 9 CORRELATION 169

are complicated by such decentralization. To illustrate, suppose that two groups in a company experience similar security problems. The root-cause data from each group should be correlated toward the optimal interpretation. If, for example, each group found simi- lar malware in their systems, then this observation could signal the source of the attack, such as a common software vendor. This fact might not be easy to determine by either group in isolation.

Quality and Reliability Issues in Data Correlation To create a proper security correlation process for national infrastructure protection in an environment of large, cross- organizational size, scope, and scale, several technical and opera- tional factors must be considered. The most important such con- siderations involve the quality and reliability of the data sources. This calls into question any national-level initiative for which these attributes cannot be controlled.

Regarding the quality of data, the best situation involves a service level agreement between the data source and correla- tion function. Managed security services are useful, because the provider will ensure that data quality exists within well-defi ned parameters. When data originates from a mix of organizations with no service level agreements, the potential exists for inaccu- rate, misleading, or invalid data to be made available. This can only be dealt with by automated or human fi ltering in which the data source and attributes are factored into the analysis. This is troublesome when correlation relies on information volunteered across the Internet. Grass roots efforts to collect volunteered data will always have an issue with guaranteed data quality.

A similar concern exists with the reliability of a data source, especially for volunteered feeds. When data is important for

Alarm

Stream

Intrusion

DetectionFirewall

Correlation

Function

Adjust Policy Based on Alarms

Perimeter Network

Internet Firewall

Policy

Figure 9.6 Correlating intrusion detection alarms with fi rewall policy rules.

Service level agreements help guarantee quality of data.

170 Chapter 9 CORRELATION

regular analysis, perhaps based on a profi le, its continued reli- ability is essential; for example, if a data stream experiences gaps or changes, perhaps at the whim of the feed source owner, this could easily confuse the correlation process. Gaps, in particu- lar, make it tough to match observed activity against the desired patterns. This issue is especially diffi cult to manage when data is being volunteered by varied sources. Thus, in addition to qual- ity issues, correlation based on any imperfect collection process, including the use of volunteered data, will also face inherent challenges related to reliability (see Figure 9.7 ).

Many national initiatives today rely on data sources agree- ing to provide data on a best effort basis. These initiatives must be viewed with great suspicion, because the conclusions being drawn will be based on a subset of relevant data. This includes initiatives where participants send intrusion detection alarms, highly sampled netfl ow summaries, and log fi les. If the delivery is not consistent, predictable, and guaranteed, then the correla- tion result is suspect; for example, attack signature patterns can be missed, profi les can be incorrectly matched or missed, and so on. National infrastructure managers should thus only collect data that is associated with a consistent service level agreement.

Correlating Data to Detect a Worm Network service providers have a particularly good vantage point for correlating data across multiple companies, agencies, groups, individuals, and regions. All government, business, and con- sumer traffi c must traverse a provider backbone at some point,

Due to limited oversight of volunteered data, its quality and reliability cannot be guaranteed.

Without consistent, predictable, and guaranteed data delivery, correlations are likely to be incorrect and data is likely missing.

AttackAttack

Real Activity:

Reported Activity:

Result: No Attack Pattern ( ) Detected

X0 X1 X0 X3 X1 X0 X1 X0 X2 X1

X0 X0

X0 X1 X0

X0 X1 X0

X3 X0 X1 X2 X1

Imperfect Data Collection Process

Data Correlation Process (Signature-Based)

Attack Signature:

Figure 9.7 Incorrect correlation result due to imperfect collection.

Chapter 9 CORRELATION 171

so this becomes an excellent source of correlation information. Obviously, if this is done, great care must be taken to ensure full compliance with applicable laws with a deep respect for privacy considerations. The effort is worth the time, because service pro- viders collecting netfl ow information on a broad scale can gen- erally correlate observed activity with known patterns to detect large-scale events such as worms. This is typically done with greater accuracy than existing computer and network security techniques using fi rewalls and intrusion detection systems.

As an illustration, consider that security devices such as intru- sion detection systems are put in place to detect worms and viruses. Unfortunately, many worms and viruses are not so easy for an intrusion detection system to detect. The Nachi worm is such an example; it raged across the Internet during the summer of 2003, using the Internet Control Messaging Protocol (ICMP) as one of its mechanisms for operation. Some speculate that the worm was actually intended to fi nd infected systems on the Internet and go fi x them. What happened instead was that the ICMP packet fl ows got out of hand, which is the main reason why this worm caused more damage than perhaps had been intended by its designer.

Most intrusion detection systems were not set up well to detect this problem, because an intrusion detection system is typically not interested in changes to some service port. In con- trast, any network system that was monitoring ICMP fl ows would see that something was amiss. On one service provider’s back- bone this increase was evident as the Nachi worm began to oper- ate. By simply counting ICMP packets crossing gateways on its backbone, the provider could quickly see the spike in traffi c fl ows due to the worm across several key network gateways. The resul- tant time-based correlation of collected ICMP data over several hours revealed the impending worm event (see Figure 9.8 ).

One might conclude from the above example that by monitoring broad network traffi c collected across organizations a much more accurate security picture can be drawn. A complementary conclu- sion that can be drawn from this example is that the network ser- vice provider clearly plays a key role in the detection of large-scale attacks. Over the past decade, so much security responsibility has been distributed to end users and organizational managers that no common strategy exists for infrastructure protection. Instead, when a problem occurs, all vulnerable endpoints must scramble to determine a suitable means of addressing it, and this can involve confl icting approaches. One group might choose to ignore and drop all packets associated with an attack, whereas another group might choose to collect, process, and send responses to the sources

Network service providers are in a unique position to collect information across multiple venues.

Network service providers have unique views of network activity that allow them to see when something is amiss.

172 Chapter 9 CORRELATION

of attack packets. This distribution of security implies that national infrastructure protection should include some degree of centralized operations. For large-scale network service, this can only be reason- ably managed by the service provider.

Correlating Data to Detect a Botnet The most insidious type of attack one fi nds today in any large- scale, distributed, Internet-connected network environment is the botnet. The way a botnet works is that an attacker rounds up a collection of Internet-connected computers to be used as bots; these computers are generally PCs attached to some home broadband service and are generally poorly administered by the home user. Such improper system administration allows for easy insertion of malware, perhaps through fi shing or other social engineering means.

Once the bots have been confi gured with suitable malware, they are commanded by a series of bot controllers located around the Internet. These controllers generally utilize some familiar pro- tocol such as Internet Relay Chat (IRC) simply for convenience, although they could certainly use any sort of communication protocol to interact with their bots. The idea is that the controller commands the bots to perform an attack task aimed at a target predetermined by the botnet operator. This works to the advan- tage of the attacker, because the bots are generally distributed across a broad geographic spectrum, and their bandwidth capac- ity might be substantive when viewed as a collective capability.

A botnet uses home-based PCs to distribute an attack.

Several Hour Period for Initiation of Nachi Worm of 2003

Key Service

Provider Gateways

Relatively Insignificant

Change

Obviously Dramatic

Change in ICMP Count

ICMP

Packet

Count

0Snapshot at Time T 1Snapshot at Time T 2Snapshot at Time T

Figure 9.8 Time-based correlation to detect worm.

Chapter 9 CORRELATION 173

If two bots can generate 1 Mbps of attack traffi c, then a target with a 1-Gbps inbound connection can be fi lled up by 2000 bots, which turns out to be a modestly sized botnet. Following this logic, a much larger botnet, perhaps with hundreds of thou- sands or even millions of bots, can be viewed as a particularly substantive problem for national infrastructure that requires attention. The correlation issue in this case is that no single end- point will have a suitable vantage point to determine the size, scope, or intensity of a given botnet. One might suggest that the only reasonable chance one has of actually performing the proper correlation relative to a botnet is in the context of carrier infrastructure.

Steps for Botnet Detection The steps involved in the detection of a botnet via correlative analysis by a network carrier are roughly as follows:

1. Broad data collection —The detection of a botnet requires a broad enough vantage point for collecting data from both broadband-connected PCs as well as enterprise servers visible to the Internet. The type of information needed is essentially netfl ow-type metadata, including source, destination, and traffi c types.

2. One-to-many communication correlation —From the collected data, the correlative analysis must focus on identifying the typical one-to-many fan-out pattern found in a distributed botnet. This pattern can include several botnet controllers, so multiple one-to-many relations typically overlap in a botnet.

3. Geographic location correlation —It is helpful to match up the bots and controllers to a geographic location using the associated IP address. This does not provide pinpoint accuracy, but it offers a general sense of where the bots and controllers are located.

4. Vigilant activity watch —The security analysis should include close, vigilant watch of activity from the bots and servers. The most important activity to be identifi ed would be a distributed attack from the bots to some target.

The steps in the box above allow for the construction of a logical map of a botnet, showing the geographic locations of the bots, their associated service provider (usually a local broadband carrier), the set of servers used as botnet controllers, and a gen- eral depiction of any relevant activity. Typical activity found in a botnet includes recruitment of new bots, as well as attacks from the bots toward some designated target (see Figure 9.9 ).

The botnet diagram demonstrates some of the conclusions that can be drawn immediately from such an analysis. The typi- cal pattern of bot clumping that one fi nds in a botnet might give hints as to the type of social engineering or lure used to drop malware onto the target PCs. Useful hints might also be gathered from regions where the botnet seems to have gathered no bots.

Botnets can have a far- reaching geographic distribution.

174 Chapter 9 CORRELATION

One area where correlative analysis is often not useful is trying to determine correlations between the geographic locations of bot- net controllers. This generally results in no useful information, as botnet controllers tend to be scattered across the globe, driven by opportunistic hacking.

It goes without saying that national infrastructure protection requires the real-time capability to monitor botnet confi gura- tion and activity. The risk of botnets has grown so much in recent years partly because they have been able to exist under the radar of most government and commercial organizations. The fi rst step in reducing this risk involves the creation of a national capabil- ity to collect information about botnets and to advise the par- ticipants on how best to avoid being either duped into hacking someone else or directly targeted for an attack.

Large-Scale Correlation Process For national infrastructure protection, large-scale correlation of all-source data by organizations with a broad vantage point is complicated by several technical, operational, and business fac- tors, including the following: ● Data formats —Individual national asset environments will

most likely collect data in incompatible formats due to a lack of standards in security data collection tools. As a result,

Disseminating information about botnet tactics may help consumers avoid future lures.

Bot

Controller

No Geographic

Correlation

Between

Controllers

Regional Bot Clumping No Bots in Region

Figure 9.9 Correlative depiction of a typical botnet.

Chapter 9 CORRELATION 175

almost all security-relevant data is collected in a proprietary or locally defi ned format. This represents a signifi cant chal- lenge for any large-scale collection from multiple sources.

● Collection targets —Individual asset environments will likely collect data from different types of events and triggers. Some, for example, might collect detailed information about net- works and only limited information from systems, whereas others might do the opposite. This obviously complicates the comparison of aggregated data from multiple sources.

● Competition —Various commercial groups collecting relevant data might be in direct business competition. (Most govern- ment groups will admit to their share of mutual competition as well.) This competitive profi le implies that any aggregated information and any interpretation that would result from correlative analysis must be carefully protected and associ- ated with suitable anonymity. To deal with these challenges on a large scale, a deliberate

correlation process must be employed. The process must break down each component of the correlation task into discrete enti- ties with well-defi ned inputs and outputs. This process is best viewed in aggregate as consisting of fi ve different passes leading from collected data to actionable information (see Figure 9.10 ).

Collected Data

For Security Action

Feedback Loop

Pass 1 Pass 2 Pass 3 Pass 4 Pass 5

Source 1

Source 2

Source 3

Source n

Correlation Engine Components

Resolve Data

Formats

Level Data

Types

Compare Data

Attributes

Store and Protect Output

Interpret and

Filter

Correlation Back End

Figure 9.10 Large-scale, multipass correlation process with feedback.

Large-scale data correlation initiatives must overcome challenges posed by competition, incompatible data formats, and differing collection targets.

176 Chapter 9 CORRELATION

National Correlation Program Implementation of a national correlation program is likely to follow two specifi c directions. First, steps might be taken to encourage individual organizations with national infrastructure responsibility to create and follow a local program of data corre- lation. This can be done by embedding correlation requirements into standard audit and certifi cation standards, as well as within any program solicitations for government-related infrastructure

Data collection can be encouraged by making it a requirement of contracted government- related projects.

Five Passes Leading to Actionable Information 1. The fi rst pass in this process schema involves resolution of all incompatible data formats from the different sources.

In addition to the data generated by familiar security devices, these inputs can also include human-generated data that could be obtained through telephony or even social processes. The resolution must be automated via fi lters that produce a common output. Amazingly, very little work has been done in the computer security community to standardize relevant formats.

2. The second pass in the schema involves a leveling of the various types of data collected. The most common task in this pass is to categorize similar data into the appropriate set of categories. This must be done because different organizations routinely refer to the same security-relevant events by different names. Commercial tools also tend to refer to the same attacks by different names and alarm types. Large-scale correlation thus requires a common understanding of the semantics associated with activity of interest. Small-scale analysis methodologies using a common threat management tool from one vendor can skip this step; large-scale analysis from multiple, diverse sources cannot.

3. The third pass involves the actual comparison of collected data to relevant attributes. Computer security experts often refer to this pass itself as correlation. This pass is where security algorithms for profi le, signature, domain, and time-based correlation are incorporated into the analysis. It typically involves a combination of automated processing using tools, with the interpretation of human experts. In the best case, this pass in the process occurs rapidly, almost in real time, but the reality is that the analysis step can take considerable time in the most complex scenarios. This pass, along with the fi rst two passes, can be viewed collectively as the correlation engine .

4. The fourth pass involves storage and protection of the output. This is likely to include interpretation of the data once it has been aggregated and compared. Insights are often evident at this stage of the process, and these can be represented as either deliberately stored information in a database or simply as information known to security analysts involved in the overall process. In either case, the information must be protected. For large-scale applications, the size of the information collected can be massive, which implies that special database technology with the ability to scale might be required.

5. The fi fth and last pass in the process involves fi ltering and dissemination of the information. This might result in a feedback loop where output recommendations become input to a new series of fi ve correlation passes. Alternatively, it can be used by appropriate parties for immediate action such as real-time incident response. This pass, along with the storage pass, can be viewed collectively as the correlation back end .

Chapter 9 CORRELATION 177

work. The likelihood of success of this approach is high and is thus recommended for immediate adoption at the national pol- icy level.

Second, national-level programs might be created to try to correlate collected data at the highest level from all available sources. This approach is much more challenging and requires addressing the following technical and operational issues: ● Transparent operations —The analysis approach used for

correlation should be fully known to all participants. Thus, whether profi les, signatures, or the like are used, the process should be clearly explained and demonstrated. This will allow participants to help improve such aspects of the process as data feed provision, data reduction algorithms, and back-end interpretation.

● Guaranteed data feeds —Any participant providing data to the correlation process must be held to a guaranteed service level. Obviously, this level can change but only under controlled conditions that can be factored into the analysis. Without such guarantees, correlation algorithms will not work.

● Clearly defi ned value proposition —Participants should rec- ognize a clearly defi ned value proposition for their provision of data. The worst situation involves a “black hole” collection process where the output recommendations from the correla- tion activity are not generally shared.

● Focus on situational awareness —The output of the process should certainly be action oriented but should also recognize the limitations inherent in broad correlation. It is unlikely that any national-level correlation function will be able to give a real-time silver bullet to any participant. More likely, the out- put will provide situational awareness that will help in the interpretation or response to an event. By addressing these issues, the technical and operational feasi-

bility of a successful, national-level correlation function increases dramatically. Unfortunately, many legal, social, and political issues—considered outside the general scope of this book—will complicate the creation of such a function.

This page intentionally left blank

179 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00010-X 2011

AWARENESS Intelligence, the information and knowledge about an adversary obtained through observation, investigation, analysis, or under- standing, is the product that provides battlespace awareness .

Edward Waltz 1

Situational awareness refers to the collective real-time under- standing within an organization of its security risk posture. Security risk measures the likelihood that an attack might pro- duce signifi cant consequences to some set of locally valued assets. A major challenge is that the factors affecting security risk are often not locally controlled and are often deliberately obscured by an adversary. To optimize situation awareness, con- siderable time, effort, and even creativity must be expended. Sadly, most existing companies and agencies with responsibility for national infrastructure have little or no discipline in this area. This is surprising, as a common question asked by senior leader- ship is whether the organization is experiencing a security risk or is “under attack” at a given time.

Awareness of security posture requires consideration of sev- eral technical, operational, business, and external or global fac- tors. These include the following: ● Known vulnerabilities —Detailed knowledge of relevant vul-

nerabilities from vendors, service providers, government, academia, and the hacking community is essential to effec- tive situational awareness. Specifi c events such as prominent hacking conferences are often a rich source of new vulnerabil- ity data.

● Security infrastructure —Understanding the state of all active security components in the local environment is required for proper situational awareness. This includes knowledge of security software versions for integrity management and anti-malware processing, signature deployments for security devices such as intrusion detection systems, and monitoring

10

1 E. Waltz, Information Warfare: Principles and Operations , Artech House, Norwood, MA, 1998.

Consider attending a hacking conference to learn more about potential vulnerabilities.

180 Chapter 10 AWARENESS

status for any types of security collection and processing systems.

● Network and computing architecture —Knowledge of network and computing architecture is also important to understand- ing an organization’s situational security posture. An accurate catalog of all inbound and outbound services through exter- nal gateways is particularly important during an incident that might be exploiting specifi c ports or protocols.

● Business environment —Security posture is directly related to business activities such as new product launches, new proj- ect initiation, public relations press releases, executive action involving anything even mildly controversial, and especially any business failures. Any types of contract negotiations between management and employee bases have a direct impact on the local situational security status.

● Global threats —Any political or global threats that might be present at a given time will certainly have an impact on an organization’s situational security posture. This must be mon- itored carefully in regions where an organization might have created a partnership or outsourcing arrangement. Because outsourcing tends to occur in regions that are remote to the organization, a global threat posture has become more signifi cant.

● Hardware and software profi les —An accurate view of all hardware and software currently in place in the organization is also essential to situational awareness. A common prob- lem involves running some product version that is too old to properly secure through a program of patching or security enhancement. A corresponding problem involves systems that are too new to properly characterize their robustness against attack. In practice, an optimal period of product oper- ation emerges between the earliest installation period, when a product or system is brand new, and the latter stages of deployment, when formal support from a vendor might have lapsed (see Figure 10.1 ). Each of these factors presents a set of unique challenges for

security teams. An emerging global confl ict, for example, will probably have nothing to do with the vulnerability profi le of soft- ware running locally in an enterprise. There are, however, clear dependencies that arise between factors in practice and will improve situational awareness. For example, when vulnerabili- ties are reported by a hacking group, the organization’s security posture will depend on its local hardware, software, and security infrastructure profi le. As a result, it is generally reasonable for an organization to combine the value of all situational status factors

The increase in global outsourcing requires awareness of how international political events may impact your vendors.

Chapter 10 AWARENESS 181

into one generic measure of its security posture. This measure should be able to provide a rough estimate of the broad, orga- nizational security risk at a given time. It should then weigh the likelihood and potential consequences of serious attack against the normal, everyday level of risk that an organization lives with every day. Presumably, risk on a day-to-day basis should be lower than during a serious incident, so it stands to reason that a rough metric could capture this status, perhaps as a high, medium, and low risk characterization (see Figure 10.2 ).

Unfortunately, the public perception of categorizing high, medium, and low security risks is that it does not provide use- ful information. This is certainly true for such measures as the public threat metric, which was used previously by the U.S. Department of Homeland Security to characterize risk. The prob- lem with this metric was that it dictated no concrete actions to be taken by citizens. If risk was characterized as low, citizens were

System Component Lifecycle Phase

Period of Optimal Usage

Brand

New

Freshly

Used

Familiar,

Mature

Slightly

Outdated

Not

Supported

High

Medium

Level of

Optimal

Use

Low

Figure 10.1 Optimal period of system usage for cyber security.

Active Attack to an

Essential Service

Ordinary Risk to all

Essential Services

Each Value Dictates a

Specific Response Action

Real-Time

High Medium Low

Figure 10.2 Rough dashboard estimate of cyber security posture.

Factoring in all elements of situational awareness and any related challenges should create an overview of an organization’s current security risk.

182 Chapter 10 AWARENESS

warned to remain vigilant and on guard; if risk was characterized as medium or even high, the advice was essentially the same. Citizens were told to go on with their normal lives, but to be somehow more careful. Obviously, this type of advice causes con- fusion and is to be avoided in national infrastructure protection.

The only way a posture metric can be useful is if it is driven by real-time events and is connected directly to an explicit inci- dent response program. When this is done, an ongoing rhythm develops where the situational status helps direct security man- agement activity. This could involve some serious fl aw being detected in an organization (which would drive the threat level upward), followed by detection of a real exploit in the wild (which would drive the threat level further upward), followed by a patch activity that fi xes the problem (which would drive the threat level back down) (see Figure 10.3 ).

Regardless of public perception with respect to previous gov- ernment threat metrics, any program of situational awareness for cyber security must include a broad characterization of real-time risk. The attributes of this broad characterization will be based on a much more detailed understanding of the real-time posture. Collectively, this posture is referred to as situational awareness and is based on an understanding of whether or not the infra- structure is under attack, which vulnerabilities are relevant to the local infrastructure, what sort of intelligence is available, the out- put of a risk management process, and information being gener- ated by a real-time security operations center. These elements are described in the sections that follow.

Most Vulnerable Period

(Threat at High)

Analysis Period

(Threat at Medium) Patch Deployed Over Time Period

(Threat to Low)

Evidence of

Product Exploit in Wild

(Threat to High) Flaw Found in Product

Used by Organization

(Threat to Medium)

Medium Threat

High Threat

Low Threat

Figure 10.3 Security posture changes based on activity and response.

Descriptors such as high, medium, and low to describe security risk are too vague to be helpful.

Security risk levels should be set to correlate with actionable items.

Chapter 10 AWARENESS 183

Detecting Infrastructure Attacks The process of determining whether an attack on national infra- structure is under way is much more diffi cult than it sounds. On the surface, one would expect that, by observing key indicators, making the determination that an attack has begun or is ongoing would seem straightforward. Correlating observed activity with profi les, signatures, and the like can provide a strong algorithmic basis, and products such as intrusion detection systems offer a means for implementation. These factors are misleading, how- ever, and the truth is that no security task is more diffi cult and complex than the detection of an ongoing attack, especially if the adversary is skilled.

To illustrate this challenge, suppose you notice that an impor- tant server is running in a somewhat sluggish manner, but you cannot diagnose the problem or explain why it is occurring. Obviously, this is suspicious and could be an indicator that your server has been attacked, but you cannot state this with any certainty. There could be a million reasons why a server is run- ning slowly, and the vast majority of them have nothing to do with security. Suppose, however, that you discover a recently installed directory on the server that is fi lled with unfamiliar, strange-looking fi les. This will clearly raise your suspicion higher, but there are still numerous explanations that do not signal an attack. Perhaps, fi nally, someone in the enterprise steps forward and admits to running some sort of benign test on the server, thus explaining all of the errant conditions. The point is that con- fi dence that a target is under attack will rise and fall, depending on the specifi cs of what is being observed. Obviously, there is a threshold at which the confi dence level is suffi ciently high in either direction to make a sound determination. In many prac- tical cases, analysis never leads to such a confi dence threshold, especially in complex national infrastructure environments (see Figure 10.4 ).

In our example, you eventually became confi dent that no attack was under way, but many scenarios are not terminated so cleanly. Instead, events expose a continuing stream of ongoing information that can have a positive, negative, or neutral effect on determining what is actually going on. In many cases, infor- mation that is incorrect or improperly interpreted has the effect of confusing the process. Relatively new technologies, such as mobile wireless services, tend to exhibit this property, especially in cases where a particular incident has never been seen before. The primary disadvantage of never determining the root cause of an attack is that the security posture cannot be accurately

There are many tools for detecting attacks, yet no single tool is comprehensive or foolproof.

184 Chapter 10 AWARENESS

measured. This is especially troublesome when the attack is severe and targets essential national infrastructure services.

Managing Vulnerability Information A common cynical view of computer security is that its experts are engaged in nothing more than a game of trivial pursuit around attack and vulnerability information. Support for this view is evident in the security books published to date, most of which contain page after page of esoteric attack specifi cs that are often long-since irrelevant. It is also evident in social circles at security and hacking conferences, where the discussion rarely addresses foundational topics of software engineering or system design but instead focuses on such trivia as which systems have which bugs in which versions on which hardware. Some security experts and hackers have become walking encyclopedias of such knowledge, even viewing information as the driver of power and skill. Anyone not possessing suffi ciently detailed knowledge is thus tagged a newbie, lamer, or perhaps worse—a manager .

In spite of this odd phenomenon, situational awareness for national infrastructure protection does require a degree of atten- tiveness to daily trivia around vulnerability information. We refer to the information as trivia simply because, once addressed and fi xed, the value of the information drops very close to zero. Nevertheless, it is important information to collect, and most national infrastructure teams use the default approach of active opportunism , where a set amount of effort is expended to gather

Determination of security risk level is a fl uid process; it changes as new information is revealed or as situations change.

Neutral

High

Confidence

of Attack

High

Confidence

of Non-Attack

Increased Confidence

Attack is Under way Additional Confidence

Required for Certainty

Event Explained

(No Attack)

Moderate Confidence

Attack is Under way

Event 1

(Server Overload)

Event 2

(Strange Files)

Event 3

(Insider Admits to Testing)

Figure 10.4 Attack confi dence changes based on events.

Chapter 10 AWARENESS 185

as much data as possible and anything else that comes in is wel- comed. The problem with active opportunism is that it will never be complete and cannot be depended upon for accurate man- agement decisions. For example, the question of whether a given vulnerability has been coded into an exploit and made available on the Internet can be researched by one, two, or 50 people. If no evidence of such an exploit is found, then the weak conclusion can be drawn that it does not exist. Obviously, information about the vulnerability could be tucked away in some IRC discussion or on an obscure hacking site, but unless it is found or volunteered the security team will never know for sure.

The best one can hope for is to create as active and complete a vulnerability information-gathering process as possible. See the box for practical heuristics that have been useful for infrastruc- ture protection in the past.

Practical Heuristics for Managing Vulnerability Information

● Structured collection —The root of all vulnerability management processes must be some sort of structured collection approach with means for assuring proper delivery of information, validating the source, cataloguing the information in a suitable taxonomy, and maintaining a useful database for real-time reference with provision for indexing and crawling vulnerability data in real-time. This structured approach should be integrated into all day-to-day cyber security activities so that accurate vulnerability information is available across the entire security infrastructure and team. Filters should exist to assure incoming data, as well as to ensure that external entities only obtain appropriate information (see Figure 10.5 ).

● Worst case assumptions —Many situations arise where a security team cannot determine whether some important piece of vulnerability-related information has actually been disclosed or has become known to an adversary group. The most mature and healthy approach in such scenarios is to assume the worst possible case. Most experts would agree that if the possibility arises that some vulnerability might be known externally, then it probably is known.

● Nondefi nitive conclusions —Making defi nitive statements about national infrastructure security is not recommended. Too many cases exist where a security team draws the confi dent conclusion that a system is secure only to later obtain vulnerability-related information to the contrary. Experienced managers understand, for example, that they should always include caveats in security posture reports given to senior leaders in government or industry.

● Connection to all sources —Managing vulnerability information should include connections to all possible sources such as industry groups, vulnerability-reporting services, hacking conferences, internal employee reports, and customer data. Sometimes the most critical piece of vulnerability information comes from the most unlikely source.

Collecting daily trivia around vulnerability information should not be dismissed as unimportant but should be considered one of many methods of achieving situational awareness.

186 Chapter 10 AWARENESS

Following the heuristics listed in the box above will help to ensure that the best available data is collected, stored, and used, but these heuristics can never provide assurance that the vulner- ability management process is perfect. Instead, managers are strongly advised to follow three basic rules: (1) always assume that the adversary knows as much or more about your infrastruc- ture than you do, (2) assume that the adversary is always keeping vulnerability-related secrets from you, and (3) never assume that you know everything relevant to the security of your infrastruc- ture. Such complete knowledge is unattainable in large, complex national infrastructure settings.

Cyber Security Intelligence Reports A technique commonly used in government intelligence commu- nity environments, but almost never in most enterprise settings, involves the creation and use of a regularly published (usually daily) intelligence report. For cyber security, such a report gener- ally includes security-related metrics, indicators, attack-related information, root-cause analysis, and so on for a designated period. It is typically provided to senior management, as well as all decision-makers on the security and infrastructure teams. The report should also be indexed for searches on current and previ- ous information, although this is not a common practice.

Although the frequency and content of intelligence reports should be tailored to the needs of the local environment, some

Daily cyber security intelligence reports that are standard in government agencies would be equally useful in enterprise settings.

Vulnerability

Data Sources

Internal Organizational Boundry

Internal

Security

Team

External

Entities

Vulnerability

Database

Index

Assured

Request

Crawl

Look-Up

Export Raw

Raw

Raw

Information

Sharing

Filter

Source

Validation

and

Delivery

Assurance

Figure 10.5 Vulnerability management structure.

Chapter 10 AWARENESS 187

types of information that one would expect in any daily intelli- gence report include the following: ● Current security posture —The situational status of the cur-

rent security risk would be required in any intelligence report, especially one issued over a daily or weekly interval (monthly intervals create too long a gap for information to be consid- ered “intelligence”).

● Top and new security risks —Characterization of the top risks, as well as any new risks, is also important to include in an intelligence report. Visualization and other techniques are often helpful to highlight changes in risk posture.

● Automated metrics —Security systems that generate metrics should provide input to the intelligence report, but care must be taken to avoid the creation of a voluminous document that no one will read. Also, raw output from some devices is indis- cernible and should be either summarized or avoided in the report.

● Human interpretation —Ultimately, the most useful cyber security intelligence includes analysis by experienced and expert human beings who can interpret available security data and recommend suitable action plans. It is unlikely that this interpretation function will be automated in the near future. The activity associated with the realization of a cyber secu-

rity intelligence report can be viewed as an ongoing and iterative process made up of three tasks (see box).

Tasks for Creating a Cyber Security Intelligence Report

1. The fi rst task involves intelligence gathering of available vulnerability and security posture data. This can be automated but should allow for manual submission from people who might have useful information to share. Many organizations do this gathering in the early morning hours, before the bulk of the business activity begins (a luxury that does not exist for global companies).

2. The second task involves interpretation and publication of the gathered data, not unlike similar processes in daily news publications. The interpretation should focus on the audience, never assuming too much or too little knowledge on the part of the reader. It is during this task that the human interpretive summary of the collected data is written.

3. The third task involves protected dissemination and archiving of the report for use by end users with a need to know. Report transmission is generally protected by encryption, and report archives and storage are protected by access controls (see Figure 10.6 ).

Human interpretation is bound to catch vulnerabilities that automated algorithms will miss.

188 Chapter 10 AWARENESS

One byproduct of creating an intelligence report is that it helps guide the local culture toward greater attentiveness to real-time security considerations. Everyone knows that, during an incident, response activity summaries will fi nd their way to senior managers which tends to heighten concentration on the accuracy and completeness of the report. In addition, when an incident occurs that does not fi nd its way into the report, manag- ers can justifi ably question the completeness of reporting around the incident.

Risk Management Process Managers of essential national services must understand the security risks associated with their underlying infrastructure. Although this can be done using all sorts of fancy risk taxono- mies, tools, and methodologies, the recommended approach is to simply maintain a prioritized list. Depending on the severity of the risks in the list, managers can decide to focus on a subset of the top ones, perhaps the top 10 or 20. Funding and resource allocation decisions for cyber security can then be driven by the security risk profi le of the organization, keeping in mind that the list of risks will change with any adjustments in threat environ- ment, technology deployment, or reported vulnerabilities.

Automated

Intelligence

Sources

Task 1 Task 2 Task 3

Intelligence

Recipients

Intelligence Report

Creation

Encrypted

Local

Intelligence

Gathering

Interpretation

and

Publication

Cyber

Intelligence

Report

Dissemination

and Archiving

Internet

Intelligence

Archive

Figure 10.6 Cyber security intelligence report creation and dissemination.

Security risks must be tracked (listed) and prioritized to drive appropriate funding and resource allocation.

Chapter 10 AWARENESS 189

The generally agreed-upon approach to measuring the secu- rity risk associated with a specifi c component begins with two estimations: ● Likelihood —This is an estimate of the chances an attack

might be successfully carried out against the specifi c compo- nent of interest.

● Consequences —This is an estimate of how serious the result might be if an attack were carried out successfully. These two estimates must be performed in the context of an

agreed-upon numeric range. The actual values in the range matter less than the relative values as the estimates increase and decrease. The simplest and most common values used are 1, 2, and 3, cor- responding to low, medium, and high for both estimates. Once the likelihood and consequences have been estimated, risk is obtained by multiplying the values. Thus, if some component has a high likelihood of attack (value 3) and medium consequences result- ing from an attack (value 2), then the associated risk is 3 times 2, or 6. If security measures are put in place to reduce the likelihood of an attack to medium (value 2), then the risk is now 2 times 2, or 4. Again, the absolute value of risk is less important than the rel- ative value based on security decisions that might be made.

A useful construct for analyzing security decisions in infra- structures compares relative security risk against the costs asso- ciated with the recommended action. The construct allows managers to consider decision paths that might increase, decrease, or leave unaffected the security risk, with the balancing consideration of increased, decreased, or unaffected associated costs (see Figure 10.7 ).

To interpret the choices in the decision path structure, start at the middle of the diagram and consider the effects of each path labeled A through H. The path labeled G shows a security deci- sion that increases costs in order to reduce risk. This is a normal management decision that is generally considered defensible as long as suffi cient budget is available. Similarly, the path labeled C is also normal, as it accepts increased risk in order to reduce costs, which is unfortunately a common enough decision.

Interestingly, any decision path in the area shaded on the fi g- ure will be generally acceptable in most cases because the rela- tionship between cost and risk is reasonable. The decision paths in the unshaded portion of the graph, however, are generally con- sidered unacceptable because of the odd balance between the two factors. Decision path H, for example, increases costs with no impact on security risk. This case corresponds to the situa- tion encountered all too often where a security safeguard is put in place that actually has zero impact on the risk profi le.

The actual numeric value of a security risk is less important than its overall relative risk.

Increasing risks likely incur increased costs; assessing relative risk will help determine the value of investing in risk reduction.

190 Chapter 10 AWARENESS

To summarize, all decisions about national infrastructure pro- tection should be made in the context of two explicit manage- ment considerations: (1) maintaining a prioritized list of security risks to the system of interest, and (2) justifying all decisions as corresponding to paths in the shaded portion of the decision path structure shown in Figure 10.7 . If these two simple consid- erations were mandatory, considerable time, effort, and money would be immediately saved for many infrastructure manage- ment teams.

Security Operations Centers The most tangible and visible realization of real-time security situational awareness is the security operations center (SOC), also referred to as a fusion center . The most basic model of SOC opera- tions involves multiple data, information, and intelligence inputs being fed into a repository used by human analysts for the pur- pose of operations such as interpretation, correlation, display, storage, archival, and decision-making. The SOC repository is constructed by active solicitation or passive acceptance of input information, and information processing combines human analy- sis with automated processing and visual display (see Figure 10.8 ).

Most SOC designs begin with a traditional centralized model where the facility is tied closely to the operations of the center.

Infrastructure

Security Cost

Infrastructure Security Risk

Path A (Worst)

Increase Risk,

Increase Cost

Path C (Normal)

Increase Risk,

Decrease Cost

Path G (Normal)

Increase Cost,

Decrease Risk

Path E (Best)

Decrease Risk,

Decrease Cost

C

D

A

B

E

F

G

H

"Shaded

Portion

Desireable"

"Unshaded

Portion

Undesireable"

Figure 10.7 Risk versus cost decision path structure.

Chapter 10 AWARENESS 191

That is, methods and procedures are created that presume SOC resources, including all personnel, are located in one place with no need for remote coordination. All data is stored in a local repository that can be physically protected in one location. This approach has its advantages, because it removes so many coor- dination-related variables from the management equation. That said, an SOC can be created from distributed resources in geo- graphically dispersed locations. Repositories can be distributed, and analysis can be performed using remote coordination tools. Generally speaking, this approach requires more work, but the main benefi t is that more expert analysts can be recruited to such an approach, especially if the requirement is that 24/7 operations be supported. Experts can be hired across the globe in a “follow- the-sun” support arrangement.

Typical operational functions supported in an SOC include all human interpretation of data by experts, management of spe- cifi c incidents as they arise, support for 24/7 contact services in case individuals have security-relevant information to share, and processing of any alarms or tickets connected to a threat management or intrusion detection system. The 24/7 aspect of SOC operation is particularly useful to national-level situational awareness, because key infrastructure protection managers will know that they can obtain a security posture status at any time from a human being on call in the SOC. Government procure- ment efforts for national services should include requirements for this type of coverage in the SOC.

Consoles

Display

ProcessingRepository

Active Solicit

Passive Accept

Human

AnalystsInformation

Sources

Response

Partners

Response

Process

Figure 10.8 Security operations center (SOC) high-level design.

The advantage to global dispersal of SOC resources is an around-the- clock real-time analysis of security threats.

192 Chapter 10 AWARENESS

National Awareness Program The goal of supporting a national-level view of security posture should not be controversial to most security and infrastructure managers. Everyone will agree that such a view is necessary and useful for supporting national infrastructure protection-related management decisions. The challenge, however, lies with the following important practical considerations: ● Commercial versus government information —To achieve full

situational awareness at the national level will require consid- erable support from both commercial and government entities. Groups supplying security status information must be pro- vided with incentives and motivations for such action. Patriotic justifi cation helps, but global companies must be more delib- erate in their sharing of information with any government.

● Information classifi cation —When information becomes clas- sifi ed, obviously the associated handling requirements will increase. This can cause problems for data fusion. In fact, the essence of data compartmentalization for classifi ed informa- tion is to prevent and avoid any type of fusion, especially with unclassifi ed data. The result is that situational awareness at the national level will probably include two views: one unclas- sifi ed and public, the other based on more sensitive views of classifi ed information.

● Agency politics —Government agencies are famous for using information as a basis for political agendas, including support for project funding, hiring plans, and facility expansion. This tendency is counter to the goal of information sharing for sit- uation awareness and must therefore be managed carefully.

● SOC responsibility —If a national SOC is to be realized, then some organization must be designated to run it. The deci- sion as to whether this should be a defense- or civilian-related initiative is beyond the scope of this book, but most security experts agree that current defense-related awareness initia- tives provide many of the elements required in a fully func- tioning SOC. If these challenges are not addressed properly, the risk is

that inaccurate views of situational awareness could arise. If an agency, for example, fi nds out about a vulnerability but decides to not share this information, then a hole emerges in any national- level risk estimation. Similarly, if a commercial organization is unable to receive and process classifi ed information, then their view of current security risk posture will not be accurate. Attentiveness to managing these issues on a case-by-case basis, perhaps as part of a national SOC, would seem the best approach.

193 Cyber Attacks. DOI: © Elsevier Inc. All rights reserved.

10.1016/B978-0-12-384917-5.00011-1 2011

RESPONSE

11

1 K. van Wyk and R. Forno, Incident Response , O’Reilly Media, Sebastopol, CA, 2001.

Incident response is a vital part of any successful IT program and is frequently overlooked until a major security emergency has occurred, resulting in untold amounts of unnecessary time and money spent, not to mention the stress associated with responding to a crisis .

Kenneth van Wyk and Richard Forno 1

The most familiar component of any cyber security program is the incident response process. This process includes all security- related activities that are initiated as a result of an attack that is imminent, suspected, under way, or completed. Incident response will generally be optimized to the local environment in an organization, but in most cases it will include at least the fol- lowing four distinct process phases: 1. Incident trigger —Some warning or event must trigger the inci-

dent response process to be initiated. Obviously, if the trigger involves a system that has already been maliciously attacked, then the response must be focused on reconstitution and disaster recovery. If the trigger involves an early warning, then it is possible that the incident response process could avoid visibly negative effects.

2. Expert gathering —This involves a gathering together of the appropriate experts to analyze the situation and make recom- mendations. Most organizations have a base set of incident response staff that work all incidents and manage a repository of information related to all previous incidents. In addition, each incident will dictate that certain subject matter experts be brought into the process to work the details. These experts will also provide a local information base relevant to the inci- dent at hand.

3. Incident analysis —Analysis of the incident is the primary task for the experts gathered during incident response. This can include detailed technical forensics, network data analysis,

194 Chapter 11 RESPONSE

and even business process examination. Generally, the most diffi cult part of any analysis involves fi guring out the under- lying cause of the incident. Once this has been determined, developing the best solution is the key goal.

4. Response activities —The output of any incident response pro- cess will be a set of management recommendations on how to deal with the incident. These often include rebuilding sys- tems, working around problems, informing customers, and the like. Providing this information to the correct individuals and organizations requires that the incident response teams be properly plugged into the specifi cs of which groups are responsible for which relevant functions. Specifi c incident response processes will vary from organi-

zation to organization, but virtually every company and agency process is based on some version of these four elements and includes incident response processes local to an organization or that might exist as a special response resource for citizens, busi- nesses, or government groups (see Figure 11.1 )

In spite of the commonality inherent in the incident response processes found in various companies and agencies, great differ- ences exist in their respective success patterns. The biggest dif- ferences reside in the relative effectiveness of incident response

Incident

triggers

Incident

response

staff

Generic

information base

(Previous incidents)

Feedback

Applicable

infrastructure

components

Feedback

Subject

matter

experts

Specific

information base

(Current incident)

Expert

Gathering

Incident

Analysis

Response

Actions

Figure 11.1 General incident response process schema.

Most organizations have some form of incident response process in place that generally incorporates the same elements.

Chapter 11 RESPONSE 195

in avoiding, rather than simply responding to, serious infrastruc- ture problems. To optimize the early-warning aspect of incident response, certain key considerations must be well understood. These include a focus on pre- versus post-attack responses, detailed understanding of what constitutes a valid indication or warning, proper construction of how an incident response team should be managed, best practices in forensic analysis, optimal interactions with law enforcement, and good processes for recov- ering from disasters. These elements are explained in more detail below, with an emphasis on how national infrastructure response processes must be constructed and operated.

Pre- Versus Post-Attack Response The most critical differentiating factor between incident response processes involves the two fundamental types of triggers that ini- tiate response. The fi rst type involves tangible, visible effects of a malicious attack or incident. These effects are usually noticed by end users in the form of slow application performance, clogged gateway performance, inability to get e-mail, slow or unavailable Internet access, and so on. Incident response in this case is usu- ally urgent and is affected by the often vocal complaints of the user base. The second type of trigger involves early warning and indications information, usually embedded in some system or network management information. These triggers are usually not visible to end users but are prone to high levels of false positive responses, where the warning really does not connect to a mali- cious action.

Incident response processes can thus be categorized into two specifi c approaches, based on the degree to which these triggers are addressed: ● Front-loaded prevention —This includes incident response

processes that are designed specifi cally to collect indications and warning information for the purpose of early prevention of security attacks. The advantage is that some attacks might be thwarted by the early focus, but the disadvantage is that the high rate of false positive responses can raise the costs of incident response dramatically.

● Back-loaded recovery —This includes incident response pro- cesses that are designed to collect information from various sources that can supply tangible, visible information about attacks that might be under way or completed. This approach reduces the false positive rates but is not effective in stopping attacks based on early warning data.

Early warning triggers are generally not visible to end users and are prone to high levels of false positives.

Effective incident response is critical, but avoiding infrastructure problems in the fi rst place will reduce the work required of the incident response team.

196 Chapter 11 RESPONSE

Hybrid incident response processes that attempt to do both front-end and back-end processing of available information are certainly possible, but the real decision point is whether to invest the time, resources, and money necessary for front-loaded pre- vention. These two types of processes can be illustrated on the time line of information that becomes available to the security team as an attack proceeds. For front-loaded prevention, the associated response costs and false positive rates are high, but the associated risk of missing information that could signal an attack is lower; for a back-loaded response, these respective val- ues are the opposite (see Figure 11.2 ).

Back-loaded incident response might be acceptable for smaller, less-critical infrastructure components, but for the pro- tection of essential national services from cyber attack the only reasonable option is to focus on front-end prevention of prob- lems. By defi nition, national infrastructure supports essential ser- vices; hence, any process that is designed specifi cally to degrade these services misses their essential nature. The fi rst implica- tion is that costs associated with incident response for national infrastructure prevention will tend to be higher than for typical enterprise situations. The second implication is that the famil- iar false positive metric, found so often in enterprise settings as a cost-cutting measure, must be removed from the vocabulary of national infrastructure protection managers.

Time

Back-Loaded RecoveryFront-Loaded Prevention

Response Cost

False Positive

Rate

Risk of Missing

Attack

Reports from Early Warnings and Indications

Systems

Reports from Users of Visible Effects from

Attack

LowHigh

LowHigh

HighLow

Figure 11.2 Comparison of front-loaded and back-loaded response processes.

Combining front-loaded prevention with back- loaded recovery creates a comprehensive response picture; however, an emphasis on front-loaded prevention may be worth the increased cost.

It is worth suffering through a higher number of false positives to ensure protection of essential national assets.

Chapter 11 RESPONSE 197

Indications and Warning Given the importance in national infrastructure protection of front-loaded prevention based on early indications and warning information, it becomes urgent to clarify the types of early trig- gers that should be used to initiate response processes. Because these triggers will vary between organizations due to obvious dif- ferences between their respective environments, the best that can be done is to categorize the various types of triggers into a broad taxonomy. Some of the elements of the taxonomy will be obvious and consistent with most current methodologies, whereas others will be quite different from current practice and will require process enhancements.

Taxonomy of Early Warning Process Triggers

The taxonomy of early warning process triggers includes: ● Vulnerability information —Knowledge of any new vulnerability is an obvious trigger for front-loaded prevention.

The vulnerability might never be exploited, but response teams should still analyze the possibilities and work toward developing proactive steps to ensure that an exploit cannot occur. In many cases, the vulnerability will be reported by a vendor, which implies that they will have to become part of the local incident response process.

● Changes in profi led behavioral metrics —Incident response teams should use meaningful changes in any measured behavioral metric as a trigger for process initiation. This can include changes in network behavior, changes in processor utilization, or changes in some application profi le. Initiation of incident response as a result of behavioral change represents a dramatic departure from current incident response processes in most organizations.

● Match on attack metric pattern —Similarly, if a signature or attack metric pattern is detected on some application, system, or network, then preventive incident response dictates that analysis be performed on the data for security implications. This is also a departure from current incident response approaches.

● Component anomalies —Any anomalous behavior detected in an infrastructure component is a candidate trigger for incident response. More intense behavioral anomalies found on more critical components will clearly trigger greater response processes.

● External attack information —Information that comes from external sources about attacks that might be locally relevant could trigger an incident response process. For national infrastructure protection, this is even more important if the information comes from a credible source regarding systems or technology having some local signifi cance.

One way to view the difference between the front-loaded and back-loaded methods is in the context of the trigger inten- sity required to initiate a response process. For the trigger approaches listed above, the information should be suffi cient to

198 Chapter 11 RESPONSE

cause the incident response team to take immediate action. In more conventional and familiar contexts, these triggers would not be suffi cient for such action (see Figure 11.3 ).

The triggers for front-loaded response share one important aspect—namely, they provide partial information that may signal a possible attack but that also could be explained in a non-security context. Thus, the major obligation of the incident response team in front-loaded prevention is to piece together all partial infor- mation into as complete a view as possible; from this view, for national infrastructure protection, the most conservative recom- mendation should be made. That is, it should be presumed that an attack is ongoing even if the team is not sure. This increases costs and decreases convenience to the local staff, but it errs on the side of caution and is thus appropriate for protecting essential services.

Incident Response Teams The optimal incident response team for national infrastructure protection includes two different components. First, a core set of individuals will manage the incident response process, main- tain relevant repository information, document all incident- related data, provide briefi ngs to anyone interested in the process (including senior management), and interact with other incident response teams. Second, a more dynamically allocated set of sub- ject matter experts will be brought into the incident response activity when an attack is targeting systems they understand best.

Back-Loaded

Recovery

Threshold

High Confidence

Attack On-Going

Front-Loaded

Prevention

Threshold

Lesser Confidence

Attack On-Going Not Enough

for Either

Response

Process

Minor

Vulnerability

Reported

Users

Detect

E-mail Virus

No

Internet

Access

Change in

Internet

Port Activity

Report of

External

Worm

Trigger

Intensity

Front-Loaded

Response

Back-Loaded

Response

Figure 11.3 Comparison of trigger intensity thresholds for response.

Front-loaded prevention responses have a high sensitivity to triggers; that is, response is initiated more often than with a back-loaded recovery response.

Erring on the side of caution is worth the extra time and expense when it comes to protecting our national assets.

Chapter 11 RESPONSE 199

In complex settings, the core incident response team is likely to be working multiple incidents simultaneously, generally with different sets of subject matter experts. Thus, response triggers will spawn new cases, which are worked in parallel to successful completion. In smaller environments, it is rare for multiple cases to be ongoing, but for larger, more complex critical infrastructure it is unusual to fi nd times when multiple incident response cases are not being worked simultaneously. This leads to the unique incident response obligation for national infrastructure protec- tion of ensuring that concurrent response activities do not mutu- ally confl ict (see Figure 11.4 ).

The notion of managing simultaneous response cases is largely unexplored in conventional computer security. This is unfortunate, because every large organization eventually comes to the realization that this is not only possible but is generally the norm. Furthermore, those national attack scenarios with the most serious potential consequences to infrastructure routinely include multiple concurrent attacks aimed at the same company or agency. Response teams in a national setting must therefore plan for the possibility of multiple, simultaneous management of different incident response cases. Some considerations that help plan properly for this possibly include the following: ● Avoidance of a single point of contact individual —If a single

individual holds the job of managing incident response pro- cesses, then the risk of case management overload emerges.

Requires Assurance that no Conflicts Between A and B

Trigger A

Trigger B

Subject Matter

Experts: A

Case B

Case A

Core Response

Team

Subject Matter

Experts: B

Spawn Case A

Spawn Case B

Figure 11.4 Management of simultaneous response cases.

Individuals on incident response teams need to ensure they are not working at cross-purposes with their colleagues.

It is unlikely that a large organization would not have simultaneous attack scenarios to face.

200 Chapter 11 RESPONSE

This might seem like a minor management detail, but given the importance of response, especially in a recovery scenario, avoidance of such weaknesses is a requirement.

● Case management automation —The use of automation to manage, log, and archive incident response cases will improve the productivity of the core incident response team and can lead to streamlined analysis, especially if previous case infor- mation is available for online, automated query and search.

● Organizational support for expert involvement —The entire organization must readily agree to provide experts for incident response when requested. This is not controversial when the process follows a back-loaded recovery method, because every- one is visually aware of the consequences of the incident. It is more challenging, however, when a front-loaded prevention approach is used and the triggers that initiate incident response are more subtle.

● 24/7 operational support —Without full 24/7 coverage every day of every year, the success likelihood of managing multiple, concurrent incident response cases drops considerably. Most organizations integrate their incident response function into an SOC to ensure proper management coverage. An interesting recent trend in infrastructure management

involves the outsourcing of certain security operations to a third party. For status monitoring of security devices such as fi rewalls and intrusion detection systems, this is a reasonably mature activ- ity and will have no materially negative effect on local security protection efforts (unless the outsourcing fi rm is incompetent). Even for certain SOC operations, outsourcing is often an excel- lent idea, especially because collection and correlation are always more effective if the vantage point is large. Outsourced SOC oper- ations can also provide the security team with access to technical skills that may not reside locally.

Incident response processes, however, can easily become awkward for full outsourcing because of the embedded nature of prevention and recovery efforts for local infrastructure. Certainly, an outsourcing provider or vendor can and should be of assis- tance, and third-party SOC experts might offer excellent guid- ance and advice. Ultimately, however, incident response must be a local management function, and the organization will have no choice but to expend time, energy, and resources to ensure that the correct local management decisions are made. Third par- ties can never prioritize actions or tailor recovery procedures to the local environment as well as the organization itself. Instead, they should be used to augment local functions, to provide expert guidance, to automate processes, to manage equipment

Companies cannot avoid complete responsibility for incident response by outsourcing the entire process; prioritizing and tailoring recovery procedures must be done locally.

Outsourcing some aspects of security operations may make good business sense.

Chapter 11 RESPONSE 201

and networks, to support data collection and correlation, and to assist in recovery.

Forensic Analysis Forensic analysis involves those activities required to investigate, at both a high level and a detailed lower level, the root cause and underpinnings of some event. Typical questions addressed dur- ing the forensic analysis process include: ● Root cause —How specifi cally was the target system attacked? ● Exploits —What vulnerabilities or exploits were used in the

attack? ● State —Is the system still under an active state of attack by an

adversary? ● Consequences —What components of the system were read,

stolen, changed, or blocked? ● Action —What actions will stop this attack (if ongoing) or pre-

vent one in the future? To answer these diffi cult questions during incident response,

forensic analysis requires the ability to drive deeply into a target system of interest, gathering relevant information but doing so in a manner than never destroys, affects, or changes key evidence. This is a critical requirement, because clumsy forensic analy- sis might overwrite important fi les, change important stamped dates on system resources, or overwrite portions of memory that include critical evidence. Forensic analysis is a diffi cult activ- ity requiring great skill and competency, as well as the ability to investigate a system both manually and with the assistance of special tools (see Figure 11.5 ).

The forensic process is performed on a computer to deter- mine how, when, and where some event on that computer might have occurred as the result of hardware, software, human, or network action. Corporate security groups, for example, often perform forensic analysis on a computer when the owner is sus- pected of violating some guideline or requirement. Law enforce- ment groups perform similar actions on computers seized from suspected criminals. Forensics can, however, be performed on a target much broader than a computer. Specifi cally, for the pro- tection of essential national services, the organization must have the ability to perform forensic analysis on the entire supporting infrastructure.

The individual technical skills required to perform such broad forensic analysis are easy to write down, but qualifi ed person- nel are not always so easy to recruit and hire. This problem is so

Great care must be taken during forensic analysis not to change or destroy fi les or other critical evidence.

Forensic analysis can be specifi c (one computer) or broad based (entire supporting infrastructure).

202 Chapter 11 RESPONSE

severe for most large organizations that it is not uncommon for a company or agency to have no local expert with suffi cient skills to properly lead the investigation of a widespread infrastructure attack. This is unacceptable, because the only options for that organization are to locate such talent externally, and this will result in a less intimate evaluation process. Long-term employees who are committed to a career in an organization will always be more knowledgeable than consultants or third parties; further- more, they will be suitably trusted to investigate an incident into the deep recesses of the local environment.

As such, the irony of forensic analysis is that most businesses and agencies would be wise to begin building and nurturing a base of talent with these skills. Typically, to maintain and satisfy forensic experts requires several things: ● Culture of relative freedom —Most good forensic analysts are

creative individuals who learned their craft by exploring. They tend to maintain their skills by continuing to explore, so orga- nizations must give them the freedom to seek and analyze systems, networks, applications, and other elements of inter- est. When they are working an incident, the target is obvious, but when they are not then managers must offer them the freedom to explore as they see fi t. This is not easy for some managers, especially in relatively mature organizations with (ahem) long legacies of tight employee controls.

● Access to interesting technology —A related aspect of the local environment required to keep forensic analysts happy is con- stant access to interesting, changing, and emerging tech- nology. What this means is that assigning your best forensic analysts to day-to-day operations around a single technology might not be the best idea.

● Ability to interact externally —Forensic analysts will also need the freedom to interact with their peer community and to

Forensic Process

Cannot Make

Changes to

Target System

Considerable

Skill and

Experience

Required

Forensic

Data

Manual

Target System

(High-Level View)

Forensic

Tools

Forensic

Analysts

Target System

(Lower-Level View)

Figure 11.5 Generic high-level forensic process schema.

An internal expert will be the one most likely to properly lead a company investigation, but few company employees have the requisite skills.

Chapter 11 RESPONSE 203

learn from experts outside the organization. This must be per- mitted and encouraged. These environmental elements are not unique to forensic

experts, but of all the skill sets required in a national infrastruc- ture protection setting forensic analysis is the one that is the most diffi cult for an organization to obtain. Good forensic ana- lysts can command the highest premium on the market and are thus diffi cult to keep, especially in a relatively low-paying govern- ment job. As such, attention to these quality-of-work-life attri- butes becomes more than just a good idea; instead, it becomes a requirement if the organization chooses to have the ability to perform forensic analysis as part of the overall incident response process.

Law Enforcement Issues A common issue faced by response teams is whether a given incident should be turned over to law enforcement for support. Most countries have laws that obligate response teams to contact law enforcement groups in the event of certain crimes; incident response teams must be familiar with these laws and must obey them without question. They must, in fact, be burned into inci- dent response processes with full review by legal council in the organization. The issue of law enforcement involvement is also driven, however, by emotional considerations, especially when great time and effort have been directed toward dealing with some incident. The team often wishes to see tangible retribution, perhaps involving the bad guys actually going to jail.

In the end, however, interaction with law enforcement for infrastructure protection should follow a more deliberate and routine process. National infrastructure protection has a singu- lar goal—namely, to ensure the continued and accurate delivery of essential services to the citizenry and businesses of a nation. This does not include the goal of catching bad guys and throwing them in jail, as much as security teams might like this result. The result is that discretionary law enforcement involvement should only be considered when the local security team believes that such enforcement could help with a current incident, perhaps through offering some relevant data or hints, or could help pre- vent a future incident by putting away some group that appears to be a repeat offender. A decision process for law enforcement involvement emerges as shown in Figure 11.6 .

This decision process does recognize and support the clear requirement that crimes must be reported, but the fi gure also

Investing in a good forensic analyst will be expensive but worthwhile for the protection of national security assets.

Carefully review local, regional, and national laws regarding when law enforcement must be contacted during a security incident.

204 Chapter 11 RESPONSE

highlights a particularly fuzzy aspect of cyber security—namely, detecting suspicious behavior on a computer network usually does not constitute suffi cient evidence of a crime being commit- ted. Even if evidence of a break-in to a given system is observed, the argument could be made that no crime has occurred, espe- cially if the break-in is the result of some automated process as one fi nds in a botnet attack.

The result is that national infrastructure protection teams will need to understand the decision process for law enforcement and follow it carefully during every incident. They will also need to create a local process for determining whether a crime has been committed in the context of their infrastructure. The result not only will optimize the interface between an organization and law enforcement but will also minimize the inevitable resource demands that will arise for the local team if law enforcement gets involved.

Disaster Recovery The process of disaster recovery after a security attack is more mature than other aspects of incident response. This stems from the commonality that exists between recovery from attack and recovery from natural disasters such as fl oods, tornados, fi res, and the like. Unfortunately, many large organizations charged with responsibility for national infrastructure do not properly address their obligation to include disaster recovery in their plan- ning. Specifi cally, disaster recovery programs have three fun- damental components, whether they are driven by concerns of malicious attack or natural disaster (see box).

Decisions on a Per-Incident Basis:

Requires

Traceability

Information

to Source

Report Incident to Law Enforcement

Evidence of a

Mandatory Reported

Crime?

View that Law

Enforcement Might

Provide Useful Data?

Evidence of

High-Likelihood

Repeat Offender?

Forensic

Analysts

Figure 11.6 Decision process for law enforcement involvement in forensics.

Incident response teams should report relevant information to law enforcement, even if it does not result in arrest.

Chapter 11 RESPONSE 205

Three Components of a Disaster Recovery Program ● Preparation —The decision to prepare in advance for disaster recovery is easy to make but much more diffi cult to

support in practice. Operational funding is usually the stumbling block, because the process of preparing for disaster in advance involves more than just writing down a list of potential actions. Instead, it often requires architectural changes to avoid single points of potential failure. It could require installation of safe, redundant means for communication between recovery teams, and it could even require upgrades to cyber security systems to ensure proper protection through a disaster.

● Planning —An essential element in a disaster recovery program is an explicit plan that is written down and incorporated into all operational methods and procedures. The plan can be continually improved as the organization deals with real disasters. For example, many organizations who relied on the use of commercial airplanes to shuttle equipment to disaster sites found that this did not work well in the aftermath of 9/11.

● Practice —The decision to practice for disasters is also an expensive one, requiring that teams of experts be funded to support mock drills. The best way to practice for a disaster is to create a realistic scenario and work through the specifi cs of the written plan. Usually, this will involve the use of spare computing or networking capacity that is set aside in a hot confi guration (see Figure 11.7 ).

Create "Mock"

Disaster Analyzed and Repaired

(If Real Incident)

Critical Infrastructure

Component "Hot" Spare

Component Provisioned

During Exercise

Now Live

Component

Preparation for Exercise Exercise Configuration

Figure 11.7 Disaster recovery exercise confi gurations.

Realistically, very few organizations actually practice for disas- ters. It requires a discipline that is generally missing from most enterprise system and network teams and can only work if the senior leadership team makes it a priority. Sadly, the only time disasters are considered is after they occur, especially after they have some impact on the local environment. This familiar process

206 Chapter 11 RESPONSE

of taking disasters seriously only after they occur is something we have all witnessed in our society, especially as it relates to natural disasters and terrorism. For proper protection of national infra- structure from cyber attack, this attitude must be adjusted.

National Response Program The most important function in any national response program involves emergency coordination among government, business, citizens, and other nations during a cyber attack incident. The respective interfaces must be identifi ed and managed as part of response planning. National programs can provide central- ized coordination, but intrasector coordination should also be encouraged (see Figure 11.8 ).

This coordination function would seem obvious, but most existing national emergency response programs and computer emergency response team (CERT) programs tend to focus on dis- semination of vulnerability-related information. This is useful, especially for smaller organizations that have no security team, but this focus tends to leave a gap in national-level coordination should a major national incident occur. Amazingly, at the time of this writing, such a major national incident has yet to occur, but if one should happen soon then national coordination in the United States is unlikely to be smooth. This is unacceptable and requires immediate attention to properly protect national infra- structure from the effects of cyber attack.

Global

Programs

(International)

Business

Enterprise

Interface

Domestic

Citizen

Interface

Intra-Sector Response Coordination

Government

Agency

Interface

Infrastructure

Sector

Interface

Telecom

Transportation

Banking

Energy

Other...

Military

Intelligence

Civilian

State

Local

National Response Program

(National-Level Coordination)

Figure 11.8 National response program coordination interfaces.

Proper planning for disaster response and recovery requires time and discipline, but the outcome is well worth the effort.

207

SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

Any discussion of computer security necessarily starts with a statement of requirements .

U.S. Department of Defense “Orange Book” ( Trusted Computer System Evaluation Criteria , DoD 5200.28-STD)

Readers of this book associated with enterprise organizations should consider translating the security material presented in this book into an action plan relevant to their local environment. For the majority, this will involve creating a set of new security requirements for infrastructure protection. Valid scenarios might include public offi cials embedding new security requirements for collection or correlation into a Request for Proposal (RFP). They might also include enterprise security engineers writing new types of security policy requirements on the fi ltering of distributed denial of service attack (DDOS) aimed at their company, as well as researchers and product engineers embedding new security requirements for deceptive honey pots into their innovation plans.

Obviously, the best way for any requirement to properly match a target environment is for it to be tailored specifi cally to that environment. This requires that the security engineer writ- ing the requirement must possess an understanding of local con- straints, along with suffi cient insight into the purpose and intent of the new security control. For this reason, it is not practical to propose in this appendix a set of requirements for national infra- structure protection that will perfectly address all cases. Instead, presented below is a set of sample base requirements that pro- vide context and illustration for readers determined to embed some of the ideas presented in this book into their local program.

Each sample requirement is written as a constraint on how a given organization might operate. In fact, the terminology for our samples follows the general format: “ The organization must …,”

APPENDIX

208 Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

and readers should be capable of tailoring the statement to a locally specifi c format: “… do such and such under conditions XYZ in environments ABC for purposes IJK .” The requirements are not comprehensive, so the idea of cutting and pasting the exam- ples to create a new policy would be a bad idea. The require- ments are also not complete, so one should not presume that the sample requirements on correlation, for example, provide a suf- fi cient summary of the material presented on that topic. Instead, this appendix is offered as a pedagogical tool to help readers create requirements that not only may be effective at protect- ing infrastructure but are also meaningful and practical in local environments.

Sample Deception Requirements (Chapter 2) In this section, we provide three sample deception requirements that will reduce the risk of attack to a given target infrastructure component by introducing uncertainty to the attacker and by creating a means for analyzing live attacks. Readers are warned, however, that locally relevant legal issues must be carefully attended to before deploying any sort of honey pot. Attorneys who are knowledgeable in the area of practical deception deploy- ment are unfortunately quite scarce.

The organization must … (DEC-1) … operate deceptive honey pot functionality, in con- nection with infrastructure supporting essential services, that is attractive and locally accessible to malicious insiders. This requirement ensures that effort has been made to operate trap functionality focused on catching insiders, including employ- ees, consultants, visitors, and contractors, who might dem- onstrate malicious intent to commit system sabotage or data exfi ltration. The deployment does not have to be extensive but should be tailored to the size and scope of the infrastructure component being protected. The decision to run in stealth or non-stealth mode can be a local decision. Demonstration of compliance should include a test in which the honey pot is shown to be accessible from an internal network and is designed well enough to be reasonably convincing to a typical internal adversary. (DEC-2) … operate deceptive honey pot functionality in con- nection with supporting essential services that is attractive and externally accessible to malicious outsiders. This requirement ensures that effort has been made to operate trap function- ality that is focused on malicious outsiders who might target

Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS 209

organizational resources via the Internet or some other exter- nal access point, such as an extranet or virtual private network (VPN). The deployment also does not have to be extensive but should be tailored to the target environment and can be done in a stealth or non-stealth manner, depending on local needs. Penetration testing from the Internet can be used to check compliance. (DEC-3) … provide evidence that all deceptive honey pot func- tionality is associated with explicit management and support systems for the purpose of initiating response. This require- ment ensures that the organization has suffi cient back-end systems for the deception to be effective, especially if it is done in stealth mode. This requirement is best met by a honey pot alarm notifi cation system connected to human beings trained to interpret the results and direct a response action. Without such back-end support, the deception is unlikely to operate properly. It is also generally important that the back- end systems include means for dealing with innocent autho- rized users who gain access to a honey pot inadvertently.

Sample Separation Requirements (Chapter 3) In this section, we introduce six sample requirements on how fi re- walls, access controls, and fi lters can be effectively used to help protect national infrastructure. The inclusion of fi ltering require- ments for mobile wireless systems will be controversial in the vast majority of environments simply because such functionality might not be locally accessible or available through any practical means.

The organization must … (SEP-1) … proactively redirect or fi lter live DDOS traffi c before it reaches local network ingress points. This requirement ensures that the organization takes steps to reduce the risk associated with inbound DDOS attacks. A service provider operating fi lters on a large capacity backbone is a good option here. The amount of fi ltering should be expressed as a multi- ple of inbound ingress capacity. Obviously, this multiple must be greater than one—and the greater, the better. The fi lters must not operate on the ingress gateway, for obvious reasons (do the math). Testing of this capability should be done care- fully, as the creation of a live, inbound attack can easily get out of hand. (Disaster recovery procedures should always be in place before creation of a live test attack on anything.) (SEP-2) … provide evidence that inbound attacks on externally accessible applications cannot produce an amplifi cation-based

210 Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

DDOS attack on local network egress points. This requirement ensures that the organization is addressing the risk of inbound amplifi cation attacks on poorly designed, externally acces- sible applications. The evidence required here most likely will be the result of a set of functional tests, code reviews, and requirements audits. The idea is to check for evidence that small questions at an application cannot produce very large responses to the external requestor. In the most com- plex application environments, this effort will not remove the egress DDOS risk, but the evidence gathering process should identify obvious problems. It should also raise general aware- ness of the risk for developers creating new applications. (SEP-3) … fl exibly enforce network access controls (fi rewalls) between designated groups of insiders, especially those work- ing in direct support of essential services. This requirement ensures that the organization is using internal fi rewalls to cre- ate trusted internal domains. Casual insiders, including the majority of a typical employee base, should not have the abil- ity to view, change, or block a broad set of internal resources as a result of their special access as employees, contractors, or visitors. This is especially true for access to the systems sup- porting essential services. Certainly, employees working on a specifi c component might have the ability to cause local prob- lems, but this effect should be limited to the local component. This can be accomplished with fi rewalls, access lists on rout- ers and switches, encryption-based systems, and other types of security mechanisms. Penetration testing from Intranet locations can be used to establish compliance. (SEP-4) … fl exibly enforce network access controls (fi rewalls) between organizational resources and any untrusted external network, including any mobile wireless infrastructure. The famil- iar part of this requirement involves fi rewalls between an orga- nization and external, untrusted networks such as the Internet. Almost every organization on the planet will be able to meet this requirement, albeit with a ragged perimeter model that might include hundreds or even thousands of access exceptions in their rule base. The unfamiliar part of the requirement is that mobility-based access over wireless carrier networks is included in the mediation. Because most companies and agencies do not have easy access to an enterprise mobile policy enforcement engine, cooperation with the mobile network carrier may be required (and don’t be surprised if your carrier has some trouble supporting your needs, as this is very new territory). (SEP-5) … stop inbound e-mail and web-based viruses, spam, and other malware before they reach network ingress points

Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS 211

to any local infrastructure supporting essential services. This requirement ensures that unwanted inbound traffi c is col- lected before it hits the ingress point to a target network— most likely the existing organizational Intranet. This greatly reduces the risk of a volume-based attack using these ser- vices and simplifi es gateway security requirements. Effi ciency and cost reduction concerns are a good byproduct in this approach, even though they are not the primary motivations for inclusion. Compliance can be established here through simple injection of inbound test traffi c aimed at the target. (SEP-6) … demonstrate that separation controls based on orga- nizational security policy are in place for cloud-hosted appli- cations, resources, and systems in support of essential services. This requirement ensures that the use of cloud infrastructure for applications, hosting, storage, and other components sup- porting essential services is properly protected based on orga- nizational security policy rules. With increasing adoption of cloud services for software, storage, applications, and systems, this requirement is likely to gradually develop into a set of best practices for cloud-based security. Integration of existing orga- nizational identity management functionality is likely to be one of the most challenging aspects of cloud security.

Sample Diversity Requirements (Chapter 4) In this section, we introduce two diversity requirements that will not be easy to implement in most environments. The practical reality is that most chief information offi cers (CIOs) have inten- tionally created networks that are lacking diversity, standard- ized, and cost effective but that are also susceptible to failure by a single vendor and do not have suffi cient backup by an alter- nate vendor. The desktop operating system is a good example. These requirements remain, nevertheless, powerful weapons for reducing the cascading effect of certain attacks and should thus be championed by the security team. The decision not to follow a diverse path should be made only after careful debate and con- sideration of all available options. A compromise might involve restricting the diversity to highly focused areas associated with especially critical systems.

The organization must … (DIV-1) … provide evidence that no single vendor failure or com- promise can produce a cascading effect across any combination of application, computing, or network components support- ing essential services. This requirement reduces the risk that a

212 Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

cascading chain of failure can occur in critical infrastructure because of a common vendor thread in a technology. As sug- gested above, this is a tough requirement for most organiza- tions to meet on the desktop, given the pervasiveness of a common architecture and set of applications. It is nevertheless critical that the cascading problem be addressed through atten- tion to diversity. Compliance can be checked manually. (DIV-2) … utilize at least one live, alternative backup vendor in a substantive manner for all software, PCs, servers, and net- work components in support of essential services. This require- ment implies one possible component of meeting DIV-1 by dictating a live, alternative backup. The requirement might be tailored in practice to a target percentage; for example, the requirement might state that at least 10% of all organizational PC operating systems in support of essential services be pro- vided by a backup vendor. Compliance can be checked manu- ally here as well.

Sample Commonality Requirements (Chapter 5) In this section, two commonality requirements are included that should be well suited for integration into most environments. The goal for most organizations will be to improve policy and compliance strategies rather than to create them from scratch.

The organization must … (COM-1) … have a written security policy, with training pro- grams for decision-makers, auditable mechanisms for com- pliance, and written processes for punishing violators. The familiar part of this requirement is that a security policy must be in place with mechanisms for compliance. The unfamiliar part involves training emphasis for decision-makers and writ- ten processes for dealing with violations. (COM-2) … demonstrate full organizational compliance to at least one recognized information security standard verifi ed by an external auditor. This requirement ensures that the orga- nization has targeted at least one reasonably well-known and accepted security standard for compliance. While there are some differences between standards, the reality is that the recognized ones all include a basic core set of requirements that dictate essentially the same sort of controls. Thus, it really doesn’t matter, in the vast majority of cases, which standard is selected as long as at least one is being used.

Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS 213

Sample Depth Requirements (Chapter 6) In this section, three sample requirements are offered that are intended to improve the depth of security mechanism layers for essential services. The requirements focus on access, fail- ure, integrity, and encryption; readers can easily extrapolate this emphasis to other technical or security areas.

The organization must … (DEP-1) … provide evidence that no individual, inside or out- side the organization, can directly access systems supporting an essential service without at least two diverse security authenti- cation challenges. This requirement ensures that two types of authentication are used in accessing essential infrastructure. Local interpretation would be required to determine if the selected methods are suffi ciently diverse. A personal iden- tifi cation number (PIN) with a handheld authenticator, for example, is often touted as providing suffi cient two-factor validation of identity; however, a group might locally interpret these two factors as a single challenge. (DEP-2) … provide evidence that failure of any one protection system cannot lead to a direct compromise in any application, computing, or networking functionality supporting essential services. This requirement ensures that failure of a single pro- tection system cannot compromise the overall mission. This might be achieved via network-based security, duplication and distribution, or some other means. Firewalls, in particu- lar, are often single points of defense that require a corre- sponding alternative protection method. (DEP-3) … provide evidence that all information, whose unau- thorized disclosure could directly affect the integrity and opera- tion of any essential service, is subjected to at least two diverse levels of encryption during both storage and transmission. This requirement ensures that the most critical information for essential services be encrypted twice using diverse means. The interpretation of what constitutes “most critical informa- tion” can be negotiated locally and should truly correspond to only that information whose confi dentiality is critical to the operation of an essential service. Thus, one could imag- ine this being a very small portion of information in limited types of circumstances. End-to-end plus link-level encryp- tion is an example of how this requirement might be met in some cases.

214 Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

Sample Discretion Requirements (Chapter 7) This section introduces two sample requirements on how infor- mation is handled. The requirements are reasonably conven- tional, and many organizations will fi nd that complying with these examples is not challenging. The real goal, however, is to create a culture of discretion in an organization; unfortunately, it is not a trivial exercise to write objective requirements that mea- sure how well a culture is suited to critical infrastructure protec- tion. The sample requirements below at least provide a start.

The organization must … (DIS-1) … provide evidence that organizational information related to any essential service is properly marked and that such markings are suitably enforced. This requirement should include evidence, perhaps through an unplanned audit, that all documents contain the proper markings. So many orga- nizations violate this simple principle, and this is a shame, because it provides powerful protection against certain types of data leakage. Data leakage tools, for example, can be tuned to detect properly marked documents that might be inadver- tently transmitted to an untrusted destination. (DIS-2) … provide evidence that organizational staff asso- ciated with essential services is fully trained in local poli- cies for how information is handled and shared externally. Compliance analysis here might require some interpretation, as most organizations will have some level of training. The key question is whether this training is considered accept- able, as well as how well the organization ensures that all staff completes the training on a regular basis. Obviously, if any of the requirements described here are inserted in the organiza- tional policy, then training must be updated accordingly.

Sample Collection Requirements (Chapter 8) This section introduces two sample collection requirements that help establish more formal, written policies on how information is collected for security purposes. A key goal in establishing any type of collection policy is to ensure that basic privacy consider- ations are fully attended to, in a manner that allows the security team suffi cient access to data to identify early indicators and to perform proper security forensic analysis.

The organization must … (COL-1) … provide evidence that a set of criteria has been established for which types of information in which contexts

Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS 215

should be collected and stored by the organization. The crite- ria for collection and storage must certainly include attention to various factors, including privacy, but the focus here is on security. The intent is to require that a set of criteria is in place for what is collected and stored, with the goal of ensuring that relevant attacks and indicators will be detectable in the col- lected data. If the collection process involves loose sampling of packets on a small percentage of network links, for exam- ple, then the likelihood of detecting an important security indicator might be low. (COL-2) … provide evidence that collection systems are in place to gather in real time and store in a secure manner all desired information from applications, systems, and networks. Having real-time access to data enables early warnings of attack. If the collection process is done in a casual batch mode, perhaps employing daily, weekly, or even monthly reports, then great damage could occur to an essential service before a corre- sponding trend might even be noted.

Sample Correlation Requirements (Chapter 9) This section introduces two sample correlation requirements that help focus an organization on a structured approach to ensuring that proper data analysis is being performed in an envi- ronment that allows for awareness and response.

The organization must … (COR-1) … provide evidence that effective algorithms are in place to correlate relevant information in real-time toward actionable results. Correlation algorithms do not have to be complex; rather, they must be properly in place and effec- tive in their operation, and they should lead to some sort of actionable security processes. In truth, complex algorithms might not even be desirable. I’ve seen simple packet counters provide more relevant information than complex pattern- matching signature-based intrusion prevention systems. (COR-2) … provide evidence that correlative output (presum- ably in a security operations center) is connected to organiza- tional awareness and response functions. This requirement ensures that the correlation, collection, awareness, and response functions are all connected. The presumption that this be done in a security operations center is based on the practical observation that most organizations that try to ensure such connections do so in a security operations center.

216 Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS

Sample Awareness Requirements (Chapter 10) This section introduces three sample requirements that focus on embedding situational awareness capabilities into the day-to-day infrastructure of an organization. Particular emphasis is placed on real-time, comprehensive capabilities.

The organization must … (AWA-1) … provide evidence that cyber security intelligence information related to essential services is collected on a regu- lar basis and disseminated to decision makers on a timely basis. This requirement ensures that an intelligence collecting process is in place. It should include mechanisms for safely obtaining raw data, for interpreting the data in a way that pro- tects its integrity as well as any privacy constraints, and for creating a set of actionable intelligence information for cyber security control. (AWA-2) … provide evidence that a real-time security oper- ations function exists that coordinates any preventive or response actions based on collected information and correlative analysis (presumably in an operations center). This require- ment ensures that the organization has a means for taking action as a result of collected intelligence. Too many groups create fancy security intelligence collection processes, render interesting patterns on large glowing wallboards, and can pro- vide anecdotes about all sorts of previous security attacks. It is far less common, however, to fi nd an organization that truly coordinates action based on available information. (AWA-3) … provide evidence that a real-time and comprehen- sive threat and vulnerability management process is in place and that relevant data is used to minimize security risk to all equipment and software supporting essential services. It is not uncommon to fi nd a threat and vulnerability management process in place in an organization. Assurance is not always available, however, that the process is real-time or comprehen- sive. That is, the data might become available in an arbitrary manner, and coverage issues for all equipment and software supporting essential services might not be properly considered.

Sample Response Requirements (Chapter 11) This section introduces two sample requirements on how an organization performs incident response. The emphasis is on enhancing existing response processes to include more proactive

Appendix: SAMPLE NATIONAL INFRASTRUCTURE PROTECTION REQUIREMENTS 217

responses to indicators as well as improved documentation and metrics.

The organization must … (RES-1) … provide evidence that the organization has the abil- ity to respond to indicators and warning signals in advance of an attack on any critical resource. This requirement ensures that an incident response can be initiated based on indica- tors, rather than just outages. Every organization has the abil- ity to notice that things have gone completely haywire (e.g., network down, e-mail not working), but only more aggres- sive groups have built processes for responding to more sub- tle data, perhaps from network devices, servers, or security systems. (RES-2) … provide evidence that the organization maintains documentation and metrics on the root cause of past security problems, as well as the effectiveness of response activities for past and present security incidents. Amazingly, few solicita- tions include this requirement; in fact, as a member of a ser- vice provider security team for the past 25 years, I’ve only seen this partially required a couple of times in major government procurements. More generic past practice requirements are often specifi ed by government buyers (rarely, if ever, in com- mercial settings), but the specifi cs regarding root cause met- rics, documented effectiveness, and other statistics are rarely dictated.

This page intentionally left blank

INDEX 219

INDEX

Access control lists (ACLs) LAN controls , 14 layered access controls ,

120 Access controls

functional separation , 55 layered , 120–122 , 122f national infrastructure SSO ,

116 remote e-mail , 118 separation principle , 14

Access paths, national depth program , 127

Access techniques, separation principle , 13–14

Accuracy data collection , 170–171 intelligence reports , 188 national infrastructure

fi rewalls , 57 SCADA systems , 62

ACLs , see Access control lists (ACLs)

Actionable information , 176 , 182

Active opportunism, vulnerability information management , 184–185

Actual assets and honey pot , 35 separation techniques , 54

Administrators, as infrastructure decision-makers , 103

Adversary separation , 53 Adversary types , 4–5 , 5f , 9–11 Aggregation

data collection , 146f system data collection , 152

Air gapping , 63–64 Alarm feed, intrusion detection

information , 126f Alarms

and actionable information , 176

false , 44 intrusion detection , 167 , 168 layered intrusion detection ,

125 SIEM threat management ,

167 SOC , 191

Alarm streams and correlation , 163 , 169f , 170 SIEM , 154–155

Algorithms and actionable information ,

176 antispam , 163–164 attack detection , 183 and awareness , 25 and collection , 23 , 145 , 157 and correlation , 24–25 ,

167–168 , 177 DDOS fi ltering , 60 and discretion , 21 vs. human interpretation , 187

Amplication approach, DDOS fi ltering , 60–61

Analysis objective, as deception , 32

Antivirus software botnet detection , 120 and botnets , 7–8 and correlation , 163–164 , 171 relevance , 119–120 as safeguard , 2 and separation , 15 , 51 system data collection , 152

Apple® PC diversity , 77 Application interoperability, and

desktop diversity , 79 Applications metric, data

collection , 151 Application strengthening,

encryption methods , 123 Approach factor, separation

techniques , 54

Approved external site, and obscurity layers , 141

Architectural separation layered e-mail fi ltering , 120f trusted networks , 15

Architecture issues, and awareness , 180

Asset identifi cation, national depth program , 127

Asset separation DDOS and CDN-hosted

content , 69f overview , 68–70

Assistance questions, TCB , 132 Attack confi dence, event

dependence , 184f Attack entry points, and

deception , 33–34 Attack metric pattern, early

warning process , 197 Attack motivations, national

infrastructure , 5 Attention objective, as

deception , 31 Attribute differences, and

diversity , 73–74 , 74f Attributes, information

reconnaissance , 139 Audits

and best practices , 18 and collection principle ,

22–23 defi nition , 89 importance , 18 meaningful vs. measurable

best practices , 91 national infrastructure

simplifi cation , 102 organization examples , 90f purpose , 90 security , see Security audit SIEM threat management ,

167

220 INDEX

Authentication layered , 115–118 , 117f remote e-mail access , 118 separation principle , 13

Authorized services, deception scanning stage , 36

Automated attacks botnets , 48–50 , 203–204 propagation , 74 , 81 worms , 77

Automated control systems , 1–2 , 2f

Automated metrics, intelligence reports , 187

Automation and actionable information ,

176 case management , 200 data collection , 21–22 data correlation , 169 in deception , 37 fusion center , 23–24 vs. human interpretation , 187 incident response cases , 200 intelligence gathering , 187 ,

188f reconnaissance , 138 situation awareness , 26f SOC , 190 statistics generation , 22

Availability deception exploitation stage ,

43–44 as security concern , 4

Awareness principle , see also Situational awareness

cyber security methodology , 11

implementation , 25–26 large-scale infrastructure

protection , 26 process fl ow , 26f sample requirements , 216 and security policy , 103 , 104

Back-end insiders, deception exploitation stage , 44 Back-loaded response ,

195 , 196f

Backup centers, diversity issues , 85

Behavioral metrics, early warning process , 197

Bell-La Padula disclosure, and MLS , 70–72

Best practices , see also Commonality principle

common standards , 90–91 consistency principle , 17–18 examples , 89 meaningful vs. measurable ,

91 , 92f national commonality

program , 108 national infrastructure

protection , 94–95 vs. security audit , 93

Biba integrity model , 70–72 Blaster worm , 76–77 Bogus vulnerabilities, and

deception discovery stage , 39–40 exploitation stage , 43–44 open ports , 38f and real assets , 44 scanning stage , 36

Botnets and antivirus software , 120 attack components , 7 bot defi nition , 6–7 controller defi nition , 7 correlation-based detection ,

172–174 , 174f and correlation principle ,

24–25 data collection trends , 157 ,

158f DDOS attack , 8f and deception , 12 domain-based correlation ,

164–165 , 165f operator defi nition , 7 and PC diversity , 78 real-time analysis , 48 as security concern , 4 and separation techniques ,

56–57 software drop defi nition , 7

system data collection , 152–153

target defi nition , 7 threat , 6–9 time-based correlation , 165 ,

166f Boundary scanning, air-gapped

networks , 64 British Security Standard, best

practices , 91 BS-7799, best practices

standards , 91 Business environment, and

awareness , 180

Career path basic considerations , 105–106 security teams , 95

Carrier-centric network-based fi rewalls , 59f

Cascade modeling, national diversity program , 87

Case studies deception discovery stage , 40 depth effectiveness , 111

CDNs , see Content distribution networks (CDNs)

Centralized mediation functional separation ,

56 , 56f smart device security , 58–59

CERT , see Computer emergency response team (CERT)

Certifi cation/education programs

best practice recommendations , 102–105

national infrastructure protection , 95

ROI trends , 104 , 104f Certifi ed Information Systems

Security Professional (CISSP) , 105

Circuit-switched technology , 82 Citizen-based data collection ,

153–154 Clark-Wilson integrity model ,

67

INDEX 221

Classifi cation commercial vs. government

information , 192 and information disclosure ,

142f organizational compartments ,

141 Clearance

commercial mapping , 143f organizational compartments ,

141 Clear policy, air-gapped

networks , 64 Cloud computing

diversity paradox , 80–82 fi rewalls , 59 layered e-mail virus/spam

protection , 119–120 network-based fi rewalls , 15

Clutter engineering chart example ,

100f national infrastructure

simplifi cation , 102 Collection of bots , 7 Collection principle , see also

Data collection and awareness , 25 cyber security methodology , 10 defi nition , 145 implementation , 21–23 large-scale trending , 156–159 national infrastructure , 22f national program , 161–162 sample requirements ,

214–215 security goals , 146–147 SIEM , 154–156 , 155f , 156f

Commercial databases data collection , 152 size , 3 system data collection , 152

Commercial fi rewalls and correlation , 168 national infrastructure , 57 and SCADA , 63 tailored separation , 53

Commercially motivated attack , 5

Commercial operating systems Mac OS® , 81 PC diversity , 77 UNIX® , 120 vulnerabilities , 139 Windows® , 74 , 77 , 81 , 120

Commercial organizations , see also Industry environments

botnet attacks , 174 clearances/classifi cations ,

143f competition issues , 175 e-mail protection , 119 government assistance , 136 information sharing , 28–29 insider separation , 66 intrusion detection , 125 national awareness program ,

192 national services , 1 and PII , 20–21 security audits , 108 security policy , 95–96 SIEM , 154 volunteered data , 158 vulnerability reports ,

129–130 Commercial tools

actionable information , 176 and deception exposing stage ,

46–47 large-scale offerings , 56 national infrastructure

protection , 14 , 15f satellite data services , 86 SIEM , 167–168 threat management systems ,

2–3 Commissions and boards

cyber security principle implementation , 28

national commonality program , 108

Commonality principle career path , 105–106 certifi cation/education ,

102–105 culture of security , 99f

engineering chart cluttered , 100f simplifi ed , 101f

infrastructure simplifi cation , 99–102

national infrastructure protection best practices , 94–95

national program , 107–108 overview , 89 past security practice ,

106–107 reward structure , 105–106 sample requirements , 212 security education ROI trends ,

104f security policy , 95–97 , 97f security protection culture ,

97–99 world-class infrastructure

protection , 94f Compartmentalization

and discretion principle , 141–143

information classifi cation , 192

separation techniques , 54f Competition

large-scale correlation , 175 PC diversity , 79

Complex environments, simplifi cation , 101–102

Complex networks, fi rewalls , 52f

Component distribution, as separation objective , 53

Computer emergency response team (CERT) , 206

Computer-to-human interface, and deception , 47–48

Confi cker worm , 158–159 Confi dentiality

as security concern , 4 vulnerability risk , 6

Consistency principle cyber security methodology , 10 implementation , 17–19

Consumer entertainment systems , 1

222 INDEX

Content condition, deceptive documents , 41

Content distribution networks (CDNs) , 69 , 69f

Correlation principle actionable information , 176 analytical methods , 163 and awareness , 25 basic considerations , 165–166 botnet detection , 172–174 ,

174f collection issues , 170f conventional methods ,

167–169 cyber security methodology ,

10–11 domain-based example , 165f implementation , 23–25 , 24f improvement steps , 24 intrusion detection and

fi rewalls , 169f large-scale , 174–176 , 175f national program , 176–177 network service providers ,

171 overview , 163 profi le-based example , 164f quality/reliability issues ,

169–170 sample requirements , 215 scenario taxonomy , 166f , 167 signature-based example ,

164f time-based example , 166f worm detection , 170–172

Cost issues cyber security principle

implementation , 29 and diversity principle , 75 platform diversity , 79 vs. risks , 190f

Country-sponsored warfare , 5 Coverage

data collection trends , 158 national infrastructure

fi rewalls , 57 SCADA systems , 62

Critical applications, as security concern , 4

Critical path analysis, national diversity program , 87

Culture of security basic considerations , 94–95 ,

97–99 implementation , 99 options , 99f

Cyber attack example , 2f Cyber security methodology

awareness principle , 25–26 collection principle , 21–23 , 22f components , 9–11 consistency principle , 17–19 correlation principle , 23–25 ,

24f deception , 11–13 depth principle , 19–20 discretion principle , 20–21 diversity principle , 16–17 intelligence reports , 186–188 national principle

implementation , 28–29 response principle , 26–28 , 27f separation , 13–15

Cyber security scale, large vs. small , 3f

Databases actionable information , 176 asset separation , 68–69 as national infrastructure , 1 , 3 national infrastructure vs.

commercial , 3 past security practices , 106 system data collection , 152 vulnerability information

management , 185 , 186f Data collection , see also

Collection principle with aggregation , 146f botnet behavior , 158f botnet detection , 173 and correlation issues , 170f decision analysis template ,

147f examples , 153f generic example , 148f network metadata , 148–150 network service providers , 171

system data , 150–154 top areas , 151 vulnerability detection , 149f vulnerability information

management , 185 worm tracking , 159–160 , 160f ,

161f Data feeds

and correlation principle , 24 national correlation program ,

177 quality issues , 169–170

Data formats, large-scale correlation , 174–175

Data leakage protection (DLP) systems , 67 , 141

Data marking enforcement , 66–67

Data sampling technique , 149 Data services , 82 , 86–87 Data sources, national collection

program , 161 Data storage

encryption methods , 123 national collection program ,

162 DDOS , see Distributed denial of

service attack (DDOS) Deception principle

adversary considerations , 33 and botnets , 12 cyber security methodology , 9 deliberately open ports , 37–39 discovery stage , 39–40 documents , 41–42 , 42f example use , 32f exploitation stage , 42–44 exposing stage , 46–47 , 46f honey pots and software bugs ,

37 human-computer interfaces ,

47–49 , 48f implementation , 11–13 interface components , 12f national asset service

interface , 36f national program , 49–50 objectives , 31–32 overview , 31

INDEX 223

procurement tricks , 45–46 , 45f

sample requirements , 208–209

scanning stage , 35–37 stages , 34 stages for national

infrastructure , 34f Decision-makers

back-end insiders , 44 and certifi cation/education ,

95 , 102 , 104–105 , 104f and data correlation , 163 and intelligence reports , 186 and security policy , 104 and TCB , 132

Decision process data collection , 147f forensic analysis , 204f risk vs. cost , 190f risk management , 189 security policy , 97f

Decomposition, asset separation , 69

Defense in depth cyber security methodology ,

10 effectiveness , 111–114 , 112f ,

114f end-user authentication , 117f general schema , 110f implementation , 19–20 , 19f ,

110 intrusion detection

information sharing , 126f layered access controls ,

120–122 layered authentication ,

115–118 layered e-mail fi ltering , 120f layered e-mail virus/spam

protection , 119–120 layered encryption , 122–124 ,

124f layered intrusion detection ,

124–126 national infrastructure , 19f national program , 126–127 overview , 109

remote e-mail access , 118 sample requirements , 213

Depth principle , see Defense in depth

Designers, as decision-makers , 103

Desktop computer systems , see Personal computers (PCs)

Developers, as decision-makers , 103

Digital rights management (DRM), and worms , 77

Disaster recovery process exercise confi gurations , 205f process , 204–206 program components , 205

Disclosure clearance/classifi cation

control , 142f deception exploitation stage ,

43–44 as security concern , 4

Discovery phase defi nition , 34 overview , 39–40

Discretion principle clearance/classifi cation

commercial mapping , 143f

cyber security methodology , 10

implementation , 20–21 information disclosure

control , 142f information reconnaissance ,

137–139 information sharing , 135–137 ,

137f national program , 143–144 obscurity layers , 139–141 ,

140f organizational compartments ,

141–143 overview , 129 sample requirements , 214 “security through obscurity” ,

133–135 and TCB , 131 top-secret information , 20

trusted computing base , 130–133 , 131f

vulnerability disclosure lifecycle , 135f

Distributed denial of service attack (DDOS)

and authentication and identity management , 13

botnets , 8f CDN-hosted content , 69 , 69f national separation program ,

72 network-based fi rewalls , 14 network technology diversity ,

83 separation principle , 9

Distributed denial of service attack (DDOS) fi lter

challanges , 61 inbound attacks , 61f separation techniques ,

60–61 Distributed mediation, functional

separation , 56 , 56f Diversity principle

and attribute differences , 73–74 , 74f

cloud computing , 80–82 cyber security methodology , 10 desktop computer systems ,

77–80 , 80f enforcing , 17 implementation , 16–17 national infrastructure , 16f national program , 87 network technology , 82–84 overview , 73 PC nondiversity example , 78f physical diversity , 85 proof factors , 116 sample requirements , 211–212 satellite data services , 86–87 ,

86f with SSO , 116 and worms , 75–77 , 76f , 83f

DLP systems , see Data leakage protection (DLP) systems

DNS , see Domain name system (DNS)

224 INDEX

Domain-based correlation , 164–165 , 165f

Domain name system (DNS) and CDNs , 69–70 cyber security principle

implementation , 29 deceptive open ports , 37–38

DRM , see Digital rights management (DRM)

Dual-homing , 64–65 , 65f Duplication

deception discovery stage , 40 honey pot design , 40 , 41f

Duty controls, and best practices , 18

E-mail layered fi ltering , 120f layered virus protection ,

119–120 remote access authentication ,

118 Emergency response

as national infrastructure , 1 national program , 206

Encryption best practices , 91 data collection , 152 deceptive documents , 42 intelligence reports , 187 layered , 122–124 , 124f national infrastructure , 2 ,

114 , 123 past security practice , 107 protected transit , 162

End user education , 105 Energy objective, as deception ,

31 Enforceability, security policy , 96 Engineering analysis, depth

effectiveness , 111 Engineering chart

cluttered , 100f simplifi ed example , 101f

Engineering standards, quality levels , 19

Enterprise security and deception principle , 12 desktop diversity options , 80f

layered authentication , 116 and PC diversity , 79 separation principle , 13–14 well-known techniques , 2–3

Expert gathering , 193 Exploitation points

deceptive open ports , 39 defi nition , 4–5 forensic analysis , 201 and honey pots , 34 national infrastructure , 5f scanning stage , 35–36 and “security through

obscurity” , 134 Exploitation stage

defi nition , 35 overview , 42–44 pre- and post-attack stages , 43f

Exposing stage defi nition , 35 overview , 46–47 , 46f

External adversary , 4–5

False positives deception exploitation stage , 43 rate , 28 and response principle , 27 response to , 43

Federal Information Security Management Act (FISMA) , 18 , 90

Fiber routes network technology diversity ,

83 worm propagation , 84f

Field control systems , 62 Filtering

DDOS , 60–61 e-mail, layered , 119–120 , 120f packet fi ltering routers , 15 system data collection , 152 vulnerability information

management , 185 Financial applications, as

national infrastructure , 1 Financially driven criminal

attack , 5 Financial networks, insider

separation , 66

Firewalls , see also Separation principle

approaches , 51–53 carrier-centric network-

based , 59f cloud computing , 59 intrusion detection

correlation , 169f large-scale networks , 51 layered access controls ,

121–122 , 122f national infrastructure ,

57–60 network-based , see Network-

based fi rewalls SCADA architecture , 63f separation enhancements ,

14 , 15f separation principle , 14 SIEM threat management , 167 simple/complex networks , 52f and worms , 76

FISMA , see Federal Information Security Management Act (FISMA)

Fix questions, TCB , 132 Flaws

and defense in depth , 112 and security posture , 182f

Forensic analysis decision process , 204f incident response , 201–203 ,

202f Front-loaded response , 195 ,

196f , 198 Functional controls, and

defense layers , 19–20 Functional separation

distributed vs. centralized mediation , 56f

overview , 55–57 Fusion centers , see also Security

operations centers (SOC) and correlation principle , 23–24 and response principle , 28 situational awareness , 190

Generalization, and infrastructure simplifi cation , 100 , 102

INDEX 225

Geographic location, botnet detection , 173

Global threats, and awareness , 180 Google® , 77 Government agencies/

environments audits , 18 botnets , 174 cloud computing , 15 , 15f commissions/boards , 108 competition , 175 data collection , 22f , 145 , 146 ,

151–153 data markings , 66–67 and deception , 12 discretion issues , 20 , 129 fi rewalls , 15 information sharing , 28–29 ,

136 , 137f infrastructure best practices ,

92–93 insider separation , 66 intelligence reports , 186 intrusion detection , 125–126 known vulnerabilites , 179 layered authentication , 115 layered intrusion detection ,

125–126 MLS , 70 national awareness program ,

192 national commonality

program , 108 national discretion program ,

143 national diversity program , 87 national response program ,

206 , 206f national separation program ,

71–72 national services , 1 network violations , 64 organizational compartments ,

141 PC diversity , 77 physical diversity , 85 politics , 192 response issues , 194 security policy , 95–96

separation program , 71–72 SIEM , 154 SOC , 191 system data collection ,

152–153 system size issues , 2–3 TCB , 132 volunteered data , 158 vulnerability information

management , 185 vulnerability reporting , 6 worm detection , 170–171

Hacking and awareness , 180–181 and discretion , 129 motivation , 5 national discretion program ,

143–144 and “security through

obscurity” , 134–135 Hardening (servers), and

deliberately open ports , 37

Hardware profi les, and awareness , 180

Health Insurance Portability and Accountability Act (HIPAA) , 90

Hidden probes, deception exposing stage , 44 , 47

HIPAA , see Health Insurance Portability and Accountability Act (HIPAA)

HMI , see Human-machine interface (HMI)

Honey pot and actual assets , 35 and deception , 32 defi nition , 11 duplication , 40 , 41f environmental mimics , 33 and exploitable points , 34 insider separation , 66 monitoring , 35 in normal server complex , 39f and real assets , 13 and software bugs , 37

testing , 12 vulnerability mimics , 40

HTTP , see Hypertext transfer protocol (HTTP)

Human-computer interfaces, and deception , 47–49

Human-human interfaces, and deception , 47–48 , 48f

Human interpretation, automated metrics , 187

Human-machine interface (HMI) , 62

Hypertext transfer protocol (HTTP), deceptive open ports , 37–38 , 38f

ICMP , see Internet Control Messaging Protocol (ICMP)

Identity management, separation principle , 13

Identity theft, as security concern , 4

IEC , see International Electrotechnical Commission (IEC)

Implied statement, and deception principle , 13

In-band detection , 125 Incident analysis , 193–194 Incident response , see also

Response principle defi nition , 194 disaster recovery , 204–206 ,

205f early warning triggers , 197 forensic analysis , 201–203 , 202f front- vs. back-loaded , 196f indications/warnings ,

197–198 law enforcement issues ,

203–204 , 204f national program , 206 , 206f phases , 193–194 pre- vs. post-attack , 195–196 process , 194f security teams , 198–201 simultaneous cases , 199f trigger intensity thresholds ,

198f

226 INDEX

Incident trigger defi nition , 193 early warning , 197

Inclusiveness, security policy , 96

Indications and warnings defi nition , 42–43 early triggers , 197 incident response , 197–198 response principle , 26–27

Industry environments , see also Commercial organizations

access control , 142 authentication issues , 115 career path/salary , 105 data collection , 22f , 151 and hackers , 129–130 information sharing , 28–29 ,

136 , 137f intrusion detection ,

125–126 national discretion program ,

143 physical diversity , 85 system size issues , 2–3 vulnerability information

management , 185 Information management,

vulnerabilities , 184–186 , 186f

Information reconnaissance information types , 138–139 overview , 137–139 planning levels , 138

Information sharing commercial vs. government ,

192 cyber security principle

implementation , 28–29 and discretion , 135–137 by government agencies , 136 ,

137f hacker perspective , 129 and incident response , 197 and intrusion detection , 126f national discretion program ,

143 occurrences , 135

and “security through obscurity” , 133

Infrastructure protection and awareness , 179–180 and meaningful best

practices , 92–95 Infrastructure simplifi cation

commitment , 95 , 99–102 national infrastructure , 101–102

Insider separation, basic considerations , 65–68

Integrity and best practices , 18 deception exploitation stage ,

43–44 as security concern , 4

Intelligence community daily briefs , 26 and discretion principle , 20 intelligence reports , 186

Intelligence reports creation , 187 creation/dissemination , 188f for cyber security , 186–188

Interfaces and deception principle ,

11–12 , 12f human-computer , 47–49 national infrastructure

simplifi cation , 102 national response program ,

206f Internal adversary , 4–5 Internal fi rewalls

insider separation , 66 layered access controls , 122

Internal separation as fi rewall approach , 52–53 national separation program ,

72 International Electrotechnical

Commission (IEC) , 91 International Organization for

Standardization (ISO) , 91 Internet Control Messaging

Protocol (ICMP), worm detection , 171

Internet Explorer®, PC diversity , 77

Internet Protocol (IP) intrusion detection , 168 layered access controls , 121 packet-switched technology ,

82 separation principle , 14

Internet Protocol over Satellite (IPoS) , 86

Internet Relay Chat (IRC), and botnets , 7–8 , 172

Intrusion detection with data security , 125 fi rewall policy correlation ,

169f information sharing , 126f layered , 124–126 SIEM threat management ,

167 Intrusion prevention , 125 Inventory processes, system

data collection , 151–152

Investment considerations and centralized security , 59 ROI and security education ,

104 , 104f IP , see Internet Protocol (IP) iPhone® , 77 iPod® , 77 IPoS , see Internet Protocol over

Satellite (IPoS) IRC , see Internet Relay Chat

(IRC) ISO , see International

Organization for Standardization (ISO)

LAN controls , see Local area network (LAN) controls

Large-scale correlation example , 175f factors , 174–176

Large-scale cyber security fi rewall protection , 51 , 56 vs. small-scale , 3 , 3f

Large-scale trending, data collection , 156–159

Law enforcement issues databases as infrastructure , 1

INDEX 227

incident response , 203–204 Layered access controls ,

120–122 Layered authentication

end-user , 117f overview , 115–118 remote e-mail access , 118

Layered encryption multiple layers , 124f national infrastructure , 123 overview , 122–124

Layered intrusion detection , 124–126

Layer of protection defense in depth overview ,

109 effectiveness , 112f , 114f

Legality questions, TCB , 132 Likelihood, risk management ,

189 Limits questions, TCB , 132 Local area fi rewall aggregation

example , 58f technique , 57

Local area network (LAN) controls , 14

Log fi les and collection principle ,

22–23 SIEM threat management ,

168 Logical access controls,

separation principle , 14 Logical diversity, network

technology , 82 Low radar actions , 42–43

MAC , see Media access control (MAC)

Mac OS®-based operating systems , 81

Mainframe data storage encryption methods , 123 system data collection , 150–

151 , 153 , 153f Malware

and awareness , 179–180 botnets , 7–8 , 172 and cloud computing , 81

and correlation , 165 , 168–169 and data collection , 152 and depth , 114 , 119–120 and open ports , 37 and PC diversity , 78 and separation , 56–57 , 65

Mandatory controls, and TCB , 131

Master terminal unit (MTU) , 62 Meaningful best practices

for infrastructure protection , 92–95

vs. measurable , 91 , 92f Measurable best practices vs.

meaningful , 91 , 92f Media access control (MAC),

separation principle , 14 Metadata

data collection , 148 sampling , 149 SIEM threat management ,

168 Military support services, as

national infrastructure , 1 Misinformation, in deception ,

11 MLS , see Multilevel security

(MLS) Mobile devices

encryption methods , 123 layered authentication issues ,

117 virus/spam issues , 119

Mobile telecommunications, as national infrastructure , 1

Monitoring deception exposing stage ,

46–47 honey pot , 35 intruders , 37 by network service providers ,

171 MTU , see Master terminal unit

(MTU) Multilevel security (MLS)

example , 71f for separation of assets , 70–72

Multiple access control systems, management , 122

Nachi worm , 171 National infrastructure

adversaries and exploitation points , 4–5 , 5f

attack detection , 183–184 attack motivations , 5 and awareness , 25–26 , 216 and collection , 21–23 , 22f ,

214–215 and commonality , 212 common security concerns , 4 and consistency , 17–19 and correlation , 23–25 , 24f ,

174–176 , 215 cyber attack vulnerability ,

1–2 , 2f cyber security methodology

components , 9–11 cyber threats, vulnerabilities,

attacks , 4–6 database size , 3 data collection , 145 data correlation issues , 170 DDOS fi ltering , 60 and deception , 11 , 34f , 43 , 46 ,

208–209 deceptive documents ,

41–42 and depth , 19–20 , 19f , 213 disaster recovery , 204–205 and discretion , 20–21 , 214 and diversity , 16–17 , 16f , 73 ,

211–212 exploitation points , 5f fi rewalls , 15f , 57–60 functional separation

techniques , 55–56 insider separation , 65–68 layered access controls ,

121–122 layered authentication , 115 layered encryption , 122–123 network technology diversity ,

84 obscurity layers , 139–140 overview , 1 and past security practice ,

106–107 PC diversity , 78

228 INDEX

National infrastructure –(contd.) physical attack vulnerability ,

1–2 , 2f and protection , 94–95 , 207–208 and response , 26–28 , 27f , 196 ,

203 , 216–217 and separation , 13–15 , 51 ,

209–211 service interface with

deception , 36f simplifi cation , 101–102 small- vs. large-scale security ,

2f , 3 smart device management , 58 SSO access system , 116 system data collection , 150 TCB assets , 132 well-known computer security

techniques , 2 National programs

awareness , 192 collection , 161–162 commonality , 107–108 correlation , 176–177 deception , 49–50 depth , 126–127 discretion , 143–144 diversity , 87 implementation of principles ,

28–29 response , 206 , 206f separation , 71–72

Need questions, TCB , 132 Netfl ow , 148 Network-based fi rewalls

and cloud computing , 15 DDOS fi ltering , 60 as fi rewall approach , 52 layered access controls ,

121–122 national separation program ,

71–72 separation enhancements , 14 simple/complex , 52f

Network data collection , 148–150 SIEM threat management , 168

Network perimeter, and defense layers , 20

Network routes, diversity issues , 85

Network service providers data collection , 171 network monitoring , 171

Network technology diversity , 82–84 and worms , 83f

Network transmission, encryption methods , 123

Nondefi nitive conclusions, vulnerability information management , 185

Nonuniformity, and infrastructure simplifi cation , 101

Obscurity layers discretion principle , 139–141 examples , 140f leaks , 141 national discretion program ,

144 Obviousness, and infrastructure

simplifi cation , 100 Off-the-shelf fi rewalls , 14 One-to-many communication,

botnet detection , 173 Online access, security policy ,

96 Open solicitations, deception

discovery stage , 40 Operational challenges

and collection principle , 22–23

and deception principle , 46 incident response , 200 smart device security , 59

Operational confi gurations, and best practices , 18

Operational costs, cyber security principle implementation , 29

Organizational culture incident response , 200 security implementation , 99 security options , 99f security protection , 94–95 ,

97–99

Outages metric, data collection , 151

Out-of-band correlation , 125 Outsourcing

and global threats , 180 incident response , 200 insider separation , 65–66 security operations , 200 security team members , 103 supply chains , 4–5

Packet fi ltering routers, separation principle , 15

Packet-switched technology , 82 Parceling, separation principle ,

15 Past security practice,

responsible , 95 , 106–107 Patching (software and systems),

and best practices , 18 Patterns, national infrastructure

simplifi cation , 102 Payment Card Industry Data

Security Standard (PCI DSS), best practices standards , 90

PCI DSS , see Payment Card Industry Data Security Standard (PCI DSS)

PCs , see Personal computers (PCs)

Permissions vectors, UNIX® , 120

Personal computers (PCs) botnet attacks , 6–7 botnet detection , 172 DDOS attacks , 8 diversity considerations ,

77–80 , 80f and diversity principle , 17 ,

77–80 nondiversity example , 78f system data collection , 150–

151 , 153 , 153f Personally identifi able

information (PII) and discretion principle ,

20–21 and TCB , 130

INDEX 229

Physical attacks, national infrastructure vulnerability , 1–2 , 2f

Physical diversity issues , 85 network technology , 82 satellite data services , 86–87

Physical security, layered access controls , 121 , 122

Physical separation dual-homing example , 65f technique , 63–65

PII , see Personally identifi able information (PII)

PKI tools , see Public key infrastructure (PKI) tools

Plain old telephone services (POTS) , 82

Planning, disaster recovery program , 205

Platforms, diversity costs , 79 Politics

and awareness , 180 and information sharing , 136 national awareness program ,

192 Port scanners, deceptive open

ports , 38 Post-attack vs. pre-attack

response , 195–196 POTS , see Plain old telephone

services (POTS) Power control networks, as

national infrastructure , 1 Practical experience, depth

effectiveness , 111 Practice, disaster recovery

program , 205 Pre-attack vs. post-attack

response , 195–196 Predators, deception-based

tools , 11 Preparation, disaster recovery

program , 205 Prevention

data collection security , 146 front-loaded , 195 past security practice , 107

Privacy policy, and collection principle , 22–23

Procedural controls, and defense layers , 19–20

Process allowance, deception exploitation stage , 44

Process coordination, deception exploitation stage , 44

Procurement discipline and deception principle ,

45–46 national diversity program , 87

Profi le-based correlation defi nition , 163 example , 164f

Proof factors, diversity , 116 Proprietary information

and discretion , 129–130 national depth program , 127

Protected transit, national collection program , 162

Protection condition, deceptive documents , 41

Protections, information reconnaissance , 139

PSTN , see Public switched telephone network (PSTN)

Public key infrastructure (PKI) tools, encryption methods , 123

Public speaking, and obscurity layers , 141

Public switched telephone network (PSTN) , 82

Published case studies, deception discovery stage , 40

Quality issues data collection , 147 data correlation , 169–170 defense in depth , 109 engineering standards , 19

Real assets and bogus assets , 38f , 44 and deception , 31 , 32f , 43 honey pot connection , 201 interfaces and deception , 12f

Real-time analysis botnet attacks , 48 honey pots , 32–33

Real-time awareness implementation , 25 process fl ow , 26f

Real-time observations, deception exposing stage , 47

Real-time risk, situational awareness , 182

Real vulnerabilities, deception scanning stage , 36

Reliability issues, data correlation , 169–170

Remote terminal unit (RTU) , 62 Removal option, PC diversity ,

81 , 81f Replication, asset separation ,

68–69 Requests for Information (RFIs),

deception discovery stage , 40

Requests for Proposals (RFPs), deception discovery stage , 40

Response principle , see also Incident response

cyber security methodology , 11

implementation , 26–28 , 27f past security practice , 107 sample requirements ,

216–217 Return on investment (ROI),

security education , 104 , 104f

Reward structure basic considerations ,

105–106 security teams , 95

RFIs , see Requests for Information (RFIs)

RFPs , see Requests for Proposals (RFPs)

Right-of-way routes, network technology diversity , 83

Risk management process , 188–190

230 INDEX

Risk reduction adversary separation , 53 by asset separation , 68–69 and botnet detection , 174 and cloud computing , 81 cyber security methodology ,

9–11 DDOS attacks , 60 , 69f and deception , 13 , 32 and depth , 19 , 113 by insider separation , 66–67 national separation program ,

72 by network technology

diversity , 84 by physical diversity , 85 by physical separation , 65 principles, national

implementation , 28 Root cause, forensic analysis ,

201 RTU , see Remote terminal unit

(RTU)

Salary , 105 Sarbanes-Oxley controls

consistency principle , 17–18 diversity principle , 74 internal separation , 72

Sasser worm , 76–77 Satellite data services

physical diversity , 86–87 SCADA confi gurations , 86f

SCADA , see Supervisory control and data acquisition (SCADA) systems

Scaling issues, system data collection , 150

Scanning stage deceptive open ports , 38 defi nition , 34 overview , 35–37

Search for leakage, and obscurity layers , 141

“Secret,” and MLS , 70 Secure commerce, encryption

methods , 123 Secure Sockets Layer (SSL),

encryption methods , 123

Security audit vs. best practices , 93 and certifi cation/education ,

102–103 defi nition , 89 infrastructure protection

relationship , 92 meaningful best practices , 92 meaningful vs. measurable

best practices , 91 national commonality

program , 108 organization examples , 90f purpose , 90

Security information and event management (SIEM)

defi nition , 154–156 generic architecture , 154–155 ,

155f generic national architecture ,

156f threat management , 167–168

Security information management system (SIMS), and collection principle , 21–22

Security operations centers (SOC) , see also Fusion centers

high-level design , 191f incident response , 200 responsibility , 192 situational awareness ,

190–191 Security policy

awareness , 103 and certifi cation/education ,

102 and decision-makers , 104 decision process , 97f intrusion detection

correlation , 169f locally relevant and

appropriate , 94–97 Security posture

and activity/response , 182f estimation , 181f intelligence reports , 187

Security standard

defi nition , 89 national commonality

program , 108 Security teams

career path/reward structure , 95

incident response , 198–201 as infrastructure decision-

makers , 103 “Security through obscurity”

and asset vulnerability , 21 defi nition , 133–135 and discretion principle , 21 exploitable fl aws , 134 knowledge lifecycle , 134f objectionable applications ,

133 primary vs. complementary

control , 133–134 Segregation, asset separation ,

69 Segregation of duties

defi nition , 67 work functions , 68f

Senior managers and career path , 106 as infrastructure decision-

makers , 103 Sensitive information, as

security concern , 4 Separation principle , see also

Firewalls asset separation , 68–70 carrier-centric network-based

fi rewalls , 59f cyber security methodology , 9 DDOS fi ltering , 60–61 distributed vs. centralized

mediation , 56f enhancements for national

infrastructure , 14 fi rewall approaches , 51–53 fi rewall enhancements , 15f functional separation , 55–57 implementation , 13–15 insider separation , 65–68 MLS , 70–72 , 71f national infrastructure

fi rewalls , 57–60

INDEX 231

national program , 71–72 objectives , 53–55 overview , 51 physical separation , 63–65 ,

65f sample requirements ,

209–211 SCADA architecture , 62–63 ,

63f techniques , 54–55

Separation vs. segregation of duties , 67

Server complex, honey pots , 39f Server data storage

encryption methods , 123 system data collection , 150–

151 , 153 , 153f Service level agreements (SLAs)

and data quality , 169 national infrastructure

fi rewalls , 59–60 Service ports

bogus assets , 38f and deception , 33–34 , 36f ,

37–39 post-scanning , 38

SIEM , see Security information and event management (SIEM)

Signature-based correlation defi nition , 163 example , 164f

Signature sharing , 125–126 Simple networks, fi rewalls , 52f Simplifi cation , see Infrastructure

simplifi cation SIMS , see Security information

management system (SIMS)

Single sign-on (SSO) initiatives and diversity , 116 layered authentication , 115 national infrastructure , 116

Situational awareness attack confi dence , 184f cyber security posture , 181f defi nition , 179 implementation , 25 and information sharing , 136

infrastructure attack detection , 183–184

intelligence reports , 187 , 188f , 186–188

national correlation program , 177

national program , 192 optimal system usage , 181f real-time risk , 182 risk categorization , 181–182 risk vs. cost decision paths ,

190f risk management process ,

188–190 security operations centers ,

190–191 , 191f vulnerability information

management , 184–186 , 186f

Sizing issues national infrastructure

simplifi cation , 101 security policy , 96 system data collection , 150

SLAs , see Service level agreements (SLAs)

Small-scale vs. large-scale cyber security , 3f , 3

Smart devices fi rewall issues , 58 national infrastructure

protection , 58 protection complexity , 58

SMTP, deceptive open ports , 37–38

SOC , see Security operations centers (SOC)

Software bugs, and honey pots , 37

Software engineering standards , 19

Software lifecycle, and best practices , 18

Software profi les, and awareness , 180

Spam, layered protection , 119–120

Sponsored research, deception discovery stage , 40

SQL/Slammer worm , 76–77 tracking , 159–160 , 160f , 161f

SSL , see Secure Sockets Layer (SSL)

SSO , see Single sign-on (SSO) initiatives

Stalkers, deception-based tools , 11

Standard audit, infrastructure protection , 93

State, forensic analysis , 201 Stream-of-consciousness

design, and infrastructure simplifi cation , 100–101

Subjective estimations, national depth program , 127

Suffi cient detail, deception exposing stage , 47

Suitability, and defense in depth , 112–113

Supervisory control and data acquisition (SCADA) systems

architecture , 62–63 , 63f insider separation , 66 IPoS , 86 , 86f layered access controls , 121 as national infrastructure , 1 national infrastructure

fi rewalls , 57 off-the-shelf fi rewalls , 14 separation principle , 9 tailored separation , 53 , 72

Supplier adversary deception techniques ,

45 , 45f defi nition , 4–5

Suppliers, diversity issues , 17 , 85

Supply chain , 4–5 Support and training, and

desktop diversity , 79 System administration, and best

practices , 18 System administration and

normal usage , 4–5 System data, collection ,

150–154

232 INDEX

Tailored separation as fi rewall approach , 53 national separation program ,

72 Target factor

large-scale correlation , 175 separation techniques , 54

Tarpit , 49–50 TCP/IP , see Transmission

Control Protocol/Internet Protocol (TCP/IP)

TDM , see Time-division multiplexed (TDM) services

Telecommunications collection systems , 22–23 insider separation , 66 as national infrastructure , 1

Terrorist attacks motivation , 5 9/11, physical attack

vulnerability , 1–2 Testing and simulation, depth

effectiveness , 111–112 Theft

deception exploitation stage , 43–44

as security concern , 4 Threat factor

insider separation , 65–66 separation techniques , 54

Threat management and best practices , 18 conventional security

correlation , 167–168 SIEM , 167–168

Time-based correlation defi nition , 165 example , 166f worm detection , 172f

Time-division multiplexed (TDM) services, diversity , 82

Tools and methods, national deception program , 49

Top-secret information disclosure control , 142 , 142f and discretion principle , 20 and MLS , 70

Transmission Control Protocol/ Internet Protocol (TCP/ IP), metadata collection , 148

Transparency and deception , 44 national correlation program ,

177 Transportation infrastructure,

insider separation , 66 Trap functionality, and deception

principle , 11–12 Trap isolation, deception

exploitation stage , 44 Trends, data collection ,

156–159 Trusted computing base (TCB)

basic questions , 132 defi nition , 130–133 discretion program goals ,

130–131 national discretion program ,

143 size issues , 131 , 131f

UDP , see User Datagram Protocol (UDP)

Uncertainty objective, as deception , 32

UNIX®-based operating systems , 120

Usage metric data collection , 151 optimal for security , 181f

Use-case studies, depth effectiveness , 111

User Datagram Protocol (UDP), worm tracking , 159–160 , 160f , 161f

Utilization metric, data collection , 151

Value proposition, national correlation program , 177

Vantage point, centralized security , 58–59

Vendors diversity issues , 85 and diversity principle , 17

Vigilant watch, botnet detection , 173

Violation issues access policies , 20 air-gapped networks , 64 and depth , 109–110 information leaks as , 142 infrastructure protection best

practices , 93 Viruses

attack initiation , 1–2 layered protection , 2f , 119–120 past security practice , 107 and response , 198f and trending , 158 and voice services , 84

Voice services , 83–84 Volunteered data , 24 , 158 , 169 ,

184–185 Vulnerability issues

and awareness , 179 and culture of security , 97 and data collection , 149f and deception , 32 , 36 , 42–44 and deceptive open ports , 38 and defense in depth , 19 disclosure lifecycle , 135f early warning process , 197 honey pot mimics , 40 information management ,

184–186 , 186f information reconnaissance ,

139 national infrastructure , 4–6 and “security through

obscurity” , 133–135 Vulnerability risk advisory , 6

Well-known computer security techniques

and exploitation points , 6 and national infrastructure , 2

Wide area fi rewall aggregation example , 58f technique , 57

Windows®-based operating systems

access control lists , 120 and diversity principle , 74

INDEX 233

PC diversity , 77 , 81 Work functions, segregation of

duties , 67 , 68f World-class focus

infrastructure protection , 93 methodology , 94f national commonality

program , 108 Worms

attack initiation , 1–2 , 2f

cloud computing , 80 and correlation , 167 , 170–172 ,

172f and diversity , 75–77 , 76f functionality , 75 Microsoft® Windows® target ,

78 and network diversity , 82 , 83f past security practice , 107 propagation , 75–77 , 76f , 84f

protection against , 4 and response , 198f as security concern , 4 tracking , 159–160 , 160f , 161f and trending , 158

Worst case assumptions, vulnerability information management , 185

  • Cyber Attacks: Protecting National Infrastructure
  • Copyright Page
  • Contents
  • Preface
  • Acknowledgment
  • Chapter 1 Introduction
    • National Cyber Threats, Vulnerabilities, and Attacks
    • Botnet Threat
    • National Cyber Security Methodology Components
    • Deception
    • Separation
    • Diversity
    • Consistency
    • Depth
    • Discretion
    • Collection
    • Correlation
    • Awareness
    • Response
    • Implementing the Principles Nationally
  • Chapter 2 Deception
    • Scanning Stage
    • Deliberately Open Ports
    • Discovery Stage
    • Deceptive Documents
    • Exploitation Stage
    • Procurement Tricks
    • Exposing Stage
    • Interfaces Between Humans and Computers
    • National Deception Program
  • Chapter 3 Separation
    • What Is Separation?
    • Functional Separation
    • National Infrastructure Firewalls
    • DDOS Filtering
    • SCADA Separation Architecture
    • Physical Separation
    • Insider Separation
    • Asset Separation
    • Multilevel Security (MLS)
  • Chapter 4 Diversity
    • Diversity and Worm Propagation
    • Desktop Computer System Diversity
    • Diversity Paradox of Cloud Computing
    • Network Technology Diversity
    • Physical Diversity
    • National Diversity Program
  • Chapter 5 Commonality
    • Meaningful Best Practices for Infrastructure Protection
    • Locally Relevant and Appropriate Security Policy
    • Culture of Security Protection
    • Infrastructure Simplification
    • Certification and Education
    • Career Path and Reward Structure
    • Responsible Past Security Practice
    • National Commonality Program
  • Chapter 6 Depth
    • Effectiveness of Depth
    • Layered Authentication
    • Layered E-Mail Virus and Spam Protection
    • Layered Access Controls
    • Layered Encryption
    • Layered Intrusion Detection
    • National Program of Depth
  • Chapter 7 Discretion
    • Trusted Computing Base
    • Security Through Obscurity
    • Information Sharing
    • Information Reconnaissance
    • Obscurity Layers
    • Organizational Compartments
    • National Discretion Program
  • Chapter 8 Collection
    • Collecting Network Data
    • Collecting System Data
    • Security Information and Event Management
    • Large-Scale Trending
    • Tracking a Worm
    • National Collection Program
  • Chapter 9 Correlation
    • Conventional Security Correlation Methods
    • Quality and Reliability Issues in Data Correlation
    • Correlating Data to Detect a Worm
    • Correlating Data to Detect a Botnet
    • Large-Scale Correlation Process
    • National Correlation Program
  • Chapter 10 Awareness
    • Detecting Infrastructure Attacks
    • Managing Vulnerability Information
    • Cyber Security Intelligence Reports
    • Risk Management Process
    • Security Operations Centers
    • National Awareness Program
  • Chapter 11 Response
    • Pre-Versus Post-Attack Response
    • Indications and Warning
    • Incident Response Teams
    • Forensic Analysis
    • Law Enforcement Issues
    • Disaster Recovery
    • National Response Program
  • Appendix: Sample National Infrastructure Protection Requirements
    • Sample Deception Requirements (Chapter 2)
    • Sample Separation Requirements (Chapter 3)
    • Sample Diversity Requirements (Chapter 4)
    • Sample Commonality Requirements (Chapter 5)
    • Sample Depth Requirements (Chapter 6)
    • Sample Discretion Requirements (Chapter 7)
    • Sample Collection Requirements (Chapter 8)
    • Sample Correlation Requirements (Chapter 9)
    • Sample Awareness Requirements (Chapter 10)
    • Sample Response Requirements (Chapter 11)
  • Index
    • A
    • B
    • C
    • D
    • E
    • F
    • G
    • H
    • I
    • L
    • M
    • N
    • O
    • P
    • Q
    • R
    • S
    • T
    • U
    • V
    • W

course_text_books/nistspecialpublication800-39.pdf

NIST Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

JOINT TASK FORCE TRANSFORMATION INITIATIVE

I N F O R M A T I O N S E C U R I T Y

Computer Security Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899-8930

March 2011

U.S. Department of Commerce Gary Locke, Secretary

National Institute of Standards and Technology Patrick D. Gallagher, Director

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Reports on Computer Systems Technology

The Information Technology Laboratory (ITL) at the National Institute of Standards and Technology (NIST) promotes the U.S. economy and public welfare by providing technical leadership for the nation’s measurement and standards infrastructure. ITL develops tests, test methods, reference data, proof of concept implementations, and technical analyses to advance the development and productive use of information technology. ITL’s responsibilities include the development of management, administrative, technical, and physical standards and guidelines for the cost-effective security and privacy of other than national security-related information in federal information systems. The Special Publication 800-series reports on ITL’s research, guidelines, and outreach efforts in information system security, and its collaborative activities with industry, government, and academic organizations.

PAGE ii

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Authority

This publication has been developed by NIST to further its statutory responsibilities under the Federal Information Security Management Act (FISMA), Public Law (P.L.) 107-347. NIST is responsible for developing information security standards and guidelines, including minimum requirements for federal information systems, but such standards and guidelines shall not apply to national security systems without the express approval of appropriate federal officials exercising policy authority over such systems. This guideline is consistent with the requirements of the Office of Management and Budget (OMB) Circular A-130, Section 8b(3), Securing Agency Information Systems, as analyzed in Circular A-130, Appendix IV: Analysis of Key Sections. Supplemental information is provided in Circular A-130, Appendix III, Security of Federal Automated Information Resources.

Nothing in this publication should be taken to contradict the standards and guidelines made mandatory and binding on federal agencies by the Secretary of Commerce under statutory authority. Nor should these guidelines be interpreted as altering or superseding the existing authorities of the Secretary of Commerce, Director of the OMB, or any other federal official. This publication may be used by nongovernmental organizations on a voluntary basis and is not subject to copyright in the United States. Attribution would, however, be appreciated by NIST.

NIST Special Publication 800-39, 88 pages

(March 2011)

Certain commercial entities, equipment, or materials may be identified in this document in order to describe an experimental procedure or concept adequately. Such identification is not intended to imply recommendation or endorsement by NIST, nor is it intended to imply that the entities, materials, or equipment are necessarily the best available for the purpose.

There may be references in this publication to other publications currently under development by NIST in accordance with its assigned statutory responsibilities. The information in this publication, including concepts and methodologies, may be used by federal agencies even before the completion of such companion publications. Thus, until each publication is completed, current requirements, guidelines, and procedures, where they exist, remain operative. For planning and transition purposes, federal agencies may wish to closely follow the development of these new publications by NIST.

Organizations are encouraged to review all draft publications during public comment periods and provide feedback to NIST. All NIST publications, other than the ones noted above, are available at http://csrc.nist.gov/publications.

National Institute of Standards and Technology Attn: Computer Security Division, Information Technology Laboratory

100 Bureau Drive (Mail Stop 8930) Gaithersburg, MD 20899-8930 Electronic mail: sec-cert@nist.gov

PAGE iii

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Compliance with NIST Standards and Guidelines

In accordance with the provisions of FISMA,1 the Secretary of Commerce shall, on the basis of standards and guidelines developed by NIST, prescribe standards and guidelines pertaining to federal information systems. The Secretary shall make standards compulsory and binding to the extent determined necessary by the Secretary to improve the efficiency of operation or security of federal information systems. Standards prescribed shall include information security standards that provide minimum information security requirements and are otherwise necessary to improve the security of federal information and information systems.

• Federal Information Processing Standards (FIPS) are approved by the Secretary of Commerce and issued by NIST in accordance with FISMA. FIPS are compulsory and binding for federal agencies.2 FISMA requires that federal agencies comply with these standards, and therefore, agencies may not waive their use.

• Special Publications (SPs) are developed and issued by NIST as recommendations and guidance documents. For other than national security programs and systems, federal agencies must follow those NIST Special Publications mandated in a Federal Information Processing Standard. FIPS 200 mandates the use of Special Publication 800-53, as amended. In addition, OMB policies (including OMB Reporting Instructions for FISMA and Agency Privacy Management) state that for other than national security programs and systems, federal agencies must follow certain specific NIST Special Publications.3

• Other security-related publications, including interagency reports (NISTIRs) and ITL Bulletins, provide technical and other information about NIST's activities. These publications are mandatory only when specified by OMB.

• Compliance schedules for NIST security standards and guidelines are established by OMB in policies, directives, or memoranda (e.g., annual FISMA Reporting Guidance).4

1 The E-Government Act (P.L. 107-347) recognizes the importance of information security to the economic and national security interests of the United States. Title III of the E-Government Act, entitled the Federal Information Security Management Act (FISMA), emphasizes the need for organizations to develop, document, and implement an organization-wide program to provide security for the information systems that support its operations and assets. 2 The term agency is used in this publication in lieu of the more general term organization only in those circumstances where its usage is directly related to other source documents such as federal legislation or policy. 3 While federal agencies are required to follow certain specific NIST Special Publications in accordance with OMB policy, there is flexibility in how agencies apply the guidance. Federal agencies apply the security concepts and principles articulated in the NIST Special Publications in accordance with and in the context of the agency’s missions, business functions, and environment of operation. Consequently, the application of NIST guidance by federal agencies can result in different security solutions that are equally acceptable, compliant with the guidance, and meet the OMB definition of adequate security for federal information systems. Given the high priority of information sharing and transparency within the federal government, agencies also consider reciprocity in developing their information security solutions. When assessing federal agency compliance with NIST Special Publications, Inspectors General, evaluators, auditors, and assessors consider the intent of the security concepts and principles articulated within the specific guidance document and how the agency applied the guidance in the context of its mission/business responsibilities, operational environment, and unique organizational conditions. 4 Unless otherwise stated, all references to NIST publications in this document (i.e., Federal Information Processing Standards and Special Publications) are to the most recent version of the publication.

PAGE iv

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Acknowledgements This publication was developed by the Joint Task Force Transformation Initiative Interagency Working Group with representatives from the Civil, Defense, and Intelligence Communities in an ongoing effort to produce a unified information security framework for the federal government. The National Institute of Standards and Technology wishes to acknowledge and thank the senior leaders from the Departments of Commerce and Defense, the Office of the Director of National Intelligence, the Committee on National Security Systems, and the members of the interagency technical working group whose dedicated efforts contributed significantly to the publication. The senior leaders, interagency working group members, and their organizational affiliations include:

U.S. Department of Defense Office of the Director of National Intelligence Teresa M. Takai Adolpho Tarasiuk Jr. Assistant Secretary of Defense for Networks and Assistant Director of National Intelligence and Information Integration/DoD Chief Information Intelligence Community Chief Information Officer (Acting) Officer

Gus Guissanie Charlene P. Leubecker Deputy Assistant Secretary of Defense (Acting) Deputy Intelligence Community Chief Information Officer

Dominic Cussatt Mark J. Morrison Senior Policy Advisor Director, Intelligence Community Information

Assurance

Barbara Fleming Roger Caslow Senior Policy Advisor Chief, Risk Management and Information

Security Programs Division

National Institute of Standards and Technology Committee on National Security Systems Cita M. Furlani Teresa M. Takai Director, Information Technology Laboratory Acting Chair, CNSS

William C. Barker Eustace D. King Cyber Security Advisor, Information Technology Laboratory CNSS Subcommittee Co-Chair

Donna Dodson Peter Gouldmann Chief, Computer Security Division CNSS Subcommittee Co-Chair

Ron Ross Lance Dubsky FISMA Implementation Project Leader CNSS Subcommittee Co-Chair

Joint Task Force Transformation Initiative Interagency Working Group

Ron Ross Gary Stoneburner Jennifer Fabius-Greene Kelley Dempsey NIST, JTF Leader Johns Hopkins APL The MITRE Corporation NIST

Deborah Bodeau Cheri Caddy Peter Gouldmann Arnold Johnson The MITRE Corporation Intelligence Community Department of State NIST

Peter Williams Karen Quigg Richard Graubart Christian Enloe Booz Allen Hamilton The MITRE Corporation The MITRE Corporation NIST

In addition to the above acknowledgments, a special note of thanks goes to Peggy Himes and Elizabeth Lennon for their superb technical editing and administrative support and to Bennett Hodge, Cassandra Kelly, Marshall Abrams, Marianne Swanson, Patricia Toth, Kevin Stine, and Matt Scholl for their valuable insights and contributions. The authors also gratefully acknowledge and appreciate the significant contributions from individuals and organizations in the public and private sectors, both nationally and internationally, whose thoughtful and constructive comments improved the overall quality, thoroughness, and usefulness of this publication.

PAGE v

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

DEVELOPING COMMON INFORMATION SECURITY FOUNDATIONS

COLLABORATION AMONG PUBLIC AND PRIVATE SECTOR ENTITIES

In developing standards and guidelines required by FISMA, NIST consults with other federal agencies and offices as well as the private sector to improve information security, avoid unnecessary and costly duplication of effort, and ensure that NIST publications are complementary with the standards and guidelines employed for the protection of national security systems. In addition to its comprehensive public review and vetting process, NIST is collaborating with the Office of the Director of National Intelligence (ODNI), the Department of Defense (DoD), and the Committee on National Security Systems (CNSS) to establish a common foundation for information security across the federal government. A common foundation for information security will provide the Intelligence, Defense, and Civil sectors of the federal government and their contractors, more uniform and consistent ways to manage the risk to organizational operations and assets, individuals, other organizations, and the Nation that results from the operation and use of information systems. A common foundation for information security will also provide a strong basis for reciprocal acceptance of security assessment results and facilitate information sharing. NIST is also working with public and private sector entities to establish mappings and relationships between the security standards and guidelines developed by NIST and the International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC).

PAGE vi

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

CAUTIONARY NOTE

INTENDED SCOPE AND USE OF THIS PUBLICATION

The guidance provided in this publication is intended to address only the management of information security-related risk derived from or associated with the operation and use of information systems or the environments in which those systems operate. The guidance is not intended to replace or subsume other risk-related activities, programs, processes, or approaches that organizations have implemented or intend to implement addressing areas of risk management covered by other legislation, directives, policies, programmatic initiatives, or mission/business requirements. Rather, the information security risk management guidance described herein is complementary to and should be used as part of a more comprehensive Enterprise Risk Management (ERM) program.

PAGE vii

________________________________________________________________________________________________

   

   

   

   

           

     

   

         

   

 

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Table of Contents

CHAPTER ONE INTRODUCTION............................................................................................ 1 1.1 PURPOSE AND APPLICABILITY .................................................................................................. 3 1.2 TARGET AUDIENCE.................................................................................................................. 3 1.3 RELATED PUBLICATIONS.......................................................................................................... 4 1.4 ORGANIZATION OF THIS SPECIAL PUBLICATION.......................................................................... 5

CHAPTER TWO THE FUNDAMENTALS ................................................................................... 6 2.1 COMPONENTS OF RISK MANAGEMENT ...................................................................................... 6 2.2 MULTITIERED RISK MANAGEMENT............................................................................................. 9 2.3 TIER ONE—ORGANIZATION VIEW............................................................................................ 11 2.4 TIER TWO—MISSION/BUSINESS PROCESS VIEW ...................................................................... 17 2.5 TIER THREE—INFORMATION SYSTEMS VIEW ........................................................................... 21 2.6 TRUST AND TRUSTWORTHINESS ............................................................................................ 23 2.7 ORGANIZATIONAL CULTURE ................................................................................................... 28 2.8 RELATIONSHIP AMONG KEY RISK CONCEPTS ........................................................................... 29

CHAPTER THREE THE PROCESS........................................................................................ 32 3.1 FRAMING RISK ...................................................................................................................... 33 3.2 ASSESSING RISK ................................................................................................................... 37 3.3 RESPONDING TO RISK ........................................................................................................... 41 3.4 MONITORING RISK................................................................................................................. 45

APPENDIX A REFERENCES.............................................................................................. A-1 APPENDIX B GLOSSARY ................................................................................................. B-1 APPENDIX C ACRONYMS................................................................................................. C-1 APPENDIX D ROLES AND RESPONSIBILITIES ..................................................................... D-1 APPENDIX E RISK MANAGEMENT PROCESS TASKS ........................................................... E-1 APPENDIX F GOVERNANCE MODELS .................................................................................F-1 APPENDIX G TRUST MODELS ........................................................................................... G-1 APPENDIX H RISK RESPONSE STRATEGIES ...................................................................... H-1

PAGE viii

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Prologue

“... Through the process of risk management, leaders must consider risk to U.S. interests from adversaries using cyberspace to their advantage and from our own efforts to employ the global nature of cyberspace to achieve objectives in military, intelligence, and business operations...”

“... For operational plans development, the combination of threats, vulnerabilities, and impacts must be evaluated in order to identify important trends and decide where effort should be applied to eliminate or reduce threat capabilities; eliminate or reduce vulnerabilities; and assess, coordinate, and deconflict all cyberspace operations...”

“... Leaders at all levels are accountable for ensuring readiness and security to the same degree as in any other domain...”

-- THE NATIONAL STRATEGY FOR CYBERSPACE OPERATIONS OFFICE OF THE CHAIRMAN, JOINT CHIEFS OF STAFF, U.S. DEPARTMENT OF DEFENSE

PAGE ix

________________________________________________________________________________________________

I

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

CHAPTER ONE

INTRODUCTION THE NEED FOR INTEGRATED ORGANIZATION-WIDE RISK MANAGEMENT

nformation technology is widely recognized as the engine that drives the U.S. economy, giving industry a competitive advantage in global markets, enabling the federal government to provide better services to its citizens, and facilitating greater productivity as a nation.

Organizations5 in the public and private sectors depend on technology-intensive information systems6 to successfully carry out their missions and business functions. Information systems can include diverse entities ranging from high-end supercomputers, workstations, personal computers, cellular telephones, and personal digital assistants to very specialized systems (e.g., weapons systems, telecommunications systems, industrial/process control systems, and environmental control systems). Information systems are subject to serious threats that can have adverse effects on organizational operations (i.e., missions, functions, image, or reputation), organizational assets, individuals, other organizations, and the Nation by exploiting both known and unknown vulnerabilities to compromise the confidentiality, integrity, or availability of the information being processed, stored, or transmitted by those systems. Threats to information and information systems can include purposeful attacks, environmental disruptions, and human/machine errors and result in great harm to the national and economic security interests of the United States. Therefore, it is imperative that leaders and managers at all levels understand their responsibilities and are held accountable for managing information security risk—that is, the risk associated with the operation and use of information systems that support the missions and business functions of their organizations.

Organizational risk can include many types of risk (e.g., program management risk, investment risk, budgetary risk, legal liability risk, safety risk, inventory risk, supply chain risk, and security risk). Security risk related to the operation and use of information systems is just one of many components of organizational risk that senior leaders/executives address as part of their ongoing risk management responsibilities. Effective risk management requires that organizations operate in highly complex, interconnected environments using state-of-the-art and legacy information systems—systems that organizations depend on to accomplish their missions and to conduct important business-related functions. Leaders must recognize that explicit, well-informed risk- based decisions are necessary in order to balance the benefits gained from the operation and use of these information systems with the risk of the same systems being vehicles through which purposeful attacks, environmental disruptions, or human errors cause mission or business failure. Managing information security risk, like risk management in general, is not an exact science. It brings together the best collective judgments of individuals and groups within organizations responsible for strategic planning, oversight, management, and day-to-day operations—providing both the necessary and sufficient risk response measures to adequately protect the missions and business functions of those organizations.

5 The term organization describes an entity of any size, complexity, or positioning within an organizational structure (e.g., a federal agency or, as appropriate, any of its operational elements) that is charged with carrying out assigned mission/business processes and that uses information systems in support of those processes. 6 An information system is a discrete set of information resources organized for the collection, processing, maintenance, use, sharing, dissemination, or disposition of information. In the context of this publication, the definition includes the environment in which the information system operates (i.e., people, processes, technologies, facilities, and cyberspace).

CHAPTER 1 PAGE 1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

The complex relationships among missions, mission/business processes, and the information systems supporting those missions/processes require an integrated, organization-wide view for managing risk.7 Unless otherwise stated, references to risk in this publication refer to information security risk from the operation and use of organizational information systems including the processes, procedures, and structures within organizations that influence or affect the design, development, implementation, and ongoing operation of those systems. The role of information security in managing risk from the operation and use of information systems is also critical to the success of organizations in achieving their strategic goals and objectives. Historically, senior leaders/executives have had a very narrow view of information security either as a technical matter or in a stovepipe that was independent of organizational risk and the traditional management and life cycle processes. This extremely limited perspective often resulted in inadequate consideration of how information security risk, like other organizational risks, affects the likelihood of organizations successfully carrying out their missions and business functions. This publication places information security into the broader organizational context of achieving mission/business success. The objective is to:

• Ensure that senior leaders/executives recognize the importance of managing information security risk and establish appropriate governance structures for managing such risk;

• Ensure that the organization’s risk management process is being effectively conducted across the three tiers of organization, mission/business processes, and information systems;

• Foster an organizational climate where information security risk is considered within the context of the design of mission/business processes, the definition of an overarching enterprise architecture, and system development life cycle processes; and

• Help individuals with responsibilities for information system implementation or operation better understand how information security risk associated with their systems translates into organization-wide risk that may ultimately affect the mission/business success.

To successfully execute organizational missions and business functions with information system- dependent processes, senior leaders/executives must be committed to making risk management a fundamental mission/business requirement. This top-level, executive commitment ensures that sufficient resources are available to develop and implement effective, organization-wide risk management programs. Understanding and addressing risk is a strategic capability and an enabler of missions and business functions across organizations. Effectively managing information security risk organization-wide requires the following key elements:

• Assignment of risk management responsibilities to senior leaders/executives;

• Ongoing recognition and understanding by senior leaders/executives of the information security risks to organizational operations and assets, individuals, other organizations, and the Nation arising from the operation and use of information systems;

• Establishing the organizational tolerance for risk and communicating the risk tolerance throughout the organization including guidance on how risk tolerance impacts ongoing decision-making activities;8 and

• Accountability by senior leaders/executives for their risk management decisions and for the implementation of effective, organization-wide risk management programs.

7 The aggregation of different types of risk across the organization is beyond the scope of this publication. 8 The evaluation of residual risk (which changes over time) to determine acceptable risk is dependent on the threshold set by organizational risk tolerance.

CHAPTER 1 PAGE 2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

1.1 PURPOSE AND APPLICABILITY NIST Special Publication 800-39 is the flagship document in the series of information security standards and guidelines developed by NIST in response to FISMA. The purpose of Special Publication 800-39 is to provide guidance for an integrated, organization-wide program for managing information security risk to organizational operations (i.e., mission, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation resulting from the operation and use of federal information systems. Special Publication 800-39 provides a structured, yet flexible approach for managing risk that is intentionally broad-based, with the specific details of assessing, responding to, and monitoring risk on an ongoing basis provided by other supporting NIST security standards and guidelines. The guidance provided in this publication is not intended to replace or subsume other risk-related activities, programs, processes, or approaches that organizations have implemented or intend to implement addressing areas of risk management covered by other legislation, directives, policies, programmatic initiatives, or mission/business requirements. Rather, the risk management guidance described herein is complementary to and should be used as part of a more comprehensive Enterprise Risk Management (ERM) program.

This publication satisfies the requirements of FISMA and meets or exceeds the information security requirements established for executive agencies9 by the Office of Management and Budget (OMB) in Circular A-130, Appendix III, Security of Federal Automated Information Resources. The guidelines in this publication are applicable to all federal information systems other than those systems designated as national security systems as defined in 44 U.S.C., Section 3542. The guidelines have been broadly developed from a technical perspective to complement similar guidelines for national security systems and may be used for such systems with the approval of appropriate federal officials exercising policy authority over such systems. State, local, and tribal governments, as well as private sector organizations are encouraged to consider using these guidelines, as appropriate.

1.2 TARGET AUDIENCE This publication is intended to serve a diverse group of risk management professionals including:

• Individuals with oversight responsibilities for risk management (e.g., heads of agencies, chief executive officers, chief operating officers);

• Individuals with responsibilities for conducting organizational missions/business functions (e.g., mission/business owners, information owners/stewards, authorizing officials);

• Individuals with responsibilities for acquiring information technology products, services, or information systems (e.g., acquisition officials, procurement officers, contracting officers);

• Individuals with information security oversight, management, and operational responsibilities (e.g., chief information officers, senior information security officers,10 information security managers, information system owners, common control providers);

9 An executive agency is: (i) an executive department specified in 5 U.S.C., Section 101; (ii) a military department specified in 5 U.S.C., Section 102; (iii) an independent establishment as defined in 5 U.S.C., Section 104(1); and (iv) a wholly owned government corporation fully subject to the provisions of 31 U.S.C., Chapter 91. In this publication, the term executive agency is synonymous with the term federal agency. 10 At the agency level, this position is known as the Senior Agency Information Security Officer. Organizations may also refer to this position as the Chief Information Security Officer.

CHAPTER 1 PAGE 3

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

• Individuals with information system/security design, development and implementation responsibilities (e.g., program managers, enterprise architects, information security architects, information system/security engineers; information systems integrators); and

• Individuals with information security assessment and monitoring responsibilities (e.g., system evaluators, penetration testers, security control assessors, independent verifiers/validators, inspectors general, auditors).

1.3 RELATED PUBLICATIONS The risk management approach described in this publication is supported by a series of security standards and guidelines necessary for managing information security risk. In particular, the Special Publications developed by the Joint Task Force Transformation Initiative11 supporting the unified information security framework for the federal government include:

• Special Publication 800-37, Guide for Applying the Risk Management Framework to Federal Information Systems: A Security Life Cycle Approach;

• Special Publication 800-53, Recommended Security Controls for Federal Information Systems and Organizations;

• Special Publication 800-53A, Guide for Assessing the Security Controls in Federal Information Systems and Organizations; and

• Draft Special Publication 800-30, Guide for Conducting Risk Assessments.12

In addition to the Joint Task Force publications listed above, the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC) publish standards for risk management and information security including:

• ISO/IEC 31000, Risk management – Principles and guidelines;

• ISO/IEC 31010, Risk management – Risk assessment techniques;

• ISO/IEC 27001, Information technology – Security techniques – Information security management systems – Requirements; and

• ISO/IEC 27005, Information technology – Security techniques – Information security risk management systems.

NIST’s mission includes harmonization of international and national standards where appropriate. The concepts and principles contained in this publication are intended to implement for federal information systems and organizations, an information security management system and a risk management process similar to those described in ISO/IEC standards. This reduces the burden on organizations that must conform to both ISO/IEC standards and NIST standards and guidance.

11 An overview of each Joint Task Force Transformation Initiative publication, similar to an Executive Summary, can be obtained through appropriate NIST ITL Security Bulletins at http://csrc.nist.gov. 12 Special Publication 800-39 supersedes the original Special Publication 800-30 as the source for guidance on risk management. Special Publication 800-30 is being revised to provide guidance on risk assessment as a supporting document to Special Publication 800-39.

CHAPTER 1 PAGE 4

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

1.4 ORGANIZATION OF THIS SPECIAL PUBLICATION The remainder of this special publication is organized as follows:

• Chapter Two describes: (i) the components of risk management; (ii) the multitiered risk management approach; (iii) risk management at the organization level (Tier 1); (iv) risk management at the mission/business process level (Tier 2); (v) risk management at the information system level (Tier 3); (vi) risk related to trust and trustworthiness; (vii) the effects of organizational culture on risk; and (viii) relationships among key risk management concepts.

• Chapter Three describes a life cycle-based process for managing information security risk including: (i) a general overview of the risk management process; (ii) how organizations establish the context for risk-based decisions; (iii) how organizations assess risk; (iv) how organizations respond to risk; and (v) how organizations monitor risk over time.

• Supporting appendices provide additional risk management information including: (i) general references; (ii) definitions and terms; (iii) acronyms; (iv) roles and responsibilities; (v) risk management process tasks; (vi) governance models; (vii) trust models; and (viii) risk response strategies.

CHAPTER 1 PAGE 5

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

CHAPTER TWO

THE FUNDAMENTALS BASIC CONCEPTS ASSOCIATED WITH RISK MANAGEMENT

This chapter describes the fundamental concepts associated with managing information security risk across an organization including: (i) the components of risk management; (ii) the multitiered risk management approach; (iii) risk management at Tier 1 (organization level); (iv) risk management at Tier 2 (mission/business process level); (v) risk management at Tier 3 (information system level); (vi) risk related to trust and trustworthiness; (vii) the effects of organizational culture on risk; and (viii) the relationships among key risk management concepts.

2.1 COMPONENTS OF RISK MANAGEMENT Managing risk is a complex, multifaceted activity that requires the involvement of the entire organization—from senior leaders/executives providing the strategic vision and top-level goals and objectives for the organization; to mid-level leaders planning, executing, and managing projects; to individuals on the front lines operating the information systems supporting the organization’s missions/business functions. Risk management is a comprehensive process that requires organizations to: (i) frame risk (i.e., establish the context for risk-based decisions); (ii) assess risk; (iii) respond to risk once determined; and (iv) monitor risk on an ongoing basis using effective organizational communications and a feedback loop for continuous improvement in the risk-related activities of organizations. Risk management is carried out as a holistic, organization- wide activity that addresses risk from the strategic level to the tactical level, ensuring that risk- based decision making is integrated into every aspect of the organization.13 The following sections briefly describe each of the four risk management components.

The first component of risk management addresses how organizations frame risk or establish a risk context—that is, describing the environment in which risk-based decisions are made. The purpose of the risk framing component is to produce a risk management strategy that addresses how organizations intend to assess risk, respond to risk, and monitor risk—making explicit and transparent the risk perceptions that organizations routinely use in making both investment and operational decisions. The risk frame establishes a foundation for managing risk and delineates the boundaries for risk-based decisions within organizations. Establishing a realistic and credible risk frame requires that organizations identify: (i) risk assumptions (e.g., assumptions about the threats, vulnerabilities, consequences/impact, and likelihood of occurrence that affect how risk is assessed, responded to, and monitored over time); (ii) risk constraints (e.g., constraints on the risk assessment, response, and monitoring alternatives under consideration); (iii) risk tolerance (e.g., levels of risk, types of risk, and degree of risk uncertainty that are acceptable); and (iv) priorities and trade-offs (e.g., the relative importance of missions/business functions, trade-offs among different types of risk that organizations face, time frames in which organizations must address risk, and any factors of uncertainty that organizations consider in risk responses). The risk framing component and the associated risk management strategy also include any strategic-level decisions on how risk to organizational operations and assets, individuals, other organizations, and the Nation, is to be managed by senior leaders/executives.

13 Integrated, enterprise-wide risk management includes, for example, consideration of: (i) the strategic goals/objectives of organizations; (ii) organizational missions/business functions prioritized as needed; (iii) mission/business processes; (iv) enterprise and information security architectures; and (v) system development life cycle processes.

CHAPTER 2 PAGE 6

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

The second component of risk management addresses how organizations assess risk within the context of the organizational risk frame. The purpose of the risk assessment component is to identify: (i) threats to organizations (i.e., operations, assets, or individuals) or threats directed through organizations against other organizations or the Nation; (ii) vulnerabilities internal and external to organizations;14 (iii) the harm (i.e., consequences/impact) to organizations that may occur given the potential for threats exploiting vulnerabilities; and (iv) the likelihood that harm will occur. The end result is a determination of risk (i.e., the degree of harm and likelihood of harm occurring). To support the risk assessment component, organizations identify: (i) the tools, techniques, and methodologies that are used to assess risk; (ii) the assumptions related to risk assessments; (iii) the constraints that may affect risk assessments; (iv) roles and responsibilities; (v) how risk assessment information is collected, processed, and communicated throughout organizations; (vi) how risk assessments are conducted within organizations; (vii) the frequency of risk assessments; and (viii) how threat information is obtained (i.e., sources and methods).

The third component of risk management addresses how organizations respond to risk once that risk is determined based on the results of risk assessments. The purpose of the risk response component is to provide a consistent, organization-wide, response to risk in accordance with the organizational risk frame by: (i) developing alternative courses of action for responding to risk; (ii) evaluating the alternative courses of action; (iii) determining appropriate courses of action consistent with organizational risk tolerance; and (iv) implementing risk responses based on selected courses of action. To support the risk response component, organizations describe the types of risk responses that can be implemented (i.e., accepting, avoiding, mitigating, sharing, or transferring risk). Organizations also identify the tools, techniques, and methodologies used to develop courses of action for responding to risk, how courses of action are evaluated, and how risk responses are communicated across organizations and as appropriate, to external entities (e.g., external service providers, supply chain partners).15

The fourth component of risk management addresses how organizations monitor risk over time. The purpose of the risk monitoring component is to: (i) verify that planned risk response measures are implemented and information security requirements derived from/traceable to organizational missions/business functions, federal legislation, directives, regulations, policies, and standards, and guidelines, are satisfied; (ii) determine the ongoing effectiveness of risk response measures following implementation; and (iii) identify risk-impacting changes to organizational information systems and the environments in which the systems operate.16 To support the risk monitoring component, organizations describe how compliance is verified and how the ongoing effectiveness of risk responses is determined (e.g., the types of tools, techniques, and methodologies used to determine the sufficiency/correctness of risk responses and if risk mitigation measures are implemented correctly, operating as intended, and producing the desired effect with regard to reducing risk). In addition, organizations describe how changes that may impact the ongoing effectiveness of risk responses are monitored.

14 Organizational vulnerabilities are not confined to information systems but can include, for example, vulnerabilities in governance structures, mission/business processes, enterprise architecture, information security architecture, facilities, equipment, system development life cycle processes, supply chain activities, and external service providers. 15 Supply chain risk management guidance is provided in NIST Interagency Report 7622. 16 Environments of operation include, but are not limited to: the threat space; vulnerabilities; missions/business functions; mission/business processes; enterprise and information security architectures; information technologies; personnel; facilities; supply chain relationships; organizational governance/culture; procurement/acquisition processes; organizational policies/procedures; organizational assumptions, constraints, risk tolerance, and priorities/trade-offs).

CHAPTER 2 PAGE 7

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

As indicated in the four components of risk management described above, organizations also consider external risk relationships, as appropriate. Organizations identify external entities with which there is an actual or potential risk relationship (i.e., organizations which could impose risks on, transfer risks to, or communicate risks to other organizations, as well as those to which organizations could impose, transfer, or communicate risks). External risk relationships include, for example, suppliers, customers or served populations, mission/business partners, and/or service providers. For organizations dealing with advanced persistent threats (i.e., a long-term pattern of targeted, sophisticated attacks) the risk posed by external partners (especially suppliers in the supply chain) may become more pronounced. Organizations establish practices for sharing risk- related information (e.g., threat and vulnerability information) with external entities, including those with which the organizations have a risk relationship as well as those which could supply or receive risk-related information (e.g., Information Sharing and Analysis Centers [ISAC], Computer Emergency Response Teams [CERT]).

Figure 1 illustrates the risk management process and the information and communications flows among components. The black arrows represent the primary flows within the risk management process with risk framing informing all the sequential step-by-step set of activities moving from risk assessment to risk response to risk monitoring. For example, one of the primary outputs from the risk framing component is a description of the sources and methods that organizations use in acquiring threat information (e.g., open source, classified intelligence community reports). The output regarding threat information is a primary input to the risk assessment component and is communicated accordingly to that component. Another example is illustrated in the primary output from the risk assessment component—that is, a determination of risk. The output from the risk assessment component is communicated to the risk response component and is received as a primary input for that component. Another primary input to the risk response component is an output from the risk framing component—the risk management strategy that defines how the organization should respond to risk. Together, these inputs, along with any additional inputs, are used by decision makers when selecting among potential courses of action for risk responses.

Information and Communications Flows

Information and Communications Flows

FRAME

ASSESS

RESPONDMONITOR

FIGURE 1: RISK MANAGEMENT PROCESS

The bidirectional nature of the arrows indicates that the information and communication flows among the risk management components as well as the execution order of the components, may

CHAPTER 2 PAGE 8

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

be flexible and respond to the dynamic nature of the risk management process. For example, new legislation, directives, or policies may require that organizations implement additional risk response measures immediately. This information is communicated directly from the risk framing component to the risk response component where specific activities are carried out to achieve compliance with the new legislation, directives, or policies, illustrating the very dynamic and flexible nature of information as it moves through the risk management process. Chapter Three provides a complete description of the organization-wide risk management process including specifications for inputs/preconditions, activities, and outputs/post conditions.

2.2 MULTITIERED RISK MANAGEMENT To integrate the risk management process throughout the organization, a three-tiered approach is employed that addresses risk at the: (i) organization level; (ii) mission/business process level; and (iii) information system level. The risk management process is carried out seamlessly across the three tiers with the overall objective of continuous improvement in the organization’s risk-related activities and effective inter-tier and intra-tier communication among all stakeholders having a shared interest in the mission/business success of the organization. Figure 2 illustrates the three- tiered approach to risk management along with some of its key characteristics.

STRATEGIC RISK

TIER 1 ORGANIZATION

TIER 2 MISSION / BUSINESS PROCESSES

TIER 3 INFORMATION SYSTEMS

- Inter- Tier and Intra-Tier Communications

- Feedback Loop for Continuous Improvement

- Traceability and Transparency of Risk-Based Decisions

- Organization-Wide Risk Awareness

TACTICAL RISK

FIGURE 2: MULTITIERED ORGANIZATION-WIDE RISK MANAGEMENT

Tier 1 addresses risk from an organizational perspective. Tier 1 implements the first component of risk management (i.e., risk framing), providing the context for all risk management activities carried out by organizations. Tier 1 risk management activities directly affect the activities carried out at Tiers 2 and 3. For example, the missions and business functions defined at Tier 1 influence the design and development of the mission/business processes created at Tier 2 to carry out those missions/business functions. Tier 1 provides a prioritization of missions/business functions which in turn drives investment strategies and funding decisions, thus, affecting the development of enterprise architecture (including embedded information security architecture) at Tier 2 and the allocations and deployment of management, operational, and technical security controls at Tier 3.

CHAPTER 2 PAGE 9

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Other examples of Tier 1 activities that affect Tier 2 and Tier 3 activities include the selection of common controls, the provision of guidance from the risk executive (function)17 to authorizing officials, and the establishment of the order of recovery for information systems supporting critical missions and business operations. Section 2.3 provides a more detailed description of the specific activities associated with Tier 1.

Tier 2 addresses risk from a mission/business process perspective and is informed by the risk context, risk decisions, and risk activities at Tier 1. Tier 2 risk management activities include: (i) defining the mission/business processes needed to support the missions and business functions of organizations; (ii) prioritizing the mission/business processes with respect to the strategic goals and objectives of organizations; (iii) defining the types of information needed to successfully execute the mission/business processes, the criticality/sensitivity of the information, and the information flows both internal and external to organizations; (iv) incorporating information security requirements18 into the mission/business processes; and (v) establishing an enterprise architecture19 with embedded information security architecture20 that promotes cost-effective and efficient information technology solutions consistent with the strategic goals and objectives of the organization and measures of performance. Tier 2 activities directly affect the activities carried out at Tier 3. For example, the information security architecture portion of the enterprise architecture developed at Tier 2 influences and guides the allocation of information protection needs which, in turn, influences and guides the allocation of the security controls to specific components of organizational information systems at Tier 3. Enterprise architecture decisions at Tier 2 affect the design of information systems at Tier 3 including the types of information technologies acceptable for use in developing those systems. The activities carried out at Tier 2 can also provide useful feedback to Tier 1, possibly resulting in revisions to the organizational risk frame or affecting risk management activities carried out at Tier 1, for example those performed by the risk executive (function). Section 2.4 provides a more detailed description of the specific activities associated with Tier 2.

Tier 3 addresses risk from an information system perspective and is guided by the risk context, risk decisions and risk activities at Tiers 1 and 2. Tier 3 risk management activities include: (i) categorizing organizational information systems; (ii) allocating security controls to organizational information systems and the environments in which those systems operate consistent with the organization’s established enterprise architecture and embedded information security architecture; and (iii) managing the selection, implementation, assessment, authorization, and ongoing monitoring of allocated security controls as part of a disciplined and structured system development life cycle process implemented across the organization. At Tier 3, information system owners, common control providers, system and security engineers, and information system security officers make risk-based decisions regarding the implementation, operation, and

17 The risk executive (function) is described in Section 2.3.2. 18 Information security requirements can be obtained from a variety of sources (e.g., legislation, policies, directives, regulations, standards, and organizational mission/business/operational requirements). Organization-level security requirements are documented in the information security program plan or equivalent document. 19 Federal Enterprise Architecture Reference Models and Segment and Solution Architectures are defined in the OMB Federal Enterprise Architecture (FEA) Program, FEA Consolidated Reference Model Document, Version 2.3, October 2003, and OMB Federal Segment Architecture Methodology (FSAM), January 2009, respectively. 20 The information security architecture describes the security-related aspects of the enterprise architecture that are incorporated into the enterprise architecture definition as an integral part of the architecture development—that is a sub-architecture derived from the enterprise architecture, not a separately defined layer or architecture.

CHAPTER 2 PAGE 10

________________________________________________________________________________________________

                                                                                   

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

monitoring of organizational information systems. Based on these day-to-day operational risk- based decisions, authorizing officials make follow-on risk-based decisions on whether or not the information systems are initially authorized to operate within the designated environments of operation or continue to receive authorization to operate on an ongoing basis. These ongoing risk- based decisions are informed by the risk management process with guidance from the risk executive (function) and the various architectural considerations supporting the mission/business processes. In addition, the activities at Tier 3 provide essential feedback to Tiers 1 and 2. New vulnerabilities discovered in an organizational information system, for example, may have systemic implications that extend organization-wide. Those same vulnerabilities may trigger changes to the enterprise architecture and embedded information security architecture or may require an adjustment to the organizational risk tolerance. Section 2.5 provides a more detailed description of the specific activities associated with Tier 3.

Since mission and business success in organizations depends on information systems, those systems must be dependable. To be dependable in the face of sophisticated threats, the information systems must be used wisely in accordance with the degree of protection and resilience achieved.

2.3 TIER ONE—ORGANIZATION VIEW Tier 1 addresses risk from an organizational perspective by establishing and implementing governance structures that are consistent with the strategic goals and objectives of organizations and the requirements defined by federal laws, directives, policies, regulations, standards, and missions/business functions. Governance structures provide oversight for the risk management activities conducted by organizations and include: (i) the establishment and implementation of a risk executive (function); (ii) the establishment of the organization’s risk management strategy including the determination of risk tolerance; and (iii) the development and execution of organization-wide investment strategies for information resources and information security.

2.3.1 Governance In general, governance is the set of responsibilities and practices exercised by those responsible for an organization (e.g., the board of directors and executive management in a corporation, the head of a federal agency) with the express goal of: (i) providing strategic direction; (ii) ensuring that organizational mission and business objectives are achieved; (iii) ascertaining that risks are managed appropriately; and (iv) verifying that the organization’s resources are used responsibly.21 Risks and resources can be associated with different organizational sectors (e.g., legal, finance, information technology, regulatory compliance, information security). Different sectors require specialized expertise in order to manage the risks associated with that sector. Thus, governance within organizations frequently is organized by sector.22 The five outcomes of governance related to organization-wide risk management are:

21 This definition is adapted from the IT Governance Institute. The Chartered Institute of Management Accountants and the International Federation of Accountants also adopted this definition in 2004. 22 While governance is frequently organized by sectors, organizations are well served by establishing a single aligned governance approach. A unified governance approach can coordinate the individual sector governance activities and provide a consistent governance approach, organization-wide.

CHAPTER 2 PAGE 11

________________________________________________________________________________________________

                                             

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

• Strategic alignment of risk management decisions with missions and business functions consistent with organizational goals and objectives;

• Execution of risk management processes to frame, assess, respond to, and monitor risk to organizational operations and assets, individuals, other organizations, and the Nation;

• Effective and efficient allocation of risk management resources;

• Performance-based outcomes by measuring, monitoring, and reporting risk management metrics to ensure that organizational goals and objectives are achieved; and

• Delivered value by optimizing risk management investments in support of organizational objectives.23

As part of organizational governance, senior leaders/executives in consultation and collaboration with the risk executive (function), determine: (i) the types of risk management decisions that are reserved for specific senior leadership roles (e.g., heads of agencies or chief executive officers, chief financial officers, chief information officers, chief information security officers);24 (ii) the types of risk management decisions that are deemed to be organization-wide and the types of decisions that can be delegated to subordinate organizations or to other roles in the organization (e.g., systems and security engineers, mission/business owners, enterprise architects, information security architects, common infrastructure or service providers, authorizing officials); and (iii) how risk management decisions will be communicated to and by the risk executive (function). Three different types of governance models (i.e., centralized, decentralized, and hybrid) are described in Appendix F. Regardless of the governance model(s) employed, clear assignment and accountability for accepting risk is essential for effective risk management.

Strong governance is the best indicator of senior leadership commitment to effective, consistent risk management across the organization to achieve ongoing mission/business success.

2.3.2 Risk Executive (Function) The risk executive is a functional role established within organizations to provide a more comprehensive, organization-wide approach to risk management. The risk executive (function) serves as the common risk management resource for senior leaders/executives, mission/business owners, chief information officers, chief information security officers, information system owners, common control providers,25 enterprise architects, information security architects, information systems/security engineers, information system security managers/officers, and any other stakeholders having a vested interest in the mission/business success of organizations. The risk executive (function) coordinates with senior leaders/executives to:

• Establish risk management roles and responsibilities;

23 Information security governance outcomes adapted from IT Governance Institute, Information Security Governance: Guidance for Boards of Directors and Executive Management, 2nd Edition, 2006. 24 There is no implication by listing various titles within an organization of any particular relationship (peer or otherwise) or lines of authority. 25 A common control provider is an organizational official responsible for the development, implementation, assessment, and monitoring of common controls (i.e., security controls inherited by information systems).

CHAPTER 2 PAGE 12

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

• Develop and implement an organization-wide risk management strategy that guides and informs organizational risk decisions (including how risk is framed, assessed, responded to, and monitored over time); 26

• Manage threat and vulnerability information with regard to organizational information systems and the environments in which the systems operate;

• Establish organization-wide forums to consider all types and sources of risk (including aggregated risk);

• Determine organizational risk based on the aggregated risk from the operation and use of information systems and the respective environments of operation;

• Provide oversight for the risk management activities carried out by organizations to ensure consistent and effective risk-based decisions;

• Develop a greater understanding of risk with regard to the strategic view of organizations and their integrated operations;

• Establish effective vehicles and serve as a focal point for communicating and sharing risk- related information among key stakeholders internally and externally to organizations;

• Specify the degree of autonomy for subordinate organizations permitted by parent organizations with regard to framing, assessing, responding to, and monitoring risk;27

• Promote cooperation and collaboration among authorizing officials to include security authorization actions requiring shared responsibility (e.g., joint/leveraged authorizations);28

• Ensure that security authorization decisions consider all factors necessary for mission and business success; and

• Ensure shared responsibility for supporting organizational missions and business functions using external providers receives the needed visibility and is elevated to appropriate decision- making authorities.

The risk executive (function) presumes neither a specific organizational structure nor formal responsibility assigned to any one individual or group within the organization. Heads of agencies or organizations may choose to retain the risk executive (function) or to delegate the function. The risk executive (function) requires a mix of skills, expertise, and perspectives to understand the strategic goals and objectives of organizations, organizational missions/business functions, technical possibilities and constraints, and key mandates and guidance that shape organizational operations. To provide this needed mixture, the risk executive (function) can be filled by a single individual or office (supported by an expert staff) or by a designated group (e.g., a risk board,

26 Organizational risk decisions include investment decisions (see Section 2.3.4). Organizational risk tolerance is determined as part of the risk framing component (see Section 2.3.3) and defined in the risk management strategy. 27 Because subordinate organizations responsible for carrying out derivative or related missions may have already invested in their own methods of framing, assessing, responding to, and monitoring risk, parent organizations may allow a greater degree of autonomy within parts of the organization or across the entire organization in order to minimize costs. When a diversity of risk management activities is allowed, organizations may choose to employ, when feasible, some means of translation and/or synthesis of the risk-related information produced from those activities to ensure that the output of the different activities can be correlated in a meaningful manner. 28 NIST Special Publication 800-37 provides guidance on joint and leveraged authorizations.

CHAPTER 2 PAGE 13

________________________________________________________________________________________________

                                             

                                                   

                       

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

executive steering committee, executive leadership council).29 The risk executive (function) fits into the organizational governance structure in such a way as to facilitate efficiency and to maximize effectiveness. While the organization-wide scope situates the risk executive (function) at Tier 1, its role entails ongoing communications with and oversight of the risk management activities of mission/business owners, authorizing officials, information system owners, common control providers, chief information officers, chief information security officers, information system and security engineers, information system security managers/officers, and other stakeholders at Tiers 2 and 3.

To be effective, organization‐wide risk management programs require the strong commitment, direct involvement, and ongoing support from senior leaders/executives. The objective is to institutionalize risk management into the day‐to‐day operations of organizations as a priority and an integral part of how organizations conduct operations in cyberspace—recognizing that this is essential in order to successfully carry out missions in threat‐laden operational environments.

2.3.3 Risk Management Strategy An organizational risk management strategy, one of the key outputs of risk framing, addresses how organizations intend to assess, respond to, and monitor risk—the risk associated with the operation and use of organizational information systems. The risk management strategy makes explicit the specific assumptions, constraints, risk tolerances, and priorities/trade-offs used within organizations for making investment and operational decisions. The risk management strategy also includes any strategic-level decisions and considerations on how senior leaders/executives are to manage information security risk to organizational operations and assets, individuals, other organizations, and the Nation. An organization-wide risk management strategy includes, for example, an unambiguous expression of the risk tolerance for the organization, acceptable risk assessment methodologies, risk response strategies, a process for consistently evaluating risk across the organization with respect to the organization’s risk tolerance, and approaches for monitoring risk over time. The use of a risk executive (function) can facilitate consistent, organization-wide application of the risk management strategy. The organization-wide risk management strategy can be informed by risk-related inputs from other sources both internal and external to the organization to ensure the strategy is both broad-based and comprehensive.

An important Tier 1 risk management activity and also part of risk framing, is the determination of risk tolerance. Risk tolerance is the level of risk or degree of uncertainty that is acceptable to organizations and is a key element of the organizational risk frame. Risk tolerance affects all components of the risk management process—having a direct impact on the risk management decisions made by senior leaders/executives throughout the organization and providing important constraints on those decisions. For example, risk tolerance affects the nature and extent of risk management oversight implemented in organizations, the extent and rigor of risk assessments performed, and the content of organizational strategies for responding to risk. With regard to risk assessments, more risk-tolerant organizations may be concerned only with those threats that peer organizations have experienced while less risk-tolerant organizations may expand the list to include those threats that are theoretically possible, but which have not been observed in operational environments. With regard to risk response, less risk-tolerant organizations are likely

29 Organizations emphasize the need for inclusiveness within the risk executive (function) by senior leaders/executives in mission/business areas to help ensure proper information security planning, resourcing, and risk management.

CHAPTER 2 PAGE 14

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

to require additional grounds for confidence in the effectiveness of selected safeguards and countermeasures or prefer safeguards and countermeasures that are more mature and have a proven track record. Such organizations may also decide to employ multiple safeguards and countermeasures from multiple sources (e.g., antivirus software at clients and servers that are provided by different vendors). Another example illustrating the impact of risk tolerance on risk response is that risk tolerance can also affect the organizational requirements for trustworthiness provided by specific information technologies. Two organizations may choose the same information technologies, but their relative degree of risk tolerance may impact the degree of assessment required prior to deployment.

There is no correct level of organizational risk tolerance. Rather, the degree of risk tolerance is: (i) generally indicative of organizational culture; (ii) potentially different for different types of losses/compromises; and (iii) highly influenced by the individual subjective risk tolerance of senior leaders/executives. Yet, the ramifications of risk decisions based on risk tolerance are potentially profound, with less risk-tolerant organizations perhaps failing to achieve needed mission/business capabilities in order to avoid what appears to be unacceptable risk; while more risk-tolerant organizations may focus on near-term mission/business efficiencies at the expense of setting themselves up for future failure. It is important that organizations exercise due diligence in determining risk tolerance—recognizing how fundamental this decision is to the effectiveness of the risk management program.

2.3.4 Investment Strategies Investment strategies30 play a significant role in organizational risk management efforts. These strategies generally reflect the long-term strategic goals and objectives of organizations and the associated risk management strategies developed and executed to ensure mission and business success. Underlying all investment strategies is the recognition that there is a finite amount of resources available to invest in helping organizations effectively manage risk—that is, effectively addressing risk to achieve on-going mission/business success.

Mission and Risk Priorities Organizations generally conduct a variety of missions and are involved in different types of business functions. This is especially true for large and complex organizations that have different organizational components, each of which is typically focused on one or two primary missions. While all of these organizational components and associated missions/business functions are likely to be important and play a key role in the overall success of organizations, in reality they are not of equal importance. The greater the criticality of organizational missions and business functions, the greater the necessity for organizations to ensure that risks are adequately managed. Such missions and business functions are likely to require a greater degree of risk management investments than missions/business functions deemed less critical. The determination of the relative importance of the missions/business functions and hence the level of risk management investment, is something that is decided upon at Tier 1, executed at Tier 2, and influences risk management activities at Tier 3.

Anticipated Risk Response Needs There is a great variation in the nature of potential threats facing organizations, ranging from hackers attempting to merely deface organizational Web sites (e.g., cyber vandalism), to insider

30 Investment strategies can include organizational approaches to: (i) replacing legacy information systems (e.g., phasing items in gradually, replacing entirely); (ii) outsourcing and using external providers of information systems and services; and (iii) internal development vs. acquisition of commercially available information technology products.

CHAPTER 2 PAGE 15

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

threats, to sophisticated terrorist groups/organized criminal enterprises seeking to exfiltrate sensitive information, to a nation state’s military attempting to destroy or disrupt critical missions by attacking organizational information systems.31 The strategic investments required to address the risk from more traditional adversaries (e.g., hackers conducting small-group activities with limited capabilities) are considerably different than the investments required to address the risk associated with advanced persistent threats consistent with more advanced adversaries (e.g., nation states or terrorist groups with highly sophisticated levels of expertise and resources that seek to establish permanent footholds in organizations for purposes of impeding aspects of the organizational missions). To address less sophisticated threats, organizations can focus their efforts at Tier 3—investing to ensure that needed safeguards and countermeasures (e.g., security controls, security services, and technologies) are obtained, implemented correctly, operating as intended, and producing the desired effect with regard to meeting information security policies and addressing known vulnerabilities. In addition to these basic investments, organizations can also invest in continuous monitoring processes to ensure that the acquired security controls, services, and technologies are operating effectively throughout the system development life cycle.

When organizations need to address advanced persistent threats, it is likely that adequately addressing related risks at Tier 3 is not feasible because necessary security solutions are not currently available in the commercial marketplace. In those instances, organizations must purposefully invest beyond Tier 3 for significant response capabilities at Tier 2, and to some extent at Tier 1. At Tier 3, the nature of investment is likely to change from implementation of existing solutions to an added strategic focus on investing in leading-edge information security technologies (essentially experimenting with innovative security solutions/technologies and being an early adopter) or investing in information security research and development efforts to address specific technology gaps.32 Information security investments to address advanced persistent threats may require expenditures over the course of several years, as new security solutions and technologies transition from research to development to full deployment. The long-term view of strategic investing in the risk response needs for organizations can help to reduce the continuing focus on near-term vulnerabilities discovered in information systems—vulnerabilities that exist due to the complexity of the information technology products and systems and the inherent weaknesses in those products and systems.

Limitations on Strategic Investments The ability of organizations to provide strategic information security investments is limited. Where the desired strategic investment funding or strategic resources33 are not available to address specific needs, organizations may be forced to make compromises. For example, organizations might extend the time frame required for strategic information security objectives to be accomplished. Alternatively, organizations might prioritize risk management investments, opting to provide resources (financial or otherwise) to address some critical strategic needs sooner than other less critical needs. All investment decisions require organizations to prioritize risks and to assess the potential impacts associated with alternative courses of action.

31 The threats described above are a subset of the overarching threat space that also includes errors of omission and commission, natural disasters, and accidents. 32 This investment strategy is a change from vulnerability and patch management to a longer-term strategy addressing information security gaps such as the lack of information technology products with the trustworthiness necessary to achieve information system resilience in the face of advanced persistent threats. 33 In some instances, the limitations may not be financial in nature, but limitations in the number of individuals with the appropriate skills/expertise or limitations regarding the state of technology.

CHAPTER 2 PAGE 16

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

2.4 TIER TWO—MISSION/BUSINESS PROCESS VIEW Tier 2 addresses risk from a mission/business process perspective by designing, developing, and implementing mission/business processes that support the missions/business functions defined at Tier 1. Organizational mission/business processes guide and inform the development of an enterprise architecture that provides a disciplined and structured methodology for managing the complexity of the organization’s information technology infrastructure. A key component of the enterprise architecture is the embedded information security architecture that provides a roadmap to ensure that mission/business process-driven information security requirements and protection needs are defined and allocated to appropriate organizational information systems and the environments in which those systems operate.

2.4.1 Risk-Aware Mission/Business Processes The risk management activities at Tier 2 begin with the identification and establishment of risk- aware mission/business processes to support the organizational missions and business functions. A risk-aware mission/business process is one that explicitly takes into account the likely risk such a process would cause if implemented. Risk aware processes are designed to manage risk in accordance with the risk management strategy defined at Tier 1 and explicitly account for risk when evaluating the mission/business activities and decisions at Tier 2.34 Implementing risk- aware mission/business processes requires a thorough understanding of the organizational missions and business functions and the relationships among missions/business functions and supporting processes. This understanding is a prerequisite to building mission/business processes sufficiently resilient to withstand a wide variety of threats including routine and sophisticated cyber attacks, errors/accidents, and natural disasters. An important part of achieving risk-aware processes is the understanding of senior leaders/executives of: (i) the types of threat sources and threat events that can adversely affect the ability of organizations to successfully execute their missions/business functions); (ii) the potential adverse impacts/consequences on organizational operations and assets, individuals, other organizations, or the Nation if the confidentiality, integrity, or availability of information or information systems used in a mission/business process is compromised; and (iii) the likely resilience to such a compromise that can be achieved with a given mission/business process definition, applying realistic expectations for the resilience of information technology.

A key output from the Tier 2 definition of mission/business processes is the selected risk response strategy35 for these processes within the constraints defined in the risk management strategy. The risk response strategy includes identification of information protection needs and the allocation of those needs across components of the process (e.g., allocation to protections within information systems, protections in the operational environments of those systems, and allocation to alternate mission/business execution paths based on the potential for compromise).

2.4.2 Enterprise Architecture A significant risk-related issue regarding the ability of organizations to successfully carry out missions and business functions is the complexity of the information technology being used in information systems. To address this complexity and associated potential risk, organizations need a disciplined and structured approach for managing information technology assets supporting

34 The identification of organizational mission/business processes includes defining the types of information that the organization needs to successfully execute those processes, the criticality and/or sensitivity of the information, and the information flows both internal and external to the organization. 35 Risk response strategies are described in Appendix H.

CHAPTER 2 PAGE 17

________________________________________________________________________________________________

                                                                         

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

their mission/business processes. Providing greater clarity and understanding of the information technology infrastructure of organizations including the design and development of the associated information systems is a prerequisite for maximizing the resilience and wise use of these systems in the face of increasingly sophisticated threats. This type of clarity and understanding can be effectively achieved through the development and implementation of enterprise architecture.

Enterprise architecture is a management practice employed by organizations to maximize the effectiveness of mission/business processes and information resources in helping to achieve mission/business success. Enterprise architecture establishes a clear and unambiguous connection from investments (including information security investments) to measurable performance improvements whether for an entire organization or portion of an organization. Enterprise architecture also provides an opportunity to standardize, consolidate, and optimize information technology assets. These activities ultimately produce information systems that are more transparent and therefore, easier to understand and protect. In addition to establishing a roadmap for more efficient and cost-effective usage of information technology throughout organizations, enterprise architecture provides a common language for discussing risk management issues related to missions, business processes, and performance goals—enabling better coordination and integration of efforts and investments across organizational and business activity boundaries. A well-designed enterprise architecture implemented organization-wide, promotes more efficient, cost-effective, consistent, and interoperable information security capabilities to help organizations better protect missions and business functions—and ultimately more effectively manage risk.

The Federal Enterprise Architecture (FEA) defines a collection of interrelated reference models including Performance, Business, Service Component, Data, and Technical as well as more detailed segment and solution architectures that are derived from the enterprise architecture.36 Organizational assets (including programs, processes, information, applications, technology, investments, personnel, and facilities) are mapped to the enterprise-level reference models to create a segment-oriented view of organizations. Segments are elements of organizations describing mission areas, common/shared business services, and organization-wide services. From an investment perspective, segment architecture drives decisions for a business case or group of business cases supporting specific mission areas or common/shared services. The primary stakeholders for segment architecture are mission/business owners. Following closely from segment architecture, solution architecture defines the information technology assets within organizations used to automate and improve mission/business processes. The scope of solution architecture is typically used to develop and implement all or parts of information systems or business solutions, including information security solutions. The primary stakeholders for solution architectures are information system developers and integrators, information system owners, information system/security engineers, and end users.

The FEA concepts that define needs‐driven, performance‐based business processes are applied by organizations, recognizing that effectively managing risk arising from operating in a cyberspace environment with sophisticated, high‐end threats is a key need and measure of performance.

36 The Federal Enterprise Architecture is described in a series of documents published by the OMB FEA Program Management Office. Additional information on the FEA reference models and the segment and solution architectures can be found in the FEA Consolidated Reference Model Document and FEA Practice Guidance, respectively.

CHAPTER 2 PAGE 18

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Enterprise architecture also promotes the concepts of segmentation, redundancy, and elimination of single points of failure—all concepts that can help organizations more effectively manage risk. Segmentation is important because it allows organizations to separate missions/business functions and operations and the information systems, system components, or subsystems supporting those missions, functions, and operations from other functions and operations and supporting systems. Segmentation helps to define more manageable components and to potentially reduce the degree of harm from a successful threat exploitation of a vulnerability. Segment architecture supports the concept of segmentation at the highest levels of organizations and the concept is carried forward through solution architecture (including decomposition of information systems and networks into subsystems and subnetworks, as appropriate).

The concept of redundancy is also very important in enterprise architecture. With the high probability of breaches or compromises when threats exploit vulnerabilities in organizational information systems, the failure or degradation of one or more information system components is inevitable. To enhance information system resilience as part of risk response, organizational information systems provide a failover mode that helps to ensure that failed components trigger appropriate backup components with similar capability. This type of capability is essential to address the advanced persistent threat in situations where organizations might be required to operate while under cyber attack in a degraded mode but still providing a sufficient level of capability to achieve mission/business success. Segment and solution architectures support the concept of redundancy by establishing a disciplined and structured approach to developing and implementing key architectural considerations that facilitate replication of critical information system components, where appropriate.

Finally, the concept of single point of failure and the elimination of such failure points is easily supported by enterprise architecture. Having the essential visibility and transparency provided in the architectural design at the organization level exposes potential single points of failure early in the development process. Thus, single points of failure are effectively addressed by segment and solution architectures. Failure to address potential single points of failure early in the architectural design can result in severe or catastrophic effects when those failure points are propagated to information systems and the actual failure causes a loss of mission/business capability.

2.4.3 Information Security Architecture The information security architecture is an integral part of the organization’s enterprise architecture. It represents that portion of the enterprise architecture specifically addressing information system resilience and providing architectural information for the implementation of security capabilities.37 The primary purpose of the information security architecture is to ensure that mission/business process-driven information security requirements are consistently and cost- effectively achieved in organizational information systems and the environments in which those systems operate consistent with the organizational risk management strategy.38 The information security architecture also incorporates security requirements from legislation, directives, policies, regulations, standards, and guidance into the segment architecture. Ultimately, the information security architecture provides a detailed roadmap that allows traceability from the highest-level strategic goals and objectives of organizations, through specific mission/business protection needs, to specific information security solutions provided by people, processes, and technologies.

37 In general, a version of an information security architecture exists for each of the enterprise architecture reference models; including Performance, Business, Service Component, Data, and Technical. 38 Organizations employ sound system and security engineering principles and techniques to ensure that information security requirements are effectively implemented in organizational information systems.

CHAPTER 2 PAGE 19

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Information security requirements defined in the segment architecture are implemented in the solution architecture in the form of management, operational, and technical security controls. The security controls are employed within or inherited by the individual information systems and the environments in which the systems operate. The allocation39 of security controls is consistent with the information security architecture as well as concepts such as defense-in-depth and defense-in- breadth. Figure 3 illustrates the process of integrating information security requirements into the enterprise architecture and the associated information systems supporting the mission/business processes of organizations.

Environments of Operation

INFORMATION SYSTEM

INFORMATION SYSTEM

INFORMATION SYSTEM

ORGANIZATION RISK MANAGEMENT STRATEGY

Mission / Business Process

Mission / Business Process

Mission / Business Process

ENTERPRISE ARCHITECTURE (Reference Models, Segment Architecture, Solution Architecture)

INFORMATION SECURITY ARCHITECTURE (Security Requirement and Control Allocation)

INFORMS

INFORMS

INFORMS

INFORMS

FIGURE 3: INFORMATION SECURITY REQUIREMENTS INTEGRATION

To summarize, risk management considerations can be addressed as an integral part of the enterprise architecture by:

• Developing a segment architecture linked to the strategic goals and objectives of organizations, defined missions/business functions, and associated mission/business processes;

• Identifying where effective risk response is a critical element in the success of organizational missions and business functions;

• Defining the appropriate, architectural-level information security requirements within organization-defined segments based on the organization’s risk management strategy;

• Incorporating an information security architecture that implements architectural-level information security requirements;

39 Security control allocation occurs down to the information system component level, employing security controls in selected system components assigned to provide a specific security capability. Specific guidance on how to incorporate information security requirements into enterprise architecture is provided in the FEA Security and Privacy Profile.

CHAPTER 2 PAGE 20

________________________________________________________________________________________________

                                                                                                                   

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

• Translating the information security requirements from the segment architecture into specific security controls for information systems/environments of operation as part of the solution architecture;

• Allocating management, operational, and technical security controls to information systems and environments of operation as defined by the information security architecture; and

• Documenting risk management decisions at all levels of the enterprise architecture.40

Enterprise architecture provides a disciplined and structured approach to achieving consolidation, standardization, and optimization of information technology assets that are employed within organizations. Risk reduction can be achieved through the full integration of management processes41 organization-wide, thereby providing greater degrees of security, privacy, reliability, and cost-effectiveness for the missions and business functions being carried out by organizations. This integrated approach of incorporating the organization’s risk management strategy into enterprise architecture gives senior leaders/executives the opportunity to make more informed risk-based decisions in dynamic operating environments—decisions based on trade-offs between fulfilling and improving organizational missions and business functions and managing the many types and sources of risk that must be considered in their risk management responsibilities.

The use of enterprise architecture can greatly enhance an organization’s risk posture by providing greater transparency and clarity in design and development activities—enabling a more consistent application of the principle of ‘wise use’ of technologies across the organization; optimizing the trade‐offs between value gained from and the risk being incurred through the information systems supporting organizational missions/business functions.

2.5 TIER THREE—INFORMATION SYSTEMS VIEW All information systems, including operational systems, systems under development, and systems undergoing modification, are in some phase of the system development life cycle.42 In addition to the risk management activities carried out at Tier 1 and Tier 2 (e.g., reflecting the organization’s risk management strategy within the enterprise architecture and embedded information security architecture), risk management activities are also integrated into the system development life cycle of organizational information systems at Tier 3. The risk management activities at Tier 3 reflect the organization’s risk management strategy and any risk related to the cost, schedule, and performance requirements for individual information systems supporting the mission/business functions of organizations. Risk management activities take place at every phase in the system development life cycle with the outputs at each phase having an effect on subsequent phases.

40 The activities required to effectively incorporate information security into enterprise architecture are carried out by key stakeholders within organizations including mission/business owners, chief information officers, chief information security officers, authorizing officials, and the risk executive (function). 41 A management process is a process for planning and controlling the performance or execution of organizational activities (e.g., programs, projects, tasks, processes). Management processes are often referred to as performance measurement and management systems. 42 There are typically five phases in system development life cycles: (i) initiation; (ii) development/ acquisition; (iii) implementation; (iv) operation/maintenance; and (v) disposal. Organizations may use a variety of system development life cycle processes including, for example, waterfall, spiral, or agile development.

CHAPTER 2 PAGE 21

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

For example, requirements definition43 is a critical part of any system development process and begins very early in the life cycle, typically in the initiation phase. The latest threat information that is available to organizations, or current organizational assumptions concerning threat, may significantly influence information system requirements and the types of solutions that are deemed by organizations to be acceptable (from a technological and operational perspective) in the face of such threats. Information security requirements are a subset of the functional requirements levied on information systems and are incorporated into the system development life cycle simultaneously with the other requirements. The information security requirements define the needed security functionality44 for information systems and the level of trustworthiness for that functionality (see Section 2.6 on the trustworthiness of information systems).

Organizations also address risk management issues during the development/acquisition phase of the system development life cycle (e.g., system design, system development/integration, and demonstration). Whether in response to specific and credible threat information or assumptions about the threat, potential design-related vulnerabilities in organizational information systems can be mitigated during this phase by choosing less susceptible alternatives. Supply chain risk during the acquisition phase of the information system is also an area of concern for organizations. To address supply chain risk during the development/acquisition phase, organizations implement specific security controls as deemed necessary by the organization. Organizations also consider risk from the standpoint of the environment in which the information systems are intended to operate when selecting the most appropriate security controls. To be effective, controls must be mutually supporting, employed with realistic expectations for effectiveness, and implemented as part of an explicit, information system-level security architecture that is consistent with the security architecture embedded in the organization’s enterprise architecture. For example, when certain technical controls are less than effective due to achievable levels of trustworthiness in organizational information systems, management and operational controls are employed as compensating controls—thus providing another opportunity to manage risk.

Subsequent to initiation, development, and acquisition, the implementation phase of the system development life cycle provides an opportunity for the organization to determine the effectiveness of the selected security controls employed within or inherited by the information systems under development prior to the commencement of actual operations. Expectations generated during this phase can be compared with actual behavior as information systems are implemented. Given the current threat information that is available to organizations and organizational assumptions about the threat, the information discovered during effectiveness assessments, and the potential adverse impacts on organizational missions/business functions, it may be necessary to modify or change the planned implementation of the information system. Risk-related information can be developed to justify the proposed changes.

Once approved for operation, information systems move into the operations/maintenance phase of the system development life cycle. The monitoring of security control effectiveness and any changes to organizational information systems and the environments in which those systems operate ensure that selected risk response measures are operating as intended on an ongoing basis. Ongoing monitoring is paramount to maintaining situational awareness of risk to organizational missions and business functions—an awareness that is critical to making the necessary course

43 Information security requirements can be obtained from a variety of sources (e.g., legislation, policies, directives, regulations, standards, and organizational mission/business/operational requirements). 44 Security functionality is the set of security controls employed within or inherited by an information system or the environment in which the system operates. The security controls, described in NIST Special Publication 800-53, are implemented by a combination of people, processes, and technologies.

CHAPTER 2 PAGE 22

________________________________________________________________________________________________

                                                                         

                             

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

corrections when risk exceeds organizational risk tolerance. During the disposal phase of the system development life cycle, it is standard procedure for organizations to verifiably remove prior to disposal, any information from information systems that may cause adverse impacts, if compromised, and also assess any risk associated with these activities.45

Early integration of information security requirements into the system development life cycle is the most cost-effective method for implementing the organizational risk management strategy at Tier 3.46 Incorporating risk management into the system development life cycle ensures that the risk management process is not isolated from the other management processes employed by the organization to develop, acquire, implement, operate, and maintain the information systems supporting organizational missions and business functions. To support system development life cycle integration, risk management (including information security considerations) is also incorporated into program, planning, and budgeting activities to help ensure that appropriate resources are available when needed—thus facilitating the completion of program and project milestones established by organizations. To incorporate risk management into program, planning and budgeting activities, risk and information security professionals are an integral part of the teams and structures used to address information system and organizational requirements.

The overall resilience of organizational information systems (i.e., how well systems operate while under stress) is a key factor and performance measure in determining the potential survivability of missions/business functions. The use of certain information technologies may introduce inherent vulnerabilities into these information systems—resulting in risk that may have to be mitigated by reengineering the current mission/business processes. The wise use of information technologies during the design, development, and implementation of organizational information systems is of paramount importance in managing risk.

Making information security‐related requirements and activities an integral part of the system development life cycle ensures that senior leaders/executives consider the risks to organizational operations and assets, individuals, other organizations, and the Nation resulting from the operation and use of information systems and take appropriate actions to exercise the organization’s due diligence.

2.6 TRUST AND TRUSTWORTHINESS Trust is an important concept related to risk management. How organizations approach trust influences their behaviors and their internal and external trust relationships. This section introduces some conceptual ways of thinking about trust, defines the concept of trustworthiness, and shows how the concept of trustworthiness can be used in developing trust relationships. Appendix G describes several trust models that can be applied in an organizational context, and

45 While the presentation of the system development life cycle is expressed as a linear flow, in reality, the knowledge gained during a later phase of the life cycle or changes in system requirements or operational environments may dictate revisiting an earlier phase. For example, changes in the threat environment during the operation/maintenance phrase may dictate the need to initiate a new or revised system capability. 46 The Risk Management Framework (RMF), described in NIST Special Publication 800-37, provides a structured process that integrates risk management activities into the system development life cycle. The RMF operates primarily at Tier 3 but also interacts with Tier 1 and Tier 2 (e.g., providing feedback from authorization decisions to the risk executive [function], disseminating updated risk information to authorizing officials, common control providers, and information system owners).

CHAPTER 2 PAGE 23

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

considers how trust can be measured. The importance of organizational governance, culture, and transparency47 are also considered with regard to trust and its affect on risk management.

Trust is a belief that an entity will behave in a predictable manner in specified circumstances. The entity may be a person, process, object or any combination of such components. The entity can be of any size from a single hardware component or software module, to a piece of equipment identified by make and model, to a site or location, to an organization, to a nation-state. Trust, while inherently a subjective determination, can be based on objective evidence and subjective elements. The objective grounds for trust can include for example, the results of information technology product testing and evaluation. Subjective belief, level of comfort, and experience may supplement (or even replace) objective evidence, or substitute for such evidence when it is unavailable. Trust is usually relative to a specific circumstance or situation (e.g., the amount of money involved in a transaction, the sensitivity or criticality of information, or whether safety is an issue with human lives at stake). Trust is generally not transitive (e.g., you trust a friend but not necessarily a friend of a friend). Finally, trust is generally earned, based on experience or measurement. However, in certain organizations, trust may be mandated by policy (see Appendix G, mandated trust model).

Trustworthiness is an attribute of a person or organization that provides confidence to others of the qualifications, capabilities, and reliability of that entity to perform specific tasks and fulfill assigned responsibilities. Trustworthiness is also a characteristic of information technology products and systems (see Section 2.6.2 on trustworthiness of information systems). The attribute of trustworthiness, whether applied to people, processes, or technologies, can be measured, at least in relative terms if not quantitatively.48 The determination of trustworthiness plays a key role in establishing trust relationships among persons and organizations. The trust relationships are key factors in risk decisions made by senior leaders/executives.

2.6.1 Establishing Trust Among Organizations Parties enter into trust relationships based on mission and business needs.49 Trust among parties typically exists along a continuum with varying degrees of trust achieved based on a number of factors. Organizations can still share information and obtain information technology services even if their trust relationship falls short of complete trust. The degree of trust required for organizations to establish partnerships can vary widely based on many factors including the organizations involved and the specifics of the situation (e.g., the missions, goals, and objectives of the potential partners, the criticality/sensitivity of activities involved in the partnership, the risk tolerance of the organizations participating in the partnership, and the historical relationship among the participants). Finally, the degree of trust among entities is not a static quality but can vary over time as circumstances change.

47 Transparency is achieved by providing visibility into the risk management and information security activities carried out by organizations participating in partnerships (e.g., employing common security standards, specification language for security controls including common controls, assessment procedures, risk assessment methodologies; defining common artifacts and bodies of evidence used in making risk-related decisions). 48 Current state-of-the-practice for measuring trustworthiness can reliably differentiate between widely different levels of trustworthiness and is capable of producing a trustworthiness scale that is hierarchical between similar instances of measuring activities (e.g., the results from ISO/IEC 15408 [Common Criteria] evaluations). 49 Trust relationships can be: (i) formally established, for example, by documenting the trust-related information in contracts, service-level agreements, statements of work, memoranda of agreement/understanding, or interconnection security agreements; (ii) scalable and inter-organizational or intra-organizational in nature; and/or (iii) represented by simple (bilateral) relationships between two partners or more complex many-to many relationships among many diverse partners.

CHAPTER 2 PAGE 24

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Organizations are becoming increasingly reliant on information system services50 and information provided by external organizations as well as partnerships to accomplish missions and business functions. This reliance results in the need for trust relationships among organizations.51 In many cases, trust relationships with external organizations, while generating greater productivity and cost efficiencies, can also bring greater risk to organizations. This risk is addressed by the risk management strategies established by organizations that take into account the strategic goals and objectives of organizations.

Effectively addressing the risk associated with the growing dependence on external service providers and partnerships with domestic and international public and private sector participants necessitates that organizations:

• Define the types of services/information to be provided to organizations or the types of information to be shared/exchanged in any proposed partnering arrangements;

• Establish the degree of control or influence organizations have over the external organizations participating in such partnering arrangements;

• Describe how the services/information are to be protected in accordance with the information security requirements of organizations;

• Obtain the relevant information from external organizations to determine trustworthiness and to support and maintain trust (e.g., visibility into business practices and risk/information security decisions to understand risk tolerance);

• Appropriately balance mission/business-based requirements to support information sharing while considering the risk of working with competing or hostile entities and the risk that other organizations, while neither competing or hostile, may be a path through which such entities attack;

• Determine if the ongoing risk to organizational operations and assets, individuals, other organizations, or the Nation resulting from the continuing use of the services/information or the participation in the partnership, is at an acceptable level; and

• Recognize that decisions to establish trust relationships are expressions of acceptable risk.

The degree of trust that an organization places in external organizations can vary widely, ranging from those who are highly trusted (e.g., business partners in a joint venture that share a common business model and common goals) to those who are less trusted and may represent greater sources of risk (e.g., business partners in one endeavor who are also competitors or adversaries). The specifics of establishing and maintaining trust can differ from organization to organization based on mission/business requirements, the participants involved in the trust relationship, the criticality/sensitivity of the information being shared or the types of services being rendered, the history between the organizations, and the overall risk to the organizations participating in the relationship. Appendix G provides several trust models that organizations can use when dealing with external organizations.

In many situations, the trust established between organizations may not allow a full spectrum of information sharing or a complete provision of services. When an organization determines that

50 External information system services are services that are implemented outside of the system’s traditional authorization boundary (i.e., services that are used by, but not a part of, the organizational information system). 51 External providers or mission/business partners can be public or private sector entities, domestic or international.

CHAPTER 2 PAGE 25

________________________________________________________________________________________________

                                                                       

                   

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

the trustworthiness of another organization does not permit the complete sharing of information or use of external services, the organization can: (i) mitigate risk, transfer risk, or share risk by employing one or more compensating controls; (ii) accept a greater degree of risk; or (iii) avoid risk by performing missions/business functions with reduced levels of functionality or possibly no functionality at all.

Explicit understanding and acceptance of the risk to an organization’s operations and assets, individuals, other organizations, and the Nation by senior leaders/executives (reflecting the organization’s risk tolerance) are made in accordance with the organization’s risk management strategy and a prerequisite for establishing trust relationships among organizations.

2.6.2 Trustworthiness of Information Systems The concept of trustworthiness can also be applied to information systems and the information technology products and services that compose those systems. Trustworthiness expresses the degree to which information systems (including the information technology products from which the systems are built) can be expected to preserve the confidentiality, integrity, and availability of the information being processed, stored, or transmitted by the systems across the full range of threats. Trustworthy information systems are systems that have been determined to have the level of trustworthiness necessary to operate within defined levels of risk despite the environmental disruptions, human errors, and purposeful attacks that are expected to occur in their environments of operation. Two factors affecting the trustworthiness of information systems are:

• Security functionality (i.e., the security features/functions employed within the system); and

• Security assurance (i.e., the grounds for confidence that the security functionality is effective in its application).52

Security functionality can be obtained by employing within organizational information systems and their environments of operation, a combination of management, operational, and technical security controls from NIST Special Publication 800-53.53 The development and implementation of needed security controls is guided by and informed by the enterprise architecture established by organizations.

Security assurance is a critical aspect in determining the trustworthiness of information systems. Assurance is the measure of confidence that the security features, practices, procedures, and architecture of an information system accurately mediates and enforces the security policy.54 Assurance is obtained by: (i) the actions taken by developers and implementers55 with regard to the design, development, implementation, and operation of the security functionality (i.e., security controls); and (ii) the actions taken by assessors to determine the extent to which the functionality is implemented correctly, operating as intended, and producing the desired outcome

52 Assurance also represents the grounds for confidence that the intended functionality of an information system is correct, always invoked (when needed), and resistant to bypass or tampering. 53 The employment of appropriate security controls for information systems and environments of operation is guided by the first three steps in the Risk Management Framework (i.e., categorization, selection, and implementation). 54 A security policy is set of criteria for the provision of security services. 55 In this context, a developer/implementer is an individual or group of individuals responsible for the design, development, implementation, or operation of security controls for an information system or supporting infrastructure.

CHAPTER 2 PAGE 26

________________________________________________________________________________________________

                                                   

                                   

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

with respect to meeting the security requirements for information systems and their environments of operation.56 Developers and implementers can increase the assurance in security functionality by employing well-defined security policies and policy models, structured and rigorous hardware and software development techniques, and sound system/security engineering principles.

Assurance for information technology products and systems is commonly based on the assessments conducted (and associated assessment evidence produced) during the initiation, acquisition/development, implementation, and operations/maintenance phases of the system development life cycle. For example, developmental evidence may include the techniques and methods used to design and develop security functionality. Operational evidence may include flaw reporting and remediation, the results of security incident reporting, and the results of ongoing security control monitoring. Independent assessments by qualified assessors may include analyses of the evidence as well as testing, inspections, and audits of the implementation of the selected security functionality.57

The concepts of assurance and trustworthiness are closely related. Assurance contributes to the trustworthiness determination relative to an information technology product or an information system. Developers/implementers of information technology products or systems may provide assurance evidence by generating appropriate artifacts (e.g., the results of independent testing and evaluation, design documentation, high-level or low-level specifications, source code analysis). Organizations using information technology products or systems may perform, or rely on others to perform, some form of assessment on the products or systems. Organizations may also have direct experience with the product or system, or may receive information about the performance of the product or system from third parties. Organizations typically evaluate all of the available assurance evidence, often applying different weighting factors as appropriate, to determine the trustworthiness of the product or system relative to the circumstances.

Information technology products and systems exhibiting a higher degree of trustworthiness (i.e., products/systems having appropriate functionality and assurance) are expected to exhibit a lower rate of latent design and implementation flaws and a higher degree of penetration resistance against a range of threats including sophisticated cyber attacks, natural disasters, accidents, and intentional/unintentional errors. The susceptibility of missions/business functions of organizations to known threats, the environments of operation where information systems are deployed, and the maximum acceptable level of risk to organizational operations and assets, individuals, other organizations, or the Nation, guide the degree of trustworthiness needed.

Trustworthiness is a key factor in the selection and wise use of information technology products used in organizational information systems. Insufficient attention to trustworthiness of information technology products and systems can adversely affect an organization’s capability to successfully carry out its assigned missions/business functions.

56 For other than national security systems, organizations meet minimum assurance requirements specified in NIST Special Publication 800-53, Appendix E. 57 NIST Special Publication 800-53A provides guidance on assessing security controls in federal information systems.

CHAPTER 2 PAGE 27

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

2.7 ORGANIZATIONAL CULTURE Organizational culture refers to the values, beliefs, and norms that influence the behaviors and actions of the senior leaders/executives and individual members of organizations. Culture describes the way things are done in organizations and can explain why certain things occur. There is a direct relationship between organizational culture and how organizations respond to uncertainties and the potential for near-term benefits to be the source for longer-term losses. The organization’s culture informs and even, to perhaps a large degree, defines that organization’s risk management strategy. At a minimum, when an expressed risk management strategy is not consistent with that organization’s culture, then it is likely that the strategy will be difficult if not impossible to implement. Recognizing and addressing the significant influence culture has on risk-related decisions of senior leaders/executives within organizations can therefore, be key to achieving effective management of risk.

Recognizing the impact from organizational culture on the implementation of an organization- wide risk management program is important as this can reflect a major organizational change. This change must be effectively managed and understanding the culture of an organization plays an important part in achieving such organization-wide change. Implementing an effective risk management program may well represent a significant organization-wide change aligning the people, processes, and culture within the organization with the new or revised organizational goals and objectives, the risk management strategy, and communication mechanisms for sharing risk-related information among entities. To effectively manage such change, organizations include cultural considerations as a fundamental component in their strategic-level thinking and decision-making processes (e.g., developing the risk management strategy). If the senior leaders/executives understand the importance of culture, they have a better chance of achieving the organization’s strategic goals and objectives by successfully managing risk.

Culture also impacts the degree of risk being incurred. Culture is reflected in an organization’s willingness to adopt new and leading edge information technologies. For example, organizations that are engaged in research and development activities may be more likely to push technological boundaries. Such organizations are more prone to be early adopters of new technologies and therefore, more likely to view the new technologies from the standpoint of the potential benefits achieved versus potential harm from use. In contrast, organizations that are engaged in security- related activities may be more conservative by nature and less likely to push technological boundaries—being more suspicious of the new technologies, especially if provided by some entity with which the organization lacks familiarity and trust. These types of organizations are also less likely to be early adopters of new technologies and would be more inclined to look at the potential harm caused by the adoption of the new technologies. Another example is that some organizations have a history of developing proprietary software applications and services, or procuring software applications and services solely for their use. These organizations may be reluctant to use externally-provided software applications and services and this reluctance may result in lower risk being incurred. Other organizations may, on the other hand, seek to maximize advantages achieved by modern net-centric architectures (e.g., service-oriented architectures, cloud computing), where hardware, software, and services are typically provided by external organizations. Since organizations typically do not have direct control over assessment, auditing, and oversight activities of external providers, a greater risk might be incurred.

In addition to the cultural impacts on organizational risk management perspectives, there can also be cultural issues between organizations. Where two or more organizations are operating together toward a common purpose, there is a possibility that cultural differences in each of the respective organizations may result in different risk management strategies, propensity to incur risk, and

CHAPTER 2 PAGE 28

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

willingness to accept risk.58 For example, assume two organizations are working together to create a common security service intended to address the advanced persistent threat. The culture of one of the organizations may result in a focus on preventing unauthorized disclosure of information, while the nature of the other organization may result in an emphasis on mission continuity. The differences in focus and emphasis resulting from organizational culture can generate different priorities and expectations regarding what security services to procure, because the organizations perceive the nature of the threat differently. Such culture-related disconnects do not occur solely between organizations but can also occur within organizations, where different organizational components (e.g., information technology components, operational components) have different values and perhaps risk tolerances. An example of an internal disconnect can be observed in a hospital that emphasizes different cultures between protecting the personal privacy of patients and the availability of medical information to medical professionals for treatment purposes.

Culture both shapes and is shaped by the people within organizations. Cultural influences and impacts can be felt across all three tiers in the multitiered risk management approach. Senior leaders/executives both directly and indirectly in Tier 1 governance structures set the stage for how organizations respond to various approaches to managing risk. Senior leaders/executives establish the risk tolerance for organizations both formally (e.g., through publication of strategy and guidance documents) and informally (e.g., through actions that get rewarded and penalized, the degree of consistency in actions, and the degree of accountability enforced). The direction set by senior leaders/executives and the understanding of existing organizational values and priorities are major factors determining how risk is managed within organizations.

2.8 RELATIONSHIP AMONG KEY RISK CONCEPTS As indicated by the discussions above, there are a variety of risk-related concepts (e.g., risk tolerance, trust, and culture), all of which have an impact on risk management. The concepts do not operate in a vacuum; rather, there is often a strong interplay among the concepts (e.g., an organization’s culture along with its governance structures and processes, often influences the pace of change and the implementation of its risk management strategy). For this reason, the risk executive (function) and other parties involved in organizational risk-based decisions, need to have an awareness and appreciation for all of the concepts. Several examples of the relationships among the risk-related concepts are provided below. The list of relationships is not exhaustive and serves only to illustrate how combining risk-related concepts can produce unintended consequences, both positive and negative in scope.

2.8.1 Governance, Risk Tolerance, and Trust As part of implementing the organization’s risk management strategy at Tier 1, the risk executive (function) establishes practices for sharing risk-related information with external entities. With regard to the demonstration of due diligence for managing risk, organizations that are less risk tolerant are likely to require more supporting evidence than organizations that are more risk tolerant. Such organizations may only trust (and hence partner with) organizations with which they have had a long and successful relationship (see direct historical trust model in Appendix G). The amount of centralization59 within an organization may be reflective of the organizational risk tolerance and/or its willingness to trust partnering organizations. Some organizations select a

58 A similar situation can exist between subordinate elements of an organization when these elements are afforded a fair amount of autonomy and operational authority. 59 Additional information on governance models can be found in Appendix F.

CHAPTER 2 PAGE 29

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

decentralized governance structure for reasons such as widely diverging mission/business areas or need for increased separation between mission/business lines due to sensitivity of the work. The reasons for decentralization may reflect and likely will influence risk tolerance. For instance, if there are no partnering organizations meeting the established trust qualifications, less risk-tolerant organizations may require significantly more supporting evidence of due diligence (e.g., access to risk assessments, security plans, security assessment reports, risk acceptance decisions) than is typically required in such situations (see validated trust model in Appendix G).

2.8.2 Trust and Culture There is also potential interplay between the concepts of risk, trust, and culture. Changes in mission/business requirements (e.g., a new mission or business requirement to interconnect information systems for the purpose of sharing information) may require a greater acceptance of risk than is typical for that organization. In the short term, additional measures may be needed to establish and/or build trust (e.g., increase transparency between interconnecting organizations). Such measures facilitate building trust and evolving organizational beliefs and norms over the longer term. Interaction between trust and culture can also be observed when there are gaps and overlaps in responsibility among an organization’s components that may impact the ability for proposed actions (especially new actions) to be carried out quickly. For example, many organizations with decentralized governance structures may be slower to embrace change unless there has been an extensive effort to expand coordination and improve trust among organizational components. Assume that some organizations are directed by higher authorities (see mandated trust model in Appendix G) to share information more freely with peer organizations. If the organizations have a history and culture of tightly controlling information, they may be reluctance to share information with outside entities, even though directed to do so. In such situations, organizations may require that partnering organizations provide concrete evidence of the steps taken to protect the information designated for sharing prior to release.

2.8.3 Investment Strategy and Risk Tolerance Investment strategies and organizational risk tolerance also have linkages. Organizations may recognize that there is a need to address advanced persistent threats where adversaries have achieved some degree of penetration of and foothold within organizational information systems and the environments in which those systems operate. The strategic investments that are required to address these types of threats may, in part, be influenced by the risk tolerance of organizations. Less risk-tolerant organizations may focus investments on information technologies that prevent adversaries from gaining further access within organizations and/or limiting the damage done to the organizations even if at the expense of achieving some of the many mission/business benefits automation can provide. More risk-tolerant organizations may focus investments on information technologies that provide greater mission/business benefits even if these benefits are achieved at the expense of adversaries gaining some advantage or benefit from compromising the information systems and supporting infrastructure.

2.8.4 Culture and Risk Tolerance A major part of managing risk within organizations is identifying what the organizational risk tolerance is for a particular type of loss. Risk tolerance can be described as a combination of the cultural willingness to accept certain types of loss within organizations and the subjective risk- related actions of senior leaders/executives. Risk-based decisions within organizations often reflect the blending of the risk tolerance of senior leaders/executives and the risk tolerance that is embedded within the culture of organizations. In establishing organizational risk tolerance, the values, beliefs, and norms of organizations are examined in order to understand why risk trade-

CHAPTER 2 PAGE 30

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

offs are made. For some organizations, in particular those organizations that deal with critical and/or sensitive information, personally identifiable information, or classified information, the emphasis is often on preventing unauthorized disclosure. In contrast, in those organizations driven by a combination of organizational culture and the nature of their missions and business functions, the emphasis is on maintaining the availability of information systems to achieve an ongoing operational capability. As part of establishing organizational risk tolerance, a risk assessment identifies the kinds and levels of risk to which organizations may be exposed. This assessment considers both the likelihood and impact of undesired events (see Chapter Three, the risk management process).

CHAPTER 2 PAGE 31

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

CHAPTER THREE

THE PROCESS APPLYING RISK MANAGEMENT CONCEPTS ACROSS AN ORGANIZATION

T his chapter describes a process for managing information security risk including: (i) a general overview of the risk management process; (ii) how organizations establish the context for risk-based decisions; (iii) how organizations assess risk considering threats, vulnerabilities, likelihood, and consequences/impact; (iv) how organizations respond to risk once determined; and (v) how organizations monitor risk over time with changing mission/business needs, operating environments, and supporting information systems . The risk management process, introduced in Chapter Two, is described in this chapter along with its applicability across the three tiers of risk management. Each of the steps in the risk management process (i.e., risk framing, risk assessment, risk response, and risk monitoring) is described in a structured manner focusing on the inputs or preconditions necessary to initiate the step, the specific activities that compose the step, and the outputs or post conditions resulting from the step.60 The effect of the risk concepts described in Chapter Two (e.g., risk tolerance, trust, and culture) are also discussed in the context of the risk management process and its multitiered application. Figure 4 illustrates the risk management process as applied across the tiers—organization, mission/business process, and information system. The bidirectional arrows in the figure indicate that the information and communication flows among the risk management components as well as the execution order of the components, may be flexible and respond to the dynamic nature of the risk management process as it is applied across all three tiers.

TIER 3 - INFORMATION SYSTEMS TIER 2 - MISSION / BUSINESS PROCESSES

TIER 1 - ORGANIZATION

FRAME

ASSESS

RESPONDMONITOR

FIGURE 4: RISK MANAGEMENT PROCESS APPLIED ACROSS THE TIERS

60 Additional guidance on selected steps in the risk management process (e.g., risk assessment, risk monitoring) can be found in other NIST Special Publications listed in Appendix A.

CHAPTER 3 PAGE 32

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

The steps in the risk management process are not inherently sequential in nature. The steps are performed in different ways, depending on the particular tier where the step is applied and on prior activities related to each of the steps. What is consistent is that the outputs or post conditions from a particular risk management step directly impact one or more of the other risk management steps in the risk management process. Organizations have significant flexibility in how the risk management steps are performed (e.g., sequence, degree of rigor, formality, and thoroughness of application) and in how the results of each step are captured and shared—both internally and externally. Ultimately, the objective of applying the risk management process and associated risk- related concepts is to develop a better understanding of information security risk in the context of the broader actions and decisions of organizations and in particular, with respect to organizational operations and assets, individuals, other organizations, and Nation.

3.1 FRAMING RISK Risk framing establishes the context and provides a common perspective on how organizations manage risk. Risk framing, as its principal output, produces a risk management strategy that addresses how organizations intend to assess risk, respond to risk, and monitor risk. The risk management strategy makes explicit the specific assumptions, constraints, risk tolerances, and priorities/trade-offs used within organizations for making investment and operational decisions. The risk management strategy also includes any strategic-level decisions and considerations on how risk to organizational operations and assets, individuals, other organizations, and the Nation, is to be managed by senior leaders/executives.

At Tier 1, senior leaders/executives, in consultation and collaboration with the risk executive (function), define the organizational risk frame including the types of risk decisions (e.g., risk responses) supported, how and under what conditions risk is assessed to support those risk decisions, and how risk is monitored (e.g., to what level of detail, in what form, and with what frequency). At Tier 2, mission/business owners apply their understanding of the organizational risk frame to address concerns specific to the organization’s missions/business functions (e.g., additional assumptions, constraints, priorities, and trade-offs). At Tier 3, program managers, information system owners, and common control providers apply their understanding of the organizational risk frame based on how decision makers at Tiers 1 and 2 choose to manage risk.

The Risk Management Framework61 is the primary means for addressing risk at Tier 3. The RMF addresses concerns specific to the design, development, implementation, operation, and disposal of organizational information systems and the environments in which those systems operate. The risk frame can be adapted at Tier 3 based on the current phase of the system development life cycle, which further constrains potential risk responses. Initially, organizational risk frames might not be explicit or might not be defined in terms that correspond to the risk management tiers. In the absence of explicit risk frames (describing assumptions, constraints, risk tolerance, and priorities/trade-offs), mission/business owners can have divergent perspectives on risk or how to manage it. This impedes a common understanding at Tier 1 of how information security risk contributes to organizational risk, and at Tier 2, of how risk accepted for one mission or business function potentially affects risk with respect to other missions/business functions. Differences in risk tolerance and the underlying assumptions, constraints, and priorities/trade-offs are grounded in operational and/or architectural considerations and should be understood and accepted by senior leaders/executives within their respective organizations.

61 The Risk Management Framework (RMF) which operates primarily at Tier 3 is described in NIST Special Publication 800-37.

CHAPTER 3 PAGE 33

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

STEP 1: RISK FRAMING

Inputs and Preconditions Risk framing is the set of assumptions, constraints, risk tolerances, and priorities/trade-offs that shape an organization’s approach for managing risk. Risk framing is informed by the organizational governance structure, financial posture, legal/regulatory environment, investment strategy, culture, and trust relationships established within and among organizations. Inputs to the risk framing step include, for example, laws, policies, directives, regulations, contractual relationships, and financial limitations which impose constraints on potential risk decisions by organizations. Other inputs to risk framing can include, for example, specific information from organizations to make explicit: (i) the identification of trust relationships and trust models (see Appendix G) that derive from existing memoranda of understanding or agreement (MOUs or MOAs); and (ii) the identification of the governance structures and processes that indicate the extent of or limits on decision-making authority for risk decisions that can be delegated to mission or business owners. The key precondition for risk framing is senior leadership commitment to defining an explicit risk management strategy and holding mission/business owners responsible and accountable for implementing the strategy.

The guidance produced by the risk framing step, and the underlying assumptions, constraints, risk tolerances, and priorities/trade-offs used to develop that guidance, may be inappropriate to one or more organizational missions or business functions. In addition, the risk environment has the potential to change over time. Thus, the risk management process allows for feedback to the risk framing step from the other steps in the process, as follows:

• Risk assessment: Information generated during the risk assessment may influence the original assumptions, change the constraints regarding appropriate risk responses, identify additional tradeoffs, or shift priorities. For example, the characterization of adversaries (including representative tactics, techniques, and procedures), or sources of vulnerability information may not be consistent with how some organizations conduct their missions/business functions; a source of threat/vulnerability information that is useful for one mission/business function could, in fact, be useful for others; or organizational guidance on assessing risk under uncertainty may be too onerous, or insufficiently defined, to be useful for one or more mission/business functions.

• Risk response: Information uncovered during the development of alternative courses of action could reveal that risk framing has removed or failed to uncover some potentially high-payoff alternatives from consideration. This situation may challenge organizations to revisit original assumptions or investigate ways to change established constraints.

• Risk monitoring: Security control monitoring by organizations could indicate that a class of controls, or a specific implementation of a control, is relatively ineffective, given investments in people, processes, or technology. This situation could lead to changes in assumptions about which types of risk responses are preferred by organizations. Monitoring of the operational environment could reveal changes in the threat landscape (e.g., changes in the tactics, techniques, and procedures observed across all organizational information systems; increasing frequency and/or intensity of attacks against specific missions/business functions) that cause organizations to revisit original threat assumptions and/or to seek different sources of threat information. Significant advances in defensive or proactive operational and technical solutions could generate the need to revisit the investment strategy identified during the framing step. Monitoring of legal/regulatory environments could also influence changes in assumptions or constraints. Also, monitoring of risk being incurred might result in the need to reconsider the organizational risk tolerance if the existing statement of risk tolerance does not appear to match the operational realities.

Activities RISK ASSUMPTIONS

TASK 1-1: Identify assumptions that affect how risk is assessed, responded to, and monitored within the organization.

Supplemental Guidance: Organizations that identify, characterize, and provide representative examples of threat sources, vulnerabilities, consequences/impacts, and likelihood determinations promote a common terminology and frame of reference for comparing and addressing risks across disparate mission/business areas. Organizations can also select appropriate risk assessment methodologies, depending on organizational governance, culture, and how divergent the missions/business functions are within the respective organizations. For example, organizations with highly centralized governance structures might elect to use a single risk assessment methodology. Organizations with hybrid governance structures might select multiple risk assessment methodologies for Tier 2, and an additional risk assessment methodology for Tier 1 that assimilates and harmonizes the findings, results, and observations of the Tier 2 risk assessments. Alternatively, when autonomy and diversity are central to the organizational culture, organizations could define requirements for the degree of rigor and the form of results, leaving the choice of specific risk assessment methodologies to mission/business owners.

CHAPTER 3 PAGE 34

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Threat Sources Threat sources cause events having undesirable consequences or adverse impacts on organizational operations and assets, individuals, other organizations, and the Nation. Threat sources include: (i) hostile cyber/physical attacks; (ii) human errors of omission or commission; or (iii) natural and man-made disasters. For threats due to hostile cyber attacks or physical attacks, organizations provide a succinct characterization of the types of tactics, techniques, and procedures employed by adversaries that are to be addressed by safeguards and countermeasures (i.e., security controls) deployed at Tier 1 (organization level), at Tier 2 (mission/business process level), and at Tier 3 (information system level)—making explicit the types of threat-sources that are to be addressed as well as making explicit those not being addressed by the safeguards/countermeasures. Adversaries can be characterized in terms of threat levels (based on capabilities, intentions, and targeting) or with additional detail. Organizations make explicit any assumptions about threat source targeting, intentions, and capabilities. Next, organizations identify a set of representative threat events. This set of threat events provides guidance on the level of detail with which the events are described. Organizations also identify conditions for when to consider threat events in risk assessments. For example, organizations can restrict risk assessments to those threat events that have actually been observed (either internally or externally by partners or peer organizations) or alternatively, specify that threat events described by credible researchers can also be considered. Finally, organizations identify the sources of threat information found to be credible and useful (e.g., sector Information Sharing and Analysis Centers [ISACs]). Trust relationships determine from which partners, suppliers, and customers, threat information is obtained as well as the expectations placed on those partners, suppliers and customers in subsequent risk management process steps. By establishing common starting points for identifying threat sources at Tier 1, organizations provide a basis for aggregating and consolidating the results of risk assessments at Tier 2 (including risk assessments conducted for coalitions of missions and business areas or for common control providers) into an overall assessment of risk to the organization as a whole. At Tier 2, mission/business owners may identify additional sources of threat information specific to organizational missions or business functions. These sources are typically based on: (i) a particular business or critical infrastructure sector (e.g., sector ISAC); (ii) operating environments specific to the missions or lines of business (e.g., maritime, airspace); and (iii) external dependencies (e.g., GPS or satellite communications). The characterization of threat sources are refined for the missions/business functions established by organizations—with the results being that some threat sources might not be of concern, while others could be described in greater detail. At Tier 3, program managers, information system owners, and common control providers consider the phase in the system development life cycle to determine the level of detail with which threats can be considered. Greater threat specificity tends to be available later in the life cycle..

Vulnerabilities Organizations identify approaches used to characterize vulnerabilities, consistent with the characterization of threat sources and events. Vulnerabilities can be associated with exploitable weakness or deficiencies in: (i) the hardware, software, or firmware components that compose organizational information systems (or the security controls employed within or inherited by those systems; (ii) mission/business processes and enterprise architectures (including embedded information security architectures) implemented by organizations; or (iii) organizational governance structures or processes. Vulnerabilities can also be associated with the susceptibility of organizations to adverse impacts, consequences, or harm from external sources (e.g., physical destruction of non-owned infrastructure such as electric power grids). Organizations provide guidance regarding how to consider dependencies on external organizations as vulnerabilities in the risk assessments conducted. The guidance can be informed by the types of trust relationships established by organizations with external providers. Organizations identify the degree of specificity with which vulnerabilities are described (e.g., general terms, Common Vulnerability Enumeration [CVE] identifiers, identification of weak/deficient security controls), giving some representative examples corresponding to representative threats. Organizational governance structures and processes determine how vulnerability information is shared across organizations. Organizations may also identify sources of vulnerability information found to be credible and useful. At Tier 2, mission/business owners may choose to identify additional sources of vulnerability information (e.g., a sector ISAC for information about vulnerabilities specific to that sector). At Tier 3, program managers, information system owners, and common control providers consider the phase in the system development life cycle—and in particular, the technologies included in the system – to determine the level of detail with which vulnerabilities can be considered. Organizations make explicit any assumptions about the degree of organizational or information system vulnerability to specific threat sources (by name or by type).

Consequences and Impact

Organizations provide guidance on how to assess impacts to organizational operations (i.e., mission, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation (e.g., using FIPS 199, CNSS Instruction 1253, or a more granular approach). Organizations can experience the consequences/impact of adverse events at the information system level (e.g., failing to perform as required), at the mission/business process level (e.g., failing to fully meet mission/business objectives), and at the organizational level (e.g., failing to comply with legal or regulatory requirements, damaging reputation or relationships, or undermining long-term viability). Organizations determine at Tier 1, which consequences and types of impact are to be considered at Tier 2, the mission/business

CHAPTER 3 PAGE 35

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

process level. An adverse event can have multiple consequences and different types of impact, at different levels, and in different time frames. For example, the exposure of sensitive information (e.g., personally identifiable information) by a particular mission/business area (e.g., human resources) can have organization-wide consequences and adverse impact with regard to reputation damage; the information system consequence/impact for multiple systems of an attacker more easily overcoming identification and authentication security controls; and the mission/business process consequence/impact (for one or more mission/business areas) of an attacker falsifying information on which future decisions are based. To ensure consistency, organizations determine at Tier 1, how consequences/impacts experienced in different time frames are to be assessed. At Tier 2, mission/business owners may amplify organizational guidance, as appropriate. The types of consequences and impact considered in risk determinations are identified to provide a basis for determining, aggregating, and/or consolidating risk results and to facilitate risk communication. Organizations also provide guidance to Tier 2 and Tier 3 with regard to the extent that risk assessments are to consider the risk to other organizations and the Nation. Organization make explicit any assumptions about the degree of impact/consequences related to specific threat sources (by name or by type) or through specific vulnerabilities (individually or by type).

Likelihood Organizations can employ a variety of approaches for determining the likelihood of threat events. Some organizations treat the likelihood that a threat event will occur and the likelihood that, if it occurs, it will result in adverse effects as separate factors, while other organizations assess threat likelihood as a combination of these factors. In addition, some organizations prefer quantitative risk assessments while other organizations, particularly when the assessment involves a high degree of uncertainty, prefer qualitative risk assessments. Likelihood determinations can be based on either threat assumptions or actual threat data (e.g., historical data on cyber attacks, historical data on earthquakes, or specific information on adversary capabilities, intentions, and targeting). When specific and credible threat data is available (e.g., types of cyber attacks, cyber attack trends, frequencies of attacks), organizations can use the empirical data and statistical analyses to determine more specific probabilities of threat events occurring. Organizations select a method consistent with organizational culture and risk tolerance. Organizations can also make explicit assumptions concerning the likelihood that a threat event will result in adverse effects as follows: (i) worst case (i.e., attack will be successful unless strong, objective reasons to presume otherwise); (ii) best case (i.e., attack will not be successful unless specific, credible information to the contrary); or (iii) something in between best and worst cases (e.g., the most probable case). Organizations document any overarching assumptions. Organizations can use empirical data and statistical analyses to help inform any of the approaches used to determine the likelihood of threat events occurring. Organizations select a method consistent with organizational culture, understanding of the operational environment, and risk tolerance.

RISK CONSTRAINTS

TASK 1-2: Identify constraints on the conduct of risk assessment, risk response, and risk monitoring activities within the organization. Supplemental Guidance: The execution of the risk management process can be constrained in various ways, some of which are direct and obvious, while others are indirect. Financial limitations can constrain the set of risk management activities directly (e.g., by limiting the total resources available for investments in risk assessments or in safeguards or countermeasures) or indirectly (e.g., by eliminating activities which, while involving relatively small investments in risk response, entail curtailing or discarding investments in legacy information systems or information technology). Organizations might also discover that the need to continue to depend on legacy information systems may constrain the risk management options available to the organization. Constraints can also include legal, regulatory, and/or contractual requirements. Such constraints can be reflected in organizational policies (e.g., restrictions on outsourcing, restrictions on and/or requirements for information to be gathered as part of risk monitoring). Organizational culture can impose indirect constraints on governance changes (e.g., precluding a shift from decentralized to hybrid governance structures) and which security controls are considered by organizations as potential common controls. In particular, organizational attitudes toward information technology risk that, for example, favor extensive automation and early adoption of new technologies can constrain the degree of risk avoidance and perhaps risk mitigation that can be achieved. Any cultural constraints that limit senior leader/executive (e.g., chief information officer) visibility into organizational information systems that are beyond their formal authority (e.g. mission-related systems) may impede overall understanding of the complexity of information systems environment and the related risks to the organization. At Tier 2, mission/business owners interpret constraints in light of organizational missions/business functions. Some regulatory constraints may not apply to particular missions/business functions (e.g., regulations that apply to international operations, when mission/business areas are restricted to the United States). Alternately, additional requirements may apply (e.g., mission/business processes performed jointly with another organization, which imposes contractual constraints). At Tier 3, information system owners, common control providers, and/or program managers interpret the organization- wide and mission/business function-specific constraints with respect to their systems and environments of operation (e.g., requirements to provide specific security controls are satisfied through common controls).

CHAPTER 3 PAGE 36

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

RISK TOLERANCE

TASK 1-3: Identify the level of risk tolerance for the organization.

Supplemental Guidance: Risk tolerance is the level of risk that organizations are willing to accept in pursuit of strategic goals and objectives. Organizations define information security-related risk tolerance organization-wide considering all missions/business functions. Organizations can use a variety of techniques for identifying information security risk tolerance (e.g., by establishing zones in a likelihood-impact trade space or by using a set of representative scenarios). Organizations also define tolerance for other types of organizational and operational risks (e.g., financial, risk, safety risk, compliance risk, or reputation risk). At Tier 2, mission/business owners may have different risk tolerances from the organization as a whole. The risk executive (function) provides organizations with ways to resolve such differences in risk tolerances at Tier 2. The level of residual risk accepted by authorizing officials for information systems or inherited common controls is within the organizational risk tolerance, and not the individual risk tolerances of those authorizing officials. In addition, organizations provide to Tier 2 and Tier 3, guidance on evaluating risk for specific mission/business processes or information systems and a focus on near-term mission/business effectiveness with the longer-term, strategic focus of the organizational risk tolerance. See Section 2.3.3 for additional information on risk tolerance.

PRIORITIES AND TRADE-OFFS

TASK 1-4: Identify priorities and trade-offs considered by the organization in managing risk.

Supplemental Guidance: Risk is experienced at different levels, in different forms, and in different time frames. At Tier 1, organizations make trade-offs among and establish priorities for responding to such risks. Organizations tend to have multiple priorities that at times conflict, which generates potential risk. Approaches employed by organizations for managing portfolios of risks reflect organizational culture, risk tolerance, as well as risk-related assumptions and constraints. These approaches are typically embodied in the strategic plans, policies, and roadmaps of organizations which may indicate preferences for different forms of risk response. For example, organizations may be willing to accept short-term risk of slightly degraded operations to achieve long-term reduction in information security risk. However, this trade-off could be unacceptable for one particularly critical mission/business function (e.g., real-time requirements in many industrial/process control systems). For that high-priority area, a different approach to improving security may be required including the application of compensating security controls.

Outputs and Post Conditions The output of the risk framing step is the risk management strategy that identifies how organizations intend to assess, respond to, and monitor risk over time. The framing step also produces a set of organizational policies, procedures, standards, guidance, and resources covering the following topics: (i) scope of the organizational risk management process (e.g., organizational entities covered; mission/business functions affected; how risk management activities are applied within the risk management tiers); (ii) risk assessment guidance including, for example, the characterization of threat sources, sources of threat information, representative threat events (in particular, adversary tactics, techniques, and procedures), when to consider and how to evaluate threats, sources of vulnerability information, risk assessment methodologies to be used, and risk assumptions; (iii) risk response guidance including, for example, risk tolerances, risk response concepts to be employed, opportunity costs, trade-offs, consequences of responses, hierarchy of authorities, and priorities; (iv) risk monitoring guidance, including, for example, guidance on analysis of monitored risk factors to determine changes in risk, and monitoring frequency, methods, and reporting; (v) other and risk constraints on executing risk management activities; and (vi) organizational priorities and trade-offs. Outputs from the risk framing step serve as inputs to the risk assessment, risk response, and risk monitoring steps.

3.2 ASSESSING RISK Risk assessment identifies, prioritizes, and estimates risk to organizational operations (i.e., mission, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation, resulting from the operation and use of information systems.62 Risk assessments use the results of threat and vulnerability assessments to identify and evaluate risk in terms of likelihood of occurrence and potential adverse impact (i.e., magnitude of harm) to organizations, assets, and individuals. Risk assessments can be conducted at any of the risk management tiers

62 Draft NIST Special Publication 800-30, Revision 1, provides guidance on conducting risk assessments (including incremental or differential risk assessments) across all three tiers in the multitiered risk management approach.

CHAPTER 3 PAGE 37

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

with different objectives and utility of the information produced. For example, risk assessments conducted at Tier 1 or Tier 2 focus on organizational operations, assets, and individuals—whether comprehensive across mission/business lines or only on those assessments that are cross-cutting to the particular mission/business line. Organization-wide assessments of risk can be based solely on the assumptions, constraints, risk tolerances, priorities, and trade-offs established in the risk framing step (derived primarily from Tier 1 activities) or can be based on risk assessments conducted across multiple mission/business lines (derived primarily from Tier 2 activities). Risk assessments conducted at one tier can be used to refine/enhance threat, vulnerability, likelihood, and impact information used in assessments conducted in other tiers. The degree that information from risk assessments can be reused is shaped by the similarity of missions/business functions and the degree of autonomy that organizational entities or subcomponents have with respect to parent organizations. Organizations that are decentralized can expect to conduct more risk assessment activities at Tier 2 and, as a result, may have a greater need to communicate within Tier 2 to identify cross-cutting threats and vulnerabilities. Decentralized organizations can still benefit from Tier 1 risk assessments and, in particular, the identification of an initial set of threat and vulnerability sources. Organization-wide risk assessments provide some initial prioritization of risks for decision makers to consider when entering the risk response step.

Organizations benefit significantly from conducting risk assessments as part of an organization- wide risk management process. However, once risk assessments are complete, it is prudent for organizations to invest some time in keeping the assessments current. Maintaining currency of risk assessments requires support from the risk monitoring step (e.g., observing changes in organizational information systems and environments of operation or analyzing monitoring results to maintain awareness of the risk). Keeping risk assessments up to date provides many potential benefits such as timely, relevant information that enables senior leaders/executives to perform near real-time risk management. Maintaining risk assessments also reduces future assessment costs and supports ongoing risk monitoring efforts. Organizations may determine that conducting comprehensive risk assessments as a way of maintaining current risk assessments do not provide sufficient value. In such situations, organizations consider conducting incremental and/or differential risk assessments. Incremental risk assessments consider only new information (e.g., the effects of using a new information system on mission/business risk), whereas differential risk assessments consider how changes affect the overall risk determination. Incremental or differential risk assessments are useful if organizations require a more targeted review of risk, seek an expanded understanding of risk, or desire an expanded understanding of the risk in relation to missions/business functions.

STEP 2: RISK ASSESSMENT

Inputs and Preconditions Inputs to the risk assessment step from the risk framing step include, for example: (i) acceptable risk assessment methodologies; (ii) the breadth and depth of analysis employed during risk assessments; (iii) the level of granularity required for describing threats; (iv) whether/how to assess external service providers; and (v) whether/how to aggregate risk assessment results from different organizational entities or mission/business functions to the organization as a whole. Organizational expectations regarding risk assessment methodologies, techniques, and/or procedures are shaped heavily by governance structures, risk tolerance, culture, trust, and life cycle processes. Prior to conducting risk assessments, organizations understand the fundamental reasons for conducting the assessments and what constitutes adequate depth and breadth for the assessments. Risk assumptions, risk constraints, risk tolerance, and priorities/trade- offs defined during the risk framing step shape how organizations use risk assessments—for example, localized applications of the risk assessments within each of the risk management tiers (i.e., governance, mission/business process, information systems) or global applications of the risk assessments across the entire organization. Risk assessments can be conducted by organizations even when some of the inputs from the risk framing step have not been received or preconditions established. However, in those situations, the quality of the risk assessment results may be affected. In addition to the risk framing step, the risk assessment step can receive inputs from the risk monitoring step,

CHAPTER 3 PAGE 38

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

especially during mission operations and the operations/maintenance phase of the system development life cycle (e.g., when organizations discover new threats or vulnerabilities that require an immediate reassessment of risk). The risk assessment step can also receive inputs from the risk response step (e.g., when organizations are considering the risk of employing new technology-based solutions as alternatives for risk reduction measures). As courses of action are developed in the risk response step, a differential risk assessment may be needed to evaluate differences that each course of action makes in the overall risk determination.

Activities THREAT AND VULNERABILITY IDENTIFICATION

TASK 2-1: Identify threats to and vulnerabilities in organizational information systems and the environments in which the systems operate.

Supplemental Guidance: Threat identification requires an examination of threat sources and events. For examining threat sources and events, organizations identify threat capabilities, intentions, and targeting information from all available sources. Organizations can leverage a number of sources for threat information at strategic or tactical levels. Threat information generated at any tier can be used to inform or refine the risk-related activities in any other tier. For example, specific threats (i.e., tactics, techniques, and procedures) identified during Tier 1 threat assessments may directly affect mission/business process and architectural design decisions at Tier 2. Specific threat information generated at Tiers 2 and 3 can be used by organizations to refine threat information generated during initial threat assessments carried out at Tier 1.

Vulnerability identification occurs at all tiers. Vulnerabilities related to organizational governance (e.g., inconsistent decisions about the relative priorities of mission/business processes, selection of incompatible implementations of security controls) as well as vulnerabilities related to external dependencies (e.g., electrical power, supply chain, telecommunications), are most effectively identified at Tier 1. However, most vulnerability identification occurs at Tiers 2 and 3. At Tier 2, process and architecture-related vulnerabilities (e.g., exploitable weaknesses or deficiencies in mission/business processes, enterprise /information security architectures including embedded information security architectures) are more likely to be identified. At Tier 3, information system vulnerabilities are the primary focus. These vulnerabilities are commonly found in the hardware, software, and firmware components of information systems or in the environments in which the systems operate. Other areas of potential vulnerabilities include vulnerabilities associated with the definition, application/implementation, and monitoring of processes, procedures and services related to management, operational, and technical aspects of information security. Vulnerabilities associated with architectural design and mission/business processes can have a greater impact on the ability of organizations to successfully carry out missions and business functions due to the potential impact across multiple information systems and mission environments. The refined vulnerability assessments conducted at Tiers 2 and 3 are shared with organizational personnel responsible for assessing risks more strategically. Vulnerability assessments conducted at Tier 2 and Tier 3 have the opportunity to evaluate additional related variables such as location, proximity to other high risk assets (physical or logical), and resource considerations related to operational environments. Information specific to operational environments allows for more useful and actionable assessment results. Vulnerability identification can be accomplished at a per-individual weakness/deficiency level or at a root-cause level. When selecting between approaches, organizations consider whether the overall objective is identifying each specific instance or symptom of a problem or understanding the underlying root causes of problems. Understanding specific exploitable weaknesses or deficiencies is helpful when problems are first identified or when quick fixes are required. This specific understanding also provides organizations with necessary sources of information for eventually diagnosing potential root causes of problems, especially those problems that are systemic in nature.

Organizations with more established enterprise architectures (including embedded information security architectures) and mature life cycle processes have outputs that can be used to inform risk assessment processes. Risk assumptions, constraints, tolerances, priorities, and trade-offs used for developing enterprise architectures and embedded information security architectures can be useful sources of information for initial risk assessment activities. Risk assessments conducted to support the development of segment or solution architectures may also serve as information sources for the identification of threats and vulnerabilities. Another factor influencing threat and vulnerability identification is organizational culture. Organizations that promote free and open communications and non-retribution for sharing adverse information tend to foster greater openness from individuals working within those organizations. Frequently, organizational personnel operating at Tiers 2 and 3 have valuable information and can make meaningful contributions in the area of threat and vulnerability identification. The culture of organizations influences the willingness of personnel to communicate potential threat and vulnerability information, which ultimately affects the quality and quantity of the threats/vulnerabilities identified.

CHAPTER 3 PAGE 39

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

RISK DETERMINATION

TASK 2-2: Determine the risk to organizational operations and assets, individuals, other organizations, and the Nation if identified threats exploit identified vulnerabilities.

Supplemental Guidance: Organizations determine risk by considering the likelihood that known threats exploit known vulnerabilities and the resulting consequences or adverse impacts (i.e., magnitude of harm) if such exploitations occur. Organizations use threat and vulnerability information together with likelihood and consequences/impact information to determine risk either qualitatively or quantitatively. Organizations can employ a variety of approaches to determine the likelihood of threats exploiting vulnerabilities. Likelihood determinations can be based on either threat assumptions or actual threat information (e.g., historical data on cyber attacks, historical data on earthquakes, or specific information on adversary capabilities, intentions, and targeting). When specific and credible threat information is available (e.g., types of cyber attacks, cyber attack trends, frequencies of attacks), organizations can use empirical data and statistical analyses to determine more specific probabilities of threats occurring. Assessment of likelihood can also be influenced by whether vulnerability identification occurred at the individual weakness or deficiency level or at the root-cause level. The relative ease/difficulty of vulnerability exploitation, the sophistication of adversaries, and the nature of operational environments all influence the likelihood that threats exploit vulnerabilities. Organizations can characterize adverse impacts by security objective (e.g., loss of confidentiality, integrity, or availability). However, to maximize usefulness, adverse impact is expressed in or translated into terms of organizational missions, business functions, and stakeholders.

Risk Determination and Uncertainty Risk determinations require analysis of threat, vulnerability, likelihood, and impact-related information. Organizations also need to examine mission/business vulnerabilities and threats where safeguards and/or countermeasures do not exist. The nature of the inputs provided to this step (e.g., general, specific, strategic, tactical) directly affects the type of outputs or risk determinations made. The reliability and accuracy of risk determinations are dependent on the currency, accuracy, completeness, and integrity of information collected to support the risk assessment process. In addition, the components of risk assessment results that affect reliability and accuracy of risk determinations also affect the amount of uncertainty associated with those risk determinations and subsequent determinations. Organizations also consider additional insights related to the anticipated time frames associated with particular risks. Time horizons associated with potential threats can shape future risk responses (e.g., risk may not be a concern if the time horizon for the risk is in the distant future).

Organizational guidance for determining risk under uncertainty indicates how combinations of likelihood and impact are combined into a determination of the risk level or risk score/rating. Organizations need to understand the type and amount of uncertainty surrounding risk decisions so that risk determinations can be understood. During the risk framing step, organizations may have provided guidance on how to analyze risk and how to determine risk when a high degree of uncertainty exists. Uncertainty is particularly a concern when the risk assessment considers advanced persistent threats, for which analysis of interacting vulnerabilities may be needed, the common body of knowledge is sparse, and past behavior may not be predictive.

While threat and vulnerability determinations apply frequently to missions and business functions, the specific requirements associated with the missions/business functions, including the environments of operation, may lead to different assessment results. Different missions, business functions, and environments of operation can lead to differences in the applicability of specific threat information considered and the likelihood of threats causing potential harm. Understanding the threat component of the risk assessment requires insight into the particular threats facing specific missions or business functions. Such awareness of threats includes understanding the capability, intent, and targeting of particular adversaries. The risk tolerance of organizations and underlying beliefs associated with how the risk tolerance is formed (including the culture within organizations) may shape the perception of impact and likelihood in the context of identified threats and vulnerabilities.

Even with the establishment of explicit criteria, risk assessments are influenced by organizational culture and the personal experiences and accumulated knowledge of the individuals conducting the assessments. As a result, assessors of risk can reach different conclusions from the same information. This diversity of perspective can enrich the risk assessment process and provide decision makers with a greater array of information and potentially fewer biases. However, such diversity may also lead to risk assessments that are inconsistent. Organizationally-defined and applied processes provide the means to identify inconsistent practices and include processes to identify and resolve such inconsistencies.

Outputs and Post Conditions The output of the risk assessment step is a determination of risk to organizational operations (i.e., mission, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation. Depending on the approach that organizations take, either the overall risk to the organization or the inputs used to determine risk may be

CHAPTER 3 PAGE 40

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

communicated to the decision makers responsible for risk response. In certain situations, there are recurring cycles between the risk assessment step and the risk response step until particular objectives are achieved. Based on the course of action selected during the risk response step, some residual risk may remain. Under certain circumstances, the level of residual risk could trigger a reassessment of risk. This reassessment is typically incremental (assessing only the new information) and differential (assessing how the new information changes the overall risk determination).

The aggregation of risk assessment results from all three tiers drives the management of portfolios of risks undertaken by organizations. Identified risks common to more than one mission/business function within organizations may also be the source for future assessment activities at Tier 1, such as root-cause analysis. Gaining a better understanding of the reasons why certain risks are more common or frequent assists decision makers in selecting risk responses that address underlying (or root-cause) problems instead of solely focusing on the surface issues surrounding the existence of the risks. The results of risk assessments can also shape future design and development decisions related to enterprise architecture (including embedded information security architecture), and organizational information systems. The extent to which missions/business functions are vulnerable to a set of identified threats and the relative ease with which those vulnerabilities can be exploited, contribute to the risk-related information provided to senior leaders/executives.

Outputs from the risk assessment step can be useful inputs to the risk framing and risk monitoring steps. For example, risk determinations can result in revisiting the organizational risk tolerance established during the risk framing step. Organizations can also choose to use information from the risk assessment step to inform the risk monitoring step. For example, risk assessments can include recommendations to monitor specific elements of risk (e.g., threat sources) so that if certain thresholds are crossed, previous risk assessment results can be reviewed and updated, as appropriate. Particular thresholds established as part of risk monitoring programs can also serve as the basis for reassessments of risk. If organizations establish criteria as a part of the risk framing step for when risk assessment results do not warrant risk responses, then assessment results could be fed directly to the risk monitoring step as a source of input.

3.3 RESPONDING TO RISK Risk response identifies, evaluates, decides on, and implements appropriate courses of action to accept, avoid, mitigate, share, or transfer risk to organizational operations and assets, individuals, other organizations, and the Nation, resulting from the operation and use of information systems. Identifying and analyzing alternative courses of action63 typically occurs at Tier 1 or Tier 2. This is due to the fact that alternative courses of action (i.e., potential risk responses) are evaluated in terms of anticipated organization-wide impacts and the ability of organizations to continue to successfully carry out organizational missions and business functions. Decisions to employ risk response measures organization-wide are typically made at Tier 1, although the decisions are informed by risk-related information from the lower tiers. At Tier 2, alternative courses of action are evaluated in terms of anticipated impacts on organizational missions/business functions, the associated mission/business processes supporting the missions/business functions, and resource requirements. At Tier 3, alternative courses of action tend to be evaluated in terms of the system development life cycle or the maximum amount of time available for implementing the selected course(s) of action. The breadth of potential risk responses is a major factor for whether the activity is carried out at Tier 1, Tier 2, or Tier 3. Risk decisions are influenced by organizational risk tolerance developed as part of risk framing activities at Tier 1. Organizations can implement risk decisions at any of the risk management tiers with different objectives and utility of information produced.

STEP 3: RISK RESPONSE

Inputs and Preconditions Inputs from the risk assessment and risk framing steps include: (i) identification of threat sources and threat events; (ii) identification of vulnerabilities that are subject to exploitation; (iii) estimates of potential consequences and/or impact if

63 A course of action is a time-phased or situation-dependent combination of risk response measures. A risk response measure is a specific action taken to respond to an identified risk. Risk response measures can be separately managed and can include, for example, the implementation of security controls to mitigate risk, promulgation of security policies to avoid risk or to accept risk in specific circumstances, and organizational agreements to share or transfer risk.

CHAPTER 3 PAGE 41

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

threats exploit vulnerabilities; (iv) likelihood estimates that threats exploit vulnerabilities; (v) a determination of risk to organizational operations (i.e., mission, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation; (vi) risk response guidance from the organizational risk management strategy (see Appendix H); and (vii) the general organizational directions and guidance on appropriate responses to risk. In addition to the risk assessment and risk framing steps, the risk response step can receive inputs from the risk monitoring step (e.g., when organizations experience a breach or compromise to their information systems or environments of operation that require an immediate response to address the incident and reduce additional risk that results from the event). The risk response step can also receive inputs from the risk framing step (e.g., when organizations are required to deploy new safeguards and countermeasures in their information systems based on security requirements in new legislation or OMB policies). The risk framing step also directly shapes the resource constraints associated with selecting an appropriate course of action. Additional preconditions established at the risk framing step may include: (i) constraints based on architecture and previous investments; (ii) organizational preferences and tolerances; (iii) the expected effectiveness at mitigating risk (including how effectiveness is measured and monitored); and (iv) the time horizon for the risk (e.g., current risk, projected risk—that is, a risk expected to arise in the future based on the results of threat assessments or a planned changes in missions/business functions, enterprise architecture (including information security architecture), or aspects of legal or regulatory compliance).

Activities RISK RESPONSE IDENTIFICATION

TASK 3-1: Identify alternative courses of action to respond to risk determined during the risk assessment.

Supplemental Guidance: Organizations can respond to risk in a variety of ways. These include: (i) risk acceptance; (ii) risk avoidance; (iii) risk mitigation; (iv) risk sharing; (v) risk transfer; or (vi) a combination of the above. A course of action is a time-phased or situation-dependent combination of risk response measures. For example, in an emergency situation, organizations might accept the risk associated with unfiltered connection to an external communications provider for a limited time; then avoid risk by cutting the connection; mitigate risk in the near-term by applying security controls to search for malware or evidence of unauthorized access to information that occurred during the period of unfiltered connection; and finally mitigate risk long-term by applying controls to handle such connections more securely.

Risk Acceptance Risk acceptance is the appropriate risk response when the identified risk is within the organizational risk tolerance. Organizations can accept risk deemed to be low, moderate, or high depending on particular situations or conditions. For example, organizations with data centers residing in the northeastern portion of the United States may opt to accept the risk of earthquakes based on known likelihood of earthquakes and data center vulnerability to damage by earthquakes. Organizations accept the fact that earthquakes are possible, but given the infrequency of major earthquakes in that region of the country, believe it is not cost-effective to address such risk—that is, the organizations have determined that risk associated with earthquakes is low. Conversely, organizations may accept substantially greater risk (in the moderate/high range) due to compelling mission, business, or operational needs. For example, federal agencies may decide to share very sensitive information with first responders who do not typically have access to such information due to time-sensitive needs to stop pending terrorist attacks, even though the information is not itself perishable with regard to risk through loss of confidentiality. Organizations typically make determinations regarding the general level of acceptable risk and the types of acceptable risk with consideration of organizational priorities and trade-offs between: (i) near-term mission/business needs and potential for longer-term mission/business impacts; and (ii) organizational interests and the potential impacts on individuals, other organizations, and the Nation.

Risk Avoidance Risk avoidance may be the appropriate risk response when the identified risk exceeds the organizational risk tolerance. Organizations may conduct certain types of activities or employ certain types of information technologies that result in risk that is unacceptable. In such situations, risk avoidance involves taking specific actions to eliminate the activities or technologies that are the basis for the risk or to revise or reposition these activities or technologies in the organizational mission/business processes to avoid the potential for unacceptable risk. For example, organizations planning to employ networked connections between two domains, may determine through risk assessments that there is unacceptable risk in establishing such connections. Organizations may also determine that implementing effective safeguards and countermeasures (e.g., cross-domain solutions) is not practical in the given circumstances. Thus, the organizations decide to avoid the risk by eliminating the electronic or networked connections and employing an “air gap” with a manual connection processes (e.g., data transfers by secondary storage devices).

Risk Mitigation Risk mitigation, or risk reduction, is the appropriate risk response for that portion of risk that cannot be accepted, avoided, shared, or transferred. The alternatives to mitigate risk depend on: (i) the risk management tier and the scope

CHAPTER 3 PAGE 42

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

of risk response decisions assigned or delegated to organizational officials at that tier (defined by the organizational governance structures); and (ii) the organizational risk management strategy and associated risk response strategies. The means used by organizations to mitigate risk can involve a combination of risk response measures across the three tiers. For example, risk mitigation can include common security controls at Tier 1, process re-engineering at Tier 2, and/or new or enhanced management, operational, or technical safeguards or countermeasures (or some combination of all three) at Tier 3. Another example of a potential risk requiring mitigation can be illustrated when adversaries gain access to mobile devices (e.g., laptop computers or personal digital assistants) while users are traveling. Possible risk mitigation measures include, for example, organizational policies prohibiting transport of mobile devices to certain areas of the world or procedures for users to obtain a clean mobile device that is never allowed to connect to the organizational networks.

Risk Sharing or Transfer Risk sharing or risk transfer is the appropriate risk response when organizations desire and have the means to shift risk liability and responsibility to other organizations. Risk transfer shifts the entire risk responsibility or liability from one organization to another organization (e.g., using insurance to transfer risk from particular organizations to insurance companies). Risk sharing shifts a portion of risk responsibility or liability to other organizations (usually organizations that are more qualified to address the risk). It is important to note that risk transfer reduces neither the likelihood of harmful events occurring nor the consequences in terms of harm to organizational operations and assets, individuals, other organizations, or the Nation. Risk sharing may be a sharing of liability or a sharing of responsibility for other, adequate risk responses such as mitigation. Therefore, the concept of risk transfer is less applicable in the public sector (e.g., federal, state, local governments) than the private sector, as liability of organizations is generally established by legislation or policy. As such, self-initiated transfers of risk by public sector organizations (as typified by purchasing insurance) are generally not possible. Risk sharing often occurs when organizations determine that addressing risk requires expertise or resources that are better provided by other organizations. For example, an identified risk might be the physical penetration of perimeters and kinetic attacks by terrorist groups. The organization decides to partner with another organization sharing the physical facility to take joint responsibility for addressing risk from kinetic attacks.

EVALUATION OF ALTERNATIVES

TASK 3-2: Evaluate alternative courses of action for responding to risk. Supplemental Guidance: The evaluation of alternative courses of action can include: (i) the expected effectiveness in achieving desired risk response (and how effectiveness is measured and monitored); and (ii) anticipated feasibility of implementation, including, for example, mission/business impact, political, legal, social, financial, technical, and economic considerations. Economic considerations include costs throughout the expected period of time during which the course of action is followed (e.g., cost of procurement, integration into organizational processes at Tier 1 and/or Tier 2, information systems at Tier 3, training, and maintenance). During the evaluation of alternative courses of action, trade-offs can be made explicit between near-term gains in mission/business effectiveness or efficiency and long-term risk of mission/business harm due to compromise of information or information systems that are providing this near- term benefit. For example, organizations concerned about the potential for mobile devices (e.g., laptop computers) being compromised while employees are on travel can evaluate several courses of action including: (i) providing users traveling to high-risk areas with clean laptops; (ii) removing hard drives from laptops and operate from CDs or DVDs; or (iii) having laptops go through a detailed assessment before being allowed to connect to organizational networks. The first option is highly effective as returning laptops are never connected to organizational networks. While the second option ensures that hard drives cannot be corrupted, it is not quite as effective in that it is still possible that hardware devices (e.g., motherboards) could have been compromised. The effectiveness of the third option is limited by the ability of organizations to detect potential insertion of malware into the hardware, firmware, or software. As such, it is the least effective of the three options. From a cost perspective, the first option is potentially the most expensive, depending upon the number of travelers (hence number of travel laptops) required. The second and third options are considerably less expensive. From a mission and operational perspective, the third option is the best alternative as users have access to standard laptop configurations including all applications and supporting data needed to perform tasks supporting missions and business functions. Such applications and data would not be available if the first or second option is selected. Ultimately, the evaluation of courses of action is made based on operational requirements, including information security requirements, needed for near and long term mission/business success. Budgetary constraints, consistency with investment management strategies, civil liberties, and privacy protection, are some of the important elements organizations consider when selecting appropriate courses of action. In those instances where organizations only identify a single course of action, then the evaluation is focused on whether the course of action is adequate. If the course of action is deemed inadequate, then organizations need to refine the identified course of action to address the inadequacies or develop another course of action (see Task 3-1).

In summary, a risk verses risk-response trade-off is conducted for each course of action to provide the information necessary for: (i) selecting between the courses of action; and (ii) evaluating the courses of action in terms of response effectiveness, costs, mission/business impact, and any other factors deemed relevant to organizations. Part of risk

CHAPTER 3 PAGE 43

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

versus risk-response trade-off considers the issue of competing resources. From an organizational perspective, this means organizations consider whether the cost (e.g., money, personnel, time) for implementing a given course of action has the potential to adversely impact other missions or business functions, and if so, to what extent. This is necessary because organizations have finite resources to employ and many competing missions/business functions across many organizational elements. Therefore, organizations assess the overall value of alternative courses of action with regard to the missions/business functions and the potential risk to each organizational element. Organizations may determine that irrespective of a particular mission/business function and the validity of the associated risk, there are more important missions/business functions that face more significant risks, and hence have a better claim on the limited resources.

RISK RESPONSE DECISION

TASK 3-3: Decide on the appropriate course of action for responding to risk.

Supplemental Guidance: Decisions on the most appropriate course of action include some form of prioritization. Some risks may be of greater concern than other risks. In that case, more resources may need to be directed at addressing higher-priority risks than at other lower-priority risks. This does not necessarily mean that the lower-priority risks would not be addressed. Rather, it could mean that fewer resources might be directed at the lower-priority risks (at least initially), or that the lower-priority risks would be addressed at a later time. A key part of the risk decision process is the recognition that regardless of the decision, there still remains a degree of residual risk that must be addressed. Organizations determine acceptable degrees of residual risk based on organizational risk tolerance and the specific risk tolerances of particular decision makers. Impacting the decision process are some of the more intangible risk-related concepts (e.g., risk tolerance, trust, and culture). The specific beliefs and approaches that organizations embrace with respect to these risk-related concepts affect the course of action selected by decision-makers.

RISK RESPONSE IMPLEMENTATION

TASK 3-4: Implement the course of action selected to respond to risk.

Supplemental Guidance: Once a course of action is selected, organizations implement the associated risk response. Given the size and complexity of some organizations, the actual implementation of risk response measures may be challenging. Some risk response measures are tactical in nature (e.g., applying patches to identified vulnerabilities in organizational information systems) and may be implemented rather quickly. Other risk response measures may be more strategic in nature and reflect solutions that take much longer to implement. Therefore, organizations apply, and tailor as appropriate to a specific risk response course of action, the risk response implementation considerations in the risk response strategies (part of the risk management strategy developed during the risk framing step). See Appendix H, Risk Response Strategies.

Outputs and Post Conditions The output of the risk response step is the implementation of the selected courses of action with consideration for: (i) individuals or organizational elements responsible for the selected risk response measures and specifications of effectiveness criteria (i.e., articulation of indicators and thresholds against which the effectiveness of risk response measures can be judged); (ii) dependencies of each selected risk response measure on other risk response measures; (iii) dependencies of selected risk response measures on other factors (e.g., the implementation of other planned information technology measures); (iv) timeline for implementation of risk response measures; (v) plans for monitoring the effectiveness of risk response measures; (vi) identification of risk monitoring triggers; and (vii) interim risk response measures selected for implementation, if appropriate. There are also ongoing communications and sharing of risk-related information with individuals or organizational elements impacted by the risk responses (including potential actions that may need to be taken by the individuals or organizational elements).

In addition to the risk monitoring step, outputs from the risk response step can be useful inputs to the risk framing and risk assessment steps. For example, it is possible that the analysis occurring during the evaluation of alternative courses of action may call into question some aspects of the risk response strategy that is part of the risk management strategy generated during the risk framing step. In such instances, organizations use that information to inform the risk framing step with appropriate actions taken to revisit the risk management strategy and its associated risk response strategy. Organizations might also determine during the evaluation of alternative courses of action for risk response, that some aspects of the risk assessments are incomplete or incorrect. This information can be used to inform the risk assessment step possibly resulting in further analysis or reassessments of risk.

CHAPTER 3 PAGE 44

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

3.4 MONITORING RISK Risk monitoring provides organizations with the means to: (i) verify compliance;64 (ii) determine the ongoing effectiveness of risk response measures; and (iii) identify risk-impacting changes to organizational information systems and environments of operation. Analyzing monitoring results gives organizations the capability to maintain awareness of the risk being incurred, highlight the need to revisit other steps in the risk management process, and initiate process improvement activities as needed.65 Organizations employ risk monitoring tools, techniques, and procedures to increase risk awareness, helping senior leaders/executives develop a better understanding of the ongoing risk to organizational operations and assets, individuals, other organizations, and the Nation. Organizations can implement risk monitoring at any of the risk management tiers with different objectives and utility of information produced. For example, Tier 1 monitoring activities might include ongoing threat assessments and how changes in the threat space may affect Tier 2 and Tier 3 activities, including enterprise architectures (with embedded information security architectures) and organizational information systems. Tier 2 monitoring activities might include, for example, analyses of new or current technologies either in use or considered for future use by organizations to identify exploitable weaknesses and/or deficiencies in those technologies that may affect mission/business success. Tier 3 monitoring activities focus on information systems and might include, for example, automated monitoring of standard configuration settings for information technology products, vulnerability scanning, and ongoing assessments of security controls. In addition to deciding on appropriate monitoring activities across the risk management tiers, organizations also decide how monitoring is to be conducted (e.g., automated or manual approaches) and the frequency of monitoring activities based on, for example, the frequency with which deployed security controls change, critical items on plans of action and milestones, and risk tolerance.

STEP 4: RISK MONITORING

Inputs and Preconditions Inputs to this step include implementation strategies for selected courses of action for risk responses and the actual implementation of selected courses of action. In addition to the risk response step, the risk monitoring step can receive inputs from the risk framing step (e.g., when organizations become aware of an advanced persistent threat reflecting a change in threat assumptions, this may result in a change in the frequency of follow on monitoring activities). The risk framing step also directly shapes the resource constraints associated with establishing and implementing an organization-wide monitoring strategy. In some instances, outputs from the risk assessment step may be useful inputs to the risk monitoring step. For example, risk assessment threshold conditions (e.g., likelihood of threats exploiting vulnerabilities) could be input to the risk monitoring step. In turn, organizations could monitor to determine if such threshold conditions are met. If threshold conditions are met, such information could be used in the risk assessment step, where it could serve as the basis for an incremental, differential risk assessment or an overall reassessment of risk to the organization.

Activities RISK MONITORING STRATEGY

TASK 4-1: Develop a risk monitoring strategy for the organization that includes the purpose, type, and frequency of monitoring activities.

64 Compliance verification ensures that organizations have implemented required risk response measures and that information security requirements derived from and traceable to organizational missions/business functions, federal legislation, directives, regulations, policies, and standards/guidelines are satisfied. 65 Draft NIST Special Publication 800-137 provides guidance on monitoring organizational information systems and environments of operation.

CHAPTER 3 PAGE 45

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Supplemental Guidance: Organizations implement risk monitoring programs: (i) to verify that required risk response measures are implemented and that information security requirements derived from and traceable to organizational missions/business functions, federal legislation, directives, regulations, policies, and standards/guidelines, are satisfied (compliance monitoring); (ii) to determine the ongoing effectiveness of risk response measures after the measures have been implemented (effectiveness monitoring); and (iii) to identify changes to organizational information systems and the environments in which the systems operate that may affect risk (change monitoring) including changes in the feasibility of the ongoing implementation of risk response measures). Determining the purpose of risk monitoring programs directly impacts the means used by organizations to conduct the monitoring activities and where monitoring occurs (i.e., at which risk management tiers). Organizations also determine the type of monitoring to be employed, including approaches that rely on automation or approaches that rely on procedural/manual activities with human intervention. Finally, organizations determine how often monitoring activities are conducted, balancing value gained from frequent monitoring with potential for operational disruptions due for example, to interruption of mission/business processes, reduction in operational bandwidth during monitoring, and shift of resources from operations to monitoring. Monitoring strategies developed at Tier 1 influence and provide direction for similar strategies developed at Tier 2 and Tier 3 including the monitoring activities associated with the Risk Management Framework at the information system level.

Monitoring Compliance Compliance monitoring is employed to ensure that organizations are implementing needed risk response measures. This includes ensuring that the risk response measures selected and implemented by organizations in response to risk determinations produced from risk assessments are implemented correctly and operating as intended. Failure to implement the risk response measures selected by organizations can result in the organizations continuing to be subject to the identified risk. Compliance monitoring also includes ensuring that risk response measures required by federal mandates (e.g., legislation, directives, policies, regulations, standards) or organizational mandates (e.g., local policies, procedures, mission/business requirements) are implemented. Compliance monitoring is the easiest type of monitoring to perform because there are typically a finite set of risk response measures employed by organizations usually in the form of security controls. Such measures are typically well-defined and articulated as an output from the risk response step. The more challenging part of compliance monitoring is evaluating whether the risk response measures are implemented correctly (and in some instances continuously). Compliance monitoring also includes, as feasible, analysis as to why compliance failed. The reason for compliance failure can range from individuals failing to do their jobs correctly to the risk response measure not functioning as intended. If monitoring indicates a failure in compliance, then the response step of the risk management process is revisited. A key element of the feedback to the response step is the finding from compliance monitoring indicating the reason for the compliance failure. In some instances, compliance failures can be fixed by simply re-implementing the same risk response measures with little or no change. But in other instances, compliance failures are more complicated (e.g., the selected risk response measures are too difficult to implement or the measures did not function as expected). In such instances, it may be necessary for organizations to return to the evaluation and decision portions of the risk response step to develop different risk response measures.

Monitoring Effectiveness Effectiveness monitoring is employed by organizations to determine if implemented risk response measures have actually been effective in reducing identified risk to the desired level. Although effectiveness monitoring is different than compliance monitoring, failure to achieve desired levels of effectiveness may be an indication that risk response measures have been implemented incorrectly or are not operating as intended. Determining the effectiveness of risk response measures is generally more challenging than determining whether the measures have been implemented correctly and are operating as intended (i.e., meeting identified compliance requirements). Risk response measures implemented correctly and operating as intended do not guarantee an effective reduction of risk. This is primarily due to: (i) the complexity of operating environments which may generate unintended consequences; (ii) subsequent changes in levels of risk or associated risk factors (e.g., threats, vulnerabilities, impact, or likelihood); (iii) inappropriate or incomplete criteria established as an output of the risk response step; and (iv) changes in information systems and environments of operation after implementation of risk response measures. This is especially true when organizations try to determine if more strategic outcomes have been achieved and for more dynamic operating environments. For example, if the desired outcome for organizations is to be less susceptible to advanced persistent threats, this may be challenging to measure since these types of threats are, by definition, very difficult to detect. Even when organizations are able to establish effectiveness criteria, it is often difficult to obtain criteria that are quantifiable. Therefore, it may become a matter of subjective judgment as to whether the implemented risk response measures are ultimately effective. Moreover, even if quantifiable effectiveness criteria are provided, it may be difficult to determine if the information provided satisfies the criteria. If organizations determine that risk response measures are not effective, then it may be necessary to return to the risk response step. Generally, for effectiveness failures, organizations cannot simply return to the implementation portion of the risk response step. Therefore, depending on the reason for the lack of effectiveness, organizations revisit all portions of the risk response step (i.e., development, evaluation, decision, and implementation) and potentially the risk assessment step. These activities may result in organizations developing and implementing entirely new risk responses.

CHAPTER 3 PAGE 46

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Monitoring Changes In addition to compliance monitoring and effectiveness monitoring, organizations monitor changes to organizational information systems and the environments in which those systems operate. Monitoring changes to information systems and environments of operation is not linked directly to previous risk response measures but it is nonetheless important to detect changes that may affect the risk to organizational operations and assets, individuals, other organizations, and the Nation. Generally, such monitoring detects changes in conditions that may undermine risk assumptions (articulated in the risk framing step). • Information System: Changes can occur in organizational information systems (including hardware, software, and

firmware) that can introduce new risk or change existing risk. For example, updates to operating system software can eliminate security capabilities that existed in earlier versions, thus introducing new vulnerabilities into organizational information systems. Another example is the discovery of new system vulnerabilities that fall outside of the scope of the tools and processes available to address such vulnerabilities (e.g., vulnerabilities for which there are no established mitigations).

• Environments of Operation: The environments in which information systems operate can also change in ways that introduce new risk or change existing risk. Environmental and operational considerations include, but are not limited to, missions/business functions, threats, vulnerabilities, mission/business processes, facilities, policies, legislation, and technologies. For example, new legislation or regulations could be passed that impose additional requirements on organizations. This change might affect the risk assumptions established by organizations. Another example is a change in the threat environment that reports new tactics, techniques, procedures, or increases in the technical capabilities of adversaries. Organizations might experience reductions in available resources (e.g., personnel or funding), which in turn results in changing priorities. Organizations might also experience changes in the ownership of third-party suppliers which could affect supply chain risk. Mission changes may require that organizations revisit underlying risk assumptions. For example, an organization whose mission is to collect threat information on possible domestic terrorist attacks and share such information with appropriate federal law enforcement and intelligence agencies may have its scope changed so that the organization is responsible for also sharing some of the information with local first responders. Such a change could affect assumptions regarding the security resources such users may have at their disposal. Changes in technology may also affect the underlying risk assumptions established by organizations. Unlike other types of change, technology changes may be totally independent of organizations, but still affect the risk organizations must address. For example, improvements in computing power may undermine assumptions regarding what constitutes sufficiently strong means of authentication (e.g., number of authentication factors) or cryptographic mechanism.

Automated Versus Manual Monitoring Broadly speaking, organizations can conduct monitoring either by automated or manual methods. Where automated monitoring is feasible, it should be employed because it is generally faster, more efficient, and more cost-effective than manual monitoring. Automated monitoring is also less prone to human error. However, not all monitoring can take advantage of automation. Monitoring conducted at Tier 3 generally lends itself to automation where activities being monitored are information technology-based. Such activities can usually be detected, tracked, and monitored through the installation of appropriate software, hardware and/or firmware. To ensure that automated processes, procedures, and/or mechanisms supporting monitoring activities are providing the information needed, such processes, procedures, and mechanisms should be appropriately validated, updated and monitored. Compliance monitoring can be supported by automation when the risk mitigation measures being validated are information technology-based (e.g., installation of firewalls or testing of configuration settings on desktop computers). Such automated validation can often check whether risk mitigation measures are installed and whether the installations are correct. Similarly, effectiveness monitoring may also be supported by automation. If the threshold conditions for determining the effectiveness of risk response measures are predetermined, then automation can support such effectiveness monitoring. While automation can be a supporting capability for Tiers 1 and 2, generally automation does not provide substantive insight for non- information technology-based activities which are more prevalent at those higher tiers. Activities that are not as likely to benefit from automation include, for example, the use of multiple suppliers within the supply chain, evolving environments of operation, or evaluating the promise of emerging technical capabilities in support of missions/business functions. Where automated monitoring is not available, organizations employ manual monitoring and/or analysis.

Frequency of Monitoring The frequency of risk monitoring (whether automated or manual) is driven by organizational missions/business functions and the ability of organizations to use the monitoring results to facilitate greater situational awareness. An increased level of situational awareness of the security state of organizational information systems and environments of operation helps organizations develop a better understanding of risk. Monitoring frequency is also driven by other factors, for example: (i) the anticipated frequency of changes in organizational information systems and operating environments; (ii) the potential impact of risk if not properly addressed through appropriate response measures; and (iii) the degree to which the threat space is changing. The frequency of monitoring can also be affected by the type of monitoring conducted (i.e., automated versus procedural approaches). Depending on the frequency of monitoring

CHAPTER 3 PAGE 47

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

required by organizations, in most situations, monitoring is most efficient and cost-effective when automation is employed. Monitoring can provide significant benefits, especially in situations where such monitoring limits the opportunities for adversaries to gain a foothold within organizations (either through information systems or the environments in which those systems operate). When manual monitoring is employed by organizations, it is generally not efficient to perform the monitoring with the frequency that automation allows. In some instances, infrequent monitoring is not a major issue. For example, missions/business functions, facilities, legislation, policies, and technologies tend to change on a more gradual basis and as such, do not lend themselves to frequent monitoring. Instead, these types of changes are better suited to condition/event-based monitoring (e.g., if missions and/or business functions change, then monitoring of such changes is appropriate to determine if the changes have any impact on risk).

RISK MONITORING

TASK 4-2: Monitor organizational information systems and environments of operation on an ongoing basis to verify compliance, determine effectiveness of risk response measures, and identify changes.

Supplemental Guidance: Once organizations complete the development of their monitoring strategies, the strategies are implemented organization-wide. Because there are so many diverse aspects of monitoring, not all aspects of monitoring may be performed, or they may be performed at different times. The particular aspects of monitoring that are performed are dictated largely by the assumptions, constraints, risk tolerance, and priorities/trade-offs established by organizations during the risk framing step. For example, while organizations might desire to conduct all forms of monitoring (i.e., compliance, effectiveness, and change), the constraints imposed upon the organizations may allow only compliance monitoring that can be readily automated at Tier 3. If multiple aspects of monitoring can be supported, the output from the risk framing step helps organizations to determine the degree of emphasis and level of effort to place on the various monitoring activities.

As noted above, not all monitoring activities are conducted at the same tiers, for the same purpose, at the same time, or using the same techniques. However, it is important that organizations attempt to coordinate the various monitoring activities. Coordination of monitoring activities facilitates the sharing of risk-related information that may be useful for organizations in providing early warning, developing trend information, or allocating risk response measures in a timely and efficient manner. If monitoring is not coordinated, then the benefit of monitoring may be reduced, and could undermine the overall effort to identify and address risk. As feasible, organizations implement the various monitoring activities in a manner that maximizes the overall goal of monitoring, looking beyond the limited goals of particular monitoring activities. Risk monitoring results are applied in performing incremental risk assessments to maintain awareness of the risk being incurred, to highlight changes in risk, and to indicate the need to revisit other steps in the risk management process, as appropriate.

Outputs and Post Conditions The output from the risk monitoring step is the information generated by: (i) verifying that required risk response measures are implemented and that information security requirements derived from and traceable to organizational missions/business functions, federal legislation, directives, regulations, policies, and standards/guidelines, are satisfied; (ii) determining the ongoing effectiveness of risk response measures; and (iii) identifying changes to organizational information systems and environments of operation. Outputs from the risk monitoring step can be useful inputs to the risk framing, risk assessment, and risk response steps. For example, compliance monitoring results may require that organizations revisit the implementation portion of the risk response step, while effectiveness monitoring results may require that organizations revisit the entire risk response step. The results of monitoring for changes to information systems and environments of operation may require organizations to revisit the risk assessment step. The results of the risk monitoring step can also serve the risk framing step (e.g., when organizations discover new threats or vulnerabilities that affect changes in organizational risk assumptions, risk tolerance, and/or priorities/trade-offs).

CHAPTER 3 PAGE 48

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX A

REFERENCES LAWS, POLICIES, DIRECTIVES, INSTRUCTIONS, STANDARDS, AND GUIDELINES

LEGISLATION

1. E-Government Act [includes FISMA] (P.L. 107-347), December 2002.

2. Federal Information Security Management Act (P.L. 107-347, Title III), December 2002.

POLICIES, DIRECTIVES, INSTRUCTIONS

1. Committee on National Security Systems (CNSS) Instruction 4009, National Information Assurance (IA) Glossary, April 2010.

2. Committee on National Security Systems (CNSS) Instruction 1253, Security Categorization and Control Selection for National Security Systems, October 2009.

3. Office of Management and Budget, Circular A-130, Appendix III, Transmittal Memorandum #4, Management of Federal Information Resources, November 2000.

STANDARDS

1. National Institute of Standards and Technology Federal Information Processing Standards Publication 199, Standards for Security Categorization of Federal Information and Information Systems, February 2004.

2. National Institute of Standards and Technology Federal Information Processing Standards Publication 200, Minimum Security Requirements for Federal Information and Information Systems, March 2006.

3. ISO/IEC 15408:2005, Common Criteria for Information Technology Security Evaluation, 2005.

GUIDELINES

1. National Institute of Standards and Technology Special Publication 800-18, Revision 1, Guide for Developing Security Plans for Federal Information Systems, February 2006.

2. National Institute of Standards and Technology Special Publication 800-30, Revision 1, Guide for Conducting Risk Assessments, (Projected Publication Spring 2011).

3. National Institute of Standards and Technology Special Publication 800-37, Revision 1, Guide for Applying the Risk Management Framework to Federal Information Systems: A Security Life Cycle Approach, February 2010.

4. National Institute of Standards and Technology Special Publication 800-53, Revision 3, Recommended Security Controls for Federal Information Systems and Organizations, August 2009.

5. National Institute of Standards and Technology Special Publication 800-53A, Revision 1, Guide for Assessing the Security Controls in Federal Information Systems and Organizations: Building Effective Security Assessment Plans, June 2010.

6. National Institute of Standards and Technology Special Publication 800-59, Guideline for Identifying an Information System as a National Security System, August 2003.

APPENDIX A PAGE A-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

7. National Institute of Standards and Technology Special Publication 800-60, Revision 1, Guide for Mapping Types of Information and Information Systems to Security Categories, August 2008.

8. National Institute of Standards and Technology Special Publication 800-70, Revision 1, National Checklist Program for IT Products--Guidelines for Checklist Users and Developers, September 2009.

9. National Institute of Standards and Technology Special Publication 800-137, Initial Public Draft, Information Security Continuous Monitoring for Federal Information Systems and Organizations, December 2010.

APPENDIX A PAGE A-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX B

GLOSSARY COMMON TERMS AND DEFINITIONS

This appendix provides definitions for security terminology used within Special Publication 800-39. The terms in the glossary are consistent with the terms used in the suite of FISMA-related security standards and guidelines developed by NIST. Unless otherwise stated, all terms used in this publication are also consistent with the definitions contained in the CNSS Instruction 4009, National Information Assurance (IA) Glossary.

Adequate Security Security commensurate with the risk and magnitude of [OMB Circular A-130, Appendix III] harm resulting from the loss, misuse, or unauthorized

access to or modification of information.

Advanced Persistent Threat An adversary that possesses sophisticated levels of expertise and significant resources which allow it to create opportunities to achieve its objectives by using multiple attack vectors (e.g., cyber, physical, and deception). These objectives typically include establishing and extending footholds within the information technology infrastructure of the targeted organizations for purposes of exfiltrating information, undermining or impeding critical aspects of a mission, program, or organization; or positioning itself to carry out these objectives in the future. The advanced persistent threat: (i) pursues its objectives repeatedly over an extended period of time; (ii) adapts to defenders’ efforts to resist it; and (iii) is determined to maintain the level of interaction needed to execute its objectives.

Agency See Executive Agency.

Assessment See Security Control Assessment.

Assessor See Security Control Assessor.

Assurance Measure of confidence that the security features, [CNSSI 4009] practices, procedures, and architecture of an information

system accurately mediates and enforces the security policy.

[NIST SP 800-53] Grounds for confidence that the set of intended security controls in an information system are effective in their application.

Assurance Case A structured set of arguments and a body of evidence [Software Engineering Institute, showing that an information system satisfies specific Carnegie Mellon University] claims with respect to a given quality attribute.

Authentication Verifying the identity of a user, process, or device, often [FIPS 200] as a prerequisite to allowing access to resources in an

information system.

APPENDIX B PAGE B-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Authenticity

Authorization (to operate)

Authorization Boundary [NIST SP 800-37]

Authorizing Official [CNSSI 4009]

Availability [44 U.S.C., Sec. 3542]

Chief Information Officer [PL 104-106, Sec. 5125(b)]

Chief Information Security Officer

Classified National Security Information [CNSSI 4009]

The property of being genuine and being able to be verified and trusted; confidence in the validity of a transmission, a message, or message originator. See Authentication.

The official management decision given by a senior organizational official to authorize operation of an information system and to explicitly accept the risk to organizational operations (including mission, functions, image, or reputation), organizational assets, individuals, other organizations, and the Nation based on the implementation of an agreed-upon set of security controls.

All components of an information system to be authorized for operation by an authorizing official and excludes separately authorized systems, to which the information system is connected.

Senior (federal) official or executive with the authority to formally assume responsibility for operating an information system at an acceptable level of risk to organizational operations (including mission, functions, image, or reputation), organizational assets, individuals, other organizations, and the Nation.

Ensuring timely and reliable access to and use of information.

Agency official responsible for: (i) Providing advice and other assistance to the head of the executive agency and other senior management personnel of the agency to ensure that information technology is acquired and information resources are managed in a manner that is consistent with laws, Executive Orders, directives, policies, regulations, and priorities established by the head of the agency; (ii) Developing, maintaining, and facilitating the implementation of a sound and integrated information technology architecture for the agency; and (iii) Promoting the effective and efficient design and operation of all major information resources management processes for the agency, including improvements to work processes of the agency.

See Senior Agency Information Security Officer. Information that has been determined pursuant to Executive Order 13526 or any predecessor order to require protection against unauthorized disclosure and is marked to indicate its classified status when in documentary form.

APPENDIX B PAGE B-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Common Control [NIST SP 800-37]

Common Control Provider [NIST SP 800-37]

Compensating Security Control [CNSSI 4009]

Confidentiality [44 U.S.C., Sec. 3542]

Course of Action (Risk Response)

Cyber Attack [CNSSI 4009]

Cyber Security [CNSSI 4009]

Cyberspace [CNSSI 4009]

Defense-in-Breadth [CNSSI 4009]

Defense-in-Depth [CNSSI 4009]

A security control that is inherited by one or more organizational information systems. See Security Control Inheritance.

An organizational official responsible for the development, implementation, assessment, and monitoring of common controls (i.e., security controls inherited by information systems).

A management, operational, and/or technical control (i.e., safeguard or countermeasure) employed by an organization in lieu of a recommended security control in the low, moderate, or high baselines that provides equivalent or comparable protection for an information system.

Preserving authorized restrictions on information access and disclosure, including means for protecting personal privacy and proprietary information.

A time-phased or situation-dependent combination of risk response measures.

An attack, via cyberspace, targeting an enterprise’s use of cyberspace for the purpose of disrupting, disabling, destroying, or maliciously controlling a computing environment/infrastructure; or destroying the integrity of the data or stealing controlled information.

The ability to protect or defend the use of cyberspace from cyber attacks.

A global domain within the information environment consisting of the interdependent network of information systems infrastructures including the Internet, telecommunications networks, computer systems, and embedded processors and controllers.

A planned, systematic set of multidisciplinary activities that seek to identify, manage, and reduce risk of exploitable vulnerabilities at every stage of the system, network, or subcomponent life cycle (system, network, or product design and development; manufacturing; packaging; assembly; system integration; distribution; operations; maintenance; and retirement).

Information security strategy integrating people, technology, and operations capabilities to establish variable barriers across multiple layers and missions of the organization.

APPENDIX B PAGE B-3

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Enterprise [CNSSI 4009]

Enterprise Architecture [CNSSI 4009]

Environment of Operation [NIST SP 800-37]

Executive Agency [41 U.S.C., Sec. 403]

Federal Agency

Federal Information System [40 U.S.C., Sec. 11331]

Hybrid Security Control [NIST SP 800-53]

Individuals

Industrial Control System

Information [CNSSI 4009]

[FIPS 199]

An organization with a defined mission/goal and a defined boundary, using information systems to execute that mission, and with responsibility for managing its own risks and performance. An enterprise may consist of all or some of the following business aspects: acquisition, program management, financial management (e.g., budgets), human resources, security, and information systems, information and mission management. See Organization.

The description of an enterprise’s entire set of information systems: how they are configured, how they are integrated, how they interface to the external environment at the enterprise’s boundary, how they are operated to support the enterprise mission, and how they contribute to the enterprise’s overall security posture.

The physical surroundings in which an information system processes, stores, and transmits information.

An executive department specified in 5 U.S.C., Sec. 101; a military department specified in 5 U.S.C., Sec. 102; an independent establishment as defined in 5 U.S.C., Sec. 104(1); and a wholly owned Government corporation fully subject to the provisions of 31 U.S.C., Chapter 91.

See Executive Agency.

An information system used or operated by an executive agency, by a contractor of an executive agency, or by another organization on behalf of an executive agency.

A security control that is implemented in an information system in part as a common control and in part as a system-specific control. See Common Control and System-Specific Security Control.

An assessment object that includes people applying specifications, mechanisms, or activities.

An information system used to control industrial processes such as manufacturing, product handling, production, and distribution. Industrial control systems include supervisory control and data acquisition systems used to control geographically dispersed assets, as well as distributed control systems and smaller control systems using programmable logic controllers to control localized processes.

Any communication or representation of knowledge such as facts, data, or opinions in any medium or form, including textual, numerical, graphic, cartographic, narrative, or audiovisual. An instance of an information type.

APPENDIX B PAGE B-4

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Information Owner Official with statutory or operational authority for [CNSSI 4009] specified information and responsibility for establishing

the controls for its generation, classification, collection, processing, dissemination, and disposal. See Information Steward.

Information Resources Information and related resources, such as personnel, [44 U.S.C., Sec. 3502] equipment, funds, and information technology.

Information Security The protection of information and information systems [44 U.S.C., Sec. 3542] from unauthorized access, use, disclosure, disruption,

modification, or destruction in order to provide confidentiality, integrity, and availability.

Information Security Architecture An embedded, integral part of the enterprise architecture that describes the structure and behavior for an enterprise’s security processes, information security systems, personnel and organizational sub-units, showing their alignment with the enterprise’s mission and strategic plans.

Information Security Program Plan Formal document that provides an overview of the [NIST SP 800-53] security requirements for an organization-wide

information security program and describes the program management controls and common controls in place or planned for meeting those requirements.

Information Steward An agency official with statutory or operational authority [CNSSI 4009] for specified information and responsibility for

establishing the controls for its generation, collection, processing, dissemination, and disposal.

Information System A discrete set of information resources organized for the [44 U.S.C., Sec. 3502] collection, processing, maintenance, use, sharing,

dissemination, or disposition of information.

Information System Boundary See Authorization Boundary.

Information System Owner Official responsible for the overall procurement, (or Program Manager) development, integration, modification, or operation and

maintenance of an information system.

Information System Resilience The ability of an information system to continue to: (i) operate under adverse conditions or stress, even if in a degraded or debilitated state, while maintaining essential operational capabilities; and (ii) recover to an effective operational posture in a time frame consistent with mission needs.

Information System Individual assigned responsibility by the senior agency Security Officer information security officer, authorizing official,

management official, or information system owner for maintaining the appropriate operational security posture for an information system or program.

APPENDIX B PAGE B-5

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Information Security Risk

Information System-Related Security Risks

Information Technology [40 U.S.C., Sec. 1401]

Information Type [FIPS 199]

Integrity [44 U.S.C., Sec. 3542]

Management Controls [FIPS 200]

The risk to organizational operations (including mission, functions, image, reputation), organizational assets, individuals, other organizations, and the Nation due to the potential for unauthorized access, use, disclosure, disruption, modification, or destruction of information and/or information systems.

Risks that arise through the loss of confidentiality, integrity, or availability of information or information systems and consider impacts to the organization (including assets, mission, functions, image, or reputation), individuals, other organizations, and the Nation. See Risk.

Any equipment or interconnected system or subsystem of equipment that is used in the automatic acquisition, storage, manipulation, management, movement, control, display, switching, interchange, transmission, or reception of data or information by the executive agency. For purposes of the preceding sentence, equipment is used by an executive agency if the equipment is used by the executive agency directly or is used by a contractor under a contract with the executive agency which: (i) requires the use of such equipment; or (ii) requires the use, to a significant extent, of such equipment in the performance of a service or the furnishing of a product. The term information technology includes computers, ancillary equipment, software, firmware, and similar procedures, services (including support services), and related resources.

A specific category of information (e.g., privacy, medical, proprietary, financial, investigative, contractor sensitive, security management) defined by an organization or in some instances, by a specific law, Executive Order, directive, policy, or regulation.

Guarding against improper information modification or destruction, and includes ensuring information non- repudiation and authenticity.

The security controls (i.e., safeguards or countermeasures) for an information system that focus on the management of risk and the management of information system security.

APPENDIX B PAGE B-6

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

National Security System [44 U.S.C., Sec. 3542]

Operational Controls [FIPS 200]

Organization [FIPS 200, Adapted]

Plan of Action and Milestones [OMB Memorandum 02-01]

Reciprocity

Resilience

Risk [CNSSI 4009]

Any information system (including any telecommunications system) used or operated by an agency or by a contractor of an agency, or other organization on behalf of an agency (i) the function, operation, or use of which involves intelligence activities; involves cryptologic activities related to national security; involves command and control of military forces; involves equipment that is an integral part of a weapon or weapons system; or is critical to the direct fulfillment of military or intelligence missions (excluding a system that is to be used for routine administrative and business applications, for example, payroll, finance, logistics, and personnel management applications); or (ii) is protected at all times by procedures established for information that have been specifically authorized under criteria established by an Executive Order or an Act of Congress to be kept classified in the interest of national defense or foreign policy.

The security controls (i.e., safeguards or countermeasures) for an information system that are primarily implemented and executed by people (as opposed to systems).

An entity of any size, complexity, or positioning within an organizational structure (e.g., a federal agency or, as appropriate, any of its operational elements). See Enterprise.

A document that identifies tasks needing to be accomplished. It details resources required to accomplish the elements of the plan, any milestones in meeting the tasks, and scheduled completion dates for the milestones.

Mutual agreement among participating organizations to accept each other’s security assessments in order to reuse information system resources and/or to accept each other’s assessed security posture in order to share information.

See Information System Resilience.

A measure of the extent to which an entity is threatened by a potential circumstance or event, and typically a function of: (i) the adverse impacts that would arise if the circumstance or event occurs; and (ii) the likelihood of occurrence. [Note: Information system-related security risks are those risks that arise from the loss of confidentiality, integrity, or availability of information or information systems and reflect the potential adverse impacts to organizational operations (including mission, functions, image, or reputation), organizational assets, individuals, other organizations, and the Nation.]

APPENDIX B PAGE B-7

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Risk Assessment The process of identifying risks to organizational operations (including mission, functions, image, reputation), organizational assets, individuals, other organizations, and the Nation, resulting from the operation of an information system. Part of risk management, incorporates threat and vulnerability analyses, and considers mitigations provided by security controls planned or in place. Synonymous with risk analysis.

Risk Executive (Function) [CNSSI 4009]

An individual or group within an organization that helps to ensure that: (i) security risk-related considerations for individual information systems, to include the authorization decisions for those systems, are viewed from an organization-wide perspective with regard to the overall strategic goals and objectives of the organization in carrying out its missions and business functions; and (ii) managing risk from individual information systems is consistent across the organization, reflects organizational risk tolerance, and is considered along with other organizational risks affecting mission/business success.

Risk Management [CNSSI 4009, adapted]

The program and supporting processes to manage information security risk to organizational operations (including mission, functions, image, reputation), organizational assets, individuals, other organizations, and the Nation, and includes: (i) establishing the context for risk-related activities; (ii) assessing risk; (iii) responding to risk once determined; and (iv) monitoring risk over time.

Risk Mitigation [CNSSI 4009]

Prioritizing, evaluating, and implementing the appropriate risk-reducing controls/countermeasures recommended from the risk management process.

Risk Monitoring Maintaining ongoing awareness of an organization’s risk environment, risk management program, and associated activities to support risk decisions.

Risk Response Accepting, avoiding, mitigating, sharing, or transferring risk to organizational operations (i.e., mission, functions, image, or reputation), organizational assets, individuals, other organizations, or the Nation.

Risk Response Measure A specific action taken to respond to an identified risk.

Root Cause Analysis A principle-based, systems approach for the identification of underlying causes associated with a particular set of risks.

Security Authorization (to Operate)

See Authorization (to operate).

APPENDIX B PAGE B-8

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Security Categorization

Security Control Assessment [CNSSI 4009, Adapted]

Security Control Assessor

Security Control Baseline [CNSSI 4009]

Security Control Enhancements

Security Control Inheritance [CNSSI 4009]

Security Controls [FIPS 199, CNSSI 4009]

Security Impact Analysis [NIST SP 800-37]

Security Objective [FIPS 199]

Security Plan [NIST SP 800-18]

Security Policy [CNSSI 4009]

The process of determining the security category for information or an information system. Security categorization methodologies are described in CNSS Instruction 1253 for national security systems and in FIPS 199 for other than national security systems.

The testing and/or evaluation of the management, operational, and technical security controls to determine the extent to which the controls are implemented correctly, operating as intended, and producing the desired outcome with respect to meeting the security requirements for an information system or organization.

The individual, group, or organization responsible for conducting a security control assessment.

The set of minimum security controls defined for a low- impact, moderate-impact, or high-impact information system.

Statements of security capability to: (i) build in additional, but related, functionality to a basic control; and/or (ii) increase the strength of a basic control.

A situation in which an information system or application receives protection from security controls (or portions of security controls) that are developed, implemented, assessed, authorized, and monitored by entities other than those responsible for the system or application; entities either internal or external to the organization where the system or application resides. See Common Control.

The management, operational, and technical controls (i.e., safeguards or countermeasures) prescribed for an information system to protect the confidentiality, integrity, and availability of the system and its information.

The analysis conducted by an organizational official to determine the extent to which changes to the information system have affected the security state of the system.

Confidentiality, integrity, or availability.

Formal document that provides an overview of the security requirements for an information system or an information security program and describes the security controls in place or planned for meeting those requirements. See System Security Plan or Information Security Program Plan.

A set of criteria for the provision of security services.

APPENDIX B PAGE B-9

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Security Requirements [FIPS 200]

Senior Agency Information Security Officer [44 U.S.C., Sec. 3544]

Senior Information Security Officer

Subsystem

Supplementation (Security Controls)

System

System Security Plan [NIST SP 800-18]

System-Specific Security Control [NIST SP 800-37]

Tailoring [NIST SP 800-53, CNSSI 4009]

Tailored Security Control Baseline

Requirements levied on an information system that are derived from applicable laws, Executive Orders, directives, policies, standards, instructions, regulations, procedures, or organizational mission/business case needs to ensure the confidentiality, integrity, and availability of the information being processed, stored, or transmitted.

Official responsible for carrying out the Chief Information Officer responsibilities under FISMA and serving as the Chief Information Officer’s primary liaison to the agency’s authorizing officials, information system owners, and information system security officers. [Note: Organizations subordinate to federal agencies may use the term Senior Information Security Officer or Chief Information Security Officer to denote individuals filling positions with similar responsibilities to Senior Agency Information Security Officers.]

See Senior Agency Information Security Officer.

A major subdivision or component of an information system consisting of information, information technology, and personnel that performs one or more specific functions.

The process of adding security controls or control enhancements to a security control baseline from NIST Special Publication 800-53 or CNSS Instruction 1253 in order to adequately meet the organization’s risk management needs.

See Information System.

Formal document that provides an overview of the security requirements for an information system and describes the security controls in place or planned for meeting those requirements.

A security control for an information system that has not been designated as a common control or the portion of a hybrid control that is to be implemented within an information system.

The process by which a security control baseline is modified based on: (i) the application of scoping guidance; (ii) the specification of compensating security controls, if needed; and (iii) the specification of organization-defined parameters in the security controls via explicit assignment and selection statements.

A set of security controls resulting from the application of tailoring guidance to the security control baseline. See Tailoring.

APPENDIX B PAGE B-10

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Technical Controls [FIPS 200]

Threat [CNSSI 4009]

Threat Assessment [CNSSI 4009]

Threat Source [CNSSI 4009]

Trustworthiness [CNSSI 4009]

Vulnerability [CNSSI 4009]

Vulnerability Assessment [CNSSI 4009]

Security controls (i.e., safeguards or countermeasures) for an information system that are primarily implemented and executed by the information system through mechanisms contained in the hardware, software, or firmware components of the system.

Any circumstance or event with the potential to adversely impact organizational operations (including mission, functions, image, or reputation), organizational assets, individuals, other organizations, or the Nation through an information system via unauthorized access, destruction, disclosure, modification of information, and/or denial of service.

Process of formally evaluating the degree of threat to an information system or enterprise and describing the nature of the threat.

The intent and method targeted at the intentional exploitation of a vulnerability or a situation and method that may accidentally exploit a vulnerability. The attribute of a person or enterprise that provides confidence to others of the qualifications, capabilities, and reliability of that entity to perform specific tasks and fulfill assigned responsibilities. Weakness in an information system, system security procedures, internal controls, or implementation that could be exploited by a threat source.

Systematic examination of an information system or product to determine the adequacy of security measures, identify security deficiencies, provide data from which to predict the effectiveness of proposed security measures, and confirm the adequacy of such measures after implementation.

APPENDIX B PAGE B-11

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX C

ACRONYMS COMMON ABBREVIATIONS

APT Advanced Persistent Threat

CIO Chief Information Officer

CNSS Committee on National Security Systems

COTS Commercial Off-The-Shelf

DoD Department of Defense

FIPS Federal Information Processing Standards

FISMA Federal Information Security Management Act

IA Information Assurance

ICS Industrial Control System

IEC International Electrotechnical Commission

ISO International Organization for Standardization

NIST National Institute of Standards and Technology

NSA National Security Agency

ODNI Office of the Director of National Intelligence

OMB Office of Management and Budget

POAM Plan of Action and Milestones

RMF Risk Management Framework

SCAP Security Content Automation Protocol

SP Special Publication

U.S.C. United States Code

APPENDIX C PAGE C-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX D

ROLES AND RESPONSIBILITIES KEY PARTICIPANTS IN THE RISK MANAGEMENT PROCESS

The following sections describe the roles and responsibilities66 of key participants involved in an organization’s risk management process.67 Recognizing that organizations have widely varying missions and organizational structures, there may be differences in naming conventions for risk management-related roles and how specific responsibilities are allocated among organizational personnel (e.g., multiple individuals filling a single role or one individual filling multiple roles).68 However, the basic functions remain the same. The application of the risk management process across the three risk management tiers described in this publication is flexible, allowing organizations to effectively accomplish the intent of the specific tasks within their respective organizational structures to best manage risk.

D.1 HEAD OF AGENCY (CHIEF EXECUTIVE OFFICER) The head of agency (or chief executive officer) is the highest-level senior official or executive within an organization with the overall responsibility to provide information security protections commensurate with the risk and magnitude of harm (i.e., impact) to organizational operations and assets, individuals, other organizations, and the Nation resulting from unauthorized access, use, disclosure, disruption, modification, or destruction of: (i) information collected or maintained by or on behalf of the agency; and (ii) information systems used or operated by an agency or by a contractor of an agency or other organization on behalf of an agency. Agency heads are also responsible for ensuring that: (i) information security management processes are integrated with strategic and operational planning processes; (ii) senior officials within the organization provide information security for the information and information systems that support the operations and assets under their control; and (iii) the organization has trained personnel sufficient to assist in complying with the information security requirements in related legislation, policies, directives, instructions, standards, and guidelines. Through the development and implementation of strong policies, the head of agency establishes the organizational commitment to information security and the actions required to effectively manage risk and protect the missions/business functions being carried out by the organization. The head of agency establishes appropriate accountability for information security and provides active support and oversight of monitoring and improvement for the information security program. Senior leadership commitment to information security establishes a level of due diligence within the organization that promotes a climate for mission and business success.

D.2 RISK EXECUTIVE (FUNCTION) The risk executive (function) is an individual or group within an organization that provides a more comprehensive, organization-wide approach to risk management. The risk executive (function) serves as the common risk management resource for senior leaders/executives, mission/business

66 The roles and responsibilities described in this appendix are consistent with the roles and responsibilities associated with the Risk Management Framework in NIST Special Publication 800-37. 67 Organizations may define other roles (e.g., facilities manager, human resources manager, systems administrator) to support the risk management process. 68 Caution is exercised when one individual fills multiples roles in the risk management process to ensure that the individual retains an appropriate level of independence and remains free from conflicts of interest.

APPENDIX D PAGE D-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

owners, chief information officers, chief information security officers, information system owners, common control providers, enterprise architects, information security architects, information systems/security engineers, information system security managers/officers, and any other stakeholders having a vested interest in the mission/business success of organizations. The risk executive (function) coordinates with senior leaders/executives to:

• Establish risk management roles and responsibilities;

• Develop and implement an organization-wide risk management strategy that guides and informs organizational risk decisions (including how risk is framed, assessed, responded to, and monitored over time);

• Manage threat and vulnerability information with regard to organizational information systems and the environments in which the systems operate;

• Establish organization-wide forums to consider all types and sources of risk (including aggregated risk);

• Determine organizational risk based on the aggregated risk from the operation and use of information systems and the respective environments of operation;

• Provide oversight for the risk management activities carried out by organizations to ensure consistent and effective risk-based decisions;

• Develop a greater understanding of risk with regard to the strategic view of organizations and their integrated operations;

• Establish effective vehicles and serve as a focal point for communicating and sharing risk- related information among key stakeholders internally and externally to organizations;

• Specify the degree of autonomy for subordinate organizations permitted by parent organizations with regard to framing, assessing, responding to, and monitoring risk;

• Promote cooperation and collaboration among authorizing officials to include security authorization actions requiring shared responsibility (e.g., joint/leveraged authorizations);

• Ensure that security authorization decisions consider all factors necessary for mission and business success; and

• Ensure shared responsibility for supporting organizational missions and business functions using external providers receives the needed visibility and is elevated to appropriate decision- making authorities.

The risk executive (function) presumes neither a specific organizational structure nor formal responsibility assigned to any one individual or group within the organization. Heads of agencies or organizations may choose to retain the risk executive (function) or to delegate the function. The risk executive (function) requires a mix of skills, expertise, and perspectives to understand the strategic goals and objectives of organizations, organizational missions/business functions, technical possibilities and constraints, and key mandates and guidance that shape organizational operations. To provide this needed mixture, the risk executive (function) can be filled by a single individual or office (supported by an expert staff) or by a designated group (e.g., a risk board, executive steering committee, executive leadership council). The risk executive (function) fits into the organizational governance structure in such a way as to facilitate efficiency and effectiveness.

APPENDIX D PAGE D-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

D.3 CHIEF INFORMATION OFFICER The chief information officer69 is an organizational official responsible for: (i) designating a senior information security officer; (ii) developing and maintaining information security policies, procedures, and control techniques to address all applicable requirements; (iii) overseeing personnel with significant responsibilities for information security and ensuring that the personnel are adequately trained; (iv) assisting senior organizational officials concerning their security responsibilities; and (v) in coordination with other senior officials, reporting annually to the head of the federal agency on the overall effectiveness of the organization’s information security program, including progress of remedial actions. The chief information officer, with the support of the risk executive (function) and the senior information security officer, works closely with authorizing officials and their designated representatives to help ensure that:

• An organization-wide information security program is effectively implemented resulting in adequate security for all organizational information systems and environments of operation for those systems;

• Information security considerations are integrated into programming/planning/budgeting cycles, enterprise architectures, and acquisition/system development life cycles;

• Information systems are covered by approved security plans and are authorized to operate;

• Information security-related activities required across the organization are accomplished in an efficient, cost-effective, and timely manner; and

• There is centralized reporting of appropriate information security-related activities.

The chief information officer and authorizing officials also determine, based on organizational priorities, the appropriate allocation of resources dedicated to the protection of the information systems supporting the organization's missions and business functions. For selected information systems, the chief information officer may be designated as an authorizing official or a co- authorizing official with other senior organizational officials. The role of chief information officer has inherent U.S. Government authority and is assigned to government personnel only.

D.4 INFORMATION OWNER/STEWARD The information owner/steward is an organizational official with statutory, management, or operational authority for specified information and the responsibility for establishing the policies and procedures governing its generation, collection, processing, dissemination, and disposal.70 In information-sharing environments, the information owner/steward is responsible for establishing the rules for appropriate use and protection of the subject information (e.g., rules of behavior) and retains that responsibility when the information is shared with or provided to other organizations. The owner/steward of the information processed, stored, or transmitted by an information system

69 When an organization has not designated a formal chief information officer position, FISMA requires the associated responsibilities to be handled by a comparable organizational official. 70 Federal information is an asset of the Nation, not of a particular federal agency or its subordinate organizations. In that spirit, many federal agencies are developing policies, procedures, processes, and training needed to end the practice of information ownership and implement the practice of information stewardship. Information stewardship is the careful and responsible management of federal information belonging to the Nation as a whole, regardless of the entity or source that may have originated, created, or compiled the information. Information stewards provide maximum access to federal information to elements of the federal government and its customers, balanced by the obligation to protect the information in accordance with the provisions of FISMA and any associated security-related federal policies, directives, regulations, standards, and guidance.

APPENDIX D PAGE D-3

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

may or may not be the same as the system owner. A single information system may contain information from multiple information owners/stewards. Information owners/stewards provide input to information system owners regarding the security requirements and security controls for the systems where the information is processed, stored, or transmitted.

D.5 SENIOR INFORMATION SECURITY OFFICER The senior information security officer is an organizational official responsible for: (i) carrying out the chief information officer security responsibilities under FISMA; and (ii) serving as the primary liaison for the chief information officer to the organization’s authorizing officials, information system owners, common control providers, and information system security officers. The senior information security officer: (i) possesses professional qualifications, including training and experience, required to administer the information security program functions; (ii) maintains information security duties as a primary responsibility; and (iii) heads an office with the mission and resources to assist the organization in achieving more secure information and information systems in accordance with the requirements in FISMA. The senior information security officer (or supporting staff members) may also serve as authorizing official designated representatives or security control assessors. The role of senior information security officer has inherent U.S. Government authority and is assigned to government personnel only.

D.6 AUTHORIZING OFFICIAL The authorizing official is a senior official or executive with the authority to formally assume responsibility for operating an information system at an acceptable level of risk to organizational operations and assets, individuals, other organizations, and the Nation.71 Authorizing officials typically have budgetary oversight for an information system or are responsible for the mission and/or business operations supported by the system. Through the security authorization process, authorizing officials are accountable for the security risks associated with information system operations. Accordingly, authorizing officials are in management positions with a level of authority commensurate with understanding and accepting such information system-related security risks. Authorizing officials also approve security plans, memorandums of agreement or understanding, and plans of action and milestones and determine whether significant changes in the information systems or environments of operation require reauthorization. Authorizing officials can deny authorization to operate an information system or if the system is operational, halt operations, if unacceptable risks exist. Authorizing officials coordinate their activities with the risk executive (function), chief information officer, senior information security officer, common control providers, information system owners, information system security officers, security control assessors, and other interested parties during the security authorization process. With the increasing complexity of mission/business processes, partnership arrangements, and the use of external/shared services, it is possible that a particular information system may involve multiple authorizing officials. If so, agreements are established among the authorizing officials and documented in the security plan. Authorizing officials are responsible for ensuring that all activities and functions associated with security authorization that are delegated to authorizing official designated representatives are carried out. The role of authorizing official has inherent U.S. Government authority and is assigned to government personnel only.

71 The responsibility of authorizing officials described in FIPS 200, was extended in NIST Special Publication 800-53 to include risks to other organizations and the Nation.

APPENDIX D PAGE D-4

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

D.7 AUTHORIZING OFFICIAL DESIGNATED REPRESENTATIVE The authorizing official designated representative is an organizational official that acts on behalf of an authorizing official to coordinate and conduct the required day-to-day activities associated with the security authorization process. Authorizing official designated representatives can be empowered by authorizing officials to make certain decisions with regard to the planning and resourcing of the security authorization process, approval of the security plan, approval and monitoring the implementation of plans of action and milestones, and the assessment and/or determination of risk. The designated representative may also be called upon to prepare the final authorization package, obtain the authorizing official’s signature on the authorization decision document, and transmit the authorization package to appropriate organizational officials. The only activity that cannot be delegated to the designated representative by the authorizing official is the authorization decision and signing of the associated authorization decision document (i.e., the acceptance of risk to organizational operations and assets, individuals, other organizations, and the Nation).

D.8 COMMON CONTROL PROVIDER The common control provider is an individual, group, or organization responsible for the development, implementation, assessment, and monitoring of common controls (i.e., security controls inherited by information systems).72 Common control providers are responsible for: (i) documenting the organization-identified common controls in a security plan (or equivalent document prescribed by the organization); (ii) ensuring that required assessments of common controls are carried out by qualified assessors with an appropriate level of independence defined by the organization; (iii) documenting assessment findings in a security assessment report; and (iv) producing a plan of action and milestones for all controls having weaknesses or deficiencies. Security plans, security assessment reports, and plans of action and milestones for common controls (or a summary of such information) is made available to information system owners inheriting those controls after the information is reviewed and approved by the senior official or executive with oversight responsibility for those controls.

D.9 INFORMATION SYSTEM OWNER The information system owner is an organizational official responsible for the procurement, development, integration, modification, operation, maintenance, and disposal of an information system.73 The information system owner is responsible for addressing the operational interests of the user community (i.e., individuals who depend upon the information system to satisfy mission, business, or operational requirements) and for ensuring compliance with information security requirements. In coordination with the information system security officer, the information system owner is responsible for the development and maintenance of the security plan and ensures that the system is deployed and operated in accordance with the agreed-upon security controls. In coordination with the information owner/steward, the information system owner is

72 Organizations can have multiple common control providers depending on how information security responsibilities are allocated organization-wide. Common control providers may also be information system owners when the common controls are resident within an information system. 73 The information system owner serves as the focal point for the information system. In that capacity, the information system owner serves both as an owner and as the central point of contact between the authorization process and the owners of components of the system including, for example: (i) applications, networking, servers, or workstations; (ii) owners/stewards of information processed, stored, or transmitted by the system; and (iii) owners of the missions and business functions supported by the system. Some organizations may refer to information system owners as program managers or business/asset owners.

APPENDIX D PAGE D-5

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

also responsible for deciding who has access to the system (and with what types of privileges or access rights)74 and ensures that system users and support personnel receive the requisite security training (e.g., instruction in rules of behavior). Based on guidance from the authorizing official, the information system owner informs appropriate organizational officials of the need to conduct the security authorization, ensures that the necessary resources are available for the effort, and provides the required information system access, information, and documentation to the security control assessor. The information system owner receives the security assessment results from the security control assessor. After taking appropriate steps to reduce or eliminate vulnerabilities, the information system owner assembles the authorization package and submits the package to the authorizing official or the authorizing official designated representative for adjudication.75

D.10 INFORMATION SYSTEM SECURITY OFFICER The information system security officer76 is an individual responsible for ensuring that the appropriate operational security posture is maintained for an information system and as such, works in close collaboration with the information system owner. The information system security officer also serves as a principal advisor on all matters, technical and otherwise, involving the security of an information system. The information system security officer has the detailed knowledge and expertise required to manage the security aspects of an information system and, in many organizations, is assigned responsibility for the day-to-day security operations of a system. This responsibility may also include, but is not limited to, physical and environmental protection, personnel security, incident handling, and security training and awareness. The information system security officer may be called upon to assist in the development of the security policies and procedures and to ensure compliance with those policies and procedures. In close coordination with the information system owner, the information system security officer often plays an active role in the monitoring of a system and its environment of operation to include developing and updating the security plan, managing and controlling changes to the system, and assessing the security impact of those changes.

D.11 INFORMATION SECURITY ARCHITECT The information security architect is an individual, group, or organization responsible for ensuring that the information security requirements necessary to protect the organizational missions/business functions are adequately addressed in all aspects of enterprise architecture including reference models, segment and solution architectures, and the resulting information systems supporting those missions and business processes. The information security architect serves as the liaison between the enterprise architect and the information system security engineer and also coordinates with information system owners, common control providers, and information system security officers on the allocation of security controls as system-specific, hybrid, or common controls. In addition, information security architects, in close coordination with information system security officers, advise authorizing officials, chief information officers,

74 The responsibility for deciding who has access to specific information within an information system (and with what types of privileges or access rights) may reside with the information owner/steward. 75 Depending on how the organization has organized its security authorization activities, the authorizing official may choose to designate an individual other than the information system owner to compile and assemble the information for the security authorization package. In this situation, the designated individual must coordinate the compilation and assembly activities with the information system owner. 76 Organizations may also define an information system security manager or information security manager role with similar responsibilities as an information system security officer or with oversight responsibilities for an information security program. In these situations, information system security officers may, at the discretion of the organization, report directly to information system security managers or information security managers.

APPENDIX D PAGE D-6

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

senior information security officers, and the risk executive (function), on a range of security- related issues including, for example, establishing information system boundaries, assessing the severity of weaknesses and deficiencies in the information system, plans of action and milestones, risk mitigation approaches, security alerts, and potential adverse effects of vulnerabilities.

D.12 INFORMATION SYSTEM SECURITY ENGINEER The information system security engineer is an individual, group, or organization responsible for conducting information system security engineering activities. Information system security engineering is a process that captures and refines information security requirements and ensures that the requirements are effectively integrated into information technology component products and information systems through purposeful security architecting, design, development, and configuration. Information system security engineers are an integral part of the development team (e.g., integrated project team) designing and developing organizational information systems or upgrading legacy systems. Information system security engineers employ best practices when implementing security controls within an information system including software engineering methodologies, system/security engineering principles, secure design, secure architecture, and secure coding techniques. System security engineers coordinate their security-related activities with information security architects, senior information security officers, information system owners, common control providers, and information system security officers.

D.13 SECURITY CONTROL ASSESSOR The security control assessor is an individual, group, or organization responsible for conducting a comprehensive assessment of the management, operational, and technical security controls employed within or inherited by an information system to determine the overall effectiveness of the controls (i.e., the extent to which the controls are implemented correctly, operating as intended, and producing the desired outcome with respect to meeting the security requirements for the system). Security control assessors also provide an assessment of the severity of weaknesses or deficiencies discovered in the information system and its environment of operation and recommend corrective actions to address identified vulnerabilities. In addition to the above responsibilities, security control assessors prepare the final security assessment report containing the results and findings from the assessment. Prior to initiating the security control assessment, an assessor conducts an assessment of the security plan to help ensure that the plan provides a set of security controls for the information system that meet the stated security requirements.

The required level of assessor independence is determined by the specific conditions of the security control assessment. For example, when the assessment is conducted in support of an authorization decision or ongoing authorization, the authorizing official makes an explicit determination of the degree of independence required in accordance with federal policies, directives, standards, and guidelines. Assessor independence is an important factor in: (i) preserving the impartial and unbiased nature of the assessment process; (ii) determining the credibility of the security assessment results; and (iii) ensuring that the authorizing official receives the most objective information possible in order to make an informed, risk-based, authorization decision. The information system owner and common control provider rely on the security expertise and the technical judgment of the assessor to: (i) assess the security controls employed within and inherited by the information system using assessment procedures specified in the security assessment plan; and (ii) provide specific recommendations on how to correct weaknesses or deficiencies in the controls and address identified vulnerabilities.

APPENDIX D PAGE D-7

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX E

RISK MANAGEMENT PROCESS TASKS SUMMARY OF TASKS FOR STEPS IN THE RISK MANAGEMENT PROCESS

TASK TASK DESCRIPTION

Step 1: Risk Framing

TASK 1-1 RISK ASSUMPTIONS

Identify assumptions that affect how risk is assessed, responded to, and monitored within the organization.

TASK 1-2 RISK CONSTRAINTS

Identify constraints on the conduct of risk assessment, risk response, and risk monitoring activities within the organization.

TASK 1-3 RISK TOLERANCE

Identify the level of risk tolerance for the organization.

TASK 1-4 PRIORITIES AND TRADE-OFFS

Identify priorities and trade-offs considered by the organization in managing risk.

Step 2: Risk Assessment

TASK 2-1 THREAT AND VULNERABILITY IDENTIFICATION

Identify threats to and vulnerabilities in organizational information systems and the environments in which the systems operate.

TASK 2-2 RISK DETERMINATION

Determine the risk to organizational operations and assets, individuals, other organizations, and the Nation if identified threats exploit identified vulnerabilities.

Step 3: Risk Response

TASK 3-1 RISK RESPONSE IDENTIFICATION

Identify alternative courses of action to respond to risk determined during the risk assessment.

TASK 3-2 EVALUATION OF ALTERNATIVES

Evaluate alternative courses of action for responding to risk.

TASK 3-3 RISK RESPONSE DECISION

Decide on the appropriate course of action for responding to risk.

TASK 3-4 RISK RESPONSE IMPLEMENTATION

Implement the course of action selected to respond to risk.

Step 4: Risk Monitoring

TASK 4-1 RISK MONITORING STRATEGY

Develop a risk monitoring strategy for the organization that includes the purpose, type, and frequency of monitoring activities.

TASK 4-2 RISK MONITORING

Monitor organizational information systems and environments of operation on an ongoing basis to verify compliance, determine effectiveness of risk response measures, and identify changes.

APPENDIX E PAGE E-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX F

GOVERNANCE MODELS APPROACHES TO INFORMATION SECURITY GOVERNANCE

Three approaches to information security governance can be used to meet organizational needs: (i) a centralized approach; (ii) a decentralized approach; or (iii) a hybrid approach. The authority, responsibility, and decision-making power related to information security and risk management differ in each governance approach. The appropriate governance structure for an organization varies based on many factors (e.g., mission/business needs; culture and size of the organization; geographic distribution of organizational operations, assets, and individuals; and risk tolerance). The information security governance structure is aligned with other governance structures (e.g., information technology governance) to ensure compatibility with the established management practices within the organization and to increase its overall effectiveness.

Centralized Governance In centralized governance structures, the authority, responsibility, and decision-making power are vested solely within central bodies. These centralized bodies establish the appropriate policies, procedures, and processes for ensuring organization-wide involvement in the development and implementation of risk management and information security strategies, risk, and information security decisions, and the creation inter-organizational and intra-organizational communication mechanisms. A centralized approach to governance requires strong, well-informed central leadership and provides consistency throughout the organization. Centralized governance structures also provide less autonomy for subordinate organizations that are part of the parent organization.

Decentralized Governance In decentralized information security governance structures, the authority, responsibility, and decision-making power are vested in and delegated to individual subordinate organizations within the parent organization (e.g., bureaus/components within an executive department of the federal government or business units within a corporation). Subordinate organizations establish their own policies, procedures, and processes for ensuring (sub) organization-wide involvement in the development and implementation of risk management and information security strategies, risk and information security decisions, and the creation of mechanisms to communicate within the organization. A decentralized approach to information security governance accommodates subordinate organizations with divergent mission/business needs and operating environments at the cost of consistency throughout the organization as a whole. The effectiveness of this approach is greatly increased by the sharing of risk-related information among subordinate organizations so that no subordinate organization is able to transfer risk to another without the latter’s informed consent. It is also important to share risk-related information with parent organizations as the risk decisions by subordinate organizations may have an effect on the organization as a whole.

Hybrid Governance In hybrid information security governance structures, the authority, responsibility, and decision- making power are distributed between a central body and individual subordinate organizations. The central body establishes the policies, procedures, and processes for ensuring organization- wide involvement in the portion of the risk management and information security strategies and decisions affecting the entire organization (e.g., decisions related to shared infrastructure or

APPENDIX F PAGE F-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

common security services). Subordinate organizations, in a similar manner, establish appropriate policies, procedures, and processes for ensuring their involvement in the portion of the risk management and information security strategies and decisions that are specific to their mission/business needs and environments of operation. A hybrid approach to governance requires strong, well-informed leadership for the organization as a whole and for subordinate organizations, and provides consistency throughout the organization for those aspects of risk and information security that affect the entire organization.

APPENDIX F PAGE F-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX G

TRUST MODELS APPROACHES TO ESTABLISHING TRUST RELATIONSIPS

The following trust models describe ways in which organizations can obtain the levels of trust needed to form partnerships, collaborate with other organizations, share information, or receive information system/security services. No single trust model is inherently better than any other model. Rather, each model provides organizations with certain advantages and disadvantages based on their circumstances (e.g., governance structure, risk tolerance, and criticality/sensitivity of organizational missions and business processes).

Validated Trust In the validated trust model, one organization obtains a body of evidence regarding the actions of another organization (e.g., the organization’s information security policies, activities, and risk- related decisions) and uses that evidence to establish a level of trust with the other organization. An example of validated trust is where one organization develops an application or information system and provides evidence (e.g., security plan, assessment results) to a second organization that supports the claims by the first organization that the application/system meets certain security requirements and/or addresses the appropriate security controls in NIST Special Publication 800- 53. Validated trust may not be sufficient—that is, the evidence offered by the first organization to the second organization may not fully satisfy the second organization’s trust requirements or trust expectations. The more evidence provided between organizations as well as the quality of such evidence, the greater the degree of trust that can be achieved. Trust is linked to the degree of transparency between the two organizations with regard to risk and information security-related activities and decisions.

Direct Historical Trust In the direct historical trust model, the track record exhibited by an organization in the past, particularly in its risk and information security-related activities and decisions, can contribute to and help establish a level of trust with other organizations. While validated trust models assume that an organization provides the required level of evidence needed to establish trust, obtaining such evidence may not always be possible. In such instances, trust may be based on other deciding factors, including the organization’s historical relationship with the other organization or its recent experience in working with the other organization. For example, if one organization has worked with a second organization for years doing some activity and has not had any negative experiences, the first organization may be willing to trust the second organization in working on another activity, even though the organizations do not share any common experience for that particular activity. Direct historical trust tends to build up over time with the more positive experiences contributing to increased levels of trust between organizations. Conversely, negative experiences may cause trust levels to decrease among organizations.

Mediated Trust In the mediated trust model, an organization establishes a level of trust with another organization based on assurances provided by some mutually trusted third party. There are several types of mediated trust models that can be employed. For example, two organizations attempting to establish a trust relationship may not have a direct trust history between the two organizations, but do have a trust relationship with a third organization. The third party that is trusted by both

APPENDIX G PAGE G-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

organizations, brokers the trust relationship between the two organizations, thus helping to establish the required level of trust. Another type of mediated trust involves the concept of transitivity of trust. In this example, one organization establishes a trust relationship with a second organization. Independent of the first trust relationship, the second organization establishes a trust relationship with a third organization. Since the first organization trusts the second organization and the second organization trusts the third organization, a trust relationship is now established between the first and third organizations (illustrating the concept of transitive trust among organizations).77

Mandated Trust In the mandated trust model, an organization establishes a level of trust with another organization based on a specific mandate issued by a third party in a position of authority.78 This mandate can be established by the respective authority through Executive Orders, directives, regulations, or policies (e.g., a memorandum from an agency head directing that all subordinate organizations accept the results of security assessments conducted by any subordinate organization within the agency). Mandated trust can also be established when some organizational entity is decreed to be the authoritative source for the provision of information resources including information technology products, systems, or services. For example, an organization may be given the responsibility and the authority to issue Public Key Infrastructure (PKI) certificates for a group of organizations.

Hybrid Trust In general, the trust models described above are not mutually exclusive. Each of the trust models may be used independently as a stand-alone model or in conjunction with another model. Several trust models may be used at times within the organization (e.g., at various phases in the system development life cycle). Also, since organizations are often large and diverse, it is possible that subordinate organizations within a parent organization might independently employ different trust models in establishing trust relationships with potential partnering organizations (including subordinate organizations). The organizational governance structure may establish the specific terms and conditions for how the various trust models are employed in a complementary manner within the organization.

Suitability of Various Trust Models The trust models can be employed at various tiers in the risk management approach described in this publication. None of the trust models is inherently better or worse than the others. However, some models may be better suited to some situations than others. For example, the validated trust model, because it requires evidence of a technical nature (e.g., tests completed successfully), is probably best suited for application at Tier 3. In contrast, the direct historical trust model, with a significant emphasis on past experiences, is more suited for application at Tiers 1 or 2. The mediated and mandated trust models are typically more oriented toward governance and consequently are best suited for application at Tier 1. However, some implementations of the mandated trust model, for example, being required to trust the source of a PKI certificate, are more oriented toward Tier 3. Similarly, although the mediated trust model is primarily oriented toward Tier 1, there can be implementations of it that are more information system-, or Tier 3-

77 In the mediated trust model, the first organization typically has no insight into the nature of the trust relationship between the second and third organizations. 78 The authoritative organization explicitly accepts the risks to be incurred by all organizations covered by the mandate and is accountable for the risk-related decisions imposed by the organization.

APPENDIX G PAGE G-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

oriented. An example of this application might be the use of authentication services that validate the authenticity or identity of an information system component or service.

The nature of a particular information technology service can also impact the suitability and the applicability of the various trust models. The validated trust model is the more traditional model for validating the trust of an information technology product, system, or service. However, this trust model works best in situations where there is a degree of control between parties (e.g., a contract between the government and an external service provider) or where there is sufficient time to obtain and validate the evidence needed to establish a trust relationship. Validated trust is a suboptimal model for situations where the two parties are peers and/or where the trust decisions regarding shared/supplied services must occur quickly due to the very dynamic and rapid nature of the service being requested/provided (e.g., service-oriented architectures).

APPENDIX G PAGE G-3

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

APPENDIX H

RISK RESPONSE STRATEGIES FROM BOUNDARY PROTECTION TO AGILE DEFENSES

Organizations develop risk management strategies as part of the risk framing step in the risk management process described in Chapter Three. The risk management strategies address how organizations intend to assess risk, respond to risk, and monitor risk— making explicit and transparent the risk perceptions that organizations routinely use in making both investment and operational decisions. As part of organizational risk management strategies, organizations also develop risk response strategies. The practical realities facing organizations today make risk response strategies essential—the realities of needing the mission/business effectiveness offered by information technology, the lack of trustworthiness in the technologies available, and the growing awareness by adversaries of the potential to achieve their objectives to cause harm by compromising organizational information systems and the environments in which those systems operate. Senior leaders/executives in modern organizations are faced with an almost intractable dilemma—that is, the information technologies needed for mission/business success may be the same technologies through which adversaries cause mission/business failure. The risk response strategies developed and implemented by organizations provide these senior leaders/executives (i.e., decision makers within organizations) with practical, pragmatic paths for dealing with this dilemma. Clearly defined and articulated risk response strategies help to ensure that senior leaders/executives take ownership of organizational risk responses and are ultimately responsible and accountable for risk decisions—understanding, acknowledging, and explicitly accepting the resulting mission/business risk.

As described in Chapter Two, there are five basic types of responses to risk: (i) accept; (ii) avoid; (iii) mitigate; (iv) share; and (v) transfer.79 While each type of response can have an associated strategy, there should be an overall strategy for selecting from among the basic response types. This overall risk response strategy and a strategy for each type of response are discussed below. In addition, specific risk mitigation strategies are presented, including a description of how such strategies can be implemented within organizations.

H.1 OVERALL RISK RESPONSE STRATEGIES Risk response strategies specify: (i) individuals or organizational subcomponents that are responsible for the selected risk response measures and specifications of effectiveness criteria (i.e., articulation of indicators and thresholds against which the effectiveness of risk response measures can be judged); (ii) dependencies of the selected risk response measures on other risk response measures; (iii) dependencies of selected risk response measures on other factors (e.g., implementation of other planned information technology measures); (iv) implementation timeline for risk responses; (v) plans for monitoring the effectiveness of the risk response measures; (vi) identification of risk monitoring triggers; and (vii) interim risk response measures selected for implementation, if appropriate. Risk response implementation strategies may include interim measures that organizations choose to implement. An overall risk response strategy provides an organizational approach to selecting between the basic risk responses for a given risk situation. A decision to accept risk must be consistent with the stated organizational tolerance for risk. Yet

79 There is overlap between the basic risk responses. For example, a shared risk is one that is being accepted by each party in the sharing arrangement, and avoiding risk can be thought of as mitigating risk to zero. Nonetheless, with this understanding of overlap, there is value in addressing each of the five types of risk responses separately.

APPENDIX H PAGE H-1

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

there is still need for a well-defined, established organizational path for selecting one or a combination of the risk responses of acceptance, avoidance, mitigation, sharing, or transfer. Organizations are often placed in situations where there is greater risk than the designated senior leaders/executives desire to accept. Some risk acceptance will likely be necessary. It might be possible to avoid risk or to share or transfer risk, and some risk mitigation is probably feasible. Avoiding risk may require selective reengineering of organizational mission/business processes and forgoing some of the benefits being accrued by the use of information technology organization-wide, perhaps even what organizations perceive as necessary benefits. Mitigating risk requires expenditure of limited resources and may quickly become cost-ineffective due to the pragmatic realities of the degree of mitigation that can actually be achieved. Lastly, risk sharing and transfer have ramifications as well, some of which if not unacceptable, may be undesirable. The risk response strategies of organizations empower senior leaders/executives to make risk- based decisions compliant with the goals, objectives, and broader organizational perspectives.

H.2 RISK ACCEPTANCE STRATEGIES Organizational risk acceptance strategies are essential companions to organizational statements of risk tolerance. The objective of establishing an organizational risk tolerance is to state in clear and unambiguous terms, a limit for risk—that is, how far organizations are willing to go with regard to accepting risk to organizational operations (including missions, functions, image, and reputation), organizational assets, individuals, other organizations, and the Nation. Real-world operations, however, are seldom so simple as to make such risk tolerance statements the end- statement for risk acceptance decisions. Organizational risk acceptance strategies place the acceptance of risk into a framework of organizational perspectives on dealing with the practical realities of operating with risk and provide the guidance necessary to ensure that the extent of the risk being accepted in specific situations is compliant with organizational direction.

H.3 RISK AVOIDANCE STRATEGIES Of all the risk response strategies, organizational risk avoidance strategies may be the key to achieving adequate risk response. The pragmatic realities of the trustworthiness of information technologies available for use within common resource constraints, make wise use of those technologies arguably a significant, if not the most significant risk response. Wise use of the information technologies that compose organizational information systems is fundamentally a form of risk avoidance—that is, organizations modify how information technologies are used to change the nature of the risk being incurred (i.e., avoid the risk). Yet such approaches can be in great tension with organizational desires and in some cases, the mandate to fully automate mission/business processes. Organizations proactively address this dilemma so that: (i) senior leaders/executives (and other organizational officials making risk-based decisions) are held accountable for only that which is within their ability to affect; and (ii) decision makers can make the difficult risk decisions that may, in fact, be in the best interests of organizations.

H.4 RISK SHARING AND TRANSFER STRATEGIES Organizational risk sharing strategies and risk transfer strategies are key elements in enabling risk decisions for specific organizational missions/business functions at Tier 2 or organizational information systems at Tier 3. Risk sharing and transfer strategies both consider and take full advantage of a lessening of risk by sharing/transferring the potential impact across other internal organizational elements or with other external organizations—making the case that some other entities are, in fact, wholly (transfer) or partly (share) responsible and accountable for risk. For risk sharing or risk transfer to be effective risk responses, the impact on the local environment (e.g., mission/business processes or information systems) must be addressed by the sharing or

APPENDIX H PAGE H-2

________________________________________________________________________________________________

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

transfer (i.e., the focus must be on mission/business success, not assigning blame). In addition, risk sharing and risk transfer activities must be carried out in accordance with intra- and inter- organizational dynamics and realities (e.g., organizational culture, governance, risk tolerance). This explains why risk sharing/transfer strategies are particularly important for the sharing and/or transfer to be a viable risk response option.

H.5 RISK MITIGATION STRATEGIES Organizational risk mitigation strategies reflect an organizational perspective on what mitigations are to be employed and where the mitigations are to be applied, to reduce information security risks to organizational operations and assets, individuals, other organizations, and the Nation. Risk mitigation strategies are the primary link between organizational risk management programs and information security programs—with the former covering all aspects of managing risk and the latter being primarily a part of the risk response component of the risk management process. Effective risk mitigation strategies consider the general placement and allocation of mitigations, the degree of intended mitigation, and cover mitigations at Tier 1 (e.g., common controls), at Tier 2 (e.g., enterprise architecture including embedded information security architecture, and risk- aware mission/business processes), and at Tier 3 (security controls in individual information systems). Organizational risk mitigation strategies reflect the following:

• Mission/business processes are designed with regard to information protection needs and information security requirements;80

• Enterprise architectures (including embedded information security architectures) are designed with consideration for realistically achievable risk mitigations;

• Risk mitigation measures are implemented within organizational information systems and environments of operation by safeguards/countermeasure (i.e., security controls) consistent with information security architectures; and

• Information security programs, processes, and safeguards/countermeasures are highly flexible and agile with regard to implementation, recognizing the diversity in organizational missions and business functions and the dynamic environments in which the organizations operate.81

Organizations develop risk mitigation strategies based on strategic goals and objectives, mission and business requirements, and organizational priorities. The strategies provide the basis for making risk-based decisions on the information security solutions associated with and applied to information systems within the organization. Risk mitigation strategies are necessary to ensure that organizations are adequately protected against the growing threats to information processed, stored, and transmitted by organizational information systems. The nature of the threats and the dynamic environments in which organizations operate, demand flexible and scalable defenses as well as solutions that can be tailored to meet rapidly changing conditions. These conditions include, for example, the emergence of new threats and vulnerabilities, the development of new technologies, changes in missions/business requirements, and/or changes to environments of operation. Effective risk mitigation strategies support the goals and objectives of organizations and established mission/business priorities, are tightly coupled to enterprise architectures and information security architectures, and can operate throughout the system development life cycle.

80 In addition to mission/business-driven information protection needs, information security requirements are obtained from a variety of sources (e.g., federal legislation, policies, directives, regulations, and standards). 81 Dynamic environments of operation are characterized, for example, by ongoing changes in people, processes, technologies, physical infrastructure, and threats.

APPENDIX H PAGE H-3

________________________________________________________________________________________________

                                                 

                                     

Special Publication 800-39 Managing Information Security Risk Organization, Mission, and Information System View

Traditional risk mitigation strategies with regard to threats from cyber attacks at first relied almost exclusively on monolithic boundary protection. These strategies assumed adversaries were outside of some established defensive perimeter, and the objective of organizations was to repel the attack. The primary focus of static boundary protection was penetration resistance of the information technology products and information systems employed by the organization as well as any additional safeguards and countermeasures implemented in the environments in which the products and systems operated. Recognition that information system boundaries were permeable or porous led to defense-in-depth as part of the mitigation strategy, relying on detection and response mechanisms to address the threats within the protection perimeter. In today’s world characterized by advanced persistent threats,82 a more comprehensive risk mitigation strategy is needed—a strategy that combines traditional boundary protection with agile defense.

Agile defense assumes that a small percentage of threats from purposeful cyber attacks will be successful by compromising organizational information systems through the supply chain83 by defeating the initial safeguards and countermeasures (i.e., security controls) implemented by organizations, or by exploiting previously unidentified vulnerabilities for which protections are not in place. In this scenario, adversaries are operating inside the defensive perimeters established by organizations and may have substantial or complete control of organizational information systems. Agile defense employs the concept of information system resilience—that is, the ability of systems to operate while under attack, even in a degraded or debilitated state, and to rapidly recover operational capabilities for essential functions after a successful attack. The concept of information system resilience can also be applied to the other classes of threats including threats from environmental disruptions and/or human errors of omission/commission. The most effective risk mitigation strategies employ a combination of boundary protection and agile defenses depending on the characteristics of the threat.84 This dual protection strategy illustrates two important information security concepts known as defense-in-depth85 and defense-in-breadth.86

Information has value and must be protected. Information systems (including people, processes, and technologies) are the primary vehicles employed to process, store, and transmit such information— allowing organizations to carry out their missions in a variety of environments of operation and to ultimately be successful.

82 An advanced persistent threat is an adversary that possesses sophisticated levels of expertise and significant resources which allow it to create opportunities to achieve its objectives by using multiple attack vectors (e.g., cyber, physical, and deception). These objectives typically include establishing/extending footholds within the information technology infrastructure of the targeted organizations for purposes of exfiltrating information, undermining or impeding critical aspects of a mission, program, or organization; or positioning itself to carry out these objectives in the future. The advanced persistent threat: (i) pursues its objectives repeatedly over an extended period of time; (ii) adapts to defenders’ efforts to resist it; and (iii) is determined to maintain the level of interaction needed to execute its objectives. 83 Draft NIST Interagency Report 7622 provides guidance on managing supply chain risk. 84 Threat characteristics include capabilities, intentions, and targeting information. 85 Defense-in-depth is an information security strategy integrating people, technology, and operations capabilities to establish variable barriers across multiple layers and missions of the organization. 86 Defense-in-breadth is a planned, systematic set of multidisciplinary activities that seek to identify, manage, and reduce risk of exploitable vulnerabilities at every stage of the system, network, or subcomponent life cycle (system, network, or product design and development; manufacturing; packaging; assembly; system integration; distribution; operations; maintenance; and retirement).

APPENDIX H PAGE H-4

professor-reading-week.txt

Class, Below are some examples of security incidents that have occurred. You may choose any security incident that has occurred and impacted the user's data. Dr. Holbert https://cyware.com/category/breaches-and-incidents-news https://www.cnet.com/how-to/equifax-words-with-friends-beyond-every-major-security-breach-and-data-hack-update/ https://www.cnbc.com/2019/07/30/five-of-the-biggest-data-breaches-ever.html https://www.trendmicro.com/vinfo/us/security/special-report/data-breach/latest-incidents https://www.csoonline.com/article/2130877/the-biggest-data-breaches-of-the-21st-century.html https://www.washingtonpost.com/technology/2019/09/26/doordash-data-breach-affects-million-users/ ClassRoom Recording Recording https://us-lti.bbcollab.com/recording/987c494c7d8a4214bb5810e050c7f74d https://us-lti.bbcollab.com/recording/6745854096784bfba5705abb9652b8d4