SWI-Prolog coding

TheArtofProlog.pdf

Home >Computer Science homework help >SWI-Prolog coding

The Art of Prolog

Leon Sterling Ehud Shapiro with a foreword by David H. D. Warren

The Art of Prolog Advanced Programming Techniques Second Edition

The MIT Press Cambridge, Massachusetts London, England

All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from the publisher.

This book was composed and typeset by Paul C. Anagnostopoulos and Joe Snowden using ZzTEX. The typeface is Lucida Bright and Lucida New Math created by Charles Bigelow and Kris Holmes specifically for scientific and electronic publishing. The Lucida letterforms have the large x-heights and open interiors that aid legibility in modern printing technology, but also echo some of the rhythms and calligraphic details of lively Renaissance handwrit- ing. Developed in the 1980s and 1990s, the extensive Lucida typeface family includes a wide variety of mathematical and technical symbols designed to harmonize with the text faces.

This book was printed and bound in the United States of America.

Library of Congress Cataloging-in-Publication Data Sterling, Leon

The art of Prolog : advanced programming techniques / Leon Sterling, Ehud Shapiro ; with a foreword by David H. D. Warren.

p. cm. - (MIT Press series in logic programming) Includes bibliographical references and index. ISBN 978-O-262-19338-2 (hardcover: alk. paper), 978-O-262-69163-5 (paperback) 1. Prolog (Computer program language) I. Shapiro, Ehud Y.

II. Title. III. Series. QA76.73.P76S74 1994 OOS.13'3dc2O 93-49494

lo CIP

To Ruth, Miriam, Micha!, Dan ya, and Sara

Contents

Figures xiii

Programs xvii

Series Foreword xxv

Foreword xxvii

Preface xxxi

Preface to First Edition

I Logic Programs 9

Introduction i

Basic Constructs 11 Li Facts 11 1.2 Queries 12

1.3 The Logical Variable, Substitutions, and Instances 13

1.4 Existential Queries 14

1.5 UniversalFacts 15

1.6 Conjunctive Queries and Shared Variables 16

1.7 Rules 18

viii Contents

1.8 A Simple Abstract Interpreter 22 1.9 The Meaning of a Logic Program 25

1.10 Summary 27

2 Database Programming 29 2.1 Simple Databases 29

2.2 Structured Data and Data Abstraction 35 2.3 Recursive Rules 39

2.4 Logic Programs and the Relational Database Model 42

2.5 Background 44

3 Recursive Programming 45 3.1 Arithmetic 45

3.2 Lists 56

3.3 Composing Recursive Programs 65 3.4 Binary Trees 72

3.5 Manipulating Symbolic Expressions 78 3.6 Background 84

4 The Computation Model of Logic Programs 87 4.1 Unification 87

4.2 An Abstract Interpreter for Logic Programs 91 4.3 Background 98

S Theory of Logic Programs 101 5.1 Semantics 101

5.2 Program Correctness 105

5.3 Complexity 108

5.4 Search Trees 110

5.5 Negation in Logic Programming 113

5.6 Background 115

ix Contents

II The Prolog Language 117

6 Pure Prolog 119

6.1 The Execution Model of Prolog 119

6.2 Comparison to Conventional Programming Languages 124

6.3 Background 127

7 Programming in Pure Prolog 129 7.1 Rule Order 129

7.2 Termination 131

7.3 Goal Order 133

7.4 Redundant Solutions 136

7.5 Recursive Programming in Pure Prolog 139

7.6 Background 147

8 Arithmetic 149 8.1 System Predicates for Arithmetic 149

8.2 Arithmetic Logic Programs Revisited 152

8.3 Transforming Recursion into Iteration 154 8.4 Background 162

9 Structure Inspection 163 9.1 Type Predicates 163 9.2 Accessing Compound Terms 167

9.3 Background 174

10 Meta-Logical Predicates 175

10.1 Meta-Logical Type Predicates 176

10.2 Comparing Nonground Terms 180

10.3 Variables as Objects 182

10.4 The Meta-Variable Facility 185

10.5 Background 186

x Contents

11 Cuts and Negation 189 11.1 Green Cuts: Expressing Determinism 189 11.2 Tail Recursion Optimization 195

11.3 Negation 198

11.4 Red Cuts: Omitting Explicit Conditions 202

11.5 Default Rules 206

11.6 Cuts for Efficiency 208

11.7 Background 212

12 Extra-Logical Predicates 215

12.1 Input/Output 215 12.2 Program Access and Manipulation 219

12.3 Memo-Functions 221

12.4 Interactive Programs 223

12.5 Failure-Driven Loops 229

12.6 Background 231

13 Program Development 233 13.1 Programming Style and Layout 233

13.2 Reflections on Program Development 235

13.3 Systematizing Program Construction 238 13.4 Background 244

HI Advanced Prolog Programming Techniques 247

14 Nondeterministic Programming 249 14.1 Generate-and-Test 249

14.2 Don't-Care and Don't-Know Nondeterminism 263

14.3 Artificial Intelligence Classics: ANALOGY, ELIZA, and McSAM 270

14.4 Background 280

15 Incomplete Data Structures 283 15.1 Difference-Lists 283

xi Contents

15.2 Difference-Structures 291

15.3 Dictionaries 293

15.4 Queues 297

15.5 Background 300

16 Second-Order Programming 301

16.1 All-Solutions Predicates 301

16.2 Applications of Set Predicates 305

16.3 Other Second-Order Predicates 314

16.4 Background 317

17 Interpreters 319 17.1 Interpreters for Finite State Machines 319

17.2 Meta-Interpreters 323

17.3 Enhanced Meta-Interpreters for Debugging 331

17.4 An Explanation Shell for Rule-Based Systems 341

17.5 Background 354

18 Program Transfoutiation 357 18.1 Unfold/Fold Transformations 357

18.2 Partial Reduction 360 18.3 Code Walking 366

18.4 Background 373

19 Logic Grammars 375 19.1 Definite Clause Grammars 375

19.2 A Grammar Interpreter 380

19.3 Application to Natural Language Understanding 382

19.4 Background 388

20 Search Techniques 389

20.1 Searching State-Space Graphs 389

20.2 Searching Game Trees 401

20.3 Background 407

xii Contents

IV Applications 409

21 Game-Playing Programs 411

21.1 Mastermind 411

21.2 Nim 415

21.3 Kalah 420

21.4 Background 423

22 A Credit Evaluation Expert System 429

22.1 Developing the System 429

22.2 Background 438

23 An Equation Solver 439 23.1 An Overview of Equation Solving 439 23.2 Factorization 448

23.3 Isolation 449

23.4 Polynomial 452

23.5 Homogenization 454 23.6 Background 457

24 A Compiler 439 24.1 Overview of the Compiler 459

24.2 The Parser 466 24.3 The Code Generator 470

24.4 The Assembler 475

24.5 Background 478

A Operators 479

References 483

Index 497

Figure s

1.1 An abstract interpreter to answer ground queries with respect to logic programs 22

1.2 Tracing the interpreter 23 1.3 A simple proof tree 25 2.1 Defining inequality 31

2.2 A logical circuit 32

2.3 Still-life objects 34

2.4 A simple graph 41

3.1 Proof trees establishing completeness of programs 47

3.2 Equivalent forms of lists 57 3.3 Proof tree verifying a list 58 3.4 Proof tree for appending two lists 61 3.5 Proof trees for reversing a list 63 3.6 Comparing trees for isomorphism 74 3.7 A binary tree and a heap that preserves the tree's shape 77 4.1 A unification algorithm 90

4.2 An abstract interpreter for logic programs 93

4.3 Tracing the appending of two lists 94

4.4 Different traces of the same solution 95 4.5 Solving the Towers of Hanoi 97

4.6 A nonterminating computation 97

xiv Figures

5.1 A nonterminating computation 107 5.2 Two search trees 111

5.3 Search tree with multiple success nodes 112 5.4 Search tree with an infinite branch 113

6.1 Tracing a simple Prolog computation 121 6.2 Multiple solutions for splitting a list 122

6.3 Tracing a quicksort computation 123 7.1 A nonterminating computation 132 7.2 Variant search trees 139

7.3 Tracing a reverse computation 146 8.1 Computing factorials iteratively 155 9.1 Basic system type predicates 164

9.2 Tracing the substitute predicate 171 11.1 Theeffectofcut 191 13.1 Template for a specification 243

14.1 A solution to the 4 queens problem 253 14.2 A map requiring four colors 255 14.3 Directed graphs 265 14.4 Initial and final states of a blocks world problem 267 14.5 A geometric analogy problem 271

14.6 Sample conversation with ELIZA 273

14.7 AstoryfilledinbyMcSAM 276 14.8 Three analogy problems 279

15.1 Concatenating difference-lists 285

15.2 Tracing a computation using difference-lists 287

15.3 Unnormalized and normalized sums 292 16.1 Power of Prolog for various searching tasks 307

16.2 The problem of Lee routing for VLSI circuits 308

16.3 Input and output for keyword in context (KWIC) problem 312

16.4 Second-order predicates 315

17.1 A simple automaton 321

xv Figures

17.2 Tracing the meta-interpreter 325 17.3 Fragment of a table of builtin predicates 327

17.4 Explaining a computation 351

18.1 A context-free grammar for the language a*b*c* 371

20.1 The water jugs problem 393 20.2 A simple game tree 405

21.1 A starting position for Nim 415

21.2 Computing nim-sums 419

21.3 Board positions for Kalah 421

23.1 Test equations 440

23.2 Position of subterms in terms 449 24.1 A PL program for computing factorials 460

24.2 Target language instructions 460

24.3 Assembly code version of a factorial program 461

24.4 The stages of compilation 461

24.5 Output from parsing 470

24.6 The generated code 475 24.7 The compiled object code 477

Programs

1.1 A biblical family database 12

1.2 Biblical family relationships 23

2.1 Defining family relationships 31

2.2 A circuit for a logical and-gate 33

2.3 The circuit database with names 36

2.4 Course rules 37

2.5 The ancestor relationship 39 2.6 A directed graph 41 2.7 The transitive closure of the edge relation 41 3.1 Defining the natural numbers 46 3.2 The less than or equal relation 48 3.3 Addition 49

3.4 Multiplication as repeated addition 51

3.5 Exponentiation as repeated multiplication 51

3.6 Computing factorials 52 3.7 The minimum of two numbers 52

3.8a A nonrecursive definition of modulus 53

3.8b A recursive definition of modulus 53

3.9 Ackermann's function 54 3.10 The Euclidean algorithm 54

3.11 Defining a list 57

xviii Programs

3.12

3.13

3.14

3.15

3.16

3.17

3.18

3.19

3.20

3.21

3.22

3.23

3.24

3.25

3.26

3.27

3.28

3.29

3.30

3.31

3.32

5.1

7.1

7.2

7.3

7.4

7.5

7.6

7.7

7.8

7.9

Membership of a list 58

Prefixes and suffixes of a list 59

Determining sublists of lists 60

Appending two lists 60

Reversing a list 62

Determining the length of a list 64

Deleting all occurrences of an element from a list 67

Selecting an element from a list 67

Permutation sort 69 Insertion sort 70 Quicksort 70

Defining binary trees 73

Testing tree membership 73 Determining when trees are isomorphic 74

Substituting for a term in a tree 75 Traversals ofabinary tree 76 Adjusting a binary tree to satisfy the heap property Recognizing polynomials 79

Derivative rules 80

Towers of Hanoi 82

Satisfiability of Boolean formulae 83 Yet another family example 102

Yet another family example 130

Merging ordered lists 138

Checking for list membership 139

Selecting the first occurrence of an element from a list Nonmembership of a list 141

140

Testing for a subset 142

Translating word for word 143

Removing duplicates from a list 145

xix Programs

710 Reversing with no duplicates 146 8.1 Computing the greatest common divisor of two integers 152 8.2 Computing the factorial of a number 153

8.3 An iterative factorial 155

8.4 Another iterative factorial 156

8.5 Generating a range of integers 157

8.6a Summing a list of integers 157

8.6b Iterative version of summing a list of integers using an accumu- lator 157

8.7a Computing inner products of vectors 158 8.7b Computing inner products of vectors iteratively 158

8.8 Computing the area of polygons 159

8.9 Finding the maximum of a list of integers 160 8.10 Checking the length of a list 160 811 Finding the length of a list 161 8.12 Generating a list of integers in a given range 161 9.la Flattening a list with double recursion 165 9,lb Flattening a list using a stack 166 9.2 Finding subterms of a term 168 9.3 A program for substituting in a term 170 9.4 Subtermdefinedusinguniv 172 9,5a Constructing a list corresponding to a term 173 9.5b Constructing a term corresponding to a list 174 10.1 Multiple uses for plus 176

10.2 A multipurpose length program 177 10.3 A more efficient version of grandparent 178 10.4 Testing if a term is ground 178

10.5 Unification algorithm 180

10.6 Unification with the occurs check 181

10.7 Occurs in 182

10.8 Numbering the variables in a term 185

xx Programs

10.9 Logical disjunction 186

11.1 Merging ordered lists 190 11.2 Merging with cuts 192

11.3 ininimuinwith cuts 193

11.4 Recognizing polynomials 193

11.5 Interchange sort 195 11.6 Negation as failure 198 11.7 Testing if terms are variants 200

11.8 Implementing 201

11.9a Deleting elements from a list 204

i 1.9b Deleting elements from a list 204 11.10 If-then-else statement 205

11.1 la Determining welfare payments 207

ll.11b Determining welfare payments 207 12.1 Writing a list of terms 216 12.2 Reading in a list of words 217

12.3 Towers of Hanoi using a memo-function 222 12.4 Basic interactive loop 223 12.5 A line editor 224

12.6 An interactive shell 226 12.7 Logging a session 228 12.8 Basic interactive repeat loop 230 12.9 Consulting a file 230

13.1 Finding the union of two lists 241

13.2 Finding the intersection of two lists 241

13.3 Finding the union and intersection of two lists 241 14.1 Finding parts of speech in a sentence 251 14.2 Naive generate-and-test program solving N queens 253

14.3 Placing one queen at a time 255 14.4 Map colormg 256

14.5 Test data for map coloring 257

xxi Programs

14.6 A puzzle solver 259

14.7 A description of a puzzle 260 14.8 Connectivity in a finite DAG 265

14.9 Finding a path by depth-first search 266 14.10 Connectivity in a graph 266 14.11 A depth-first planner 268

14.12 Testing the depth-first planner 269 14.13 A program solving geometric analogies 272

14.14 Testing ANALOGY 273

14.15 ELIZA 275

14.16 McSAM 277

14.17 Testing McSAM 278

15.1 Concatenating difference-lists 285 15.2 Flattening a list of lists using difference-lists 286

15.3 Reverse with difference-lists 288

15.4 Quicksort using difference-lists 289

15.5 A solution to the Dutch flag problem 290 15.6 Dutch flag with difference-lists 291

15.7 Normalizing plus expressions 292 15.8 Dictionary lookup from a list of tuples 294 15.9 Dictionary lookup in a binary tree 295 15.10 Meltingaterm 296 15.11 Aqueueprocess 297 15.12 Flattening a list using a queue 298 16.1 Sample data 302

16.2 Applying set predicates 303

16.3 Implementing an all-solutions predicate using difference-lists, assert, and retract 304

16.4 Testing connectivity breadth-first in a DAG 306

16.5 Testing connectivity breadth-first in a graph 307

16.6 Lee routing 310

16.7 Producing a keyword in context (KWIC) index 313

xxii Programs

16.8 Second-order predicates in Prolog 316

17.1 An interpreter for a nondeterministic finite automaton (NDFA) 320

17.2 An NDFA that accepts the language (ab)* 321

17.3 An interpreter for a nondetermimstic pushdown automaton (NPDA) 322

17.4 An NPDA for palindromes over a finite alphabet 322

17.5 A meta-interpreter for pure Prolog 324

17.6 A meta-interpreter for pure Prolog in continuation style 326

17.7 AtracerforProlog 328 17.8 A meta-interpreter for building a proof tree 329

17.9 A meta-interpreter for reasoning with uncertainty 330 17.10 Reasoning with uncertainty with threshold cutoff 331

17.11 A meta-interpreter detecting a stack overflow 333 17.12 A nonterminating insertion sort 334 17.13 An incorrect and incomplete insertion sort 335

17.14 Bottom-up diagnosis of a false solution 336 17.15 Top-down diagnosis of a false solution 338 17.16 Diagnosing missing solution 340 17.17 Oven placement rule-based system 342 17.18 A skeleton two-level rule interpreter 343 17.19 An interactive rule interpreter 345 17.20 A two-level rule interpreter carrying rules 347

17.21 A two-level rule interpreter with proof trees 348

17.22 Explaining aproof 350 17.23 An explanation shell 352

18.1 A program accepting palindromes 359

18.2 A meta-interpreter for determining a residue 361 18.3 A simple partial reduction system 362 18.4 Specializing an NPDA 363

18.5 Specializing a rule interpreter 364

18.6 Composing two enhancements of a skeleton 368

xxiii Programs

18.7 Testing program composition 370 18.8 A Prolog program parsing the language a*b*c* 371

18.9 Translating grammar rules to Prolog clauses 372

19.1 Enhancing the language a*b*c* 377

19.2 Recognizing the language dt1cN 377 19.3 Parsing the declarative part of a Pascal block 378 19.4 A definite clause grammar (DCG) interpreter 381

19.5 A DCG interpreter that counts words 382

19.6 A DCG context-free grammar 383

19.7 A DCG computing a parse tree 384

19.8 A DCG with subject/object number agreement 385

19.9 A DCG for recognizing numbers 387

20.1 A depth-first state-transition framework for problem solving 390

20.2 Solving the wolf, goat, and cabbage problem 392

20.3 Solving the water jugs problem 394 20.4 Hill climbing framework for problem solving 397 20.5 Test data 398 20.6 Best-first framework for problem solving 399 20.7 Concise best-first framework for problem solving 400 20.8 Framework for playing games 402

20.9 Choosing the best move 403

20.10 Choosing the best move with the minimax algorithm 406 20.11 Choosing a move using minimax with alpha-beta pruning 407 21.1 Playing mastermind 413 21.2 A program for playing a winning game of Nim 417

21.3 A complete program for playing Kalah 424

22.1 A credit evaluation system 432 22.2 Test data for the credit evaluation system 437

23.1 A program for solving equations 442

24.1 A compiler from PL to machine language 462

24.2 Test data 465

Series Foreword

The logic programming approach to computing investigates the use of logic as a programming language and explores computational models based on controlled deduction.

The field of logic programming has seen a tremendous growth in the last several years, both in depth and in scope. This growth is reflected in the number of articles, journals, theses, books, workshops, and confer- ences devoted to the subject. The MIT Press series in logic programming was created to accommodate this development and to nurture it. lt is dedicated to the publication of high-quality textbooks, monographs, col- lections, and proceedings in logic programming.

Ehud Shapiro The Weizmann Institute of Science Rehovot, Israel

Foreword

Programming in Prolog opens the mind to a new way of looking at com- puting. There is a change of perspective which every Prolog programmer experiences when first getting to know the language.

I shall never forget my first Prolog program. The time was early 1974. I had learned about the abstract idea of logic programming from Bob Kowaiski at dinburgh, although the name "logic programming" had not yet been coined. The main idea was that deduction could be viewed as a form of computation, and that a declarative statement of the form

P if Q and R and S.

could also be interpreted procedurally as

To solve P, solve Q and R and S.

Now I had been invited to Marseilles. Here, Alain Colmerauer and his col- leagues had devised the language Prolog based on the logic programming concept. Somehow, this realization of the concept seemed to me, at first sight, too simpleminded. However, Gerard Battani and Henri Meloni had implemented a Prolog interpreter in Fortran (their first major exercise in programming, incidentally). Why not give Prolog a try?

I sat at a clattering teletype connected down an ordinary telephone line to an IBM machine far away in Grenoble. I typed in some rules defining how plans could be constructed as sequences of actions. There was one important rule, modeled on the SRI planner Strips, which described how a plan could be elaborated by adding an action at the end. Another rule, necessary for completeness, described how to elaborate a plan by insert- ing an action in the middle of the plan. As an example for the planner to

xxviii Foreword

work on, I typed in facts about some simple actions in a "blocks world" and an initial state of this world. I entered a description of a goal state to be achieved. Prolog spat back at me:

meaning it couldn't find a solution. Could it be that a solution was not deducible from the axioms I had supplied? Ah, yes, I had forgotten to enter some crucial facts. I tried again. Prolog was quiet for a long time and then responded:

DEBORDEMENT DE PILE

Stack overflow! I had run into a loop. Now a loop was conceivable since the space of potential plans to be considered was infinite. However, I had taken advantage of Prolog's procedural semantics to organize the axioms so that shorter plans ought to be generated first. Could something else be wrong? After a lot of head scratching, I finally realized that I had mistyped the names of some variables. I corrected the mistakes, and tried again.

Lo and behold, Prolog responded almost instantly with a correct plan to achieve the goal state. Magic! Declaratively correct axioms had assured a correct result. Deduction was being harnessed before my very eyes to produce effective computation. Declarative programming was truly programming on a higher plane! I had dimly seen the advantages in theory. Now Prolog had made them vividly real in practice. Never had I experienced such ease in getting a complex program coded and running.

Of course, I had taken care to formulate the axioms and organize them in such a way that Prolog could use them effectively. I had a general idea of how the axioms would be used. Nevertheless it was a surprise to see how the axioms got used in practice on particular examples. It was a delightful experience over the next few days to explore how Prolog actually created these plans, to correct one or two more bugs in my facts and rules, and to further refine the program.

Since that time, Prolog systems have improved significantly in terms of debugging environments, speed, and general robustness. The techniques of using Prolog have been more fully explored and are now better un- derstood. And logic programming has blossomed, not least because of its adoption by the Japanese as the central focus of the Fifth Generation project.

xxix Foreword

After more than a decade of growth of interest in Prolog, it is a great pleasure to see the appearance of this book. Hitherto, knowledge of how to use Prolog for serious programming has largely been communicated by word of mouth. This textbook sets down and explains for the first time in an accessible form the deeper principles and techniques of Prolog programming

The book is excellent for not only conveying what Prolog is but also ex- plaining how it should be used. The key to understanding how to use Prolog is to properly understand the relationship between Prolog and logic programming. This book takes great care to elucidate the relation- ship.

Above all, the book conveys the excitement of using Prologthe thrill of declarative programming As the authors put it, "Declarative program- ming clears the mind" Declarative programming enables one to concen- trate on the essentials of a problem without gettmg bogged down in too much operational detail. Programming should be an intellectually rewarding activity. Prolog helps to make it so. Prolog is indeed, as the authors contend, a tool for thinking.

David H. D. Warren Manchester, England, September 1986

Preface

Seven years have passed since the first edition of The Art of Prolog was published. In that time, the perception of Prolog has changed markedly. While not as widely used as the language C, Prolog is no longer regarded as an exotic language. An abundance of books on Prolog have appeared. Prolog is now accepted by many as interesting and useful for certain applications. Articles on Prolog regularly appear in popular magazines. Prolog and logic programming are part of most computer science and engineering programs, although perhaps in a minor role in an artificial intelligence or programming languages class. The first conference on Practical Applications of Prolog was held in London in April 1992. A standard for the language is likely to be in place in 1994. A future for Prolog among the programming languages of the world seems assured.

In preparing for a second edition, we had to address the question of how much to change. I decided to listen to a request not to make the new edition into a new book. This second edition is much like the first, al- though a number of changes are to be expected in a second edition. The typography of the book has been improved: Program code is now in a dis- tinctive font rather than in italics. Figures such as proof trees and search trees are drawn more consistently. We have taken the opportunity to be more precise with language usage and to remove minor inconsistencies with hyphenation of words and similar details. All known typographi- cal errors have been fixed. The background sections at the end of most chapters have been updated to take into account recent, important re- search results. The list of references has been expanded considerably. Extra, more advanced exercises, which have been used successfully in my Prolog classes, have been added.

xxxii Preface

Let us take an overview of the specific changes to each part in turn. Part IV, Applications, is unchanged apart from minor corrections and tidying. Part I, Logic Programs, is essentially unchanged. New programs have been added to Chapter 3 on tree manipulation, including heapifying a binary tree. Extra exercises are also present.

Part II, The Prolog Langauge, is primarily affected by the imminence of a Prolog standard. We have removed all references to Wisdom Prolog in the text in preparation for Standard Prolog. It has proved impossible to guarantee that this book is consistent with the standard. Reaching a stan- dard has been a long, difficult process for the members of the committee. Certain predicates come into favor and then disappear, making it difficult for the authors of a text to know what to write. Furthermore, some of the proposed I/O predicates are not available in current Prologs, so it is im- possible to run all the code! Most of the difficulties in reaching a Prolog standard agreeable to all interested parties have been with builtin or sys- tem predicates. This book raises some of the issues involved in adding builtins to Prolog but largely avoids the concerns by using pure Prolog as much as possible. We tend not to give detailed explanations of the con- troversial nonlogical behaviors of some of the system predicates, and we certainly do not use odd features in our code.

Part III, Advanced Programming Tecimiques, is the most altered in this second edition, which perhaps should be expected. A new chapter has been added on program transformation, and many of the other chapters have been reordered. The chapters on Interpreters and Logic Grammars have extensive additions.

Many people provided us feedback on the first edition, almost all of it very positive. I thank you all. Three people deserve special thanks for taking the trouble to provide long lists of suggestions for improve- ments and to point out embarrassingly long lists of typos in the first edition: Norbert Fuchs, Harald Søndergaard, and Stanley Selkow. The following deserve mention for pointing out mistakes and typos in the various printings of the first edition or making constructive comments about the book that led to improvements in later printings of the first edition and for this second edition. The list is long, my memory some- times short, so please forgive me if I forget to mention anyone. Thanks to Ham Assiryani, Tim Boemker, Jim Brand, Bill Braun, Pu Chen, Yves Deville, George Ernst, Claudia Günther, Ann Halbran, Sundar Iyengar, Gary Kacmarcik, Mansoor Khan, Sundeep Kumar, Arun Lakhotia, Jean-

xxxiii Preface

Louis Lassez, Charlie Linville, Per Ljung, David Maier, Fred Mailey, Martin Marshall, Andre Mesarovic, Dan Oldham, Scott Pierce, Lynn Pierce, David Pedder, S. S. Ramakrishnan, Chet Ramey, Marty Silverstein, Bill Sloan, Ron Taylor, Rodney Topor, R. J. Wengert, Ted Wright, and Nan Yang. For the former students of CMPS41Ì, I hope the extra marks were sufficient re- ward.

Thanks to Sarah Fliegelmann and Venkatesh Srinivasan for help with entering changes to the second edition and TeXing numerous drafts. Thanks to Phil Gannon and Zoë Sterling for helpful discussions about the figures, and to Joe Geiles for drawing the new figures. For proofreading the second edition, thanks to Kathy Kovacic, David Schwartz, Ashish Jam, and Venkatesh Srinivasan. Finally, a warm thanks to my editor, Terry Ehling, who has always been very helpful and very responsive to queries.

Needless to say, the support of my family and friends is the most important and most appreciated.

Leon Sterling Cleveland, January 1993

Preface to First Edition

The origins of this book lie in graduate student courses aimed at teach- ing advanced Prolog programming A wealth of techniques has emerged in the fifteen years since the inception of Prolog as a programming lan- guage. Our intention in this book has been to make accessible the pro- grammmg techniques that kindled our own excitement, imagination, and involvement in this area.

The book fills a general need. Prolog, and more generally logic pro- gramming, has received wide publicity in recent years. Currently avail- able books and accounts, however, typically describe only the basics. All but the simplest examples of the use of Prolog have remained essentially inaccessible to people outside the Prolog community.

We emphasize throughout the book the distinction between logic pro- gramming and Prolog programming Logic programs can be understood and studied, using two abstract, machine-independent concepts: truth and logical deduction. One can ask whether an axiom in a program is true, under some interpretation of the program symbols; or whether a logical statement is a consequence of the program. These questions can be answered independently of any concrete execution mechanism.

On the contrary, Prolog is a programming language, borrowing its basic constructs from logic. Prolog programs have precise operational mean- ing: they are instructions for execution on a computera Prolog ma- chine. Prolog programs in good style can almost always be read as log- ical statements, thus inheriting some of the abstract properties of logic programs. Most important, the result of a computation of such a Pro- log program is a logical consequence of the axioms in it. Effective Prolog

xxxvi Preface to First Edition

progranmiing requires an understanding of the theory of logic program- ming.

The book consists of four parts: logic programming, the Prolog lan- guage, advanced techniques, and applications. The first part is a self- contained introduction to logic programming. It consists of five chapters. The first chapter introduces the basic constructs of logic programs. Our account differs from other introductions to logic programming by ex- plaining the basics in terms of logical deduction. Other accounts explain the basics from the background of resolution from which logic program- ming originated. We have found the former to be a more effective means of teaching the material, which students find intuitive and easy to under- stand.

The second and third chapters of Part I introduce the two basic styles of logic programming: database programming and recursive program- ming. The fourth chapter discusses the computation model of logic pro- gramming, introducing unification, while the fifth chapter presents some theoretical results without proofs. In developing this part to enable the clear explanation of advanced techniques, we have introduced new con- cepts and reorganized others, in particular, in the discussion of types and termination. Other issues such as complexity and correctness are concepts whose consequences have not yet been fully developed in the logic programming research community.

The second part is an introduction to Prolog. It consists of Chapters 6 through 13. Chapter 6 discusses the computation model of Prolog in contrast to logic programming, and gives a comparison between Prolog and conventional programming languages such as Pascal. Chapter 7 dis- cusses the differences between composing Prolog programs and logic programs. Examples are given of basic programming techniques.

The next five chapters introduce system-provided predicates that are essential to make Prolog a practical programming language. We clas- sify Prolog system predicates into four categories: those concerned with efficient arithmetic, structure inspection, meta-logical predicates that discuss the state of the computation, and extra-logical predicates that achieve side effects outside the computation model of logic pro- gramming. One chapter is devoted to the most notorious of Prolog extra-logical predicates, the cut. Basic techniques using these system predicates are explained. The final chapter of the section gives assorted pragmatic programming tips.

xxxvii Preface to First Edition

The main part of the book is Part III. We describe advanced Prolog programming techniques that have evolved in the Prolog programming community, illustrating each with small yet powerful example programs. The examples typify the applications for which the technique is useful. The six chapters cover nondeterministic programming, incomplete data structures, parsing with DCGs, second-order programming, search tech-. niques, and the use of meta-interpreters.

The final part consists of four chapters that show how the material in the rest of the book can be combined to build application programs. A common request of Prolog newcomers is to see larger applications. They understand how to write elegant short programs but have difficulty in building a major program. The applications covered are game-playing programs, a prototype expert system for evaluating requests for credit, a symbolic equation solver, and a compiler.

During the development of the book, it has been necessary to reorga- nize the foundations and basic examples existing in the folklore of the logic programming community. Our structure constitutes a novel frame- work for the teaching of Prolog.

Material from this book has been used successfully for several courses on logic programming and Prolog: in Israel, the United States, and Scot- land. The material more than suffices for a one-semester course to first- year graduate students or advanced undergraduates. There is consider- able scope for instructors to particularize a course to suit a special area of interest.

A recommended division of the book for a 13-week course to senior un- dergraduates or first-year graduates is as follows: 4 weeks on logic pro- gramming, encouraging students to develop a declarative style of writing programs, 4 weeks on basic Prolog programming, 3 weeks on advanced techniques, and 2 weeks on applications. The advanced techniques should include some discussion of nondeterminism, incomplete data structures, basic second-order predicates, and basic meta-interpreters. Other sections can be covered instead of applications. Application areas that can be stressed are search techniques in artificial intellígence, build- ing expert systems, writing compilers and parsers, symbol manipulation, and natural language proces sing.

There is considerable flexibility in the order of presentation. The ma- terial from Part I should be covered first. The material in Parts III and IV can be interspersed with the material in Part lito show the student how

xxxviii Preface to First Edition

larger Prolog programs using more advanced techniques are composed in the same style as smaller examples.

Our assessment of students has usually been 50 percent by homework assignments throughout the course, and 50 percent by project. Our expe- rience has been that students are capable of a significant prograniming task for their project. Examples of projects are prototype expert systems, assemblers, game-playing programs, partial evaluators, and implementa- tions of graph theory algorithms.

For the student who is studying the material on her own, we strongly advise reading through the more abstract material iii Part I. A good Pro- log progranirning style develops from thinking declaratively about the logic of a situation. The theory in Chapter 5, however, can be skipped until a later reading.

The exercises in the book range from very easy and well defined to difficult and open-ended. Most of them are suitable for homework exer- cises. Some of the more open-ended exercises were submitted as course projects.

The code in this book is essentially in Edinburgh Prolog. The course has been given where students used several different variants of Edinburgh Prolog, and no problems were encountered. All the examples run on Wisdom Prolog, which is discussed in the appendixes.

We acknowledge and thank the people who contributed directly to the book. We also thank, collectively and anonymously, all those who indi- rectly contributed by influencing our programming styles in Prolog. Im- provements were suggested by Lawrence Byrd, Oded Maler, Jack Minker, Richard O'Keefe, Fernando Pereira, and several anonymous referees.

We appreciate the contribution of the students who sat through courses as material from the book was being debugged. The first author acknowledges students at the University of Edinburgh, the Weizmann Institute of Science, Tel Aviv University, and Case Western Reserve Uni- versity. The second author taught courses at the Weizmanri Institute and Hebrew University of Jerusalem, and in industry.

We are grateful to many people for assisting in the technical aspects of producing a book. We especially thank Sarah Fliegelmann, who pro- duced the various drafts and camera-ready copy, above and beyond the call of duty. This book might not have appeared without her tremendous efforts. Arvind Bansal prepared the index and helped with the references. Yehuda Barbut drew most of the figures. Max Goldberg and Shmuel Safra

xxxix Preface to First Edition

prepared the appendix. The publishers, MIT Press, were helpful and sup- portive.

Finally, we acknowledge the support of family and friends, without which nothing would get done.

Leon Sterling 1986

Introduction

The inception of logic is tied with that of scientific thinking. Logic pro- vides a precise language for the explicit expression of one's goals, knowl- edge, and assumptions. Logic provides the foundation for deducing consequences from premises; for studying the truth or falsity of state- ments given the truth or falsity of other statements; for establishing the consistency of one's claims; and for verifying the validity of one's argu- ments.

Computers are relatively new in our intellectual history. Similar to logic, they are the object of scientific study and a powerful tool for the advancement of scientific endeavor. Like logic, computers require a precise and explicit statement of one's goals and assumptions. Un- like logic, which has developed with the power of human thinking as the only external consideration, the development of computers has been gov- erned from the start by severe technological and engineering constraints. Although computers were intended for use by humans, the difficul- ties in constructing them were so dominant that the language for expressing problems to the computer and instructing it how to solve them was designed from the perspective of the engineering of the com- puter alone.

Almost all modern computers are based on the early concepts of von Neumann and his colleagues, which emerged during the 1940s. The von Neumann machine is characterized by a large uniform store of memory cells and a processing unit with some local cells, called registers. The processing unit can load data from memory to registers, perform arith- metic or logical operations on registers, and store values of registers back into memory. A program for a von Neumann machine consists of

2 Introduction

a sequence of instructions to perform such operations, and an additional set of control instructions, which can affect the next instruction to be executed, possibly depending on the content of some register.

As the problems of building computers were gradually understood and solved, the problems of using them mounted. The bottleneck ceased to be the inability of the computer to perform the human's instructions but rather the inability of the human to instruct, or program, the computer. A search for programming languages convenient for humans to use be- gan. Starting from the language understood directly by the computer, the machine language, better notations and formalisms were developed. The main outcome of these efforts was languages that were easier for humans to express themselves in but that still mapped rather directly to the underlying machine language. Although increasingly abstract, the languages in the mainstream of development, starting from assembly language through Fortran, Algol, Pascal, and Ada, all carried the mark of the underlying machinethe von Neumann architecture.

To the uninitiated intelligent person who is not familiar with the en- gineering constraints that led to its design, the von Neumann machine seems an arbitrary, even bizarre, device. Thinking in terms of its con- strained set of operations is a nontrivial problem, which sometimes stretches the adaptiveness of the human mind to its limits.

These characteristic aspects of programming von Neumann computers led to a separation of work: there were those who thought how to solve the problem, and designed the methods for its solution, and there were the coders, who performed the mundane and tedious task of translating the instructions of the designers to instructions a computer can use.

Both logic and programming require the explicit expression of one's knowledge and methods in an acceptable formalism. The task of making one's knowledge explicit is tedious. However, formalizing one's knowl- edge in logic is often an intellectually rewarding activity and usually reflects back on or adds insight to the problem under consideration. In contrast, formalizing one's problem and method of solution using the von Neumann instruction set rarely has these beneficial effects.

We believe that programming can be, and should be, an intellectu- ally rewarding activity; that a good programming language is a powerful conceptual toola tool for organizing, expressing, experimenting with, and even communicating one's thoughts; that treating programming as

Introduction

"coding," the last, mundane, intellectually trivial, time-consuming, and tedious phase of solving a problem using a computer system, is perhaps at the very root of what has been known as the "software crisis."

Rather, we think that programming can be, and should be, part of the problem-solving process itself; that thoughts should be organized as programs, so that consequences of a complex set of assumptions can be investigated by "running" the assumptions; that a conceptual solution to a problem should be developed hand-in-hand with a working program that demonstrates it and exposes its different aspects. Suggestions in this direction have been made under the title "rapid prototyping."

To achieve this goal in its fullestto become true mates of the human thinking processcomputers have still a long way to go. However, we find it both appropriate and gratifying from a historical perspective that logic, a companion to the human thinking process since the early days of human intellectual history, has been discovered as a suitable stepping- stone in this long journey.

Although logic has been used as a tool for designing computers and for reasoning about computers and computer programs since almost their beginning, the use of logic directly as a prograniming language, termed logic programming, is quite recent.

Logic programming, as well as its sister approach, functional program- ming, departs radically from the mainstream of computer languages. Rather then being derived, by a series of abstractions and reorganiza- tions, from the von Neumann machine model and instruction set, it is derived from an abstract model, which has no direct relation to or de- pendence on to one machine model or another. It is based on the belief that instead of the human learning to think in terms of the operations of a computer that which some scientists and engineers at some point in history happened to find easy and cost-effective to build, the com- puter should perform instructions that are easy for humans to provide. In its ultimate and purest form, logic programming suggests that even explicit instructions for operation not be given but rather that the knowl- edge about the problem and assumptions sufficient to solve it be stated explicitly, as logical axioms. Such a set of axioms constitutes an alterna- tive to the conventional program. The program can be executed by pro- viding it with a problem, formalized as a logical statement to be proved, called a goal statement. The execution is an attempt to solve the prob-

4 Introduction

lem, that is, to prove the goal statement, given the assumptions in the logic program.

A distinguishing aspect of the logic used in logic programming is that a goal statement typically is existentially quantified: it states that there exist some individuals with some property. An example of a goal state- ment is, "there exists a list X such that sorting the list 13, 1,21 gives X." The mechanism used to prove the goal statement is constructive. If suc- cessful, it provides the identity of the unknown individuals mentioned in the goal statement, which constitutes the output of the computation. In the preceding example, assuming that the logic program contains appro- priate axioms defining the sort relation, the output of the computation wouldbeX= [1,2,3].

These ideas can be summarized in the following two metaphorical equations:

program = set of axioms.

computation = constructive proof of a goal statement from the program.

The ideas behind these equations can be traced back as far as intuition- istic mathematics and proof theory of the early twentieth century. They are related to Hilbert's program, to base the entire body of mathemati- cal knowledge on logical foundations and to provide mechanical proofs for its theories, starting from the axioms of logic and set theory alone. It is interesting to note that the failure of this program, from which en- sued the incompleteness and undecidability results of Gödel and Turing, marks the beginning of the modern age of computers.

The first use of this approach in practical computing is a sequel to Robinson's unification algorithm and resolution principle, published in 1965. Several hesitant attempts were made to use this principle as a basis of a computation mechanism, but they did not gain any momentum. The beginning of logic programming can be attributed to Kowalski and Colmerauer. Kowalski formulated the procedural interpretation of Horn clause logic. He showed that an axiom

A if B1 and B2 and... and B

can be read and executed as a procedure of a recursive programming language, where A is the procedure head and the B are its body. In

Introduction

addition to the declarative reading of the clause, A is true if the B are true, it can be read as follows: To solve (execute) A, solve (execute) B1 and B2 and. . . and B. In this reading, the proof procedure of Horn clause logic is the interpreter of the language, and the unification algorithm, which is at the heart of the resolution proof procedure, performs the basic data manipulation operations of variable assignment, parameter passing, data selection, and data construction.

At the same time, in the early 1970s, Colmerauer and his group at the University of Marseilles-Aix developed a specialized theorem prover, written in Fortran, which they used to implement natural language pro- cessing systems. The theorem prover, called Prolog (for Programmation en Logique), embodied Kowaiski's procedural interpretation. Later, van Emden and Kowalski developed a formal semantics for the language of logic programs, showing that its operational, model-theoretic, and fix- point semantics are the same.

In spite of all the theoretical work and the exciting ideas, the logic pro- gramming approach seemed unrealistic. At the time of its inception, re- searchers in the United States began to recognize the failure of the "next- generation Al languages," such as Micro-Planner and Conniver, which de- veloped as a substitute for Lisp. The main claim against these languages was that they were hopelessly inefficient, and very difficult to control. Given their bitter experience with logic-based high-level languages, it is no great surprise that U.S. artificial intelligence scientists, when hearing about Prolog, thought that the Europeans were over-excited over what they, the Americans, had already suggested, tried, and discovered not to work.

In that atmosphere the Prolog-lo compiler was almost an imaginary being. Developed in the mid to late 1970s by David H. D. Warren and his colleagues, this efficient implementation of Prolog dispelled all the myths about the impracticality of logic programming. That compiler, still one of the finest implementations of Prolog around, delivered on pure list-processing programs a performance comparable to the best Lisp sys- tems available at the time. Furthermore, the compiler itself was written almost entirely in Prolog, suggesting that classic programming tasks, not just sophisticated AI applications, could benefit from the power of logic programming.

6 Introduction

The impact of this implementation cannot be overemphasized. Without it, the accumulated experience that has led to this book would not have existed.

In spite of the promise of the ideas, and the practicality of their im- plementation, most of the Western computer science and AI research community was ignorant, openly hostile, or, at best, indifferent to logic programming. By 1980 the number of researchers actively engaged in logic programming were only a few dozen in the United States and about one hundred around the world.

No doubt, logic programming would have remained a fringe activity in computer science for quite a while longer hadit not been for the an- nouncement of the Japanese Fifth Generation Project, which took place in October 1981. Although the research program the Japanese presented was rather baggy, faithful to their tradition of achieving consensus at almost any cost, the important role of logic programming in the next generation of computer systems was made clear.

Since that time the Prolog language has undergone a rapid transition from adolescence to maturity. There are numerous commercially avail- able Prolog implementations on most computers. A large number of Pro- log programming books are directed to different audiences and empha- size different aspects of the language. And the language itself has more or less stabilized, having a de facto standard, the Edinburgh Prolog fam- ily.

The maturity of the language means that it is no longer a concept for scientists yet to shape and define but rather a given object, with vices and virtues. lt is time to recognize that, on the one hand, Prolog falls short of the high goals of logic programming but, on the other hand, is a powerful, productive, and practical programming formalism. Given the standard life cycle of computer programming languages, the next few years will reveal whether these properties show their merit only in the classroom or prove useful also in the field, where people pay money to solve problems they care about.

What are the current active subjects of research in logic programming and Prolog? Answers to this question can be found in the regular sci- entific journals and conferences of the field; the Logic Programming Journal, the Journal of New Generation Computing, the International Conference on Logic Programming, and the IEEE Symposium on Logic

7 Introduction

Programming as well as in the general computer science journals and conf erences.

Clearly, one of the dominant areas of interest is the relation between logic programming, Prolog, and parallelism. The promise of parallel com- puters, combined with the parallelism that seems to be available in the logic programming model, have led to numerous attempts, still ongoing, to execute Prolog in parallel and to devise novel concurrent program- ming languages based on the logic programming computation model. This, however, is a subject for another book.

-- &J 2è - L

Leonardo Da Vinci. Old Man thinking. Pen and ink (slightly enlarged). About 1510. Windsor Castle, Royal Library.

I Logic Programs

A logic program is a set of axioms, or rules, defining relations between objects. A computation of a logic program is a deduction of conse- quences of the program. A program defines a set of consequences, which is its meaning The art of logic programming is constructing concise and elegant programs that have the desired meaning.

Li Facts

The simplest kind of statement is called a fact. Facts are a means of stating that a relation holds between objects. An example is

father(abrahani,isaac).

This fact says that Abraham is the father of Isaac, or that the relation f a- ther holds between the individuals named abraham and isaac. Another name for a relation is a predicate. Names of individuals are known as atoms. Similarly, plus(2,3,5) expresses the relation that 2 plus 3 is 5. The familiar plus relation can be realized via a set of facts that defines the addition table. An initial segment of the table is

plus(O,O,O). plus(O,1,1). plus(O,2,2). plus(O,3,3).

Basic Constructs

The basic constructs of logic programming, terms and statements, are inherited from logic. There are three basic statements: facts, rules, and queries. There is a single data structure: the logical term.

plus(1,O,1). plus(1,1,2). plus(1,2,3). plus(1,3,4).

A sufficiently large segment of this table, which happens to be also a legal logic program, will be assumed as the definition of the plus relation throughout this chapter.

The syntactic conventions used throughout the book are introduced as needed. The first is the case convention. It is significant that the names

12 Chapter 1

father(terach,abraham). male(terach).

father (terach,nachor). male (abraham).

father (terach,haran). male (nachor).

f ather(abraham,isaac). male (haran).

father (haran,lot). male(isaac).

father (haran,milcah). male (lot).

father(haran,yiscah).

female(sarah).

mother(sarah,isaac). female (milcah).

female(yiscah).

Program 1.1 A biblical family database

of both predicates and atoms in facts begin with a lowercase letter rather than an uppercase letter.

A finite set of facts constitutes a program. This is the simplest form of logic program. A set of facts is also a description of a situation. This insight is the basis of database programming, to be discussed in the next chapter. An example database of family relationships from the Bible is given as Program 1.1. The predicates father, mother, male, and female express the obvious relationships.

1.2 Queries

The second form of statement in a logic program is a query. Queries are a means of retrieving information from a logic program. A query asks whether a certain relation holds between objects. For example, the query father (abraham, isaac)? asks whether the father relationship holds between abraham and isaac. Given the facts of Program 1.1, the answer to this query is yes.

Syntactically, queries and facts look the same, but they can be distin- guished by the context. When there is a possibility of confusion, a term!- nating period will indicate a fact, while a terminating question mark will indicate a query. We call the entity without the period or question mark a goal. A fact P. states that the goal P is true. A query P? asks whether the goal P is true. A simple query consists of a single goal.

Answering a query with respect to a program is determining whether the query is a logical consequence of the program. We define logical

13 Basic Constructs

consequence incrementally through this chapter. Logical consequences are obtained by applying deduction rules. The simplest rule of deduction is identity: from P deduce P. A query is a logical consequence of an identical fact.

Operationally, answermg simple queries using a program containing facts like Program 1.1 is straightforward. Search for a fact in the program that implies the query. If a fact identical to the query is found, the answer is yes.

The answer no is given if a fact identical to the query is not found, because the fact is not a logical consequence of the program. This answer does not reflect on the truth of the query; it merely says that we failed to prove the query from the program. Both the queries f emale(abrahazn)? and plus(1 ,1,2)? will be answered no with respect to Program 1.1.

1.3 The Logical Variable, Substitutions, and Instances

A logical variable stands for an unspecified individual and is used ac- cordingly. Consider its use in queries. Suppose we want to know of whom abraham is the father. One way is to ask a series of queries, father(abraham,lot)?, father(abraham,milcah)?, ..., father

(abraham,isaac)?,. . . until an answer yes is given. A variable allows a better way of expressing the query as f ather(abraham,X)?, to which the answer is X=isaac. Used in this way, variables are a means of sum- marizing many queries. A query containing a variable asks whether there is a value for the variable that makes the query a logical consequence of the program, as explained later.

Variables in logic programs behave differently from variables in con- ventional programming languages. They stand for an unspecified but sin- gle entity rather than for a store location in memory.

Having introduced variables, we can define a term, the single data structure ¡ii logic programs. The definition is inductive. Constants and variables are terms. Also compound terms, or structures, are terms. A compound term comprises a functor (called the principal functor of the term) and a sequence of one or more arguments, which are terms. A functor is characterized by its name, which is an atom, and its arity, or number of arguments. Syntactically, compound terms have

14 Chapter 1

the form f(t1,t2,. . .,t), where the functor has name f and is of arity n, and the t are the arguments. Examples of compound terms include s(0), hot(milk), name(john,doe), list(a,list(b,nil)), foo(X), and tree(tree(nil,3,nil) ,5,R).

Queries, goals, and more generally terms where variables do not occur are called ground. Where variables do occur, they are called nonground. For example, f oo (a, b) is ground, whereas bar (X) is nonground.

Definition A substitution is a finite set (possibly empty) of pairs of the form X = t, where X is a variable and t is a term, and X X1 for every i j, and X does not occur in t1, for any i and j. .

An example of a substitution consisting of a single pair is {X=isaac}. Substitutions can be applied to terms. The result of applying a substi- tution G to a term A, denoted by AO, is the term obtained by replacing every occurrence of X by t in A, for every pair X = t in O.

The result of applying {X=isaac} to the term f ather(abraham,X) is the term father(abrahani,isaac).

Definition A is an instance of B if there is a substitution O such that A = BO. .

The goal f ather(abraham,isaac) is an instance of father (abraham, X) by this definition. Similarly, mother(sarah, isaac) is an instance of mother(X,Y) under the substitution {X=sarah,Y=isaac}.

1.4 Existential Queries

Logically speaking, variables in queries are existentially quantified, which means, intuitively, that the query father (abraham, X)? reads: "Does there exist an X such that abraham is the father of X?" More generally, a query p(T1,T2,. . .,T)?, which contains the variables X1,X2,. . .,Xk reads: "Are there X1,X2,. . .,Xk such that p(T1,T2,. . .,T)?" For convenience, exis- tential quantification is usually omitted.

The next deduction rule we introduce is generalization. An existential query P is a logical consequence of an instance of it, PO, for any substi- tution 6. The fact f ather(abraham, isaac) implies that there exists an X such that f ather(abraham,X) is true, namely, X=isaac.

15 Basic Constructs

Operationally, to answer a nonground query using a program of facts, search for a fact that is an instance of the query. If found, the answer, or solution, is that instance. A solution is represented in this chapter by the substitution that, if applied to the query, results in the solution. The answer is no if there is no suitable fact in the program.

In general, an existential query may have several solutions. Program 1.1 shows that Haran is the father of three children. Thus the query father (haran,X)? has the solutions (X=lot}, {X=milcah}, {X=yiscah}. Another query with multiple solutions is plus (X, Y, 4)? for finding num- bers that add up to 4. Solutions are, for example, {X=O, Y=4} and {X=1, Y=3}. Note that the different variables X and Y correspond to (possibly) different objects.

An interesting variant of the last query is plus(X,X,4)?, which insists that the two numbers that add up to 4 be the same. It has a unique answer {X=2}.

LS Universal Facts

Variables are also useful in facts. Suppose that all the biblical characters like pomegranates. Instead of including in the program an appropriate fact for every individual,

likes(abraham,pomegranates).

likes(sarah,pomegranates).

a fact likes (X,pomegranates) can say it all. Used in this way, variables are a means of summarizing many facts. The fact tirnes(O,X,O) summa- rizes all the facts stating that o times some number is O.

Variables in facts are implicitly universally quantified, which means, intuitively, that the fact likes(X,pomegranates) states that for all X, X likes pomegranates. In general, a fact p(T1,. . .,T) reads that for all X1,. . .,Xk, where the X1 are variables occurring in the fact, p(T1,. . is true. Logically, from a universally quantified fact one can deduce any instance of it. For example, from likes(X,poniegranates), deduce likes (abraham, pomegranates).

16 Chapter 1

This is the third deduction rule, called instantiation. From a universally quantified statement P, deduce an instance of it, PO, for any substitution 0.

As for queries, two unspecified objects, denoted by variables, can be constrained to be the same by using the same variable name. The fact plus (0, X, X) expresses that O is a left identity for addition. It reads that for all values of X, O plus X is X. A similar use occurs when translating the English statement "Everybody likes himself" to likes (X, X).

Answering a ground query with a universally quantified fact is straight- forward. Search for a fact for which the query is an instance. For example, the answer to plus(0,2,2)? is yes, based on the fact plus(0,X,X). An- swering a nonground query using a nonground fact involves a new defi- nition: a common instance of two terms.

Definition C is a common instance of A and B if it is an instance of A and an instance of B, in other words, if there are substitutions 0 and 02 such that C=A01 is syntactically identical to BO2.

For example, the goals plus(0,3,Y) and plus(0,X,X) have a com- mon instance plus (0,3,3). When the substitution {Y=3} is applied to plus (0,3, Y) and the substitution {X=3} is applied to plus (0, X, X), both yield plus(0,3,3).

In general, to answer a query using a fact, search for a common in- stance of the query and fact. The answer is the common instance, if one exists. Otherwise the answer is no.

Answering an existential query with a universal fact using a common instance involves two logical deductions. The instance is deduced from the fact by the rule of instantiation, and the query is deduced from the instance by the rule of generalization.

1.6 Conjunctive Queries and Shared Variables

An important extension to the queries discussed so far is conjunctive queries. Conjunctive queries are a conjunction of goals posed as a query, for example, f ather(terach,X) ,father(X,Y)? or in general, Q,. . Simple queries are a special case of conjunctive queries when there is a

17 Basic Constructs

single goal. Logically, it asks whether a conjunction is deducible from the program. We use '," throughout to denote logical and. Do not confuse the comma that separates the arguments in a goal with commas used to separate goals, denoting conjunction.

In the simplest conjunctive queries all the goals are ground, for exam- ple, f ather(abraham,isaac) ,male(lot)?. The answer to this query us- ing Program 1.1 is clearly yes because both goals in the query are facts in the program. In general, the query Q,. .,Q,?, where each Q is a ground goal, is answered yes with respect to a program P if each Q is implied by P. Hence ground conjunctive queries are not very interesting.

Conjunctive queries are interesting when there are one or more shared variables, variables that occur in two different goals of the query. An ex- ample is the query f ather(haran,X) ,male(X)?. The scope of a variable in a conjunctive query, as in a simple query, is the whole conjunction. Thus the query p(X),q(X)? reads: "Is there an X such that both p(X) and

Shared variables are used as a means of constraining a simple query by restricting the range of a variable. We have already seen an example with the query plus (X , X, 4)?, where the solution of numbers adding up to 4 was restricted to the numbers being the same. Consider the query f ather(harari,X) ,male(X)?. Here solutions to the query f a- ther(haran,X)? are restricted to children that are male. Program 1.1 shows there is only one solution, {X1ot}. Alternatively, this query can be viewed as restricting solutions to the query male (X)? to individuals who have Haran for a father.

A slightly different use of a shared variable can be seen in the query father (terach,X) ,father(X,Y)?. On the one hand, it restricts the sons of terach to those who are themselves fathers. On the other hand, it con- siders individuals Y, whose fathers are sons of terach. There are several solutions, for example, (Xabrahain, Y=isaac J and {X=haran, Y=lot J.

A conjunctive query is a logical consequence of a program P if all the goals in the conjunction are consequences of P, where shared variables are instantiated to the same values in different goals. A sufficient condi- tion is that there be a ground instance of the query that is a consequence of P. This instance then deduces the conjuncts in the query via general- ization.

The restriction to ground instances is unnecessary and will be lifted in Chapter 4 when we discuss the computation model of logic programs.

18 Chapter 1

We employ this restriction in the meantime to simplify the discussion in the coming sections.

Operationally, to solve the conjunctive query A1,A2 ,...,A? using a pro- gram P, find a substitution O such that A10 and.. . and AO are ground instances of facts in P. The same substitution applied to all the goals en- sures that instances of variables are common throughout the query. For example, consider the query f ather(haran,X) ,male(X)? with respect to Program 1.1. Applying the substitution {X=lot} to the query gives the ground instance father (haran,lot) ,male (lot)?, which is a conse- quence of the program.

1.7 Rules

Interesting conjunctive queries are defining relationships in their own right. The query father (haran,X) ,male(X)? is asking for a son of Ha- ran. The query father(terach,X) ,father(X,Y)? is asking about grand- children of Terach. This brings us to the third and most important state- ment in logic programming, a rule, which enables us to define new rela- tionships in terms of existing relationships.

Rules are statements of the form:

A - B1,B2,. .

where n O. The goal A is the head of the rule, and the conjunction of goals B1,. . .,B is the body of the rule. Rules, facts, and queries are also called Horn clauses, or clauses for short. Note that a fact is just a special case of a rule when n = O. Facts are also called unit clauses. We also have a special name for clauses with one goal in the body, namely, when n = 1. Such a clause is called an iterative clause. As for facts, variables appearing in rules are universally quantified, and their scope is the whole rule.

A rule expressing the son relationship is

son(X,Y) - father(Y,X), male(X).

Similarly one can define a rule for the daughter relationship:

daughter(X,Y) - father(Y,X), female(X).

19 Basic Constructs

A rule for the grandfather relationship is

grandfather(X,Y) father(X,Z), father(Z,Y).

Rules can be viewed in two ways. First, they are a means of ex- pressing new or complex queries in terms of simple queries. A query son(X,haran)? to the program that contains the preceding rule for son is translated to the query f ather(haran,X) ,male(X)? according to the rule, and solved as before. A new query about the son relationship has been built from simple queries involving father and male relationships. Interpreting rules in this way is their procedural reading. The procedural reading for the grandfather rule is: "To answer a query is X the grand- father of Y?, answer the conjunctive query Is X the father of Z and Z the father of Y?."

The second view of rules comes from interpreting the rule as a logical axiom. The backward arrow is used to denote logical implication. The son rule reads: "X is a son of Y if Y is the father of X and X is male." In this view, rules are a means of defining new or complex relationships using other, simpler relationships. The predicate son has been defined in terms of the predicates father and male. The associated reading of the rule is known as the declarative reading. The declarative reading of the grandfather rule is: "For all X, Y, and Z, X is the grandfather of Y if X is the father of Z and Z is the father of Y."

Although formally all variables in a clause are universally quantified, we will sometimes refer to variables that occur in the body of the clause, but not in its head, as if they are existentially quantified inside the body. For example, the grandfather rule can be read: "For all X and Y, X is the grandfather of Y if there exists a Z such that X is the father of Z and Z is the father of Y." The formal justification of this verbal transformation will not be given, and we treat it just as a convenience. Whenever it is a source of confusion, the reader can resort back to the formal reading of a clause, in which all variables are universally quantified from the outside.

To incorporate rules into our framework of logical deduction, we need the law of modus ponens. Modus ponens states that from B and A B we can deduce A,

Definition The law of universal modus ponens says that from the rule

R = (A - B1,B2,. .

20 Chapter 1

and the facts

B. B.

A' can be deduced if

A' - B'1,B,. .

is an instance of R.

Universal modus ponens includes identity and instantiation as special cases.

We are now in a position to give a complete definition of the concept of a logic program and of its associated concept of logical consequence.

Definition A logic program is a finite set of rules.

Definition An existentially quantified goal G is a logical consequence of a program P if there is a clause in P with a ground instance A - B1,. . . ,B, n O such that B1.....B are logical consequences of P, and A is an instance of G. .

Note that the goal G is a logical consequence of a program P if and only if G can be deduced from P by a finite number of applications of the rule of universal modus pollens.

Consider the query son (S, haran)? with respect to Program 1.1 aug- mented by the rule for son. The substitution {X=lot , Y=haran} applied to the rule gives the instance son(lot,haran) f ather(haran,lot), male(lot). Both the goals in the body of this rule are facts in Pro- gram 1.1. Thus universal modus ponens implies the query with answer {S=lot}.

Operationally, answering queries reflects the definition of logical con- sequence. Guess a ground instance of a goal, and a ground instance of a rule, and recursively answer the conjunctive query corresponding to the body of that rule. To solve a goal A with program P, choose a rule A1 B1,B2,. . .,B in P, and guess substitution O such that A = A10, and

21 Basic Constructs

BO is ground for i í n. Then recursively solve each BLO. This pro- cedure can involve arbitrarily long chains of reasoning. lt is difficult in general to guess the correct ground instance and to choose the right rule. We show in Chapter 4 how the guessing of an instance can be removed.

The rule given for son is correct but is an incomplete specification of the relationship. For example, we cannot conclude that Isaac is the son of Sarah. What is missing is that a child can be the son of a mother as well as the son of a father. A new rule expressing this relationship can be added, namely,

son(X,Y) - niother(Y,X), male(X).

To define the relationship grandparent correctly would take four rules to include both cases of father and mother:

grandparent (X, Y)

grandparent (X ,Y)

grandparent (X, Y)

grandparent (X ,Y)

- father(X,Z)

- mother(X,Z)

father(Z,

mother (Z

father(Z,

mother (Z

Y).

There is a better, more compact, way of expressing these rules. We need to define the auxiliary relationship parent as being a father or a mother. Part of the art of logic programming is deciding on what intermediate predicates to define to achieve a complete, elegant axiomatization of a relationship. The rules defining parent are straightforward, capturing the definition of a parent being a father or a mother. Logic programs can incorporate alternative definitions, or more technically disjunction, by having alternative rules, as for parent:

parent(X,Y) - f ather(X,Y).

parent(X,Y) mother(X,Y).

Rules for son and grandparent are now, respectively,

son(X,Y) - parent(Y,X), male(X).

grandparent(X,Y) - parent(X,Z), parent(Z,Y).

A collection of rules with the same predicate in the head, such as the pair of parent rules, is called a procedure. We shall see later that under the operational interpretation of these rules by Prolog, such a collection of rules is indeed the analogue of procedures or subroutines in conventional programming languages,

22 Chapter 1

1.8 A Simple Abstract Interpreter

An operational procedure for answering queries has been informally de- scribed and progressively developed in the previous sections. In this section, the details are fleshed out into an abstract interpreter for logic programs. In keeping with the restriction of universal modus ponens to ground goals, the interpreter only answers ground queries.

The abstract interpreter performs yes/no computations. It takes as input a program and a goal, and answers yes if the goal is a logi- cal consequence of the program and no otherwise. The interpreter is given in Figure 1.1. Note that the interpreter may fail to terminate if the goal is not deducible from the program, in which case no answer is given.

The current, usually conjunctive, goal at any stage of the computation is called the resolvent. A trace of the interpreter is the sequence of resol- vents produced during the computation. Figure 1.2 is a trace of answer- ing the query son(1ot,harn)? with respect to Program 1.2, a subset of the facts of Program 1.1 together with rules defining son and daughter. For clarity, Figure 1.2 also explicitly states the choice of goal and clause made at each iteration of the abstract interpreter.

Each iteration of the while loop of the abstract interpreter corresponds to a single application of modus ponens. This is called a reduction.

Input:

Output:

Algorithm

A ground goal G and a program P

yes if G is a logical consequence of P, no otherwise

Initialize the resolvent to G. while the resolvent is not empty do

choose a goal A from the resolvent choose a ground instance of a clause A' B1.....B,, from P

such that A and A' are identical (if no such goal and clause exist, exit the while loop)

replace A by B1.....B,, in the resolvent If the resolvent is empty, then output yes, else output no.

Figure 1.1 An abstract interpreter to answer ground queries with respect to logic programs

23 Basic Constructs

Input: sori(lot,haran)? and Program L2 Resolvent is son(lot ,haran) Resolvent is not empty

choose son(lot,haran) (the only choice) choose son(lot,haran) father(haran,lot), male(lot) new resolvent is father(haran,lot), male(lot)

Resolvent is not empty choose father (hamo ,lot) choose father (haran,lot). new resolvent is inale(lot)

Resolvent is not empty choose male(lot) choose male(lot) new resolvent is empty

Output: yes

Figure 1.2 Tracing the interpreter

father(abraham,isaac) male(isaac)

father(haran,lot). male(lot).

father (haran,mïlcah). female (milcah).

father(haran,yiscah) female(yiscah).

son(X,Y) - father(Y,X), male(X).

daughter(X,Y) father(Y,X), female(X).

Program 1.2 Biblical family relationships

Definition A reduction of a goal G by a program P is the replacement of G by the body of an instance of a clause in P, whose head is identical to the chosen goal.

A reduction is the basic computational step in logic programming. The goal replaced in a reduction is reduced, and the new goals are derived. In this chapter, we restrict ourselves to ground reductions, where the goal and the instance of the clause are ground. Later, in Chapter 4, we consider more general reductions where unification is used to choose the instance of the clause and make the goal to be reduced and the head of the clause identical.

24 Chapter 1

The trace in Figure 1.2 contains three reductions. The first reduces the goal son(lot,haran) and produces two derived goals, f ather(haran, lot) and male (lot). The second reduction is of father (har,lot) and produces no derived goals. The third reduction also produces no derived goals in reducing male (lot).

There are two unspecified choices in the interpreter in Figure 1.1. The first is the goal to reduce from the resolvent. The second choice is the clause (and an appropriate ground instance) to reduce the goal. These two choices have very different natures.

The selection of the goal to be reduced is arbitrary. In any given resol- vent, all the goals must be reduced. It can be shown that the order of reductions is immaterial for answering the query.

In contrast, the choice of the clause and a suitable instance is criti- cal. In general, there are several choices of a clause, and infinitely many ground instances. The choice is made nondeterministically. The concept of nondeterministic choice is used in the definition of many computa- tion models, e.g., finite automata and Turing machines, and has proven to be a powerful theoretic concept. A nondeterrninistic choice is an un- specified choice from a number of alternatives, which is supposed to be made in a "clairvoyant" way. If only some of the alternatives lead to a successful computation, then one of them is chosen. Formally, the con- cept is defined as follows. A computation that contains nondeterministic choices succeeds if there is a sequence of choices that leads to success. Of course, no real machine can directly implement this definition. How- ever, it can be approximated in a useful way, as done in Prolog. This is explained in Chapter 6.

The interpreter given in Figure 1.1 can be extended to answer non- ground existential queries by an initial additional step. Guess a ground instance of the query. This is identical to the step in the interpreter of guessing ground instances of the rules. It is difficult in general to guess the correct ground instance, since that means knowing the result of the computation before performing it.

A new concept is needed to lift the restriction to ground instances and remove the burden of guessing them. In Chapter 4, we show how the guess of ground instances can be eliminated, and we introduce the com- putational model of logic programs more fully. Until then it is assumed that the correct choices can be made.

25 Basic Constructs

Figure 1.3 A simple proof tree

A trace of a query implicitly contains a proof that the query follows from the program. A more convenient representation of the proof is with a proof tree. A proof tree consists of nodes and edges that represent the goals reduced during the computation. The root of the proof tree for a simple query is the query itself. The nodes of the tree are goals that are reduced during the computation. There is a directed edge from a node to each node corresponding to a derived goal of the reduced goal. The proof tree for a conjunctive query is just the collection of proof trees for the individual goals in the conjunction. Figure 1.3 gives a proof tree for the program trace in Figure 1.2.

An important measure provided by proof trees is the number of nodes in the tree. lt indicates how many reduction steps are performed in a computation. This measure is used as a basis of comparison between different programs in Chapter 3.

1.9 The Meaning of a Logic Program

How can we know if a logic program says what we wanted it to say? If it is correct, or incorrect? In order to answer such questions, we have to define what is the meaning of a logic program. Once defined, we can examine if the program means what we have intended it to mean.

Definition The meaning of a logic program P, M(P), is the set of ground goals deducible from P. u

From this definition it follows that the meaning of a logic program composed just of ground facts, such as Program 1.1, is the program it- self. In other words, for simple programs, the program "means just what

26 Chapter 1

it says." Consider Program 1.1 augmented with the two rules defining the parent relationship. What is its meaning? It contains, in addition to the facts about fathers and mothers, mentioned explicitly in the pro- gram, all goals of the form parent(X,Y) for every pair X and Y such that father (X, Y) or mother (X, Y) is in the program. This example shows that the meaning of a program contains explicitly whatever the program states implicitly.

Assuming that we define the intended meaning of a program also to be a set of ground goals, we can ask what is the relation between the actual and the intended meanings of a program. We can check whether everything the program says is correct, or whether the program says everything we wanted it to say.

Informally, we say that a program is correct with respect to some intended meaning M if the meaning of P, M(P), is a subset of M. That is, a correct program does not say things that were not intended. A program is complete with respect to M if M is a subset of M(P). That is, a complete program says everything that is intended. It follows that a program P is correct and complete with respect to an intended meaning M if M = M(P).

Throughout the book, when meaningful predicate and constant names are used, the intended meaning of the program is assumed to be the one intuitively implied by the choice of names.

For example, the program for the son relationship containing only the first axiom that uses father is incomplete with respect to the in- tuitively understood intended meaning of son, since it cannot deduce son(isaac,sarah). If we add to Program 1.1 the rule

son(X,Y) - rnother(X,Y), male(Y).

it would make the program incorrect with respect to the intended mean- mg, since it deduces son(sarah,isaac).

The notions of correctness and completeness of a logic program are studied further in Chapter 5.

Although the notion of truth is not defined fully here, we will say that a ground goal is true with respect to an intended meaning if it is a member of it, and false otherwise. We will say it is simply true if it is a member of the intended meaning implied by the names of the predicate and constant symbols appearing in the program.

27 Basic Constructs

1.10 Summary

We conclude this section with a summary of the constructs and concepts introduced, filling in the remaining necessary definitions.

The basic structure in logic programs is a term. A term is a constant, a variable, or a compound term. Constants denote particular individuals such as integers and atoms, while variables denote a single but unspec- ified individual. The symbol for an atom can be any sequence of char- acters, which is quoted if there is possibility of confusion with other symbols (such as variables or integers). Symbols for variables are distin- guished by beginning with ari uppercase letter.

A compound term comprises a functor (called the principal functor of the term) and a sequence of one or more terms called arguments. A functor is characterized by its name, which is an atom, and its arity or number of arguments. Constants are considered functors of arity O. Syn- tactically, compound terms have the form f(t1,t2.....t) where the functor has name f and is of arity n, and the t are the arguments. A functor f of arity n is denoted f/n. Functors with the same name but different arities are distinct. Terms are ground if they contain no variables; other- wise they are nonground. Goals are atoms or compound terms, and are generally nonground.

A substitution is a finite set (possibly empty) of pairs of the form X = t, where X is a variable and t is a term, with no variable on the left-hand side of a pair appearing on the right-hand side of another pair, and no two pairs having the same variable as left-hand side. For any substitution O {X1 = ti,X2 t2,... ,X = t} and term s, the term sO denotes the result of simultaneously replacing in s each occurrence of the variable X by t, i j <n; the term sO is called an instance of s. More will be said on this restriction on substitutions in the background to Chapter 4.

A logic program is a finite set of clauses. A clause or rule is a univer- sally quantified logical sentence of the form

A - B1,B2,. . .,Bk. k O,

where A and the B are goals. Such a sentence is read declaratively: "A is implied by the conjunction of the B1," and is interpreted procedurally "To answer query A, answer the conjunctive query B1,B2,. . .,Bk." A is called the clause's head and the conjunction of the B the clause's body. If k = O,

28 Chapter 1

the clause is known as a fact or unit clause and written A., meaning A is true under the declarative reading, and goal A is satisfied under the procedural interpretation. If k = 1, the clause is known as an iterative clause.

A query is a conjunction of the form

A1,...,A? n>O,

where the A are goals. Variables in a query are understood to be existen- tially quantified.

A computation of a logic program P finds an instance of a given query logically deducible from P. A goal G is deducible from a program P if there is an instance A of G where A B1,.. .,B, n O, is a ground instance of a clause in P, and the B are deducible from P. Deduction of a goal from an identical fact is a special case.

The meaning of a program P is inductively defined using logical de- duction. The set of ground instances of facts in P are in the meaning A ground goal G is in the meaning if there is a ground instance G B1,. . of a rule in P such that B1,. . .,B are in the meaning. The meaning consists of the ground instances that are deducible from the program.

An intended meaning M of a program is also a set of ground unit goals. A program P is correct with respect to an intended meaning M if M(P) is a subset of M. It is complete with respect to M if M is a subset of M(P). Clearly, it is correct and complete with respect to its intended meaning, which is the desired situation, if M = M(P).

A ground goal is true with respect to an intended meaning if it is a member of it, and false otherwise.

Logical deduction is defined syntactically here, and hence also the meaning of logic programs. In Chapter 5, alternative ways of describing the meaning of logic programs are presented, and their equivalence with the current definition is discussed.

2 Database Programming

2.1 Simple Databases

We begin by revising Program 1.1, the biblical database, and its aug- mentation with rules expressing family relationships. The database itself had four basic predicates, f ather/2, mother/2, male/i, and f e- male/i. We adopt a convention from database theory and give for each relation a relation scheme that specifies the role that each po- sition in the relation (or argument in the goal) is intended to repre- sent. Relation schemes for the four predicates here are, respectively, f ather(Father,Chilcl), mother(Mother,Child), male(Person), and female (Person). The mnemonic names are intended to speak for them- selves.

Variables are given mnemonic names in rules, but usually X or Y when discussing queries. Multiword names are handled differently for vari- ables and predicates. Each new word ¡n a variable starts with an upper- case letter, for example, NieceOrNephew, while words are delimited by

There are two basic styles of using logic programs: defining a logical database, and manipulating data structures. This chapter discusses data- base programming. A logic database contains a set of facts and rules. We show how a set of facts can define relations, as in relational data- bases. We show how rules can define complex relational queries, as in relational algebra. A logic program composed of a set of facts and rutes of a rather restricted format can express the functionalities associated with relational databases.

30 Chapter 2

underscores for predicate and function names, for example, schedule_ conf lict.

New relations are built from these basic relationships by defining suit- able rules. Appropriate relation schemes for the relationships introduced in the previous chapter are son(Son,Parent), daughter(Daughter, Parent), parent (Parent ,Child), and grandparent(Grandparent, Grandchild). From the logical viewpoint, it is unimportant which re- lationships are defined by facts and which by rules. For example, if the available database consisted of parent, male and female facts, the rules defining son and grandparent are still correct. New rules must be writ- ten for the relationships no longer defined by facts, namely, father and mother. Suitable rules are

father (Dad, Child) - parent(Dad, Child), male(Dad).

mother (Mum, Child) - parent(Mum, Child), f emale(Mum).

Interesting rules can be obtained by making relationships explicit that are present in the database only implicitly. For example, since we know the father and mother of a child, we know which couples produced off- spring, or to use a Biblical term, procreated. This is not given explicitly in the database, but a simple rule can be written recovering the information. The relation scheme is procreated(Man,Woman).

procreated(Man,Woman) -

father (Man,Child), mother(Woman,Child).

This reads: "Man and Woman procreated if there is a Child such that Man is the father of Child and Woman is the mother of Child."

Another example of information that can be recovered from the simple information present is sibling relationships - brothers and sisters. We give a rule for brother (Brother,Sibling).

brother (Brother ,Sib) -

parent (Parent ,Brother), parent(Parent ,Sib), male(Brother).

This reads: "Brother is the brother of Sib if Parent is a parent of both Brother and Sib, and Brother is male."

There is a problem with this definition of brother. The query brother (X,X)? is satisfied for any male child X, which is not our understanding of the brother relationship.

In order to preclude such cases from the meaning of the program,

31 Database Programming

abraham isaac.

abraham milcah.

isaac lot.

liaran lot.

lot milcah.

Figure 2.1 Defining inequality

uncle (Uncle Person)

brother(Uncle,Parent), parent(Parent,Person).

sibling(Sibl,Sib2) -

parent(Parent,Sibl), parent(Parent,Sib2), Sibi Sib2.

cousin(Cousini ,Cousin2) -

parent (Parenti ,Cousini),

parent (Parent2 ,Cousin2),

sibling(Parentl,Parent2).

Program 2.1 Defining family relationships

we introduce a predicate (Termi , Term2). It is convenient to write this predicate as an infix operator. Thus Termi Term2 is true if Terni and Term2 are different. For the present it is restricted to constant terms. lt can be defined, in principle, by a table X Y for every two different individuals X and Y in the domain of interest. Figure 2.1 gives part of the appropriate table for Program 1.1.

The new brother rule is

brother (Brother,Sib) parent (Parent ,Brother), parent (Parent ,Sib), male (Brother) Brother Sib.

The more relationships that are present, the easier it is to define com- plicated relationships. Program 2.1 defines the relationships uncle(Uncle,Person), sibling(Sibl,Sib2), and cousin(Cousinl, Cousin2). The definition of uncle in Program 2.1 does not define the husband of a sister of a parent to be an uncle. This may or may not be the intended meaning. In general, different cultures define these family relationships differently. In any case, the logic makes clear exactly what the programmer means by these family relationships.

abraham haran.

abraham yiscah.

isaac milcah.

haran milcah.

lot yiscah.

abraham lot.

isaac liaran.

isaac yiscah.

haran 4 yiscah.

mïlcah yiscah.

32 Chapter 2

Another relationship implicit in the family database is whether a woman is a mother. This is determined by using the mother/2 relation- ship. The new relation scheme is mother (Woman), defined by the rule

mother(Woman) - mother(Woman,Child).

This reads: "Woman is a mother if she is the mother of some Child." Note that we have used the same predicate name, mother, to describe two different mother relationships. The mother predicate takes a different number of arguments, i.e., has a different arity, in the two cases. In general, the same predicate name denotes a different relation when it has a different arity.

We change examples, lest the example of family relationships become incestuous, and consider describing simple logical circuits. A circuit can be viewed from two perspectives. The first is the topological layout of the physical components usually described in the circuit diagram. The second is the interaction of functional units. Both views are easily ac- commodated in a logic program. The circuit diagram is represented by a collection of facts, while rules describe the functional components.

Program 2.2 is a database giving a simplified view of the logical and- gate drawn in Figure 2.2. The facts are the connections of the particular resistors and transistors comprising the circuit. The relation scheme for resistors is resistor(Endl,End2) and for transistors transis- tor (Gate ,Source, Drain).

Power

Figure 2.2 A logical circuit

n3 o

n5 o

33 Database Programming

rosistor(power,nl).

resistor(power,n2).

transistor(n2,ground,nl).

transistor(n3,n4,ri2)

transistor(n5,ground,n4).

inverter(Input,Output) - Output is the inversion of Input.

inverter(Input ,Output) -

transistor(Input ,ground,Uutput),

resistor (power ,Output).

nand_gate(Inputl,Input2,Output) - Output is the logical nand of Inputi and Input2.

nand_gate(Inputl , Input2,Uutput)

transïstor(Inputl ,X,Output),

transistor(Input2 ,ground,X),

resistor (power ,Output).

and_gate(Inputl,Iriput2,Output) - Output is the logical and of Inputi and Iriput2.

and_gate(Inputl ,Input2,Output) -

nand_gate(Inputl ,Iriput2,X),

inverter (X ,Output).

Program 2.2 A circuit for a logical and-gate

The program demonstrates the style of commenting of logic programs we will follow throughout the book. Each interesting procedure is pre- ceded by a relation scheme for the procedure, shown in italic font, and by English text defining the relation. We recommend this style of comment- ing, which emphasizes the declarative reading of programs, for Prolog programs as well.

Particular configurations of resistors and transistors fulfill roles cap- tured via rules defining the functional components of the circuit. The circuit describes an and-gate, which takes two input signals and pro- duces as output the logical and of these signals. One way of building an and-gate, and how this circuit is composed, is to connect a nand-gate with an inverter. Relation schemes for these three components are and_ gate(Inputl,Input2,Output), nand_gate(Inputl,Input2,Output),

and inverter(Input ,Output).

34 Chapter 2

To appreciate Program 2.2, let us read the inverter rule. This states that an inverter is built up from a transistor with the source connected to the ground, and a resistor with one end connected to the power source. The gate of the transistor is the input to the inverter, while the free end of the resistor must be connected to the dram of the transistor, which forms the output of the inverter. Sharing of variables is used to insist on the common connection.

Consider the query and_gate(Inl , 1n2 ,Out)? to Program 2.2. It has the solution {Ini=n3, 1n2=n5 , Out=nl}. This solution confirms that the circuit described by the facts is an and-gate, and indicates the inputs and output.

2.1.1 Exercises for Section 2.1

Modify the rule for brother on page 21 to give a rule for sister, the rule for uncle in Program 2.1 to give a rule for niece, and the rule for sibling in Program 2.1 so that it only recognizes full siblings, i.e., those that have the same mother and father.

Using a predicate married_couple(Wife,Husband), define the rela- tionships mother_in_law, brother_in_law, and son_in_law.

Describe the layout of objects in Figiire 2.3 with facts using the predicates left_of(Objectl,Object2) and above(Objecti3Ob- ject2). Define predicates right_of (Objectl,Object2) and below (Obj ect 1. .Obj ect2) in terms of left_of and above, respectively.

Figure 2.3 Still-life objects

35 Database Programming

22 Structured Data and Data Abstraction

A limitation of Program 2.2 for describing the and-gate is the treatment of the circuit as a black box. There is no indication of the structure of the circuit in the answer to the and_gate query, even though the structure has been implicitly used in finding the answer. The rules tell us that the circuit represents an and-gate, but the structure of the and-gate is present only implicitly. We remedy this by adding an extra argument to each of the goals in the database. For uniformity, the extra argument becomes the first argument. The base facts simply acquire an identifier. Proceeding from left to right in the diagram of Figure 2.2, we label the resistors rl and r2, and the transistors ti, t2, and t3.

Names of the functional components should reflect their structure. An inverter is composed of a transistor and a resistor. To represent this, we need structured data. The technique is to use a compound term, inv(T,R), where T and R are the respective names of the inverter's com- ponent transistor and resistor. Analogously, the name of a nand-gate will be nand(T1,T2,R), where Ti, T2, and R name the two transistors and re- sistor that comprise a nand-gate. Finally, an and-gate can be named in terms of an inverter and a nand-gate. The modified code containing the names appears in Program 2.3.

The query and_gate(G,Inl,1n2,Out)? has solution {G=and(nand(t2, t3,r2) ,inv(ti,rl)) ,In1=n3,In2n5,Out=n1}. Ini, 1n2, and Out have their previous values. The complicated structure for G reflects accurately the functional composition of the and-gate.

Structuring data is important in progranmìing in general and in logic programming in particular. It is used to organize data in a meaningful way. Rules can be written more abstractly, ignoring irrelevant details. More modular programs can be achieved this way, because a change of data representation need not mean a change in the whole program, as shown by the following example.

Consider the following two ways of representing a fact about a lecture course on complexity given on Monday from 9 to 11 by David Harel in the Feinberg building, room A:

course(complexity,monday,9,li,david,harel,feinberg,a).

and

36 Chapter 2

resistor (R,Nodel,Node2) - R is a resistor between Nodel and Node2.

resistor(rl,power,nl).

resistor(r2,power,n2).

transistor ( T,Gate,Source,D rain) * T is a transistor whose gate is Gate, source is Source, and drain is Drain.

transistor(tl,n2,ground,ni).

transistor(t2,n3,n4,n2).

transistor(t3,n5,ground,n4).

inverter(I,Input,Output) - I is an inverter that inverts In put to Output.

inverter(inv(T,R) ,Input,Output) -

transistor(T, Input ,ground,Output),

resistor(R,power,Output).

nand_g ate (Nand,Inputl ,Input2, Output) - Nand is a gate forming the logical nand, Output, of Inputi and Input2.

nand_gate(nand(Tl ,T2 ,R) ,Inputl ,Input2,Output) -

transistor(T1 ,Inputl,X,Output),

transistor (T2 , Input2 ,ground, X),

resistor(R,power,Output).

and_gate(And,Inputl,Input2,Output) And is a gate forming the logical and, Output, of Inputi and Input2.

and_gate(and(N,I),Inputi,Input2,Output) -

nan&gate(N, Input i, Input2 ,X),

inverter(I ,X,Output).

Program 2.3 The circuit database with names

37 Database Programming

course(complexity,time(monday,9,11) ,lecturer(david,harel),

location(feinberg,a)). The first fact represents course as a relation between eight items - a course name, a day, a starting hour, a finishing hour, a lecturer's first name, a lecturer's surname, a building, and a room. The second fact makes course a relation between four items - a name, a time, a lecturer, and a location with further qualification. The time is composed of a day, a starting time, and a finishing time; lecturers have a first name and a surname; and locations are specified by a building and a room. The second fact reflects more elegantly the relations that hold.

The four-argument version of course enables more concise rules to be written by abstracting the details that are irrelevant to the query. Program 2.4 contains examples. The occupied rule assumes a predicate less than or equal, represented as a binary infix operator .

Rules not using the particular values of a structured argument need not "know" how the argument is structured. For example, the rules for duration and teaches represent time explicitly as time (Day »Start, Finish) because the Day or Start or Finish times of the course are de- sired. In contrast, the rule for lecturer does not. This leads to greater modularity, because the representation of time can be changed without affecting the rules that do not inspect it.

We offer no definitive advice on when to use structured data. Not using structured data allows a uniform representation where all the data are simple. The advantages of structured data are compactness of represen- tation, which more accurately reflects our perspective of a situation, and

lecturer (Lecturer ,Course)

course (Course,Time,Lecturer,Location)

duration(Courso ,Length) -

course(Course,time(Day,Start,Finish),Lecturer,Location),

plus (Start, Length ,Finish).

teaches (Lecturer ,Day) -

course(Course,time(Day,Start,Finish) ,Lecturer,Locatïon).

occupied(Room,Day,Time) -

course(Course,time(Day,Start,Finish),Lecturer,Room),

Start Time, Time Finish.

Program 2.4 Course rules

38 Chapter2

modularity. We can relate the discussion to conventional programming languages. Facts are the counterpart of tables, while structured data cor- respond to records with aggregate fields.

We believe that the appearance of a program is important, particularly when attempting difficult problems. A good structuring of data can make a difference when programming complex problems.

Some of the rules in Program 2.4 are recovering relations between two individuals, binai-y relations, from the single, more complicated one. All the course information could have been written in terms of binary relations as follows:

day (complexity, monday).

start_time (complexity, 9).

f inish_time(cornplexity, 11).

lecturer(complexity,harel).

building(complexity,feinberg).

room(complexity, a).

Rules would then be expressed differently, reverting to the previous style of making implicit connections explicit. For example,

teaches(Lecturer,Day) -

lecturer (Course ,Lecturer), day(Course ,Day).

2.2.1 Exercises for Section 2.2

Add rules defining the relations location(Course,Building), busy (Lecturer ,Time), and cannot_meet (Lecturerl ,Lecturer2). Test with your own course facts.

Possibly using relations from Exercise (i), define the relation sched- ule_conflict (Time ,Place ,Coursel ,Course2).

Write a program to check if a student has met the requirements for a college degree. Facts will be used to represent the courses that the student has taken and the grades obtained, and rules will be used to enforce the college requirements.

Design a small database for an application of your own choice. Use a single predicate to express the information, and invent suitable rules.

39 Database Programming

2.3 Recursive Rules

The rules described so far define new relationships in terms of existing ones. An interesting extension is recursive definitions of relationships that define relationships in terms of themselves. One way of viewing recursive rules is as generalization of a set of nonrecursive rules.

Consider a series of rules defining ancestors - grandparents, great- grandparents, etc:

grandparent (Ancestor, Descendant) -

parent (Ancestor,Person), parent(Person,Descendant).

greatgrandparent (Ancestor, Descendant) -

parent (Ancestor,Person), grandparent(Person,Descendant).

greatgreatgrandparent (Ancestor ,Descendant) -

parent (Ancestor,Person), greatgrandparent(Person,

Descendant).

A clear pattern can be seen, which can be expressed in a rule defining the relationship ancestor (Ancestor ,Descendant):

ancestor (Ancestor, Descendant) -

parent (Ancestor,Person), ancestor(Person,Descendant).

This rule is a generalization of the previous rules. A logic program for ancestor also requires a nonrecursive rule, the

choice of which affects the meaning of the program. If the fact ances- tor (X, X) is used, defining the ancestor relationship to be reflexive, peo- ple will be considered to be their own ancestors. This is not the intuitive meaning of ancestor. Program 2.5 is a logic program defining the ances- tor relationship, where parents are considered ancestors.

ancestor(Ancestor,Descendant) Ancestor is an ancestor of Descendant.

ancestor(Ancestor,Descendaxit) - parent (Ancestor , Descendant)

ancestor(Ancestor,Descendant) - parent(Ancestor,Person), ancestor(Person,Descendant).

Program 2.5 The ancestor relationship

40 Chapter 2

The ancestor relationship is the transitive closure of the parent re- lationship. In general, finìding the transitive closure of a relationship is easily done in a logic program by using a recursive rule.

Program 2.5 defining ancestor is an example of a linear recursive pro- gram. A program is linear recursive if there is only one recursive goal in the body of the recursive clause. The linearity can be easily seen from considering the complexity of proof trees solving ancestor queries. A proof tree establishing that two individuals are n generations apart given Program 2.5 and a collection of parent facts has 2 . n nodes.

There are many alternative ways of defining ancestors. The declarative content of the recursive rule in Program 2.5 is that Ancestor is an ances- tor of Descendant if Ancestor is a parent of an ancestor of Descendant. Another way of expressing the recursion is by observing that Ancestor would be an ancestor of Descendant if Ancestor is an ancestor of a par- ent of Descendant. The relevant rule is

ancestor(Ancestor,Descendant) -

ancestor(Ancestor,Person), parent(Person,Descendant).

Another version of defining ancestors is not linear recursive. A pro- gram identical in meaning to Program 2.5 but with two recursive goals in the recursive clause is

ancestor(Ancestor,Descendant) -

parent (Ancestor,Descendant).

ancestor(Ancestor,Descendant) -

ancestor(Ancestor,Person), ancestor(Person,Descendant).

Consider the problem of testing connectivity in a directed graph. A directed graph can be represented as a logic program by a collection of facts. A fact edge(Nodel,Node2) is present in the program if there is an edge from Nodel to Node2 in the graph. Figure 2.4 shows a graph; Program 2.6 is its description as a logic program.

Two nodes are connected if there is a series of edges that can be tra- versed to get from the first node to the second. That is, the relation con- nected(Nodel,Node2), which is true if Nodel and Node2 are connected, is the transitive closure of the edge relation. For example, a and e are connected in the graph in Figure 2.4, but b and f are not. Program 2.7 defines the relation. The meaning of the program is the set of goals con-

41 Database Programming

Figure 2.4 A simple graph

edge(a,b). edge(a,c), edge(b,d).

edge(c,d). edge(d,e). edge(f,g).

Program 2.6 A directed graph

connected(Nodel,Node2) - Nodel is connected to Node2 in the graph defined by the edge/2 relation.

cormected(Node,Node).

connected(Nodel,Node2) - edge(Nodel,Link),

Program 2.7 The transitive closure of the edge relation

nected(X,Y), where X and Y are connected. Note that connected is a transitive reflexive relation because of the choice of base fact.

2.3.1 Exercises for Section 2.3

(i) A stack of blocks can be described by a collection of facts on (Blockl,Block2), which is true if Blocki is on Block2. Define a predicate above(Blockl,Block2) that is true if Blocki is above Block2 in the stack. (Hint: above is the transitive closure of on.)

connected(Link,Node2)

42 Chapter 2

Add recursive rules for left_of and above from Exercise 2.1(iii) on p. 34. Define higher(Objectl,Object2),whichis true if Objecti is on a line higher than Object2 in Figure 2.3. For example, the bicycle is higher than the fish in the figure.

How many nodes are there in the proof tree for connected(a,e) using Programs 2.6 and 2.7? In general, using Program 2.6 and a collection of edge/2 facts, how many nodes are there in a proof tree establishing that two nodes are connected by a path containing n intermediate nodes?

2.4 Logic Programs and the Relational Database Model

Logic programs can be viewed as a powerful extension to the relational database model, the extra power coming from the ability to specify rules. Many of the concepts mtroduced have meaningful analogues in terms of databases. The converse is also true. The basic operations of the rela- tional algebra are easily expressed within logic programming.

Procedures composed solely of facts correspond to relations, the arity of the relation being the arity of the procedure. Five basic operations define the relational algebra: union, set difference, Cartesian product, projection, and selection. We show how each is translated into a logic program.

The union operation creates a relation of arity n from two relations r and s, both of arity n. The new relation, denoted here r_union_s, is the union of r and s. It is defined directly as a logic program by two rules:

r_union_s(X1, . . . ,X) - r(Xi, . . . r_union_s(X1, . . . ,X) - s(Xi, . . . ,X).

Set difference involves negation. We assume a predicate not. Intu- itively, a goal not G is true with respect to a program P if G is not a logical consequence of P. Negation in logic programs is discussed in Chapter 5, where limitations of the intuitive definition are indicated. The definition is correct, however, if we deal only with ground facts, as is the case with relational databases.

The definition of r_diff_s of arity n, where r and s are of arity n, is

43 Database Programming

r_diff_s(Xi, . . ,X) - r(Xi, . . . ,X,), not s(Xi, . . ,X,). Cartesian product can be defined in a single rule. If r is a relation of

arity m, and s is a relation of arity n, then r_x_s is a relation of arity m + n defined by

r_x_s(Xi, . . ,X,Xji, . . . ,Xm) r(Xi, . . . s(Xm+i, . . . ,Xmn).

Projection involves forming a new relation comprising'only some of the attributes of an existing relation. This is straightforward for any particular case. For example, the projection r13 selecting the first and third arguments of a relation r of arity 3 is

r13(Xi,X3) - r(Xì1X2,X3).

Selection is similarly straightforward for any particular case. Consider a relation consisting of tuples whose third components are greater than their second, and a relation where the first component is Smith or Jones. In both cases a relation r of arity 3 is used to illustrate. The first example creates a relation rl:

rl(X1,X2,X3) - r(Xj,X2,X3) ,X3 > X2.

The second example creates a relation r2, which requires a disjunctive relationship, smith_or_j ones:

r2(Xi,X2,X3) - r(Xj,X2,X3), smith_or_jones(Xj). smith_or_j ones (smith).

smith_or_j ones (jones).

Some of the derived operations of the relational algebra are more closely related to the constructs of logic programming We mention two, intersection and the natural join. If r and s are relations of arity n, the intersection, r_meet_s is also of arity n and is defined in a single rule.

r_meet_s(Xi, . . . ,X) - r(X1, . . . ,X), s(Xi, . . . ,X). A natural join is precisely a conjunctive query with shared variables.

44 Chapter 2

2.5 Background

Readers interested in pursuing the connection between logic program- ming and database theory are referred to the many papers that have been written on the subject. A good starting place is the review paper by Gallaire et al. (1984). There are earlier papers on logic and databases in Gallaire and Minker (1978). Another interesting book is about the imple- mentation of a database query language in Prolog (Li, 1984). Our discus- sion of relational databases follows Uliman (1982). Another good account of relational databases can be found in Maier (1983).

In the seven years between the appearance of the first edition and the second edition of this book, the database community has accepted logic programs as extensions of relational databases. The term used for a data- base extended with logical rules is logic database or deductive database. There is now a wealth of material about logic databases. The rewritten version of Ullman's text (1989) discusses logic databases and gives point- ers to the important literature.

Perhaps the major difference between logic databases as taught from a database perspective and the view presented here is the way of evalu- ating queries. Here we implicitly assume that the interpreter from Figure 4.2 will be used, a top-down approach. The database community prefers a bottom-up evaluation mechanism. Various bottom-up strategies for an- swering a query with respect to a logic database are given in Uliman (1989).

In general, an n-ary relation can be replaced by n + i binary relations, as shown by Kowalski (1979a). If one of the arguments forms a key for the relation, as does the course name in the example in Section 2.2, n binary relations suffice.

The addition of an extra argument to each predicate in the circuit, as discussed at the beginning of Section 2.2, is an example of an en- hancement of a logic program. The technique of developing programs by enhancement is of growing importance. More will be said about this in Chapter 13.

3.1 Arithmetic

The simplest recursive data type, natural numbers, arises from the foun- dations of mathematics. Arithmetic is based on the natural numbers. This section gives logic programs for performing arithmetic.

In fact, Prolog programs for performing arithmetic differ considerably from their logical counterparts, as we will see in later chapters. How- ever, it is useful to spend time discussing the logic programs. There are

Recursive Prograniniing

The programs of the previous chapter essentially retrieve information from, and manipulate, finite data structures. In general, mathematical power is gained by considering infinite or potentially infinite structures. Finite instances then follow as special cases. Logic programs harness this power by using recursive data types.

Logical terms can be classified into types. A type is a (possibly infinite) set of terms. Some types are conveniently defined by unary relations. A relation p/i defines the type p to be the set of X's such that p(X).

For example, the male/i and female/i predicates used previously de- fine the male and female types.

More complex types can be defined by recursive logic programs. Such types are called recursive types. Types defined by unary recursive pro- grams are called simple recursive types. A program defining a type is called a type definition.

In this chapter, we show logic programs defining relations over simple recursive types, such as integers, lists, and binary trees, and also pro- grams over more complex types, such as polynomials.

46 Chapter 3

natural_number(X) - X is a natural number.

natural_nuniber(0). natural_number(s(X)) - natural_number(X).

Program 3.1 Defining the natural numbers

two main reasons. First, the operations of arithmetic are usually thought of functionally rather than relationally. Presenting examples for such a familiar area emphasizes the change in thinking necessary for compos- ing logic programs. Second, it is more natural to discuss the underlying mathematical issues, such as correctness and completeness of programs.

The natural numbers are built from two constructs, the constant sym- bol O and the successor function s of arity 1. All the natural numbers are then recursively given as 0, s(0), s(s(0)), s(s(s(0))).....We adopt the convention that s(0) denotes the integer n, that is, n applications of the successor function to O.

As in Chapter 2, we give a relation scheme for each predicate, together with the intended meaning of the predicate. Recall that a program P is correct with respect to an intended meaning M if the meaning of P is a subset of M. It is complete if M is a subset of the meaning of P. It is correct and complete if its meaning is identical to M. Proving correctness establishes that everything deducible from the program is intended. Proving completeness establishes that everything intended is deducible from the program. Two correctness and completeness proofs are given in this section.

The simple type definition of natural numbers is neatly encapsulated in the logic program, shown as Program 3.1. The relation scheme used is natural_number (X), with intended meaning that X is a natural num- ber. The program consists of one unit clause and one iterative clause (a clause with a single goal in the body). Such a program is called minimal recursive.

Proposition Program 3.1 is correct and complete with respect to the set of goals natural_number(s'(0)), for i O.

Proof (1) Completeness. Let n be a natural number. We show that the goal natural_number (n) is deducible from the program by giving an explicit proof tree. Either n is O or of the form sa(0). The proof tree for the goal natural_nuniber(0) is trivial. The proof tree for the goal

47 Recursive Programming

uraI_number(s1 (0)

s(s(0),sm(0) ,sn*m(0))

s (0) ,sm(0) (O

js(s(0) ,sm(0),sm+l (0))

cpIus(0),sm(0),sm(0)

cturaJ_number(sm(O

Figure 3.1 Proof trees establishing completeness of programs

natural_number (s (. .s(0)...)) contains n reductions, using the rule in Program 3.1, to reach the fact riatural_number(0), as shown in the left half of Figure 3.1.

(2) Correctness. Suppose that natural_nuniber(X) is deducible from Program 3.1, in n deductions. We prove that natural_riuniber(X) is in the intended meaning of the program by induction on n. If n = O, then the goal must have been proved using a unit clause, which implies that X = O. If n > o, then the goal must be of the form natural_number(s(X')), since it is deducible from the program, and further, natural_nuxnber(X') is deducible in n - i deductions. By the induction hypothesis, X' is in the intended meaning of the program, i e X'=&c (0) for some k O. u

The natural numbers have a natural order. Program 3.2 is a logic pro- gram defining the relation less than or equal to according to the order. We denote the relation with a binary infix symbol, or operator, , accord- ing to mathematical usage. The goal O < X has predicate symbol < of arity 2, has arguments O and X, and is syntactically identical to '' (O,X).

48 Chapter 3

x Y - X and Y are natural numbers, such that X is less than or equal to Y.

O X - natural_number(X). s(X) < s(Y) - X Y. natural_number(X) - See Program 3.1

Program 3.2 The less than or equal relation

The relation scheme is N1 N2. The intended meaning of Program 3.2 is all ground facts X Y, where X and Y are natural numbers and X is less than or equal to Y. Exercise (ii) at the end of this section is to prove the correctness and completeness of Program 3.2.

The recursive definition of is not computationally efficient. The proof tree establishing that a particular N is less than a particular M has M + 2 nodes. We usually think of testing whether one number is less than another as a unit operation, independent of the size of the numbers. Indeed, Prolog does not define arithmetic according to the axioms pre- sented in this section but uses the underlying arithmetic capabilities of the computer directly.

Addition is a basic operation defining a relation between two natural numbers and their sum. In Section 1.1, a table of the plus relation was assumed for all relevant natural numbers. A recursive program captures the relation elegantly and more compactly, and is given as Program 3.3. The intended meaning of Program 3.3 is the set of facts plus (X,Y,Z), where X, Y, and Z are natural numbers and X+Y=Z.

Proposition Programs 3.1 and 3.3 constitute a correct and complete axiomatization of addition with respect to the standard intended meaning of plus/3.

Proof (1) Completeness. Let X, Y, and Z be natural numbers such that X+Y=Z. We give a proof tree for the goal plus (X,Y,Z). If X equals O, then Y equals Z. Since Program 3.1 is a complete axiomatization of the natural numbers, there is a proof tree for natural_number(Y), which is easily extended to a proof tree for plus (O, Y, Y). Otherwise, X equals 5n(Q) for some n. If Y equals m (0), then Z equals sm (0). The proof tree in the right half of Figure 3.1 establishes completeness.

49 Recursive Programming

plus(X,Y,Z) X, Y , and Z are natural numbers such that Z is the sum of X and Y.

plus(O,X,X) natural_number(X).

plus(s(X),Y,s(Z)) plus(X,Y,Z).

natural_number(X) - See Program 3.1

Program 3.3 Addition

(2) Correctness. Let plus(X,Y,Z) be in the meaning. A simple induc- tive argument on the size of X, similar to the one used in the previous proposition, establishes that X+Y=Z. u

Addition is usually considered to be a function of two arguments rather than a relation of arity 3. Generally, logic programs corresponding to functions of n arguments define relations of arity n + 1. Computing the value of a function is achieved by posing a query with n arguments instantiated and the argument place corresponding to the value of the function uninstantiated. The solution to the query is the value of the function with the given arguments. To make the analogy clearer, we give a functional definition of addition corresponding to the logic program:

o+X = X. s(X)+Y = s(X+Y).

One advantage that relational programs have over functional programs is the multiple uses that can be made of the program. For example, the query plus (s (0) ,s(0) ,s(s(0)))? means checking whether i + i = 2. (We feel free to use the more readable decimal notation when mentioning numbers.) As for , the program for plus is not efficient. The proof tree confirming that the sum of N and M is N + M has N + M + 2 nodes.

Posing the query plus(s(0) ,s(0) ,X)?, an example of the standard use, calculates the sum of 1 and 1. However, the program can just as eas- ily be used for subtraction by posing a query such as plus (s (0) , X, s (s (s(0))))?. The computed value of Xis the difference between 3 and 1, namely, 2. Similarly, asking a query with the first argument uninstanti- ated, and the second and third instantiated, also performs subtraction.

A more novel use exploits the possibility of a query having multiple so- lutions. Consider the query plus(X,Y,s(s(s(0))))?. It reads: "Do there

50 Chapter 3

exist numbers X and Y that add up to 3." In other words, find a partition of the number 3 into the sum of two numbers, X and Y. There are several solutions.

A query with multiple solutions becomes more interesting when the properties of the variables in the query are restricted. There are two forms of restriction: using extra conjuncts in the query, and instanti- ating variables in the query. We saw examples of this when querying a database. Exercise (ii) at the end of this section requires to define a pred- icate even(X), which is true if X is an even number. Assuming such a predicate, the query plus (X,Y,N) ,even(X) ,even(Y)? gives a partition of N into two even numbers. The second type of restriction is exemplified by the query plus(s(s(X)),s(s(Y)),N)?, which insists that each of the numbers adding up to N is strictly greater than 1.

Almost all logic programs have multiple uses. Consider Program 3.2 for , for example. The query s (0) s (s (0))? checks whether i is less than or equal to 2. The query X s(s(0))? finds numbers X less than or equal to 2. The query X Y? computes pairs of numbers less than or equal to each other.

Program 3.3 defining addition is not unique. For example, the logic program

plus(X,O,X) - natural_nuniber(X).

plus(X,s(Y),s(Z)) - plus(X,Y,Z).

has precisely the same meaning as Program 3.3 for plus. Two programs are to be expected because of the symmetry between the first two argu- ments. A proof of correctness and completeness given for Program 3.3 applies to this program by reversing the roles of the symmetric argu- ments.

The meaning of the program for plus would not change even if it consisted of the two programs combined. This composite program is un- desirable, however. There are several different proof trees for the same goal. It is important both for runtime efficiency and for textual concise- ness that axiomatizations of logic programs be minimal.

We define a type condition to be a call to the predicate defining the type. For natural numbers, a type condition is any goal of the form natural_number (X).

In practice, both Programs 3.2 and 3.3 are simplified by omitting the body of the base rule, natural_number (X). Without this test, facts such

51 Recursive Programming

times(X,Y,Z) X, Y, arid Z are natural numbers such that Z is the product of X and Y.

times(O,X,O)

times(s(X) ,Y,Z) es(X,Y,XY), plus(XY,Y,Z).

plus(X,Y,Z) See Program 3.3

Program 3.4 Multiplication as repeated addition

exp(N,X,Y) N, X, and Y are natural numbers such that Y equals X raised to the power N.

exp(s(X) 0,0).

exp(0,s(X) ,s(0))

exp(s(N),X,Y) .- exp(N,X,Z), times(Z,X,Y).

times(X,Y,Z) - See Program 3.4

Program 3.5 Exponentiation as repeated multiplication

as O a and plus(O,a,a), where a is an arbitrary constant, will be in the programs' meanings. Type conditions are necessary for correct programs. However, type conditions distract from the simplicity of the programs arid affect the size of the proof trees. Hence in the following we might omit explicit type conditions from the example programs, Pro- grams 3.4-3.7.

The basic programs shown are the building blocks for more compli- cated relations. A typical example is defining multiplication as repeated addition. Program 3.4 reflects this relation. The relation scheme is times(X,Y,Z), meaning X times Y equals Z.

Exponentiation is defined as repeated multiplication. Program 3.5 for exp(N,X,Y) expresses the relation that XN=Y. It is analogous to Pro- gram 3.4 for times(X,Y,Z), with exp and times replacing times and plus, respectively. The base cases for exponentiation are X0=1 for all pos- itive values of X, and 0N=0 for positive values of N.

A definition of the factorial function uses the definition of multiplica- tion.RecallthatN! =NN i .....2 i.Thepredicatefactorial(N,F) relates a number N to its factorial F. Program 3.6 is its axiomatization.

52 Chapter 3

factoria!(N,F) - F equals N factorial.

Íactorial(O,s(0)) factorial(s(N),F) - factorial(N,F1), times(s(N),F1,F). times(X,Y,Z) - SeeProgram3.4.

Program 3.6 Computing factorials

minimum(N1,N2,Min) - The minimum of the natural numbers Nl and N2 is Min.

minimum(N1,N2,N1) Nl 5 N2.

minimum(Nl,N2,N2) - N2 Nl.

Nl 52 - See Program 3.2

Program 3.7 The minimum of two numbers

Not all relations concerning natural numbers are defined recursively. Relations can also be defined in the style of programs in Chapter 2. An example is Program 3.7 determining the minimum of two numbers via the relation minimum (Nl , N2 , Mm).

Composing a program to determine the remainder after integer divi- sion reveals an interesting phenomenondifferent mathematical defini- tions of the same concept are translated into different logic programs. Programs 3 .8a and 3 .8b give two definitions of the relation mod (X ,Y, Z), which is true if Z is the value of X modulo Y, or in other words, Z is the re- mainder of X divided by Y. The programs assume a relation < as specified in Exercise (i) at the end of this section.

Program 3.8a illustrates the direct translation of a mathematical defi- nition, which is a logical statement, into a logic program. The program corresponds to an existential definition of the integer remainder: "Z is the value of X mod Y if Z is strictly less than Y, and there exists a num- ber Q such that X = Q . Y + Z. In general, mathematical definitions are easily translated to logic programs.

We can relate Program 3.8a to constructive mathematics. Although seemingly an existential definition, it is also constructive, because of the constructive nature of <, plus, and times. The number Q, for example, proposed in the definition will be explicitly computed by times iii any use of mod.

53 Recursive Programming

mod(X,}Z) - Z is the remainder of the integer division of X by Y.

mod(X,Y,Z) - Z < Y, times(Y,Q,QY), plus(QY,Z,X).

Program 3.8a A nonrecursive definition of modulus

mod(X,Y,Z) - Z is the remainder of the integer division of X by Y.

mod(X,Y,X) .- X < Y. mod(X,Y,Z) plus(X1,Y,X), mod(X1,Y,Z).

Program 3.8b A recursive definition of modulus

In contrast to Program 3.8a, Program 3.8b is defined recursively. It con- stitutes an algorithm for finding the integer remainder based on repeated subtraction. The first rule says that X mod Y is X if X is strictly less than Y. The second rule says that the value of X mod Y is the same as X - Y mod Y. The effect of any computation to determine the modulus is to re- peatedly subtract Y from X until it becomes less than Y and hence is the correct value.

The mathematical function X mod Y is not defined when Y is zero. Nei- ther Program 3.8a nor Program 3.8b has goal mod(X,O,Z) in its meaning for any values of X or Z. The test of < guarantees that.

The computational model gives a way of distinguishing between the two programs for mod. Given a particular X, Y, and Z satisfying mod, we can compare the sizes of their proof trees. In general, proof trees produced with Program 3.8b will be smaller than those produced with Program 3.8a. In that sense Program 3.8b is more efficient. We defer more rigorous discussions of efficiency till the discussions on lists, where the insights gained will carry over to Prolog programs.

Another example of translating a mathematical definition directly into a logic program is writing a program that defines Ackermann's function. Ackermann's function is the simplest example of a recursive function that is not primitive recursive. It is a function of two arguments, defined by three cases:

ackermann(O,N) = N + 1.

ackermann(M, O) = ackerrnann(M - 1, 1).

ackermann(M,N) = ackermann(M - 1,ackermann(M,N - 1)).

54 Chapter 3

ackermarìn(X,Y,A) A is the value of Ackermann's function for the natural numbers X and Y.

ackermarm (O ,N , s (N) )

ackermann(s(M) ,O,Val) - ackermann(M,s(0) ,Val).

ackermann(s(M) ,s(N) ,Val)

ackermann(s(M),N,Vall), ackermann(M,Vall,Val).

Program 3.9 Ackermann's function

gcd(X,Y,Z) - Z is the greatest common divisor of the natural numbers X and Y.

gcd(X,Y,Gcd) - mod(X,Y,Z), gcd(Y,Z,Gcd).

gcd(X,O,X) X > O.

Program 3.10 The Euclidean algorithm

Program 3.9 is a translation of the functional definition into a logic pro- gram. The predicate ackermann(M,N,A) denotes that A=ackermann(M,N). The third rule involves two calls to Ackermann's function, one to com- pute the value of the second argument.

The functional definition of Ackermann's function is clearer than the relational one given in Program 3.9. In general, functional notation is more readable for pure functional definitions, such as Ackermann's function and the factorial function (Program 3.6). Expressing constraints can also be awkward with relational logic programs. For example, Pro- gram 3.8a says less directly that X = Q . Y + Z.

The final example in this section is the Euclidean algorithm for finding the greatest common divisor of two natural numbers, recast as a logic program. Like Program 3.8b, it is a recursive program not based on the recursive structure of numbers. The relation scheme is gcd(X,Y,Z), with intended meaning that Z is the greatest common divisor (or gcd) of two natural numbers X and Y. It uses either of the two programs, 3.8a or 3.8b, for mod.

The first rule in Program 3.10 is the logical essence of the Euclidean algorithm. The gcd of X and Y is the same as the gcd of Y and X mod Y. A proof that Program 3.10 is correct depends on the correctness

55 Recursive Programming

of the above mathematical statement about greatest common divisors. The proof that the Euclidean algorithm is correct similarly rests on this result.

The second fact in Program 3.10 is the base fact. It must be specified that Xis greater than Oto preclude gcd(O,O,O) from being in the mean- ing. The gcd of O and O is not well defined.

3.1.1 Exercises for Section 3.1

Modify Program 3.2 for < to axiomatize the relations <, >, and . Discuss multiple uses of these programs.

Prove that Program 3.2 is a correct and complete axiomatization of <

Prove that a proof tree for the query s'(0) < tm(Q) using Pro- gram 3.2 has m + 2 nodes.

Define predicates even(X) and odd(X) for determining if a natural number is even or odd. (Hint: Modify Program 3.1 for natural_ nuniber.)

(y) Write a logic program defining the relation f ib(N,F) to determine the Nth Fibonacci number F.

The predicate times can be used for computing exact quotients with queries such as times(s(s(0)),X,s(s(s(s(0)))))? to find

the result of 4 divided by 2. The query times(s(s(0)),X,s(s(s (0))))? to find 3/2 has no solution. Many applications require the use of integer division that would calculate 3/2 to be 1. Write a program to compute integer quotients. (Hint: Use repeated subtrac- tion.)

Modify Program 3.10 for finding the gcd of two integers so that it performs repeated subtraction directly rather than use the mod function. (Hint: The program repeatedly subtracts the smaller num- ber from the larger number until the two numbers are equal.)

Rewrite the logic programs in Section 3.1 using a different represen- tation of natural numbers, namely as a sum of l's. For example, the modified version of Program 3.1 would be

56 Chapter 3

natural_number (1).

natural_number(1+X) - natural_number(X).

Note that + is used as a binary operator, and O is not defined to be a natural number.

3.2 Lists

The basic structure for arithmetic is the unary successor functor. Al- though complicated recursive functions such as Ackermann's function can be defined, the use of a unary recursive structure is limited. This sec- tion discusses the binary structure, the list.

The first argument of a list holds an element, and the second argument is recursively the rest of the list. Lists are sufficient for most computa- tions - attested to by the success of the programming language Lisp, which has lists as its basic compound data structure. Arbitrarily complex structures can be represented with lists, though it is more convenient to use different structures when appropriate.

For lists, as for numbers, a constant symbol is necessary to terminate recursion. This "empty list," referred to as nil, will be denoted here by the symbol [1. We also need a functor of arity 2. Historically, the usual functor for lists is "" (pronounced dot), which overloads the use of the period. It is convenient to define a separate, special syntax. The term

(X,Y) is denoted [XJY]. Its components have special names: X is called the head and Y is called the tail.

The term [XIY] corresponds to a cons pair in Lisp. The corresponding words for head and tail are, respectively, car and cdr.

Figure 3.2 illustrates the relation between lists written with different syntaxes. The first colunm writes lists with the dot functor, and is the way lists are considered as terms in logic programs. The second colunm gives the square bracket equivalent of the dot syntax. The third column is an improvement upon the syntax of the second colunm, essentially hiding the recursive structure of lists. In this syntax, lists are written as a sequence of elements enclosed in square brackets and separated by commas. The empty list used to terminate the recursive structure is suppressed. Note the use of "cons pair notation" in the third column when the list has a variable tail.

57 Recursive Programming

Formal object Cons pair syntax Element syntax

(a,] 1) Fai F]] Fa]

.(a,(b,F J)) [aIFbIF J]] Fa,b]

.(a,.(b,.(c,F 1))) ]aiFbI]c I] J]]] Fa,b,c]

(aX) Faix] Faix]

Fai]biXl] Fa,biX]

Figure 3.2 Equivalent forms of lists

!ist(Xs) Xs is a list.

list([ ]). list([XiXs]) - list(Xs).

Program 3.11 Defining a list

Terms built with the dot functor are more general than lists. Program 3.11 defines a list precisely. Declaratively it reads: "A list is either the empty list or a cons pair whose tail is a list." The program is analogous to Program 3.1 defining natural numbers, and is the simple type definition of lists.

Figure 3.3 gives a proof tree for the goal list ([a, b, cl). Implicit in the proof tree are ground instances of rules in Program 3.11, for example, list( [a,b , c]) - list( [b, c]). We specify the particular instance here explicitly, as instances of lists in cons pair notation can be confusing. [a, b, c] is an instance of [XIXs] under the substitution {X=a, Xs= [b, c]).

Because lists are richer data structures than numbers, a great variety of interesting relations can be specified with them. Perhaps the most basic operation with lists is determining whether a particular element is in a list. The predicate expressing this relation is member (Element,List). Program 3.12 is a recursive definition of member/2.

Declaratively, the reading of Program 3.12 is straightforward. X is an element of a list if it is the head of the list by the first clause, or if it is a member of the tail of the list by the second clause. The meaning of the program is the set of all ground instances member (X,Xs), where

58 Chapter 3

Figure 3.3 Proof tree verifying a list

member(Element,List) - Element is an element of the list List.

member(X, [XIXs]). member(X,[YIYs]) - member(X,Ys).

Program 3.12 Membership of a list

X is an element of Xs. We omit the type condition in the first clause. Alternatively, it would be written

rnember(X,[XIXs]) - list(Xs).

This program has many interesting applications, to be revealed throughout the book. Its basic uses are checking whether an element is in a list with a query such as member(b, [a,b,c])?, finding an ele- ment of a list with a query such as member (X, [a, b, c])?, and finding a list containing an element with a query such as member(b,X)?. This last query may seem strange, but there are programs that are based on this use of member.

We use the following conventions wherever possible when naming vari- ables in programs involving lists. If X is used to denote the head of a list, then Xs will denote its tail. More generally, plural variable names will denote lists of elements, and singular names will denote individual ele- ments. Numerical suffixes will denote variants of lists. Relation schemes will still contain nmemonic names.

59 Recursive Programming

prefix(Prefix,List) - Prefix is a prefix of List.

prefix([ ],Ys).

prefix([XIXs],[XJYs]) - prefix(Xs,Ys).

suffix(Suffix,List) Suffix is a suffix of List.

suffix(Xs,Xs)

suffix(Xs,[YIYs]) - suffix(Xs,Ys)

Program 3.13 Prefixes and suffixes of a list

Our next example is a predicate sublist(Sub,List) for determining whether Sub is a sublist of List. A sublist needs the elements to be consecutive: [b, c] is a sublist of [a, b, c , d], whereas [a, c] is not.

It is convenient to define two special cases of sublists to make the defi- nition of sublist easier. lt is good style when composing logic programs to define meaningful relations as auxiliary predicates. The two cases con- sidered are initial sublists, or prefixes, of a list, and terminal sublists, or suffixes, of a list. The programs are interesting in their own right.

The predicate prefix(Prefix,List) is true if Prefix is an initial sub- list of List, for example, prefix([a,b] , [a,b,c]) is true. The compan- ion predicate to prefix is suffix(Suffix,List), determining if Suffix is a terminal sublist of List. For example, suffix( [b, c] , [a,b, ci) is true. Both predicates are defined in Program 3.13. A type condition ex- pressing that the variables in the base facts are lists should be added to the base fact in each predicate to give the correct meaning.

An arbitrary sublist can be specified in terms of prefixes and suffixes: namely, as a suffix of a prefix, or as a prefix of a suffix. Program 3.14a expresses the logical rule that Xs is a sublist of Ys if there exists Ps such that Ps is a prefix of Ys and Xs is a suffix of Ps. Program 3.14b is the dual definition of a sublist as a prefix of a suffix.

The predicate prefix can also be used as the basis of a recursive definition of sublist. Thìs is given as Program 3.14c. The base rule reads that a prefix of a list is a sublist of a list. The recursive rule reads that the sublist of a tail of a list is a sublist of the list itself.

The predicate member can be viewed as a special case of sublist de- fined by the rule

member(X,Xs) - sublist([X],Xs).

60 Chapter 3

sublist (Sub,List) - Sub is a sublist of List.

Suffix of a prefix

sublist(Xs,Ys) - prefix(Ps,Ys), suffix(Xs,Ps).

Prefixofasuffix sublist(Xs,Ys) - prefix(Xs,Ss), suffix(Ss,Ys).

C: Recursive definition of a sublist sublist(Xs,Ys) - prefix(Xs,Ys).

sublist(Xs, [YIYs]) - sublist(Xs,Ys).

Prefix of a suffix, using append

sublist (Xs , AsXsBs) -

append(As,XsBs,AsXsBs), append(Xs,Bs,XsBs).

Suffix of a prefix, using append

sublist (Xs , AsXsBs) -

append(AsXs,Bs,AsXsBs), append(As,Xs,AsXs).

Program 3.14 Determining sublists of lists

append (Xs, Ys,XsYs) - XsYs is the result of concatenating the lists Xs and Ys.

append([ ],Ys,Ys).

append([XIXs],Ys,[XIZs]) - append(Xs,Ys,Zs).

Program 3.15 Appending two lists

The basic operation with lists is concatenating two lists to give a third list. This defines a relation, append(Xs,Ys,Zs), between two lists Xs, Ys and the result Zs of joining them together. The code for append, Pro- gram 3.15, is identical in structure to the basic program for combining two numbers, Program 3.3 for plus.

Figure 3.4 gives a proof tree for the goal append([a,b] , [c,d] , [a,b, c , d]). The tree structure suggests that its size is linear in the size of the first list. In general, if Xs is a list of n elements, the proof tree for append(Xs,Ys,Zs) has n + 1 nodes.

There are multiple uses for append similar to the multiple uses for plus. The basic use is to concatenate two lists by posing a query such

61 Recursive Programming

[a,b],[c,d

r cpend([b],[c,d],[b,c,d]

ppend([ ],[c,d],[c,d])

Ftgure 3.4 Proof tree for appending two lists

as append([a,b,c] , [d,e] ,Xs)? with answer Xs=[a,b,c,d,e]. A query such as append(Xs,[c,d],[a,b,c,d])? finds the difference Xs=[a,b] between the lists [c,d] and [a,b,c,d]. Unlike plus, append is not sym- metric in its first two arguments, and thus there are two distinct versions of finding the difference between two lists.

The analogous process to partitioning a number is splitting a list. The query append(As,Bs, [a,b,c,d])?, for example, asks for lists As andBs such that appending Bs to As gives the list [a,b,c,d]. Queries about splitting lists are made more interesting by partially specifying the na- ture of the split lists. The predicates member, sublist, prefix, and suf- fix, introduced previously, can all be defined in terms of append by viewing the process as splitting a list.

The most straightforward definitions are for prefix and suffix, which just specify which of the two split pieces are of interest:

prefix(Xs,Ys) - append(Xs,As,Ys).

suffix(Xs,Ys) - append(As,Xs,Ys).

Sublist can be written using two append goals. There are two distinct variants, given as Programs 3.14d and 314e. These two programs are obtained from Programs 3.14a and 3.14b, respectively, where prefix and suffix are replaced by append goals. Member can be defined using append, as follows:

member(X,Ys) - append(As, [XIXs] ,Ys).

This says that X is a member of Ys if Ys can be split into two lists where X is the head of the second list.

62 Chapter 3

reverse(List, Tsil) - Tsil is the result of reversing the list List.

Naive reverse

reverse(f 1,1 1). reverse([XIXs] ,Zs) reverse(Xs,Ys), append(Ys, [X] ,Zs).

Reverse-accumulate

reverse(Xs,Ys) - reverse(Xs,[ ],Ys).

reverse([XIXs],Acc,Ys) - reverse(Xs,[XIAcc],Ys).

reverse([ ],Ys,Ys).

Program 3.16 Reversing a list

A similar rule can be written to express the relation adj acent (X, Y, Zs) that two elements X and Y are adjacent in a list Zs:

adjacent(X,Y,Zs) - append(As, [X,YIYs] ,Zs).

Another relation easily expressed through append is determining the last element of a list. The desired pattern of the second argument to append, a list with one element, is built into the rule:

last(X,Xs) append(As,[X],Xs).

Repeated applications of append can be used to define a predicate reverse (Li st ,Tsil). The intended meaning of reverse is that Tsil is a list containing the elements in the list List in reverse order to how they appear m List. An example of a goal in the meaning of the program is reverse ([a, b, c] , [c , b, a]). The naive version, given as Program 3.1 6a, is the logical equivalent of the recursive formulation in any language: recursively reverse the tail of the list, and then add the first element at the back of the reversed tail.

There is an alternative way of defining reverse without calling append directly. We define an auxiliary predicate reverse (Xs,Ys,Zs), which is true if Zs is the result of appending Ys to the elements of Xs reversed. It is defined in Program 3.16b. The predicate reverse/3 is related to reverse/2 by the first clause in Program 3.16b.

Program 3.16b is more efficient than Program 3.16a. Consider Fig- ure 3.5, showing proof trees for the goal reverse( [a,b, cl , [c ,b , a]) us- ing both programs. In general, the size of the proof tree of Program 3.16a

63 Recursive Programming

reverse([b,c],[c,b])

reverse([ LE 1) append([ ],[c],[c]) append([ 1,[b],[b])

append([c,b],[a],[c,b,a])

append([b],[a] [ba])

append([ ],[a],[a])

cZTIerse([a,b,c],[c,b,aj

crse([a,b,c],[ ],[c,b,i

erse([c],[b,a],[c,b,a])

reverse([ ],[c,b,a],[c,b,a]

Figure 3.5 Proof trees for reversing a list

64 Chapter 3

!ength(Xs,N) - The list Xs has N elements.

length([ 1,0). length([XIXs],s(N)) - length(Xs,N).

Program 3.17 Determining the length of a list

is quadratic in the number of elements in the list to be reversed, while that of Program 3.16b is linear.

The insight in Program 3.16b is the use of a better data structure for representing the sequence of elements, which we discuss in more detail in Chapters 7 and 15.

The final program in this section, Program 3.17, expresses a rela- tion between numbers and lists, using the recursive structure of each. The predicate length(Xs,N) is true if Xs is a list of length N, that is, contains N elements, where N is a natural number. For example, length([a,b] ,s(s(0))), indicating that [a,b] has two elements, is in the program's meaning.

Let us consider the multiple uses of Program 3.17. The query length ([a,b] ,X)? computes the length, 2, of a list [a,b]. In this way, length is regarded as a function of a list, with the functional definition

length([ 1) = O length([XIXs]) = s(length(Xs)).

The querylength([a,b] ,s(s(0)))? checks whetherthelist [a,b] has length 2. The query length(Xs,s(s(0)))? generates a list of length 2 with variables for elements.

3.2.1 Exercises for Section 3.2

(i) A variant of Program 3.14 for sublist is defined by the following three rules:

subsequence([XIXs] , [XIYs]) - subsequence(Xs,Ys). subsequence(Xs, [Y lYs]) - subsequence(Xs,Ys). subsequence([ ] ,Ys).

Explain why this program has a different meaning from Pro- gram 3.14.

65 Recursive Programming

Write recursive programs for adjacent and last that have the same meaning as the predicates defined in the text in terms of append.

Write a program for double (List,ListList), where every element [ri List appears twice in ListList, e.g., double([1,2,3] [1,1,2, 2,3,3]) is true.

Compute the size of the proof tree as a function of the size of the input list for Programs 3.16a and 3.1Gb defining reverse.

(y) Define the relation suin(Listoflntegers,Sum), which holds if Sum is the sum of the ListOf Integers,

Using plus/3;

Without using any auxiliary predicate.

(Hint: Three axioms are enough.)

3.3 Composing Recursive Programs

No explanation has been given so far about how the example logic pro- grams have been composed. The composition of logic programs is a skill that can be learned by apprenticeship or osmosis, and most definitely by practice. For simple relations, the best axiomatizations have an aesthetic elegance that look obviously correct when written down. Through solv- ing the exercises, the reader may find, however, that there is a difference between recognizing and constructing elegant logic programs

This section gives more example programs involving lists. Their pre- sentation, however, places more emphasis on how the programs might be composed. Two principles are illustrated: how to blend procedural and declarative thinking, and how to develop a program top-down.

We have shown the dual reading of clauses: declarative and procedural. How do they interrelate when composing logic programs? Pragmatically, one thinks procedurally when programming. However, one thinks declar- atively when considering issues of truth and meaning. One way to blend them in logic programming is to compose procedurally and then niter- pret the result as a declarative statement. Construct a program with a

66 Chapter 3

given use in mind; then consider if the alternative uses make declarative sense. We apply this to a program for deleting elements from a list.

The first, and most important, step is to specify the intended meaning of the relation. Clearly, three arguments are involved when deleting ele- ments from a list: an element X to be deleted, a list Li that might have occurrences of X, and a list L2 with all occurrences of X deleted. An ap- propriate relation scheme is delete (Li , X, L2). The natural meaning is all ground instances where L2 is the list Li with all occurrences of X re- moved.

When composing the program, it is easiest to think of one specific use. Consider the query delete([a,b,c,b] ,b,X)?, a typical example of finding the result of deleting an element from a list. The answer here is X= [a, c]. The program will be recursive on the first argument. Let's don our procedural thinking caps.

We begin with the recursive part. The usual form of the recursive ar- gument for lists is [XXs]. There are two possibilities to consider, one where X is the element to be deleted, and one where it is not. In the first case, the result of recursively deleting X from Xs is the desired answer to the query. The appropriate rule is

delete([XIXs],X,Ys) - delete(Xs,X,Ys). Switching hats, the declarative reading of this rule is: "The deletion of

X from [XXs] is Ys if the deletion of X from Xs is Ys." The condition that the head of the list and the element to be deleted are the same is specified by the shared variable in the head of the rule.

The second case where the element to be deleted is different from X, the head of the list, is similar. The result required is a list whose head is X and whose tail is the result of recursively deleting the element. The rule is

delete([XIXs],Z,[XIYs]) - X Z, delete(Xs,Z,Ys).

The rule's declarative reading is: "The deletion of Z from [XIXs] is [XYs] if Z is different from X and the deletion of Z from Xs is Ys." In contrast to the previous rule, the condition that the head of the list and the element to be deleted are different is made explicit in the body of the rule.

The base case is straightforward. No elements can be deleted from the empty list, and the required result is also the empty list. This gives the

67 Recursive Programming

delete(List,X,HasNoXs) - The list HasNoXs is the result of removing all occurrences of X from the list List.

delete([XIXs],X,Ys) - delete(Xs,X,Ys).

delete([XIXs],Z,[XIYs]) XZ, delete(Xs,Z,Ys).

delete([ ] ,X, E 1).

Program 3.18 Deleting all occurrences of an element from a list

select (X,HasXs,OneLessXs) - The list OneLessXs is the result of removing one occurrence of X from the list HasXs.

select (X, [XIXs] ,Xs)

select(X,[YIYs],[YIZs]) select(X,Ys,Zs).

Program 3.19 Selecting an element from a list

fact delete([ ] ,X, E J). The complete program is collected together as Program 3.18.

Let us review the program we have written, and consider alternative formulations. Omitting the condition XZ from the second rule in Pro- gram 3.18 gives a variant of delete. This variant has a less natural mean- ing, since any number of occurrences of an element may be deleted. For example, delete([a,b,c,b],b,[a,c]), delete([a,b,c,b],b,[a,c, b]), delete([a,b,c,b],b,[a,b,c]), and delete([a,b,c,b],b,[a,b, c , b]) are all in the meaning of the variant.

Both Program 3.18 and the variant include in their meaning instances where the element to be deleted does not appear in either list, for ex- ample, delete([a] ,b, [a]) is true. There are applications where this is not desired. Program 3.19 defines select(X,L1,L2), a relation that has a different approach to elements not appearing in the list. The meaning of select (X,L1,L2) is all ground instances where L2 is the list Li where exactly one occurrence of X has been removed. The declarative reading of Program 3.19 is: "X is selected from [XXs] to give Xs; or X is selected from [YYs] to give [YZs] if X is selected from Ys to give Zs,"

A major thrust in programming has been the emphasis on a top-down design methodology, together with stepwise refinement. Loosely, the

68 Chapter 3

methodology is to state the general problem, break it down into subprob- lems, and then solve the pieces. A top-down programming style is one natural way for composing logic programs. Our description of programs throughout the book will be mostly top-down. The rest of this section de- scribes the composition of two programs for sorting a list: permutation sort and quicksort. Their top-down development is stressed.

A logical specification of sorting a list is finding an ordered permuta- tion of a list. This can be written down immediately as a logic program. The basic relation scheme is sort (Xs,Ys), where Ys is a list containing the elements in Xs sorted in ascending order:

sort(Xs,Ys) - permutation(Xs,Ys), ordered(Ys).

The top-level goal of sorting has been decomposed. We must now define permutation and ordered.

Testing whether a list is ordered ascendingly can be expressed in the two clauses that follow. The fact says that a list with a single element is necessarily ordered. The rule says that a list is ordered if the first element is less than or equal to the second, and if the rest of the list, beginning from the second element, is ordered:

ordered([X]).

ordered([X,YIYs]) - X Y, ordered([YJYs]).

A program for permutation is more delicate. One view of the process of permuting a list is selecting an element nondeterrniriistically to be the first element of the permuted list, then recursively permuting the rest of the list. We translate this view into a logic program for permutation, using Program 3.19 for select. The base fact says that the empty list is its own unique permutation:

permutation(Xs, [ZIZs]) - select(Z,Xs,Ys), permutation(Ys,Zs).

permutation([ ],[ 1).

Another procedural view of generating permutations of lists is recur- sively permuting the tail of the list and inserting the head in an arbitrary position. This view also can be encoded immediately. The base part is identical to the previous version:

permutation([XIXsI ,Zs) - permutation(Xs,Ys), insert(X,Ys,Zs).

permutation([ ],[ ]).

69 Recursive Programming

sort(Xs,Ys) - The list Ys is an ordered permutation of the list Xs.

sort(Xs,Ys) - permutation(Xs,Ys), ordered(Ys).

permutation(Xs, [ZIZs]) - select(Z,Xs,Ys), permutation(Ys,Zs).

perrnutation([ ],[ 1). ordered([ ]).

ordered([X])

ordered([X,YIYs]) X Y, ordered([YIYs]).

Program 3.20 Permutation sort

The predicate insert can be defined in terms of Program 3.19 for se- lect:

insert(X,Ys,Zs) '- select(X,Zs,Ys).

Both procedural versions of permutation have clear declarative read- ings.

The "naive" sorting program, which we call permutation sort, is col- lected together as Program 3.20. It is an example of the generate-and-test paradigm, discussed fully in Chapter 14. Note the addition of the extra base case for ordered so that the program behaves correctly for empty lists.

The problem of sorting lists is well studied. Permutation sort is not a good method for sorting lists in practice. Much better algorithms come from applying a "divide and conquer" strategy to the task of sorting. The insight is to sort a list by dividing it into two pieces, recursively sorting the pieces, and then joining the two pieces together to give the sorted list. The methods for dividing and joining the lists must be specified. There are two extreme positions. The first is to make the dividing hard, and the joining easy. This approach is taken by the quicksort algorithm The second position is making the joining hard, but the dividing easy. This is the approach of merge sort, which is posed as Exercise (y) at the end of this section, and insertion sort, shown in Program 3.21.

In insertion sort, one element (typically the first) is removed from the list. The rest of the list is sorted recursively; then the element is inserted, preserving the orderedness of the list.

The insight in quicksort is to divide the list by choosing an arbitrary element in it, and then to split the list into the elements smaller than the

70 Chapter 3

sort(Xs,Ys) - The list Ys is an ordered permutation of the list Xs.

sort([XIXs],Ys) - sort(Xs,Zs), insert(X,Zs,Ys).

sort([ ],[ ]).

insert(X,[ ],[X]).

insert(X,[YIYs],[YIZs]) - X> Y, insert(X,Ys,Zs).

insert(X, [YIYs] , [X,YIYs]) - X y.

Program 3.21 Insertion sort

quicksort (Xs, Ys) - The list Ys is an ordered permutation of the list Xs.

quicksort([XIXs] ,Ys) -

partition(Xs,X,Littles,Bigs),

quicksort(Littles,Ls),

quicksort(Bigs,Bs),

append(Ls, [X lBs] ,Ys).

quicksort([ ],[ 1). partition([XIXs] ,Y, [XILs] ,Bs) X Y, partition(Xs,Y,Ls,Bs).

partition([XIXs] ,Y,Ls, [XIBs]) - X > Y, partition(Xs,Y,Ls,Bs).

partition( E ] ,Y, E J , E i)

Program 3.22 Quicksort

chosen element and the elements larger than the chosen element. The sorted list is composed of the smaller elements, followed by the chosen element, and then the larger elements. The program we describe chooses the first element of the list as the basis of partition.

Program 3.22 defines the quicksort algorithm. The recursive rule for quicksort reads: "Ys is a sorted version of [XIXs] if Littles and Bigs are a result of partitioning Xs according to X; Ls and Bs are the result of sorting Littles and Bigs recursively; and Ys is the result of appending [XBs] to Ls."

Partitioning a list is straightforward, and is similar to the program for deleting elements. There are two cases to consider: when the current head of the list is smaller than the element being used for the parti- tioning, and when the head is larger than the partitioning element. The declarative reading of the first partition clause is: "Partitioning a list whose head is X and whose tail is Xs according to an element Y gives the

71 Recursive Programming

lists [XLitt1es] and Bigs if X is less than or equal to Y, and partitioning Xs according to Y gives the lists Littles and Bigs." The second clause for partition has a similar reading. The base case is that the empty list is partitioned into two empty lists.

3.3.1 Exercises for Section 3.3

Write a program for substitute(X,Y,Li,L2), where L2 is the result of substituting Y for all occurrences of X in Li, e.g., sub- stitute(a,x,[a,b,a,c],[x,b,x,c]) is true, whereas substi- tute (a, x, [a,b, a, cl , [a,b ,x, c] ) is false.

What is the meaning of the variant of select:

select (X, [XIXs] ,Xs). select(X, [YIYs] , [YIZs]) '- X Y, select(X,Ys,Zs).

Write a program for no_doubles(L1 ,L2), where L2 is the result of removing all duplicate elements from Li, e.g., no_doubles ([a, b, c, b], [a,c,b]) is true. (Hint: Use member.)

Write programs for evenpermutation(Xs ,Ys) and oddpermuta- tion(Xs,Ys) that find Ys, the even and odd permutations, respec- tively, of a list Xs. For example, even_permutation([i »2,3] » [2,3, i]) andoddpeimutation([i,2,3],[2,i,3]) aretrue.

(y) Write a program for merge sort.

(vi) Write a logic program for kthlargest (Xs , K) that implements the linear algorithm for finding the kth largest element K of a list Xs. The algorithm has the following steps:

Break the list into groups of five elements. Efficiently find the median of each of the groups, which can be done with a fixed number of comparisons. Recursively find the median of the medians. Partition the original list with respect to the median of medians. Recursively find the kth largest element in the appropriate smaller list.

72 Chapter 3

(vii) Write a program for the relation better_poker_hand(Handl, Hand2,Hand) that succeeds if Hand is the better poker hand be- tween Handi and Hand2. For those unfamiliar with this card game, here are some rules of poker necessary for answering this exercise:

The order of cards is 2, 3, 4, 5, 6, 7, 8, 9, 10, jack, queen, king, ace.

Each hand consists of five cards.

The rank of hands in ascending order is no pairs < one pair < two pairs < three of a kind < flush < straight < full house < four of a kind < straight flush.

Where two cards have the same rank, the higher denomination wins, for example, a pair of kings beats a pair of 7's.

(Hints: (1) Represent a poker hand by a list of terms of the form card(Suit,Value). For example a hand consisting of the 2 of clubs, the 5 of spades, the queen of hearts, the queen of dia- monds, and the 7 of spades would be represented by the list [card (clubs,2),card(spades,5),card(hearts,queen),card(diamonds,

queen),card(spades,7)1. (2)It maybe helpful to define relations such as has_f lush (Hand), which is true if all the cards in Hand are of the same suit; has_full_house (Hand), which is true if Hand has three cards with the same value but in different suits, and the other two cards have the same different value; and has_straight (Hand), which is true if Hand has cards with consecutive values. (3) The number of cases to consider is reduced if the hand is first sorted.)

3.4 Binary Trees

We next consider binary trees, another recursive data type. These struc- tures have an important place in many algorithms.

Binary trees are represented by the ternary functor tree (Element, Leí t,Right), where Element is the element at the node, and Left and Right are the left and right subtrees respectively. The empty tree is represented by the atom void. For example, the tree

73 Recursive Programming

b C

would be represented as

tree(a,tree(b,void,void),tree(c,void,void)).

Logic programs manipulating binary trees are similar to those manip- ulating lists. As with natural numbers and lists, we start with the type definition of binary trees. lt is given as Program 3.23. Note that the pro- gram is doubly recursive; that is, there are two goals in the body of the recursive rule with the same predicate as the head of the rule. This re- suits from the doubly recursive nature of binary trees and will be seen also in the rest of the programs of this section.

Let us write some tree-processing programs. Our first example tests whether an element appears in a tree. The relation scheme is tree_ member(Element,Tree). The relation is true if Element is one of the nodes in the tree. Program 3.24 contains the definition. The declarative reading of the program is: "X is a member of a tree if it is the element at the node (by the fact) or if it is a member of the left or right subtree (by the two recursive rules)."

The two branches of a binary tree are distinguishable, but for many ap- plications the distinction is not relevant. Consequently, a useful concept

binary_tree( Tree) - Tree is a binary tree.

binary_tree (void).

binary_tree (tree(Element ,Lef t ,Right)) -

binary_tree(Left), binarytree(Right).

Program 3.23 Defining binary trees

tree_member(Element,Tree) - Element is an element of the binary tree Tree.

tree_momber(X,tree(X,Left,Right)).

tree_member(X,tree(Y,Left ,Right)) - tree_rnember(X,Left).

tree_membor(X,tree(Y,Left ,Right)) '- tree_member(X ,Rïght).

Program 3.24 Testing tree membership

74 Chapter 3

a b

Figure 3.6 Comparing trees for isomorphism

isotree(Treel,Tree2) - Tree I and Tree2 are isomorphic binary trees.

isotree(void,void).

isotree(tree(X,Leftl,Rightl),tree(X,Left2,Right2))

isotree(Leftl,Left2), isotree(Rightl,Right2)

isotree(tree(X,Leftl,Rightl),tree(X,Left2,Right2))

isotree(Leftl ,Right2), isotree(Rightl ,Left2).

Program 3.25 Determining when trees are isomorphic

is isomorphism, which defines when unordered trees are essentially the same. Two binary trees Ti and T2 are isomorphic if T2 can be obtained by reordering the branches of the subtrees of Ti. Figure 3.6 shows three simple binary trees. The first two are isomorphic; the first and third are not.

Isomorphism is an equivalence relation with a simple recursive defini- tion. Two empty trees are isomorphic. Otherwise, two trees are isomor- phic if they have identical elements at the node and either both the left subtrees and the right subtrees are isomorphic; or the left subtree of one is isomorphic with the right subtree of the other and the two other sub- trees are isomorphic.

Program 3.25 defines a predicate isotree(Treel,Tree2), which is true if Tree i and Tree2 are isomorphic. The predicate is symmetric in its arguments.

Programs related to binary trees involve double recursion, one for each branch of the tree. The double recursion can be manifest in two ways. Programs can have two separate cases to consider, as in Program 3.24 for tree_member. In contrast, Program 3.12 testing membership of a list has only one recursive case. Alternatively, the body of the recursive clause has two recursive calls, as in each of the recursive rules for isotree in Program 3.25.

75 Recursive Programming

substitute (X, Y,TreeX,TreeY) - The binary tree Tree Y is the result of replacing all occurrences of X in the binary tree TreeX by Y.

substïtute(X,Y,void,void)

substitute(X,Y,tree(Node,Left,Right),tree(Nodel,Leftl,Rightl))

replace (X,Y,Node,Nodel)

substitute (X,Y,Left,Leftl)

substitute(X,Y,Rigbt,Rightl).

replace(X,Y,X,Y).

replace(X,Y,Z,Z) - X Z.

Program 3.26 Substituting for a term in a tree

The task in Exercise 3.3(i) is to write a program for substituting for el- ements in lists. An analogous program can be written for substituting elements in binary trees. The predicate substitute(X,Y,OldTree, NewTree) is true if NewTree is the result of replacing all occurrences of X by Y in OldTree. An axiomatization of substitute/4 is given as Program 3.26.

Many applications involving trees require access to the elements ap- pearing as nodes. Central is the idea of a tree traversal, which is a se- quence of the nodes of the tree in some predefined order. There are three possibilities for the linear order of traversal: preorder, where the value of the node is first, then the nodes in the left subtree, followed by the nodes in the right subtree; morder, where the left nodes come first followed by the node itself and then the right nodes; and postorder, where the node comes after the left arid right subtrees.

A definition of each of the three traversals is given in Program 3.27. The recursive structure is identical; the ordy difference between the pro- grams is the order in which the elements are composed by the various append goals.

The final example in this section shows interesting manipulation of trees. A binary tree satisfies the heap property if the value at each node is at least as large as the value at its children (if they exist). Heaps, a class of binary trees that satisfy the heap property, are a useful data structure and can be used to implement priority queues efficiently.

lt is possible to heapify any binary tree containing values for which an ordering exists. That is, the values in the tree are moved around so that

76 Chapter 3

preorder ( Tree,Pre) - Pre is a preorder traversal of the binary tree Tree.

preorder(tree(X,L,R) ,Xs)

preorder(L,Ls), preorder(R,Rs), append([XJLsJ,Rs,Xs).

preorder(void, E 1). morder ( Tree,In) -

In is an morder traversal of the binary tree Tree. inorder(tree(X,L,R) ,Xs) -

inorder(L,Ls), inorder(R,Rs), append(Ls,EXIRs],Xs).

inorder(void, E 1).

postorder ( Tree,Post) - Post is a postorder traversal of the binary tree Tree.

postorder(tree(X,L,R) ,Xs) -

postorder(L,Ls),

postorder(R,Rs),

append(Rs, [X] ,Rsl),

append(Ls,Rsl,Xs).

postorder(void, E 1).

Program 3.27 Traversals of a binary tree

the shape of the tree is preserved and the heap property is satisfied. An example tree and its heapified equivalent are shown in Figure 3.7.

An algorithm for heapifying the elements of a binary tree so that the heap property is satisfied is easily stated recursively. Heapify the left and right subtrees so that they both satisfy the heap property and then ad- just the element at the root appropriately. Program 3.28 embodies this algorithm. The relation heapify/2 lays out the doubly recursive pro- gram structure, and adjust(X,HeapL,HeapR,Heap) produces the final tree Heap satisfying the heap property from the root value X and the left and right subtrees HeapL and HeapR satisfying the heap property.

There are three cases for adj ust/4 depending on the values. If the root value is larger than the root values of the left and right subtrees, then the heap is tree(X,HeapL,HeapR). This is indicated in the first adjust clause in Program 3.28. The second clause handles the case where the root node in the left heap is larger than the root node and the root of the right heap. In that case, the adjustment proceeds recursively on the left heap. The third clause handles the symmetric case where the root node of the right heap is the largest. The code is simplified by relegating the concern whether the subtree is empty to the predicate greater/2.

77 Recursive Programming

Figure 3.7 A binary tree and a heap that preserves the tree's shape

heapify ( Tree,Heap) The elements of the complete binary tree Tree have been adjusted to form the binary tree Heap, which has the same shape as Tree and satisfies the heap property that the value of each parent node is greater than or equal to the values of its children.

heapify(void,void). heapify(tree(X,L,R) Heap)

heapify(L,HeapL), heapify(R,HeapR), adjust(X,HeapL,HeapR,Heap).

adj ust (X , HeapL , HeapR , tree (X , HeapL , HeapR) greater(X,HeapL), greater(X,HeapR).

adj ust(X,tree(X1,L,R),HeapR,tree(X1,HeapL,HeapR)) X < Xl, greater(X1,HeapR), adjust(X,L,R,HeapL).

adjust(X,HeapL,tree(Xl,L,R),tree(Xl,HeapL,HeapR)) -

X < Xl, greater(Xl,HeapL), adjust(X,L,R,HeapR).

greater(X,void).

greater(X,tree(X1,L,R)) - X Xl.

Program 3.28 Adjusting a binary tree to satisfy the heap property

3.4.1 Exercises for Section 3.4

(i) Define a program for subtree(S ,T), where S is a subtree of T.

(II) Define the relation sum_tree(Treeoflntegers,Sum), which holds if Sum is the sum of the integer elements in TreeOflntegers.

(IIi) Define the relation ordered(Treeof Integers), which holds if Tree is an ordered tree of integers, that is, for each node in the tree the elements in the left subtree are smaller than the element in

78 Chapter 3

the node, and the elements in the right subtree are larger than the element in the node. (Hint: Define two auxiliary relations, ordered_left(X,Tree) and ordered_right(X,Tree), which hold if both Tree is ordered and X is larger (respectively, smaller) than the largest (smallest) node of Tree.)

(iv) Define the relation tree_insert(X,Tree,Treel), which holds if Treel is an ordered tree resulting from inserting X into the ordered tree Tree. If X already occurs in Tree, then Tree and Tree i are iden- tical. (Hint: Four axioms suffice.)

(y) Write a logic program for the relation path(X,Tree,Path), where Path is the path from the root of the tree Tree to X.

3.5 Manipulating Symbolic Expressions

The logic programs illustrated so far in this chapter have manipulated natural numbers, lists, and binary trees. The programming style is ap- plicable more generally. This section gives four examples of recursive programming - a program for defining polynomials, a program for sym- bolic differentiation, a program for solving the Towers of Hanoi problem, and a program for testing the satisfiability of Boolean formulae.

The first example is a program for recognizing polynomials in some term X. Polynomials are defined inductively. X itself is a polynomial in X, as is any constant. Sums, differences, and products of polynomials in X are polynomials in X. So too are polynomials raised to the power of a natural number, and the quotient of a polynomial by a constant.

An example of a polynomial in the term x is x2 - 3x + 2. This follows from its being the sum of the polynomials, x2 - 3x and 2, where x2 - 3x is recognized recursively.

A logic program for recognizing polynomials is obtained by expressing the preceding informal rules in the correct form. Program 3.29 defines the relation polynomial (Expression,X), which is true if Expression is a polynomial in X. We give a declarative reading of two rules from the program.

The fact polynomial(X,X) says that a term X is a polynomial in itself. The rule

79 Recursive Programming

polynomial (Expression,X) - Expression is a polynomial ¡n X.

polynomial (X , X)

polynomial (Term, X) -

constant (Term)

polynomial (Termi +Term2 , X) -

polynomial (Terml,X), polynomial(Torm2,X).

polynomial (TermlTorm2 ,X)

polynomial(Terml,X), polynomial(Term2,X).

polynomial (Terml *Term2 , X)

polynomïal(Terml,X), polynomial(Term2,X).

polynomial (Terml/Term2 , X)

polynomial(Terml,X), constant(Term2).

polynomial(TermtN,X)

natural_number(N), polynomial(Term,X).

Program 3.29 Recognizing polynomials

polynomial(Terml+Terin2,X) - polyrioinial(Termi,X), polynomial(Term2,X).

says that the sum Terml+Term2 is a polynomial in X if both Tenni and Term2 are polynomials in X.

Other conventions used in Program 3.29 are the use of the unary pred- icate constant for recognizing constants, and the binary functor T to denote exponentiation. The term XTY denotes x".

The next example is a program for taking derivatives. The relation scheme is derivative(Expression,X,DifferentiatedExpression). The intended meaning of derivative is that DifferentiatedExpres- sion is the derivative of Expression with respect to X.

As for Program 3.29 for recognizing polynomials, a logic program for differentiation is just a collection of the relevant differentiation rules, written in the correct syntax. For example, the fact

derivative(X,X,s(0)).

expresses that the derivative of X with respect to itself is 1. The fact

derivative(sin(X) ,X,cos(X)).

80 Chapter 3

derivative (Expression,X,DifferentiatedEx pression) DifferentiatedEx pression is the derivative of Expression with respect to X.

derivative(X,X,s(0)).

derivative(XIs(N) ,X,s(N)*XIN)

derivative(sin(X) ,X,cos(X))

derivative(cos(X) ,X,-sin(X)).

derivative(elX,X,eX).

derivative(log(X),X,1/X).

derivat ive (F+G ,X ,DF+DG) -