Database Fundamentals- Content Analysis
Chap1 .pptx
Database Principles: Fundamentals of Design, Implementations and Management
CHAPTER 1
THE DATABASE APPROACH
In this chapter, you will learn:
The difference between data and information
What a database is, what the different types of databases are, and why they are valuable assets for decision making
The importance of database design
How modern databases evolved from file systems
About flaws in file system data management
What the database system’s main components are and how a database system differs from a file system
The main functions of a database management system (DBMS)
The role of Open Source Database Systems
The importance of Data Governance and Data Quality
2
Data vs. Information
Data:
Raw facts; building blocks of information
Unprocessed information
Information:
Data processed to reveal meaning
Accurate, relevant, and timely information is the key to good decision making
Good decision making is the key to survival in a global environment
3
Transforming Raw Data into Information Fig 1.1 p6 Initial survey screen
4
Transforming Raw Data into Information (cont..) Fig 1.1 Information in graphic format
5
Data Quality and Data Governance
Data Quality can be examined at a number of different levels including:
Accuracy: Is the data accurate and come from a verifiable source?
Relevance: Is the data relevant to the organisation?
Completeness: Is the required data being stored?
Timeliness: Is the data updated frequently in order to meet the business requirements?
Uniqueness: Is the data unique and there is no redundancy in the database?
Unambiguous: Is the meaning of the data clear.
6
Data Quality and Data Governance (cont…)
Data governance is the term used to describe a strategy or methodology defined by an organisation to safeguard data quality.
Each organisation produces its own data governance strategy which will involve the development of a series of policies and procedures for managing availability, usability, quality, integrity, and security of data within the organisation.
Introducing the Database and the DBMS
Database—shared, integrated computer structure that stores:
End user data (raw facts)
Metadata (data about data)
DBMS (database management system):
Collection of programs that manages database structure and controls access to data
Possible to share data among multiple applications or users
Makes data management more efficient and effective
The DBMS hides much of the database’s internal complexity from the application programs and users.
The application program might be written by a programmer using a programming language such as COBOL, Visual basic, C++, or Java or it might be created through a DBMS utility program.
8
Role and Advantages of the DBMS (cont.)
A DBMS provides advantages such as :
Improved data sharing. Users have better access to more and better-managed data
Better data integration. Promotes integrated view of organization’s operations
Minimised data inconsistency. Probability of data inconsistency is greatly reduced
Improved data access. Possible to produce quick answers to ad hoc queries
9
Role and Advantages of the DBMS (cont..)
10
Types of Databases
Single-user:
Supports only one user at a time
Desktop:
Single-user database running on a personal computer
Multi-user:
Supports multiple users at the same time
Workgroup:
Multi-user database that supports a small group of users or a single department
Enterprise:
Multi-user database that supports a large group of users or an entire organization
11
Types of Databases (cont..)
Can be classified by location:
Centralized:
Supports data located at a single site
Distributed:
Supports data distributed across several sites
Can be classified by use:
Transactional (or production):
Supports a company’s day-to-day operations
Data warehouse:
Stores data used to generate information required to make tactical or strategic decisions
Often used to store historical data
Structure is quite different
12
Why Database Design is Important
Database design refers to the activities that focus on the design of the database structure that will be used to store and manages end-user data.
Defines the database’s expected use
Different approach needed for different types of databases
Avoid redundant data
Poorly designed database generates errors leads to bad decisions can lead to failure of organization
13
Historical Roots: Files and Data Processing
Managing data with file systems is obsolete
Understanding file system characteristics makes database design easier to understand
Awareness of problems with file systems helps prevent similar problems in DBMS
Knowledge of file systems is helpful if you plan to convert an obsolete file system to a DBMS
14
Historical Roots: Files and Data Processing (cont..)
Manual File systems:
Collection of file folders kept in file cabinet
Organization within folders based on data’s expected use (ideally logically related)
System adequate for small amounts of data with few reporting requirements
Finding and using data in growing collections of file folders became time-consuming and cumbersome
15
Historical Roots: Files and Data Processing (cont..)
Computerised file systems:
Conversion from manual to computer system:
Could be technically complex, requiring hiring of data processing (DP) specialists
Resulted in numerous “home-grown” systems being created
Initially, computer files were similar in design to manual files (see Figure 1.3)
16
Historical Roots: Files and Data Processing cont..)
17
Historical Roots: Files and Data Processing (cont..)
Fig 1.3
18
Historical Roots: Files and Data Processing (cont..)
DP specialist wrote programs for reports:
Monthly summaries of types and amounts of insurance sold by agents
Monthly reports about which customers should be contacted for renewal
Reports that analyzed ratios of insurance types sold by agent
Customer contact letters summarizing coverage
Other departments requested databases be written for them
SALES database created for sales department
AGENT database created for personnel department (see Fig 1.4 next)
19
Historical Roots: Files and Data Processing (cont…)
20
Historical Roots: Files and Data Processing(cont..)
As number of databases increased, small file system evolved
Each file used its own application programs
Each file was owned by individual or department who commissioned its creation
21
Historical Roots: Files and Data Processing (cont)
22
Example of Early Database Design (cont…)
As system grew, demand for DP’s (Data Specialists) programming skills grew
Additional programmers hired
DP specialist evolved into DP manager, supervising a DP department
Primary activity of department (and DP manager) remained programming
23
Problems with File System Data Management
Every task requires extensive programming in a third-generation language (3GL)
Programmer must specify task and how it must be done
Modern databases use fourth-generation languages (4GL)
Allow users to specify what must be done without specifying how it is to be done
24
Problems with File System Data Management
Lengthy development times.
Difficulty in getting quick answers.
Complex System Administration
Lack of security and limited data sharing
Extensive Programming
25
Structural and Data Dependence
Structural dependence (SD)
A file systems exhibits SD; that is, access to a file depends on its structure
Data independence
Changes in the data storage characteristics without affecting the application program’s ability to access the data
The practical significance of data dependence is the difference between the:
Logical data format
How the human being views the data
And the Physical data format
How the computer “sees” the data
26
Field Definitions and Naming Conventions
Flexible record definition anticipates reporting requirements by breaking up fields into their component parts
27
Data Redundancy
Data redundancy results in data inconsistency
Different and conflicting versions of the same data appear in different places
Errors more likely to occur when complex entries are made in several different files and/or recur frequently in one or more files
A Data anomaly develops when required changes in the redundant data are not made successfully
Types of data anomalies:
Update anomalies
Occur when changes must be made to existing records
Insertion anomalies
Occur when entering new records
Deletion anomalies
Occur when deleting records
28
Database Systems
Problems inherent in file systems make using a database system very desirable
File system
Many separate and unrelated files
Database
Logically related data stored in a single logical data repository
29
Database Systems
30
The Database System Environment
Database system is composed of five main parts:
Hardware
Software
Operating system software
DBMS software
Application programs and utility software
People
Procedures
Data
31
The Database System Environment (cont…)
32
DBMS Functions
DBMS performs functions that guarantee integrity and consistency of data
Data dictionary management
defines data elements and their relationships
Data storage management
stores data and related data entry forms, report definitions, etc.
Data transformation and presentation
translates logical requests into commands to physically locate and retrieve the requested data
Security management
enforces user security and data privacy within database
33
DBMS Functions (cont…)
Multiuser access control
uses sophisticated algorithms to ensure multiple users can access the database concurrently without compromising the integrity of the database
Backup and recovery management
provides backup and data recovery procedures
Data integrity management
promotes and enforces integrity rules
Database access languages and application programming interfaces
provide data access through a query language
Database communication interfaces
allow database to accept end-user requests via multiple, different network environments
34
DBMS Functions (continued)
35
DBMS Functions (cont…)
36
Summary
Data are raw facts. Information is the result of processing data to reveal its meaning.
To implement and manage a database, use a DBMS.
Database design defines the database structure.
A well-designed database facilitates data management and generates accurate and valuable information.
A poorly designed database can lead to bad decision making, and bad decision making can lead to the failure of an organization.
Databases were preceded by file systems.
Limitations of file system data management:
requires extensive programming
system administration complex and difficult
making changes to existing structures is difficult
security features are likely to be inadequate
independent files tend to contain redundant data
DBMS’s were developed to address file systems’ inherent weaknesses
37
Types of Databases (cont..)
Open Source
Open Source software is that it is free to acquire and use the product itself.
However, there will be costs involved in the development and on-going support of the software.
LAMP is used to define the most the most popular open source software namely; Linux, Apache Web server, MySQL DBMS, and the Perl PHP/Python development languages.
38