Database Design

INeedHwHelp305
NewAssgnment3.pdf

1

The Need for Normalization

Christina Peacock

ISM 641 Database Design and Management

Professor Xu Ashton

10/09/23

2

The Need for Normalization

It is fundamental to guarantee that the conceptual model adheres to the principles of

normalization before translating the Entity-Relationship Diagram (ERD) into a physical database

using Structured Query Language (SQL). The normalization process is an essential step in database

design pointed toward arranging information to limit overt repetitiveness and keep up with data

integrity. In this context, I will explore the purpose of normalization and give portrayals of each of

the five normal forms (1NF, 2NF, 3NF, 4NF, and 5NF) while offering models for better

comprehension.

Purpose of the Normalization Process

Minimizing Data Redundancy

Normalization assumes a critical part in limiting data redundancy by organizing data

efficiently (Servant et al., 2018). Redundant data alludes to the pointless duplication of data within a

database. At the point when information is excess, it consumes extra room and builds the gamble of

irregularities. For example, in a denormalized database, where redundant data is common, on the off

chance that one occurrence of an information point is refreshed while others are not, it prompts data

inconsistencies.

Enhancing Data Integrity

Update Anomalies: In a denormalized database, where information is copied across numerous

records, refreshing one event of an information point might leave others conflicting. For instance, in

the event that the telephone number of a client is changed in one spot but not in another, it results in

an update anomaly.

Insertion Anomalies: In some cases, adding new data to a denormalized database can be

challenging, as it might require embedding information into different spots (Ouafiq, Saadane &

Chehri, 2022). This intricacy can prompt inclusion abnormalities, where data insertion becomes

error-prone.

3

Deletion Anomalies: Deletion anomalies occur when deleting data from a denormalized database

inadvertently removes related data that ought to be held. For example, if erasing a client's

organization likewise eliminates their contact data, it leads to a deletion anomaly.

Improving Query Performance

Reduced Data Volume: Normalization eliminates data redundancy and stores data in a smaller

structure. This decrease in information volume implies that less data must be scanned or transferred

during query execution, leading to faster query performance.

Optimized Indexing: Normalized databases often have distinct essential keys and records. These

lists are more modest and more proficient, making it easier for the database management system to

locate and retrieve specific data quickly.

Structured Joins: Normalization often involves breaking data into multiple related tables, which

empowers organized joins between tables (Kasica, Berret & Munzner, 2020) Effective joins are

fundamental for complex questions that include recovering information from different related

elements. Very much planned joins result in quicker query execution.

Simplified Query Logic: In normalized databases, the inquiry rationale will be more clear and

natural. This straightforwardness makes it simpler for engineers to write efficient queries, reducing

the chances of performance bottlenecks.

1NF - First Normal Form

Scenario Violation of 1NF Resolution (1NF)

Table:

EmployeePhoneNumbers

Attributes:

EmployeeID,

PhoneNumbers

Attributes: EmployeeID, PhoneNumber

Description: In this table, there

is a single attribute

To bring this table into 1NF, create a new row for

each phone number and associate them with a

unique identifier, in this case, EmployeeID.

Before 1NF (Example Row): EmployeeID PhoneNumbers

4

101 123-456-7890, 987-654-3210

101

101

2NF - Second Normal Form

Scenario Violation of 2NF Resolution (2NF)

Table:

OrderDetails

Attributes: OrderID, ProductID,

ProductName

Table: Products

Attributes: ProductID (Primary Key),

ProductName

Description: In this table, we have attributes To achieve 2NF, we create a new table

"OrderID," "ProductID," and called "Products" with "ProductID" as

"ProductName." A violation of 2NF the primary key. We then include

occurs because "ProductName"

depends

"ProductName" in the "Products" table

on "ProductID" but not "OrderID." with a reference to "ProductID" as a

foreign key. This separation of data

ensures that non-key attributes are fully

functionally dependent on the primary key,

adhering to the principles of the Second

Normal Form (2NF).

Before 2NF (Example

Rows): OrderID ProductID ProductName

101 501 Laptop

102 502 Smartphone

103 501 Laptop

104 503 Tablet

5

3NF - Third Normal Form

Scenario Violation of 3NF Resolution (3NF)

Table:

EmployeeDetails

Attributes: EmployeeID, Department,

Manager

Table: Departments

Attributes: Department (Primary Key),

Manager

Description: In this table, we have attributes To achieve 3NF, we create a new table

called

"EmployeeID," "Department," and "Departments" with "Department" as the

primary

"Manager." A violation of 3NF key. We then include "Manager" in the

occurs because "Manager" depends "Departments" table with a reference to

on "Department" but not "Department" as a foreign key. This

separation

"EmployeeID." of data resolves the transitive

dependency.

Before 3NF (Example Rows): EmployeeID Department

101 HR

102 IT

103 HR

104 IT

Normalization of the ERD

Table

Meets

1NF

Meets

2NF

Meets

3NF Issues & Resolution

Students Yes Yes Yes No issues found.

Courses Yes Yes Yes No issues found.

Grade_Report Yes Yes No (3NF

Issue)

"DepartmentName" transitive dependency.

Create a separate "Courses" table with

"DepartmentName."

6

Departments Yes Yes Yes No issues found.

In conclusion, while most of the tables in the ERD already meet 1NF, 2NF, and 3NF

criteria, the "Grade_Report" table requires a slight change to accomplish 3NF by separating

"DepartmentName" into a new "Courses" table. This adjustment guarantees that the ERD lines up

with the principles of database normalization, optimizing data organization and integrity.

7

References

Ouafiq, E. M., Saadane, R., & Chehri, A. (2022). Data management and integration of low power

consumption embedded devices IoT for transforming smart agriculture into actionable

knowledge. Agriculture, 12(3), 329.

Kasica, S., Berret, C., & Munzner, T. (2020). Table scraps: an actionable framework for multi-table

data wrangling from an artifact study of computational journalism. IEEE Transactions on

visualization and computer graphics, 27(2), 957-966.

Servant, N., Varoquaux, N., Heard, E., Barillot, E., & Vert, J. P. (2018). Effective normalization for

copy number variation in Hi-C data. Bmc Bioinformatics, 19(1), 1-16.