11. Building Information Systems
Download
Report
Transcript 11. Building Information Systems
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
File Organization Terms and Concepts
Bit: Smallest unit of data; binary digit (0,1)
Byte: Group of bits that represents a single
character
Field: Group of words or a complete number
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
File Organization Terms and Concepts
Record: Group of related fields
File: Group of records of same type
Database: Group of related files
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Data Hierarchy in a Computer System
Figure 7-1
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
File Organization Terms and Concepts
Entity: Person, place, thing, event about which
information is maintained
Attribute: Description of a particular entity
Key field: Identifier field used to retrieve,
update, sort a record
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Entitities and Attributes
Figure 7-2
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Traditional File Processing
Figure 7-3
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Problems with the Traditional File Environment
Data redundancy
Program-Data dependence
Lack of flexibility
Poor security
Lack of data-sharing and availability
DATA REDUNDANCY
The presence of duplicate data in multiple data
files
Different functions collect the same information
independently
May have different meanings in different parts of
the organisation
Data Redundancy
7
Data Redundancy
Staff_Branch relation has redundant data; the details of a
branch are repeated for every member of staff.
In contrast, the branch information appears only once for
each branch in the Branch relation and only the branch
number (Branch_No) is repeated in the Staff relation, to
represent where each member of staff is located.
8
Program Data Dependence
The tight relationship between data stored in
files and the specific programs required to
update and maintain those files
Every program must describe the nature
In traditional file environment any changes to
data requires a change in all programs that
access the data
A change in tax rates for example !!
Lack of Flexibility
Traditional File system can deliver routine scheduled
reports after a significant programming efforts
An ad hoc/ unanticipated request for information,
would require a lot of time
The information is somewhere in the system but too
expensive to locate/retrieve
Compiling the data could take weeks
Poor Security
There is little or no control and management of
data
Data could be disseminated all over the
organisation without control
Who is accessing the data and making changes?
Lack of Data-sharing
Lack of control over access
Hard to get hands on information
Different pieces of information in different files
and different physical locations
Since files in different locations can’t be related
hard to share or access in a timely manner
Impossible for information to flow freely
Database Technology
DATABASE:
A collection of data organised to serve many
applications efficiently by centralising the data and
minimising redundant data.
Historical context
Why develop DBMS at all?
Manage flood of data from Transaction
Processing Systems
Integrate data across organisation
“Data glare”
DBMS
A Database Management System (DBMS) is
general purpose software and hardware
facility to:
Create, delete, reorganize, and manipulate
data in a database
Store, retrieve, share, and maintain data in
a database
Maintain relationships between the
database components
THE DATABASE APPROACH TO DATA MANAGEMENT
Database Management System (DBMS)
• Creates and maintains databases
• Eliminates requirement for data definition
statements
• Acts as interface between application
programs and physical data files
DBMS Cont’d
Provide security and procedures relating to
privilege and access.
Authenticates the integrity of all the updates and
transactions that are carried out.
interface for the access, deletion and addition of
data and for redefining the relationships within the
database.
A DBMS is a collection of programs that manages the
database structure and controls access to the data
stored in the database.
DBMS
Relieves the programmer or end user from the
task of understanding where and how data are
actually stored
Seperates the logical view from the physical
view
Logical View- How data perceived by end
users or business specialists
Physical View- How data is actually organised
and structured on phsical storage
media
THE DATABASE APPROACH TO DATA MANAGEMENT
The Contemporary Database Environment
Figure 7-4
THE DATABASE APPROACH TO DATA MANAGEMENT
Types of Databases
• Relational DBMS
• Hierarchical and Network DBMS
• Object-Oriented Databases
THE DATABASE APPROACH TO DATA MANAGEMENT
Relational DBMS
• The most popular type of DBMS today for PCs
as well as for larger companies and mainframes
• Represents all data in DB as two-dimensional
tables called relations
• Similar to flat files but information in more than
one file can easily be extracted and combined
• Relates data across tables based on common
data element
• Examples: DB2, Oracle, MS SQL Server
THE DATABASE APPROACH TO DATA MANAGEMENT
Relational Data Model
Figure 7-6
THE DATABASE APPROACH TO DATA MANAGEMENT
Three Basic Operations in a Relational Database
• Select: Creates subset of rows that meet
specific criteria
• Join: Combines relational tables to provide
users with information
• Project: Enables users to create new tables
containing only relevant information
THE DATABASE APPROACH TO DATA MANAGEMENT
Three Basic Operations in a Relational Database
Figure 7-7
THE DATABASE APPROACH TO DATA MANAGEMENT
Hierarchical and Network DBMS
Hierarchical DBMS
• Organizes data in a tree-like structure
• Supports one-to-many parent-child
relationships
• Prevalent in large legacy systems
THE DATABASE APPROACH TO DATA MANAGEMENT
Hierarchical DBMS
Figure 7-8
Hierarchical
Disadvantages
Knowledge of physical level required
Does not support logical data independence and
does not support all physical data independence
operations
Not all problems are one-to-many types
Problems with multiple parent implementation
Problems with anomalies for parent deletion
Application development in 3GL time-consuming
Support programs are not part of the DBMS
“System created by programmers for
programmers!”
THE DATABASE APPROACH TO DATA MANAGEMENT
Network DBMS
• Depicts data logically as many-to-many
relationships
THE DATABASE APPROACH TO DATA MANAGEMENT
Network DBMS
THE DATABASE APPROACH TO DATA MANAGEMENT
Disadvantages
Outdated
Less flexible compared to RDBMS
Lack support for ad-hoc and English languagelike queries
Object-Oriented databases
Object-oriented DBMS: Stores data and
procedures as objects that can be retrieved and
shared automatically
Object-relational DBMS: Provides capabilities
of both object-oriented and relational DBMS
DBMS Disadvantages
DBMSs are complex;
Need for explicit backup and control;
Costs associated with development and
operation can be substantial;
Consolidation of an entire business’
information resources can create a high level
of vulnerability.
Designing Databases
Conceptual design: Abstract model of database
from a business perspective
Physical design: How data is actually stored on
direct access storage devices
CREATING A DATABASE ENVIRONMENT
Designing Databases
Entity-relationship diagram: Methodology for
documenting databases illustrating relationships
between database entities
Normalization: Process of creating small stable
data structures from complex groups of data
An Entity-Relationship Diagram
Figure 7-10
An Unnormalized Relation of ORDER
Figure 7-11
An Normalized Relation of ORDER
Figure 7-12
Distributing Databases
Centralized database
Used by single central processor or multiple
processors in client/server network
Distributing Databases
Distributed database
Stored in more than one physical location
Partitioned database
Duplicated database
Distributed Databases
Figure 7-13
DATABASE TRENDS
Data Warehousing and Datamining
Data warehouse
Supports reporting and query tools
Stores current and historical data
Consolidates data for management analysis and
decision making
DATABASE TRENDS
Components of a Data Warehouse
Figure 7-16
DATABASE TRENDS
Data Warehousing and Datamining
Datamining
Tools for analyzing large pools of data
Find hidden patterns and infer rules to predict
trends
DATABASE TRENDS
Benefits of Data Warehouses
Improved and easy accessibility to information
Ability to model and remodel the data
DATABASE TRENDS
Databases and the Web
Database server
Computer in a client/server environment runs a
DBMS to process SQL statements and perform
database management tasks
Application server
Software handling all application operations
DATABASE TRENDS
Linking Internal Databases to the Web
Figure 7-18