|
Index - Major Sections
Home
**InHCc HMIS**
Site Map
Health Economic and Reform
Benefits
Discussion
Data and Data
Analysis
Health Management
Product and Services
References
Team
_______________
Index - Same Level Subject
Index - Child Subjects
| |
Introduction
The Data Warehouse is the repository of information from the Clinical
System. Whereas the Clinical system is the on-line real time transactional
database, the data warehouse is the "read only" access to the historical
information.
Purpose
A data warehouse stores stable, verified data values.
- A transaction database helps people carry out activities,
and a data warehouse helps people make decisions. For example, a transaction
database might show which clinics have a medical specialist so that a client
can be referred. A data warehouse, on the other hand might show the
historical pattern of how many requests for a medical specialist took place
and in what areas so that a manager can decide to adjust staffing in the
future.
- A transaction database is volatile; its information
constantly changes as new clients are seen. A data warehouse is stable; its
information is updated at standard intervals--perhaps monthly, weekly, or
even hourly--and, in an ideal world, an update would add values for the new
time period only without changing values previously stored in the warehouse.
- A transaction database focuses on the details; the
physician making the referral doesn't want to know the average number of
visits per month to each medial specialist. A data warehouse focuses on
high-level aggregates: the manager updating staffing doesn't care which
specialist is not at the present moment occupied. The implication of this
difference is that the key values in a data warehouse must be numeric values
that can be summarized.
Characteristics
A Data Warehouse can be viewed as a Information Database. In additional to
the characteristics of the Information Database it also has the following
characteristics:
-
Organized
by subject
-
Data
is time-variant. It contains
historical data.
-
Data
is integrated from multiple data sources
-
Data
is no-volatile. The individual data is never updated or
deleted.
-
Data
is added in batches
-
Data
is optimized for read only queries and not for data entry.
-
May
requires
mapping of sources to the database structure.
Advantages
Organization
There are two basic concepts on how to organize a database
There are differences between the types of databases.
Reasons
to split of Operational data from Informational data (Innom, 1996)
-
The
data serving operational needs is physically different data from that
serving informational or analytic needs.
-
The
supporting technology for operational processing is fundamentally
different from the technology used to support informational or analytical
needs.
-
The
user community for operational data is different from the one served by
informational or analytical data.
-
The
processing characteristics for the operational environment and the
informational environment are fundamentally different.
| Operational |
Analytical |
| Individual transactions |
Masses amount of data |
| Frequent Updates |
Read Only |
| Quick response time |
Response time not as important |
| Normalized data |
Denormalized Data |
| Traditionally operational data |
Traditionally Research data |
A data warehouse is not as susceptible to system failures
as a transaction processing system. If the system crashes and data is lost,
the data warehouse can be reloaded from its sources.
Reasons to separate the operational from the data warehouse (Innom,
1996, page-27)
Relationship to the Relational Databases
A data warehouse almost never lets the user update or add
records to the database…therefore the main benefit of the normalization of
the database for the relational database does not apply to the data warehouse.
Flexibility comes at a price. Performs poorly because of
the necessary to create joins.
Relationship to Object-oriented databases
The elimination of the table-row-column structures of
relational data bases and substituted directly into the database engine, the
concepts of classes and subclasses (or types and subtypes), objects,
properties, and methods.
Relationship to Multidimensional Databases
Multidimensional databases throw out the idea of the
conventional relations database.
To understand multidimensional databases, we must first
look at the types of analytical analysis that they perform.
Relationship to Online Analytically Processing (OLAP)
OLTP applications typical access only a few tables during
a given short lived transaction that can either added data or extract data
from the data warehouse.
OLAP applications typical access many tables during a
given long lived transaction that only reads the data. |