| Previous | Table of Contents | Next | 
by Matt Larson
Businesses have often understood how important operational data is to the survival of the company. They spend millions collecting data during the course of business. Unfortunately, most businesses fail to properly utilize the data. Once the operational benefits have been gained, the data is forgotten. Due to the increasingly competitive marketplace, businesses are looking to their data to gain an edge over the competition.
A data warehouse is a central repository of data used for analysis by decision makers. The data is pulled from the operational data stores into the warehouse where it is massaged into a format that is easy to utilize. The decision makers receive a simplified business view of the company data that can easily be used for analysis.
Established 20 years ago, the fictitious Acme company has grown into a worldwide corporation that sells office supplies to over 30,000 businesses on four continents. Acme has always understood that current technology is a vital tool. In the last 10 years, Acme has amassed a computer network that consists of over 700 servers. Some of these servers are mainframes running on MVS, some are mid-range systems running on VMS and UNIX, and the newest servers are running on Windows NT. This open environment has been beneficial at the business unit level. Each business unit was allowed to purchase the best possible solution to its problem. However, it has become increasingly difficult to answer critical business questions that span more than one business unit.
The CEO recently contemplated expanding the sales force by 10%. In order to make an informed decision, the CEO asked about the relationship between the number of salespeople and profitability. There are several pieces of information needed to answer this question.
Data analysts ran into several problems while attempting to answer this simple question (see Figure 30.1).
 
Figure 30.1.  Operational data.
Eventually the data analysts gather the data they need. They compile a report and present it to the CEO. The report says that salespeople added to the sales force in the past year have not contributed to a higher profit. Instead, they appear to be taking sales away from existing salespeople. The CEO takes one look at it the report and asks about the profit potential of additional advertising. The data analysts, having spent two weeks on the sales force report, look at each other in amazement. There has to be a better way.
There isdata warehousing.
There are six steps involved in creating a data warehouse. The steps are usually performed in sequential order, although some may be performed together.
The data warehouse is going to pull data from all of the important systems in the company.
The first step in documenting the operational environment is to identify all of the systems. Some of them will be easy to spot.
Some of the systems will be harder to identify.
The second step in documenting the operational environment is to determine what data is stored in each system. This must be done at the column level. In other words, the columns for every important table must be clearly understood. Depending on the environment, this step may take months to finish. For example, Oracle Applications 10.7 contains thousand of tables with valuable data. All of these tables must be fully understood and documented. This will become even more difficult with older systems. It may be difficult to find an analyst who understands how to interpret the table structures.
| Previous | Table of Contents | Next |