Previous | Table of Contents | Next |
Service Dimension
The service dimension describes the services being sold. This table will have a relatively small number of rows, since fewer than 1,000 services are provided. It will usually be a table with a large number of columns. These columns detail everything that can be known about the service, such as the following:
Here are some questions that may be answered with the service dimension:
Salesperson Dimension
The salesperson dimension records information about the salesperson who achieved the sale. This dimension will often be used for spatial analysis. The columns in this dimension contain information about the salesperson that is easy to group, as shown in the following list. Notice that the name and address information of the salesperson is not recorded. That information is not relevant for high-level decisions.
The following questions are the type that may be answered with the salesperson dimension.
Customer Dimension
The customer dimension is often the most valuable dimension in this particular type of star schema. It can record information about the customer that will allow management to better understand the customer base. Some of the data included would be the following:
Some of the questions the customer dimension might answer are the following:
Using All the Dimensions
The true benefit of dimensions is that they can all be used together. The data from the fact table can be filtered using any of the dimension tables.
The following are examples of the kinds of questions that may be answered with multiple dimensions:
The star schema is a data warehousing model that provides a flexible, simple architecture for decision support systems. It is the most proven and accepted model in this area.
The data warehouse is typically designed to be updated on a periodic basis. The data warehouse design team must determine the periodic refresh interval for each group of data. Some tables will need to be updated nightly. The Sales table is a good candidate for a nightly refresh because it is constantly changing. Other tables, such as lookup tables, change much less frequently.
An example of this is the Shipping table. The Shipping table contains the shipper_id, name, address, phone number, and shipping code. The Shipping table has three rows (see Table 30.1).
ID | Name | Address | Phone | Shipping Code |
---|---|---|---|---|
1 | Federal Express | 101 Lincoln Ave. | 303-555-1112 | A788D6 |
2 | United Parcel Service | 121 10th St. | 303-555-4444 | 11118786657 |
3 | U.S. Post Office | 6556 Main Rd. | 303-555-8878 | U454IUYO |
Because the company rarely adds or changes shipping companies, this table can be updated much less frequently inside the data warehouse.
Some data brought into the data warehouse is refreshed based on a schedule outside of the data warehouse design teams control. Data derived from accounting cycles will be updated inside the data warehouse according to the accounting cycle dates. A company that completes all of its accounting for each month on the 15th of the following month will refresh the data on the 16th of the month. Other data will not be periodic in nature. Press releases are often random in nature and will need to be added as quickly as possible.
Its important for the data warehouse design team to remember that not all the data is current at a single point in time.
Previous | Table of Contents | Next |