In the last couple of years, I have worked on several data warehouse applications that have provided the inspiration for a new presentation that I am giving at the DOAG and UKOUG conferences this year.
The presentation and this series of related blogs have several objectives:
- Partly, it is an attempt to get some advice out in the hope that some of these poor design decisions are not made in the first place.
- I get to have a bit of a rant about how to design a data warehouse properly, and I will feel better at the end it.
- I get to attend two really good conferences where I will learn (or be reminded of) much.
Oracle Sales History (SH) Sample Schema
I cannot illustrate issues with actual examples from client systems, no matter how well anonymised they are. So, I have recreated some of the most egregious mistakes using the Oracle Sales History example schema, SH. Everything I am going to show you does have an origin in the real world! The SH sample schema is a simple and very typical star schema. It is well designed and implemented, and represents good practice. I encourage you to install and play with it for yourself.
My examples use a single fact table (SALES), that has five dimensions around it (CUSTOMERS, CHANNELS, PRODUCTS, PROMOTIONS and TIMES), although I will only use three. COUNTRIES is a dimension on the CUSTOMERS dimension, sometimes called a 'snowflake'.
- The schema described in the Oracle documentation.
- Installation scripts are available from Github.
- It is also available by default in the Autonomous Data Warehouse Cloud but in a slightly different form.
Common Mistakes in Data Warehouse Design and Build
As each blog post is published, I will add a link to them in this section.
No comments :
Post a Comment