Monday, January 27, 2014

Star Schema v/s OLAP Cube

In the first couple of weeks, we have recognized the importance of dimensional modeling in the design of a data warehouse. Dimensional model simplifies the database and allows the BI software to present the data to end-users in a meaningful and efficient way. Dimensional model can be implemented using Star schemas or OLAP cubes. 

Star Schema:

A star schema is the simplest form of dimensional modeling which consists of one or more fact tables in the center surrounded by dimension tables (providing a context to the facts). This structure resembles a star and hence the name Star Schema. These star schemas are implemented on a relational database management system.


OLAP Cube:

An OLAP (Online Analytical Processing) cube is multi-dimensional and consists of aggregated and pre-calculated summary data. As a result, an OLAP cube provides superior performance and enables the business users to slice and dice along the various dimensions, drill up and drill down through hierarchies.


Slice-and-Dice operation

By slicing across the product dimension in the cube, the product manager will be able to analyze the sale of a product while the store manager can observe the sales for a particular store by slicing across the location dimension.

A “slice” means a cut across the cube and “dice” means a small cube which is produced by the intersection of slices.


Drill-up and Drill-down operation

To analyze the data at different levels, the user is provided with three hierarchies:
  • Time Hierarchy
  • Product Hierarchy
  • Location Hierarchy
 

In this scenario, the manager will be able examine the data for different product types in various cities of a particular state for all the quarters in a particular year. This can be possible through the drill-up and drill-down functionality in multidimensional cubes.

Both star schemas and OLAP cubes offer their own benefits and shortcomings. Therefore, based on a good understanding of requirements, an efficient star schema or/and an OLAP cube can be designed to answer difficult business questions!

More to follow in my next blog. Stay tuned!

References:
  • The Data Warehouse Toolkit, Ralph Kimball and Margy Ross
  • Database Systems – Design, Implementation and Management, Peter Rob and Carlos Coronel
  • Wikipedia





3 comments:

  1. hi Pradeep, this is a fine article, keep it up! In your next blog, will you talk about the Pro's and Con's of the two structural choices in greater detail? Am looking forward to it!

    ReplyDelete
    Replies
    1. Hi SC, definitely! I'll keep you posted. This is also part of the coursework! :)

      Delete
  2. Hi Pradeep, nice article. I am not sure why you have not included details on snowflake schema, will you be taking that later on?

    ReplyDelete