BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI Work Integrated Learning Programme (DLPD)
BITS-Satyam Collaborative Programme: M.S. Software Engineering at Satyam Comp. Services Ltd., Hyderabad I SEMESTER 2007-2008 COURSE HANDOUT
Course No. Title Instructors
: SESL ZG514 : Data Warehousing : Prof. A. Govardhan
[email protected]
Course Description Introduction, Evolution of data warehousing; decision support systems; goals, benefit, and challenges of data warehousing; architecture; data warehouse information flows; software and hardware requirements; approaches to data warehouse design; creating and maintaining a data warehouse; Online Analytical Processing (OLAP) and multi-dimensional data, multi-dimensional modeling; view materialization; data marts; data warehouse metadata; data mining. Scope and Objectives Corporate decision makers require access to all the organization’s data, wherever it is located. To provide comprehensive analysis of the organization, its business, its requirements and any trends, require access to not only the current data in the database but also to historical data. To facilitate this type of analysis, data warehouses have been created to contain data drawn from several sources, maintained by different departments of the organization. This course will involve an indepth study of various concepts needed to design and develop a data warehouse. It also provides an introduction to data mining and end user access tools for a data warehouse. Prescribed Text Book(S) T1. T2.
Anahory S, & Dennis M, “Data Warehousing in the Real World”, Pearson Education, 1997 Kimball R, “The Data Warehouse Toolkit”, John Wiley, 2nd Ed., 2002.
Reference Books R1.
Kimball R, et. al., “The Data Warehouse Lifecycle Toolkit”, John Wiley, 1998.
R2.
R Ramakrishnan & J Gehrke, Database Management Systems, MGHISE, 3rd Ed., 2003.
Plan of Study Week Topic(s) No. 1 Introduction to Data Warehousing
Chapter Reference
T1: Ch 1 T2: Ch 1 2-3 Data Warehouse Architecture T1: Ch 3, 4 T2: Ch 1 4-6 Data Warehouse Design T1: Ch 5 T2: Ch 1 7 Accessing the Data Warehouse T1: Appendix B 8 Case Studies: Retail Sales and Inventory T2: Ch 2, 3 Syllabus for Mid-Semester Test (Closed Book): Topics in Week No. 1 to 8 9-11 Performance Enhancing Techniques • Partitioning Strategy T1: Ch 6 T1: Ch 7 • Aggregation 12 Data Marting T1: Ch 8 13 Metadata T1: Ch 9 14 Advanced Design Issues • Hardware Architecture T1: Ch. 11 • Security T1: Ch. 13 • Backup and Recovery T1: Ch. 14 • Capacity Planning T1: Ch. 17 15-16 Case Studies • Financial Services: Bank T2: Ch. 9, 10,12 • Telecom Companies • Education Syllabus for Comprehensive Exam (Closed Book): All topics given in Plan of Study
Evaluation Scheme: EC Component & Nature No
Duration
Weightage
Date and Time
Oct 06, 2007 (FN) 10am to 12 pm Dec 22, 2007 (FN) 9am to 12pm
1.
Mid-Semester Test(Closed Book)
2 hrs
30%
2.
Compre. Exam (Closed Book)
3 hrs
40%
3.
Surprise Quizzes -2 Nos.( Best of the Two is taken) Home Assignment – 2Nos. (Average is taken)
1 Hour
15%
4.
15%