FACULTY OF INFORMATICS DEPARTMENT OF INFORMATION SCIENCE
INST 766: DATA MINING ASSIGNMENT-1 ( DUE DATA : On OR Before 3nd November,2007, 4 PM.)
1. Describe five functionalities of data mining and provide examples wherever possible. 2. Discuss various ways of classifying data mining systems. 3. What do you understand by concept hierarchies? Elaborate on concept hierarchy generation for categorical data. 4. Various organizations developed their own KDD processes. Pickup from the Internet the following KDD processes and report on them: a. Two Crow’s KDD process and b. CRISP-DM process 5.
Pickup data rich of both nominal and numerical attributes. Delete some of the attribute values randomly from the records, introduce outliers for some of the attributes, and then experiment the following operations using WEKA and SPSS or any of your favorite DM tools. a. Data cleaning operations b. Discretization by binning c. Normalization d. Sampling e. Data reduction ( Note: This question is just to inculcate practice on DM tools. No need to report in the assignment.)