(SEM VIII) THEORY EXAMINATION 2023-24 DATA WAREHOUSING & DATA MINING

B.Tech Data Structure 0 downloads
₹29.00

SECTION A

(Attempt all | 2 × 10 = 20 Marks)

 

a. Define Data Warehousing
A data warehouse is a subject-oriented, integrated, time-variant, and non-volatile collection of data used to support decision making.

 

b. Discuss Fact Constellation
Fact constellation is a schema with multiple fact tables sharing common dimension tables. It is also called a galaxy schema.

 

c. Explain Distributed DBMS implementation
Distributed DBMS stores data across multiple locations connected by a network, allowing data sharing, reliability, and parallel processing.

 

d. Define Warehousing Software
Warehousing software is used to extract, transform, load (ETL) data and manage storage, querying, and analysis of warehouse data.

 

e. Are all patterns interesting?
No. Only patterns that are useful, valid, novel, and understandable are considered interesting.

 

f. Binary symmetric vs asymmetric attributes
Symmetric attributes treat both values equally (e.g., gender).
Asymmetric attributes treat one value as more important (e.g., disease presence).

 

g. Mode of dataset & advantage
Dataset: 12, 13, 34, 32, 21, 29, 40, 11, 39, 23
All values occur once → No mode.
Advantage: Mode is not affected by extreme values.

 

h. Manhattan distance
Objects: (22, 2, 45, 10) and (20, 10, 26, 2)

∣22−20∣+∣2−10∣+∣45−26∣+∣10−2∣=2+8+19+8=37|22−20|+|2−10|+|45−26|+|10−2| = 2+8+19+8 = \boxed{37}∣22−20∣+∣2−10∣+∣45−26∣+∣10−2∣=2+8+19+8=37​

 

i. Temporal Mining
Temporal mining discovers patterns from time-related data, such as trends, sequences, and periodic patterns.

 

j. Data Visualization
Data visualization represents data using graphs, charts, plots, and dashboards to identify patterns and insights.

 

SECTION B

(Attempt any THREE | 10 × 3 = 30 Marks)

 

2(a) Knowledge Discovery Process & Snowflake Schema

Steps of Knowledge Discovery in Data (KDD):

Data cleaning                                                                    Data integration

Data selection                                                                   Data transformation

Data mining                                                                      Pattern evaluation

Knowledge presentation

 

Snowflake Schema:
It is an extension of star schema where dimension tables are normalized into multiple related tables.
Advantages: Reduced redundancy
Disadvantages: Complex queries and joins

 

2(b) Market Basket Analysis

Market Basket Analysis identifies relationships between items purchased together using association rules.
Example:
If customers buy bread, they also buy butter.
It uses support, confidence, and lift measures and is widely used in retail and e-commerce.

 

2(c) Box-and-Whisker Plot

Dataset is sorted, and quartiles (Q1, Q2, Q3) are calculated.
The box represents interquartile range, median is shown inside the box, and whiskers show minimum and maximum values.
It helps detect spread, skewness, and outliers.

 

2(d) K-Means Clustering (2 clusters)

Points: (2,4), (6,8), (1,2), (4,5), (3,5)

After iterations using Euclidean distance:

Cluster 1: (1,2), (2,4)                                                 Cluster 2: (3,5), (4,5), (6,8)

K-Means minimizes intra-cluster distance.

 

2(e) ROLAP, MOLAP & HOLAP

ROLAP: Uses relational databases, scalable, slower queries

MOLAP: Uses multidimensional cubes, fast queries, less scalable

HOLAP: Combines both ROLAP and MOLAP advantages

 

SECTION C

 

3(a) Mapping 2D table to multidimensional model

A 2D sales table (Product, Time, Sales) is mapped into a cube with dimensions Product, Time, Location and measure Sales.
This enables slice, dice, drill-down, and roll-up operations.

 

3(b) Data Characterization & Discrimination

Data Characterization: Summarizes general features of a class.
Data Discrimination: Compares features of two or more classes.
Used for descriptive data mining.

 

4(a) Min-Max vs Z-Score Normalization

Min-Max:

v′=v−minmax−minv' = \frac{v−min}{max−min}v′=max−minv−min​

Z-Score:

v′=v−meanstdv' = \frac{v−mean}{std}v′=stdv−mean​

Binary data uses 0/1 values, while nominal data represents categories like colors or names.

 

4(b) Data Mining Architecture

Components include:                                                  Data sources

Data warehouse                                                          Database server

Data mining engine                                                    Pattern evaluation module

User interface

 

5(a) Decision Tree-Based Classifiers

Decision trees classify data using if-then rules.
They use entropy and information gain to split nodes.
Advantages: Easy to understand, fast classification
Disadvantages: Overfitting

 

5(b) Bayesian Classification (Result)

Given tuple X = (youth, medium, yes, fair),                 Using Bayes theorem, the tuple is classified as:
buys_computer = YES

 

6(a) Types of Clustering Methods

Partitioning (K-Means)                                                Hierarchical

Density-based (DBSCAN)                                            Grid-based

Model-based

Partitioning clustering divides data into K clusters minimizing distance.

 

6(b) DBSCAN Algorithm

DBSCAN groups data based on density using parameters ε (epsilon) and MinPts.
It identifies clusters of arbitrary shape and handles noise effectively.

 

7(a) OLAP vs OLTP & Slice vs Dice

OLAP: Analytical, read-intensive, historical data
OLTP: Transactional, real-time updates

Slice: Fixes one dimension
Dice: Selects multiple dimensions

 

7(b) Spatial Data Mining

Spatial data includes geographical information (maps, satellite data).
Mining involves spatial clustering, association, and trend detection using GIS tools.

File Size
152.55 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Web Development Classes Near Noida Sector 103 – Complete Guide to Start Your Tech Career Noida
Geography Classes Near Sector 92 Gurugram – Build Strong Concepts, Map Skills & Exam Confidence Gurugram
Spoken English Classes Near By Malviya Nagar Build Confidence, Improve Fluency & Unlock Career Opportunities in 2026 Malviya Nagar, Delhi
Personal Fitness Training Near Sector 135 Greater Noida – Achieve Your Health and Fitness Goals with Expert Guidance Sector 135, Noida
Yoga Classes Near By Tilak Nagar Holistic Wellness, Stress Relief & Stronger Mind-Body Balance Tilak Nagar, Delhi
Guitar Classes Near Central Noida Sector 5 – Learn Guitar with Professional Trainers B Block Sector 5, Noida
Vedic Maths Classes Near Sector 99A Dwarka Expressway, Gurugram – Boost Speed, Accuracy & Mental Calculation Skills Sector 99A, Gurugram
Voice-Over Training Near Sector 139 Noida – Learn Professional Voice Acting & Recording Skills Noida
Physiotherapy Guidance (Certified Professionals Only) Near Sector 120 Noida – Expert Care for Pain Relief and Recovery Sector 120, Noida
Meditation Coaching Near By Nangloi – Find Inner Peace & Mental Clarity Nangloi, Delhi
Drum Lessons (Electronic Drums Preferred at Home) Near Sector 146 Noida – Learn Drumming with Professional Trainers Sector 146, Noida
Spoken English Classes Near By Kalkaji Improve Fluency, Build Confidence & Grow Career Opportunities in 2026 Kalkaji, Delhi
Hindi Classes Near Sector 89 Gurugram – Build Language Skills with Confidence and Clarity Sector 89, Gurugram
🇩🇪 German Language Classes Near Sector 116 Noida – Learn German with Professional Training Sector 116, Noida
History Classes Near By Dwarka Mor Build Strong Conceptual Understanding & Score High in Board Exams Dwarka Mor, Delhi
Singing / Vocal Training Near DLF Phase 2 – Professional Voice Training for All Age Groups DLF Phase 2, Gurugram
Diet & Nutrition Consultation Near Sector 125 Noida – Your Complete Guide to Healthy Living Sector 125, Noida
Spoken English Classes Near By Kirti Nagar Improve Fluency, Build Confidence & Unlock Career Opportunities in 2026 Kirti Nagar, Delhi
Personality Development Classes Near Sector 56 Gurugram – Build Confidence, Communication & Professional Success Sector 56, Gurugram
Singing & Guitar Classes Near Sector 106 Gurugram (Dwarka Expressway) – Discover Your Musical Talent Sector 106, Gurugram
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020