(SEM VII) THEORY EXAMINATION 2023-24 DATA WAREHOUSING AND DATA MINING

B.Tech Data Structure 0 downloads
₹29.00

SECTION A – Very Short Answer Type

(2 × 10 = 20 Marks)

a) Key steps of Data Mining

The key steps are:                                               Data cleaning

Data integration                                                  Data selection

Data transformation                                            Data mining

Pattern evaluation                                               Knowledge presentation

These steps convert raw data into useful knowledge.

 

b) Support and Confidence

Support: Frequency of an itemset appearing in the database.

Confidence: Probability that item Y is purchased when item X is purchased.

 

c) Data Warehouse Process

It involves data extraction from multiple sources, transformation to a consistent format, and loading into a central repository for analysis.

 

d) Warehousing Strategy

Warehousing strategy defines how data is collected, stored, and accessed, such as enterprise warehouse, data mart, or virtual warehouse.

 

e) Statement of Apriori Algorithm

All non-empty subsets of a frequent itemset must also be frequent.”
This property reduces the search space in association rule mining.

 

f) Drawbacks of K-Means Algorithm

Requires predefined number of clusters                       Sensitive to initial centroids

Fails with non-spherical clusters                                    Affected by noise and outliers

 

g) Classification

Classification assigns data objects to predefined classes based on labeled training data, using algorithms like decision trees and Naive Bayes.

 

h) Clustering

Clustering groups similar data objects without predefined labels, aiming to maximize similarity within clusters and minimize similarity between clusters.

 

i) Need for Data Mining

Data mining helps discover hidden patterns, trends, and relationships in large datasets to support decision-making.

 

j) Binning

Binning is a data smoothing technique that reduces noise by grouping values into intervals or bins.

 

SECTION B – Long Answer Type

(Attempt any three – 10 Marks each)

 

2(a) Knowledge Extraction Process

Knowledge extraction is the process of discovering meaningful patterns from large datasets.

Steps:

 

Data Selection – Choose relevant data

Data Preprocessing – Clean noisy and missing data

Transformation – Normalize and aggregate data

Data Mining – Apply algorithms (classification, clustering, association)

Pattern Evaluation – Identify interesting patterns

Knowledge Presentation – Visualize results using charts and reports

This process converts raw data into actionable knowledge.

 

2(b) OLAP Functions, Tools, and Servers

OLAP Functions:                                                        Roll-up

Drill-down                                                                 Slice

Dice                                                                           Pivot

OLAP Tools:                                                               MOLAP (Multidimensional OLAP)

ROLAP (Relational OLAP)                                          HOLAP (Hybrid OLAP)

OLAP Servers:

OLAP servers store and process multidimensional data, enabling fast analytical queries.

 

2(c) Database Schemas

Types:

Star Schema – Central fact table connected to dimension tables

Snowflake Schema – Normalized version of star schema

Fact Constellation Schema – Multiple fact tables sharing dimensions

Example:
Sales fact table linked to time, product, and location dimensions.

 

2(d) Statistical Measures in Classification

Key measures include:                                               Mean

Median                                                                      Variance

Standard Deviation                                                   Correlation

These measures summarize data distribution and improve classification accuracy in large databases.

 

2(e) Building a Data Warehouse

Steps include:                                                             Business analysis

Data source identification                                          ETL design

Data modeling                                                           Data loading

Testing and deployment

A data warehouse supports long-term decision making.

 

SECTION C – Descriptive Answer Type


3(a) Mapping Data Warehouse to Multiprocessor Architecture

Steps:

Partition data across processors                                  Assign fact and dimension tables

Enable parallel query processing                                 Synchronize data access

Optimize workload distribution                                   This improves performance and scalability.

 

3(b) Data Cubes with Example

A data cube represents data in multiple dimensions.

 

Example:
Sales analyzed by time, location, and product.
Each cell stores aggregated values like total sales.

 

4(a) Concept Hierarchy

Concept hierarchy organizes data from general to specific levels.

Example:
Location → Country → State → City                             It supports roll-up and drill-down operations.

 

4(b) Warehouse Management and Support Process

Includes:                                                                       Data refresh

Indexing                                                                       Backup and recovery

Query management                                                     Performance tuning

Ensures smooth warehouse operation.

 

5(a) Data Mining System Integration Approaches

ApproachDescription
No CouplingMining done outside database
Loose CouplingDatabase used for data storage
Semi-tight CouplingSome mining functions integrated

Semi-tight coupling provides better performance.

 

5(b) Data Consolidation Statement Justification

Yes, data consolidation is a data modeling activity because it integrates data from multiple sources into a unified schema for analysis.

 

6(a) Measures of Central Tendency

Mean – Average value                                       Median – Middle value

Mode – Most frequent value                             These summarize dataset characteristics.

 

6(b) Quartiles and Histograms

Quartiles divide data into four equal parts       Histograms graphically represent data distribution

Both help in understanding data spread.


7(a) Distance-Based vs Decision Tree Algorithms

AspectDistance-BasedDecision Tree
MethodSimilarity measuresRule-based
InterpretabilityLowHigh
Noise handlingWeakStrong

7(b) Web Mining

Types:                                                                         Web Content Mining – Extracts text and media

Web Structure Mining – Analyzes link structure     Web Usage Mining – Studies user behavior

Used in recommendation systems and search engines.

File Size
139.03 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Meditation Coaching Near Sector 124 Noida – A Complete Guide to Mental Peace and Mindfulness Noida
Japanese Language Classes Near Uttam Nagar – Learn Japanese for Global Opportunities Uttam Nagar, Delhi
Spanish Language Classes Near Sector 43 Gurugram – Learn Spanish with Expert Trainers Sector 43, Gurugram
Spoken English Classes Near Sector 119 Noida – Improve Your Communication Skills with Expert Training Sector 119, Noida
Meditation Coaching Near By Nangloi – Find Inner Peace & Mental Clarity Nangloi, Delhi
Accounts & Commerce Classes Near By Dwarka Mor Professional Coaching Dwarka Mor, Delhi
App Development Classes Near Uttam Nagar – Build Android & iOS Apps Uttam Nagar, Delhi
Public Speaking Training Near Sector 109 Noida – Improve Confidence and Communication Skills Noida
Spoken English Classes Near By Chhatarpur Improve Fluency, Build Confidence & Unlock Career Opportunities in 2026 Chhatarpur, Delhi
Yoga Classes Near Sector 136 Greater Noida – Improve Your Health, Flexibility and Mental Wellness Noida
Yoga Classes Near By Lajpat Nagar Build Strength, Reduce Stress & Achieve Holistic Wellness in 2026 Lajpat Nagar, Delhi
Guitar Classes Near By Saket Learn Guitar from Experts & Turn Your Passion into Skill in 2026 Saket, Delhi
🇫🇷 French Language Classes Near Sector 112 Noida – Learn French with Expert Trainers Noida
No Office Rent Business Setup Near Najafgarh Start & Grow Your Business Without Paying High Office Rent in 2026 Najafgarh, Delhi
Tally / Accounting Software Course Near Sector 64 Gurugram – Build a Strong Career in Accounts & Finance Sector 64, Gurugram
Spoken English Classes Near By Najafgarh Improve Fluency, Build Confidence & Speak English Naturally Najafgarh, Delhi
Prenatal Yoga Training Near Uppal Southend, Gurugram – A Calm & Healthy Pregnancy Journey Uppal Southend, Gurugram
Violin Classes Near DLF Phase 5 – Learn, Grow & Perform with Confidence DLF Phase V, Gurugram
Web Development Classes Near Uttam Nagar – Learn to Build Modern Websites Uttam Nagar, Delhi
Guitar Classes Near Central Noida Sector 5 – Learn Guitar with Professional Trainers B Block Sector 5, Noida
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020