(SEM VIII) THEORY EXAMINATION 2024-25 DATA WAREHOUSING & DATA MINING

B.Tech Data Structure 0 downloads
₹29.00

SECTION A – Short Answers (2 Marks Each) – Paragraph Style

 

a) Explain Discretization.

Discretization is a data preprocessing technique used in data mining where continuous numerical data is converted into a finite number of intervals or categories. This helps reduce data complexity and improves the performance of data mining algorithms by making patterns easier to identify and analyze.

 

b) Discuss issues to consider during data integration.

Data integration involves combining data from multiple sources into a unified dataset. During this process, issues such as schema conflicts, naming inconsistencies, data redundancy, and data value conflicts may arise. Resolving these issues is necessary to maintain data consistency and accuracy.

 

c) Explain the working of an Artificial Neuron.

An artificial neuron is a basic unit of a neural network that mimics the functioning of a biological neuron. It receives input signals, multiplies them by weights, sums them, and applies an activation function to produce an output. This process enables learning and pattern recognition.

 

d) Explain support and confidence in association rule mining.

Support measures how frequently an itemset appears in a dataset, while confidence indicates the reliability of an association rule. Together, they help determine the strength and usefulness of relationships discovered in transactional data.

 

e) Explain pivot operation in OLAP.

The pivot operation in OLAP allows users to rotate the data cube to view data from different perspectives. It helps in analyzing multidimensional data by rearranging rows and columns to gain better insights.

 

f) Define Information Gain.

Information Gain is a measure used in decision tree algorithms to determine the best attribute for splitting data. It calculates the reduction in entropy after a dataset is divided based on an attribute.

 

g) Define Data Warehouse.

A data warehouse is a centralized repository that stores integrated, historical, and subject-oriented data from multiple sources to support decision-making and analysis.

 

h) Explain Outlier Analysis.

Outlier analysis identifies data objects that significantly differ from the rest of the dataset. These unusual values may indicate errors, rare events, or important insights and must be carefully analyzed.

 

i) What do negative, positive, and zero correlation coefficients indicate?

A positive correlation coefficient indicates that variables increase together, a negative value shows that one variable increases while the other decreases, and a zero value means there is no linear relationship between variables.

 

j) Explain the binning method for dealing with noisy data.

Binning is a data smoothing technique where data is divided into bins and replaced with a representative value such as the mean or median. This helps reduce noise and improves data quality.

 

SECTION B – Descriptive Answers (10 Marks Each) – Paragraph Style

 

a) Explain three-tier data warehousing architecture.

The three-tier data warehousing architecture consists of the bottom tier, middle tier, and top tier. The bottom tier contains the data warehouse database where cleaned and integrated data is stored. The middle tier includes OLAP servers that process queries and perform analytical operations. The top tier consists of front-end tools used by users for reporting, analysis, and decision-making. This architecture improves scalability, performance, and data management efficiency.

 

b) Compare OLTP and OLAP systems.

OLTP systems are designed for routine transaction processing and handle large numbers of short, simple operations such as insert, update, and delete. OLAP systems, on the other hand, are used for analytical processing and complex queries involving large datasets. While OLTP focuses on operational efficiency, OLAP emphasizes data analysis and decision support.

 

c) Discuss snowflake and fact constellation schemas.

A snowflake schema is an extension of the star schema where dimension tables are normalized to reduce redundancy. A fact constellation schema consists of multiple fact tables sharing common dimension tables. These schemas support complex analytical queries and improve storage efficiency.

 

d) Explain Enterprise Warehouse and Data Mart.

An enterprise warehouse stores integrated data from the entire organization and supports strategic decision-making. A data mart is a subset of a data warehouse designed for a specific department or business function. Data marts are smaller, faster to implement, and easier to manage.

 

e) Explain non-linear separability using EX-OR functionality in neural networks.

The EX-OR problem demonstrates non-linear separability where data cannot be separated using a single straight line. Neural networks solve this problem using multiple layers and hidden neurons that learn complex decision boundaries, highlighting the power of multi-layer perceptrons.

 

SECTION C – Long Answer (10 Marks) – Paragraph Style

 

a) Explain Laplacian correction in Naïve Bayesian classifier with an example.

Laplacian correction is used in Naïve Bayesian classifiers to avoid zero probability values when a feature does not appear in a training class. It adds a small constant to frequency counts, ensuring that no probability becomes zero. This improves classification accuracy, especially when dealing with limited training data.

 

OR

 

b) Explain top-down and bottom-up approaches for hierarchical clustering.

The top-down approach, also known as divisive clustering, starts with all data points in a single cluster and recursively divides them. The bottom-up approach, called agglomerative clustering, begins with each data point as an individual cluster and gradually merges them. Both approaches build a hierarchy of clusters that help identify data structure.

File Size
136.62 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Dance Classes (Bollywood, Hip-Hop, Classical) Near Sohna Road – Learn, Perform & Shine Sohna Road, Gurugram
Home Tuition (All Subjects) Near Sector 88 Gurugram – Personalized Learning for Academic Excellence Sector 88, Gurugram
Music Production (Laptop-Based) Near DLF Golf Course Road – Create, Mix & Release Your Own Music DLF Road, Gurugram
Spoken English Classes Near By CR Park Improve Fluency, Boost Confidence & Unlock Better Opportunities in 2026 Chittaranjan Park, Delhi
App Development Classes Near Uttam Nagar – Build Android & iOS Apps Uttam Nagar, Delhi
IELTS Coaching Near Sector 57 Gurugram – Expert Training for High Band Scores Gurugram Sector 57, Gurugram
Drum Lessons Near Tilak Nagar – Learn Electronic Drums at Home with Confidence Tilak Nagar, Delhi
Social Science Classess Dwarka Mor, Delhi
Spanish Language Classes Near Uttam Nagar – Learn Spanish with Confidence Uttam Nagar, Delhi
Music Production (Laptop-Based) Classes Near Sector 142 Noida – Learn Professional Digital Music Creation Sector 142, Noida
Resume & Interview Coaching Near By Dwarka Mor Build a Professional Resume, Crack Interviews & Secure Your Dream Job Dwarka Mor, Delhi
Yoga Classes Near Hauz Khas Experience Holistic Wellness, Strength & Inner Balance in 2026 Hauz Khas, Delhi
Hindi Classes Near Sector 89 Gurugram – Build Language Skills with Confidence and Clarity Sector 89, Gurugram
Tally / Accounting Software Course Near Sector 64 Gurugram – Build a Strong Career in Accounts & Finance Sector 64, Gurugram
Keyboard / Piano Classes Near DLF Phase 3 – Learn Music with Professional Training DLF Phase 3, Gurugram
Yoga Classes Near Sector 105 Gurugram (Dwarka Expressway) – Transform Your Body & Mind Naturally Gurugram
Meditation Coaching Near Sector 126 Noida – A Complete Guide to Mental Wellness and Inner Peace Sector 126, Noida
Singing / Vocal Training Near Sector 18 Market Area Noida – Learn Music with Professional Vocal Trainers Noida Sector 18, Noida
Digital Marketing Classes Near Noida Sector 96 – Learn Modern Marketing Skills and Build a Successful Career Noida
Computer Basics Course Near By Dwarka Mor – Complete Beginner Training Program Delhi
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020