(SEM V) THEORY EXAMINATION 2024-25 DATA ANALYTICS

B.Tech Engineering 0 downloads
₹29.00

Subject Code: BCS052
Maximum Marks: 70
Time: 3 Hours
Paper ID: 310908

Question Paper Overview

SECTION A (2 × 7 = 14 Marks)

(Short Answer / Conceptual Questions)

a. Differentiate between Predictive and Prescriptive Data Analytics.
b. Define the term Data Lake, Database, and Data Warehouse.
c. Explain the concept of Outliers.
d. Describe the concept of Lasso Regression.
e. Differentiate between Stream Processing and Traditional Data Processing.
f. Write the two limitations of K-Means.
g. Discuss the various categories of clustering techniques.

SECTION B (Attempt any three × 7 = 21 Marks)

a. Explain the different categories of data analytics with examples.
b. Explore PCA (Principal Component Analysis).

Given data = {4, 8, 13, 7; 11, 4, 5, 14}.

Compute the principal components and reduce dimension from 2D to 1D.
c. Explain Market Basket Analysis.

Is it supervised or unsupervised?

How can a company use it to improve marketing strategies?
d. Differentiate between CLIQUE and ProCLUS clustering algorithms.
e. Differentiate between NoSQL and Relational Databases.

Identify when to use NoSQL instead of a Relational Database, with an example.

SECTION C (Attempt one part from each question × 7 = 35 Marks)

Q3

(a) Differentiate between Structured, Semi-Structured, and Unstructured Data.
OR
(b) Describe Big Data and its characteristics.

Q4

(a) Differentiate between Neural Network and Artificial Neural Network.
OR
(b) Given two fuzzy sets:

A = {(10, 0.2), (20, 0.4), (25, 0.7), (30, 0.9), (40, 1), (50, 0.4)}

B = {(10, 0.4), (20, 0.1), (25, 0.9), (30, 0.2), (40, 0.6), (50, 0.6)}

Apply Union, Intersection, Complement, Bold Union, and Bold Intersection operations.

Q5

(a) Apply the Flajolet-Martin Algorithm on the data stream:

S = 1, 3, 2, 1, 2, 3, 4, 3, 1, 2, 3, 1

Given: h(x) = (6x + 1) mod 5

Identify unique elements in the stream.
OR
(b) Discuss the concept of filtering in Data Stream Processing and explain Bloom Filtering in detail.

Q6

(a) Cluster the following eight points into three clusters using K-Means Algorithm:

A₁(2,10), A₂(2,5), A₃(8,4), A₄(5,8), A₅(7,5), A₆(6,4), A₇(1,2), A₈(4,9)

Initial centers: A₁(2,10), A₄(5,8), A₇(1,2)

Distance function:

  • P(a,b)=∣x2−x1∣+∣y2−y1∣P(a,b) = |x₂ - x₁| + |y₂ - y₁|P(a,b)=∣x2​−x1​∣+∣y2​−y1​∣

Find the final cluster centers.
OR
(b) A transaction database has 6 transactions with Support = 50%, Confidence = 60%:

TIDItems Bought
10Beer, Nuts, Diaper
20Beer, Coffee, Diaper
30Beer, Diaper, Eggs
40Nuts, Eggs, Milk
50Nuts, Coffee, Diaper, Eggs, Milk
60Beer, Nuts, Diaper

i) Use Apriori Algorithm to find frequent itemsets.
ii) Show all strong association rules (with support & confidence).

Q7

(a) Brief about the main components of MapReduce.
OR
(b) Draw and explain the architecture of HIVE with its features.

Key Topics for Revision

1. Categories of Data Analytics

TypeDescriptionExample
DescriptiveSummarizes past dataMonthly sales reports
DiagnosticExplains reasons behind trendsRoot cause analysis
PredictiveForecasts future trendsPredicting customer churn
PrescriptiveSuggests optimal actionsRecommending marketing offers

2. Data Storage Concepts

Database: Structured, transactional data (SQL). Data Warehouse: Historical, analytical storage (OLAP).

Data Lake: Raw, unstructured storage (Hadoop, AWS S3).

3. Outliers

Data points that deviate significantly from others.       Detected using:

Z-score,                                                                          IQR (Interquartile Range),

Visualization (Box Plot).

4. Lasso Regression

Regularized regression using L1 penalty.

Shrinks coefficients to zero → performs feature selection.

5. Stream Processing vs Traditional Processing

Stream ProcessingTraditional Processing
Real-time data flowBatch data
Frameworks: Apache Flink, KafkaHadoop, Spark
Example: IoT sensor dataDaily transaction logs

6. PCA (Principal Component Analysis)

Used for dimensionality reduction.

Steps:

Standardize data.                                                  Compute covariance matrix.

Calculate eigenvalues & eigenvectors.                 Project data onto principal components.

7. Market Basket Analysis

Unsupervised learning (association rule mining).

Uses Apriori Algorithm:               Finds frequent itemsets, e.g., “Beer → Diaper.”

Applications:                                  Retail recommendations, cross-selling, layout optimization.

8. Clustering

Partitioning methods: K-Means, K-Medoids.          Hierarchical methods: Agglomerative, Divisive.

Density-based: DBSCAN, OPTICS.                            Grid-based: CLIQUE, STING.

9. NoSQL vs Relational Database

FeatureRelationalNoSQL
SchemaFixedFlexible
ScalingVerticalHorizontal
Use CaseBankingSocial media, IoT
ExampleMySQLMongoDB, Cassandra

10. Big Data Characteristics (5Vs)

Volume: Massive data size.                                 Velocity: Fast data generation.

Variety: Structured, semi/unstructured.              Veracity: Data accuracy.

Value: Extracting useful insights.

11. Flajolet–Martin Algorithm

Estimates number of distinct elements in data streams using hash functions.

Efficient for large-scale streaming data.

12. Bloom Filtering

Probabilistic data structure for membership testing.

Space-efficient but allows false positives.

Used in caching, networking, and databases.

13. Apriori Algorithm

Step 1: Generate frequent itemsets using support.   Step 2: Generate strong association rules using confidence.

Example:

Support(A→B) = freq(A∪B) / total transactions         Confidence(A→B) = freq(A∪B) / freq(A)

14. K-Means Clustering

Iterative algorithm that partitions data into k clusters.

Limitations:

Sensitive to initial centroids.             Assumes spherical clusters.

15. MapReduce Components

Map Phase: Input split → key-value pairs.

Shuffle & Sort: Group similar keys.

Reduce Phase: Aggregate output.

16. HIVE Architecture

Built on top of Hadoop for data querying (SQL-like interface).

Components:

Driver: Compiles queries.

Metastore: Stores schema.

Execution Engine: Converts queries to MapReduce.

HiveQL: SQL-based query language.

File Size
147.89 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Baking Classes Near Sector 84 Gurugram – Learn Cake & Bakery Skills Professionally Sector 84, Gurugram
Stenography Classes Near Sector 93 Gurugram – Build Speed, Accuracy & Secure Government Career Opportunities Sector 93, Gurugram
Graphic Designing Classes Near Noida Sector 99 – Learn Creative Design and Build a Successful Career Noida
App Development Classes Near Noida Sector 100 – Learn Mobile App Development and Start Your Tech Career Sector 100, Noida
Piano Classes Near Tilak Nagar – Learn, Play & Master Music with Confidenc Tilak Nagar, Delhi
Fashion Designing Course Near Sector 81 Gurugram – Turn Your Creativity into a Successful Career Sector 81, Gurugram
Digital Marketing Course Near Sector 62 Gurugram – Master Online Growth & Build a High-Demand Career Sector 62, Gurugram
Maths Coaching Near By Dwarka Mor – Build Strong Concepts & Score Higher Dwarka Mor, Delhi
No Office Rent Business Setup Near By Uttam Nagar Start & Grow Your Business Without Paying High Office Rent in 2026 Uttam Nagar, Delhi
Public Speaking Training Near Sector 108 Noida – Build Confidence and Communication Skills Noida
Guitar Classes Near Central Noida Sector 1 – Learn Guitar with Expert Trainers Noida
Geography Coaching Classes Near By Dwarka Mor Build Strong Conceptual Understanding & Score High in Board Exams Dwarka Mor, Delhi
Spanish Language Classes Near Sector 113 Noida – Learn Spanish with Professional Training Noida
Guitar Classes Near Mehrauli – Professional Guitar Training in South Delhi Mehrauli, Delhi
Spoken English Classes Near By Saket Improve Fluency, Confidence & Career Opportunities with Expert Training in 2026 Saket, Delhi
Diet & Nutrition Consultation Near Malibu Town – Personalized Guidance for a Healthy Lifestyle Malibu Town, Gurugram
Diet & Nutrition Consultation Near Sector 125 Noida – Your Complete Guide to Healthy Living Sector 125, Noida
🇪🇸 Spanish Language Classes Near Golf Course Road – Learn Spanish for Global Communication Golf Course Road, Gurugram
Guitar Classes Near Chhatarpur – Professional Guitar Training in South Delhi Chhatarpur, Delhi
Guitar Classes Near DLF Phase 1 – Learn Guitar from Expert Teachers DLF Phase I, Gurugram
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020