(SEM VI) THEORY EXAMINATION 2021-22 DATA ANALYTICS

B.Tech General 0 downloads
₹29.01

DATA ANALYTICS (KIT601)

B.Tech Semester VI – Theory Examination (2021–22) 

 

Data Analytics is an interdisciplinary subject that focuses on the systematic analysis of data to extract meaningful patterns, trends, and insights for decision-making. In the modern digital world, massive volumes of data are generated every second through business transactions, social media, sensors, mobile devices, and online platforms. Raw data by itself has limited value, but when analyzed using appropriate statistical, computational, and machine-learning techniques, it becomes a powerful asset for organizations. Data analytics helps in improving efficiency, predicting future outcomes, optimizing processes, and supporting strategic decisions across domains such as business, healthcare, finance, manufacturing, and governance.
 

From the uploaded question paper, it is evident that the syllabus emphasizes data types, data streams, sampling, clustering, classification, decision trees, neural networks, data analytics life cycle, data stream algorithms, association rule mining, PCA, support vector machines, Hive architecture, and R programming. To score well, answers must be written in clear, explanatory paragraphs, showing understanding of concepts, algorithms, and applications rather than brief bullet points.


SECTION A – FUNDAMENTAL CONCEPTS OF DATA ANALYTICS

(Based on Section A, Page-1 of the paper) 

 

The need for data analytics arises from the rapid growth of data and the requirement to convert this data into actionable knowledge. Organizations rely on data analytics to understand customer behavior, improve operational efficiency, detect fraud, predict trends, and gain competitive advantage. Without analytics, large datasets remain underutilized and decision-making becomes intuition-based rather than evidence-based.
 

Data classification refers to categorizing data based on its structure and nature. Data may be structured, semi-structured, or unstructured, and this classification determines the choice of storage systems and analytical techniques.
 

A neural network is a computational model inspired by the structure of the human brain. It consists of interconnected neurons organized into input, hidden, and output layers, and it is widely used for pattern recognition, classification, and prediction tasks.
 

Multivariate analysis involves the examination of multiple variables simultaneously to understand relationships, dependencies, and patterns among them. It is commonly used in marketing analysis, finance, and scientific research.
 

The full form of RTAP is Real-Time Analytics Platform, which is used to analyze streaming data instantly to support time-critical decisions such as fraud detection, stock trading, and network monitoring.
 

The role of sampling in data streams is crucial because data streams are continuous and potentially infinite. Sampling techniques allow efficient approximation and analysis without storing the entire stream.
 

The limited pass algorithm is used in data stream processing where data can be scanned only a limited number of times. Such algorithms are essential for real-time analytics with memory constraints.
 

The principle behind hierarchical clustering is to build a hierarchy of clusters either by successively merging smaller clusters into larger ones or by splitting larger clusters into smaller ones, based on similarity measures.
 

In descriptive statistics, R functions such as mean, median, sd, summary, and var are commonly used to describe the central tendency and dispersion of data.
 

Popular data visualization tools help in graphical representation of data to make insights easily understandable by humans.
 

SECTION B – DATA ANALYTICS MODELS AND ALGORITHMS

(Based on Section B, Page-1) 

 

The process model and computation model for Big Data platforms describe how data is collected, stored, processed, and analyzed. The process model includes data acquisition, preprocessing, storage, analysis, and visualization, while the computation model focuses on distributed processing frameworks that enable scalable analytics.
 

Decision trees are supervised learning models used for classification and prediction. They are easy to interpret and visualize, and they work by recursively splitting data based on attribute values to reach a decision outcome.
 

The architecture of a data stream model includes data sources, stream processing engine, memory management, and query processing modules. This architecture enables continuous analysis of real-time data.
 

The K-means algorithm is a popular clustering technique that partitions data into K clusters by minimizing intra-cluster variance. It works iteratively by assigning data points to the nearest centroid and updating centroids until convergence. Its simplicity and efficiency make it suitable for large datasets.

The difference between NoSQL and RDBMS databases lies in schema flexibility, scalability, and consistency models. NoSQL databases support schema-less design and horizontal scaling, while RDBMS systems emphasize structured schema and strong consistency.


SECTION C – DATA ANALYTICS LIFE CYCLE AND TOOLS

(Based on Section C, Page-1) 

 

The data analytics life cycle consists of multiple phases including problem definition, data collection, data cleaning, exploratory analysis, model building, evaluation, and deployment. Each phase ensures that analytics results are accurate, relevant, and actionable.
 

Modern data analytics tools include platforms for data storage, processing, visualization, and modeling. These tools support large-scale analytics, machine learning, and real-time data processing.


ADVANCED ANALYTICS TECHNIQUES

(Based on Questions 4 & 5, Page-2) 

 

Support Vector Machines (SVM) and kernel methods are powerful supervised learning techniques used for classification and regression. Kernel methods allow SVMs to handle non-linear data by transforming it into higher-dimensional space.
 

Principal Component Analysis (PCA) is a dimensionality reduction technique that transforms correlated variables into a smaller set of uncorrelated principal components, preserving maximum variance.
 

Algorithms for counting distinct elements in a data stream are essential when memory is limited. These algorithms provide approximate counts efficiently.
 

The case study of stock market prediction demonstrates how historical price data, trends, and machine-learning models are used to forecast future market behavior, although predictions remain probabilistic due to market uncertainty.
 

DATA MINING, ASSOCIATION RULES & HIVE

(Based on Questions 6 & 7, Page-2) 

 

The difference between CLIQUE and ProCLUS clustering lies in how subspace clusters are discovered in high-dimensional data. These algorithms address the curse of dimensionality in clustering.
 

The Apriori algorithm is used for mining frequent itemsets and generating association rules. By applying minimum support and confidence thresholds, meaningful relationships among items are discovered.
 

The HIVE architecture enables SQL-like querying on large datasets stored in distributed file systems. It includes components such as query compiler, execution engine, and metastore.
 

Writing an R function demonstrates practical data analytics skills by enabling statistical computation and data processing programmatically.
 

HOW TO WRITE DATA ANALYTICS ANSWERS IN THE EXAM
 

In Data Analytics, never write answers in short bullet points. Always start with a clear explanation of the concept, followed by algorithmic understanding, working principles, and applications. Use correct terminology such as clustering, classification, data streams, PCA, Apriori, and Hive. Examiners give maximum weightage to conceptual clarity, analytical reasoning, and real-world relevance.

File Size
126.46 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Violin Classes Near DLF Phase 5 – Learn Classical & Modern Violin from Expert Teachers DLF Phase V, Gurugram
Data Analytics Training Near Noida Sector 94 – Learn Data Skills and Build a High-Demand Career Noida
Spoken English Classes Near Sector 119 Noida – Improve Your Communication Skills with Expert Training Sector 119, Noida
German Language Classes Near Sector 118 Noida – Learn German with Expert Trainers Noida
German Language Classes Near Central Park 2 – Learn German for Career, Study & Global Opportunities Central Park 2, Gurugram
SEO Training Near Sector 63 Gurugram – Master Search Engine Optimization & Build a High-Growth Career Sector 63, Gurugram
Data Analytics Classes Near Kirti Nagar – Build a Future-Ready Career in Data Kirti Nagar, Delhi
Baking Classes Near Sector 84 Gurugram – Learn Cake & Bakery Skills Professionally Sector 84, Gurugram
Guitar Classes Near By Defence Colony Learn Guitar with Expert Trainers & Turn Your Passion into a Lifelong Skill Defence Colony, Delhi
Meditation Coaching Near Sector 124 Noida – A Complete Guide to Mental Peace and Mindfulness Noida
Spoken English Classes Near By Green Park Build Fluency, Confidence & Professional Communication Skills in 2026 Green Park, Delhi
Yoga Classes (Home or Online) Near Sushant Lok Phase 2 – Improve Health, Flexibility & Peace of Mind Sushant Lok 2, Sector 57, Gurugram
Spoken English Classes Near Sector 117 Noida – Improve Fluency, Confidence and Communication Skills Noida
Violin Classes Near DLF Phase 5 – Learn, Grow & Perform with Confidence DLF Phase V, Gurugram
Spoken English Classes Near By CR Park Improve Fluency, Boost Confidence & Unlock Better Opportunities in 2026 Chittaranjan Park, Delhi
Resume & Interview Coaching Near Sector 102 Gurugram (Dwarka Expressway) – Build Confidence, Crack Interviews, Get Hired Sector 102, Gurugram
Home Tuition (All Subjects) Near Sector 88 Gurugram – Personalized Learning for Academic Excellence Sector 88, Gurugram
Career Counseling Near Sector 100 Dwarka Expressway, Gurugram – Guidance for a Clear & Confident Future Gurugram
Personal Fitness Training Near Palam Vihar – Transform Your Body with Expert Guidance Palam Vihar, Gurugram
Zumba Classes Near Sector 133 Greater Noida – Fun, Fitness and Energy in Every Step Noida
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020