(SEM VI) THEORY EXAMINATION 2022-23 BIG DATA AND ANALYTICS

B.Tech General 0 downloads
₹28.92

BIG DATA AND ANALYTICS – KDS-601

Section-wise Important Questions & Ready Answers


SECTION A

(Attempt all questions – 2 marks each)


(a) Different Kinds of Digital Data

Digital data can be classified as structured data, semi-structured data, and unstructured data. Structured data is organized in tabular form like databases, semi-structured data includes XML and JSON files, while unstructured data includes images, videos, audio files, emails, and social media content.


(b) Drivers of Big Data

The major drivers of Big Data include the rapid growth of social media, mobile devices, IoT sensors, cloud computing, digital transactions, and the need for real-time analytics. These factors generate massive 

volumes of diverse and fast-moving data.


(c) Importance of Hadoop Data Format

Hadoop data format is important because it enables efficient storage and processing of large datasets. Hadoop supports formats like Text, SequenceFile, Avro, and Parquet, which improve compression, performance, and compatibility with MapReduce and other ecosystem tools.


(d) Distributed File System

A distributed file system stores data across multiple machines while appearing as a single logical system to users. It provides scalability, fault tolerance, and high availability by distributing data blocks across nodes.


(e) Working of File System

A file system manages how data is stored, retrieved, and organized on storage devices. It handles file naming, access control, data allocation, and metadata management to ensure efficient and secure data access.


(f) Use of Data Replication

Data replication creates multiple copies of data across different nodes. It improves fault tolerance, data availability, and reliability by ensuring data remains accessible even if a node fails.


(g) Need of Scheduler in Hadoop

A scheduler is required in Hadoop to allocate cluster resources efficiently among multiple jobs. It ensures fairness, optimal resource utilization, and balanced workload execution across nodes.


(h) Data Types Used in MongoDB

MongoDB supports data types such as String, Integer, Boolean, Double, Array, Object, Date, ObjectId, and Binary data, enabling flexible schema-less storage.


(i) Applications of Big Data Using Pig

Pig is used for data cleansing, transformation, aggregation, and analysis of large datasets. It is widely applied in log analysis, ETL processes, customer behavior analysis, and recommendation systems.


(j) Data Processing Operators Used in Pig

Pig operators include LOAD, FILTER, GROUP, FOREACH, JOIN, ORDER, DISTINCT, UNION, and STORE, which help in performing complex data transformations easily.


SECTION B

(Attempt any three – 10 marks each)


2(a) Overcoming Challenges of Conventional Data Analysis Systems

Conventional systems fail due to limited scalability, high cost, and inability to process unstructured data. These challenges are overcome using distributed computing, parallel processing, cloud infrastructure, and Big Data frameworks like Hadoop and Spark, which enable scalable and cost-effective analytics.


2(b) Hadoop Ecosystem – Concept and Architecture

The Hadoop ecosystem consists of HDFS for storage, MapReduce for processing, YARN for resource management, and tools like Hive, Pig, HBase, Sqoop, Flume, and Oozie. Together, they support data ingestion, storage, processing, and analysis of large datasets.
(In exam, a neat labeled architecture diagram is expected.)


2(c) HDFS Monitoring and Maintenance Process

HDFS monitoring involves checking disk usage, node health, and block replication using tools like NameNode UI and logs. Maintenance includes balancing data, replacing failed nodes, repairing corrupted blocks, and ensuring optimal replication for reliability.


2(d) New Features in Hadoop 2.0

Hadoop 2.0 introduced YARN for better resource management, improved scalability, support for multiple processing models, enhanced fault tolerance, and better performance compared to Hadoop 1.x.


2(e) Apache Hive Installation and Architecture

Hive is installed on Hadoop to enable SQL-like querying using HiveQL. Its architecture includes user interface, driver, compiler, optimizer, execution engine, and metastore. Hive translates queries into MapReduce jobs for execution on HDFS.


SECTION C


3(a) Big Data Architecture and Characteristics

Big Data architecture includes data sources, data ingestion, storage layer, processing layer, analytics layer, and visualization. Its key characteristics are Volume, Velocity, Variety, Veracity, and Value, which define the nature and complexity of Big Data systems.


3(b) Big Data Security, Protection, and Auditing Features

Big Data security includes authentication, authorization, encryption, data masking, and auditing. Tools like Kerberos, Ranger, and Knox ensure secure access, data protection, and compliance monitoring.

File Size
36.28 KB
Uploader
SuGanta International
⭐ Elite Educators Network

Meet Our Exceptional Teachers

Discover passionate educators who inspire, motivate, and transform learning experiences with their expertise and dedication

KISHAN KUMAR DUBEY

KISHAN KUMAR DUBEY

Sant Ravidas Nagar Bhadohi, Uttar Pradesh , Babusarai Market , 221314
5 Years
Years
₹10000+
Monthly
₹201-300
Per Hour

This is Kishan Kumar Dubey. I have done my schooling from CBSE, graduation from CSJMU, post graduati...

Swethavyas bakka

Swethavyas bakka

Hyderabad, Telangana , 500044
10 Years
Years
₹10000+
Monthly
₹501-600
Per Hour

I have 10+ years of experience in teaching maths physics and chemistry for 10th 11th 12th and interm...

Vijaya Lakshmi

Vijaya Lakshmi

Hyderabad, Telangana , New Nallakunta , 500044
30+ Years
Years
₹9001-10000
Monthly
₹501-600
Per Hour

I am an experienced teacher ,worked with many reputed institutions Mount Carmel Convent , Chandrapu...

Shifna sherin F

Shifna sherin F

Gudalur, Tamilnadu , Gudalur , 643212
5 Years
Years
₹6001-7000
Monthly
₹401-500
Per Hour

Hi, I’m Shifna Sherin! I believe that every student has the potential to excel in Math with the righ...

Divyank Gautam

Divyank Gautam

Pune, Maharashtra , Kothrud , 411052
3 Years
Years
Not Specified
Monthly
Not Specified
Per Hour

An IIT graduate having 8 years of experience teaching Maths. Passionate to understand student proble...

Explore Tutors In Your Location

Discover expert tutors in popular areas across India

Spoken English Classes Near By Chhatarpur Improve Fluency, Build Confidence & Unlock Career Opportunities in 2026 Chhatarpur, Delhi
Voice-over Training Classes Near By Saket – Build a Powerful & Professional Voice Saket, Delhi
German Language Classes Near Golf Course Road – Learn German for Career & Study Abroad Golf Course Road, Gurugram
Home Tuition (All Subjects) Near Sector 88 Gurugram – Personalized Learning for Academic Excellence Sector 88, Gurugram
Guitar Classes Near South Extension – Professional Guitar Training in South Delhi South Extension, Delhi
Science Classes Near By Dwarka Mor – Build Strong Concepts in Physics, Chemistry & Biology Dwarka Mor, Delhi
Violin Classes Near by Gurugram – Learn, Perform & Master the Art of Strings Gurugram
Music Theory & Composition Near DLF Cyber City – Master the Language of Music DLF Cyber City, Gurugram
Spoken English Classes Near Central Park 1 – Improve Confidence and Communication Skills Central Park 2, Gurugram
Digital Marketing Classes Near Noida Sector 98 – Learn Modern Marketing Skills and Build a Successful Career Expressway, Sector 98, Noida, Noida
Graphic Designing Classes Near Noida Sector 99 – Learn Creative Design and Build a Successful Career Noida
Legal Documentation Assistance Near Sector 102A Gurugram (Dwarka Expressway) – Reliable, Professional & Hassle-Free Services Village Dhankot, Sector 102, Gurugram
Drum Lessons Near DLF Phase 4 – Learn Drumming with Electronic Drum Training at Home DLF Phase IV, Gurugram
Prenatal Yoga Training Near Vatika City – Safe & Healthy Pregnancy Wellness Vatika City, Gurugram
Spanish Language Classes Near Uttam Nagar – Learn Spanish with Confidence Uttam Nagar, Delhi
Fashion Designing Classes Near By Dwarka Mor – Turn Your Creativity into a Stylish Career Dwarka Mor, Delhi
Personality Development Classes Near Sector 56 Gurugram – Build Confidence, Communication & Professional Success Sector 56, Gurugram
Painting Classes Near By Dwarka Mor – Discover the Artist Within You Dwarka Mor, Delhi
Candle Making Classes Near Sector 83 Gurugram – Learn the Art of Handmade Candles Gurugram
App Development Classes Near Noida Sector 100 – Learn Mobile App Development and Start Your Tech Career Sector 100, Noida
⭐ Premium Institute Network

Discover Elite Educational Institutes

Connect with top-tier educational institutions offering world-class learning experiences, expert faculty, and innovative teaching methodologies

Réussi Academy of languages

sugandha mishra

Réussi Academy of languages
Madhya pradesh, Indore, G...

Details

Coaching Center
Private
Est. 2021-Present

Sugandha Mishra is the Founder Director of Réussi Academy of Languages, a premie...

IGS Institute

Pranav Shivhare

IGS Institute
Uttar Pradesh, Noida, Sec...

Details

Coaching Center
Private
Est. 2011-2020

Institute For Government Services

Krishna home tutor

Krishna Home tutor

Krishna home tutor
New Delhi, New Delhi, 110...

Details

School
Private
Est. 2001-2010

Krishna home tutor provide tutors for all subjects & classes since 2001

Edustunt Tuition Centre

Lakhwinder Singh

Edustunt Tuition Centre
Punjab, Hoshiarpur, 14453...

Details

Coaching Center
Private
Est. 2021-Present
Great success tuition & tutor

Ginni Sahdev

Great success tuition & tutor
Delhi, Delhi, Raja park,...

Details

Coaching Center
Private
Est. 2011-2020