SlideShare a Scribd company logo
BIG DATA
Prepared By :
Hritika Raj
CONTENT
1. Introduction
2. What is Big Data
3. Characteristic of Big Data
4. Storing , selecting and processing of Big Data
5. Why Big Data
6. How it is Different
7. Big Data sources
8. Tools used in Big Data
9. Application of Big Data
10. Risks of Big Data
11. Benefits of Big Data
12. How Big Data Impact on IT
13. Future of Big Data
Introduction
Big Data may well be the Next Big Thing in the IT world.
Big data burst upon the scene in the first decade of the
21st century.
The first organizations to embrace it were online and
startup firms. Firms like Google, eBay, LinkedIn, and
Facebook were built around big data from the beginning.
Like many new information technologies, big data can
bring about dramatic cost reductions, substantial
improvements in the time required to perform a computing
task, or new product and service offerings.
DATA WITH A
LOT OF
INFORMATION.
BIG DATA IS NOT ONLY ABOUT THE SIZE OF THE
DATA,
IT’S ABOUT THE VALUE WITHIN THE DATA.
‘Big Data’ is similar to ‘small data’, but bigger in size
But having data bigger it requires different approaches:
Techniques, tools and architecture
An aim to solve new problems or old problems in a better
way
Big Data generates value from the storage and processing of
very large quantities of digital information that cannot be
analyzed with traditional computing techniques.
WHAT IS BIG DATA?
Charateristics Of Big Data
VOLUME
•Over 90% of all the data in the world was created in the past 2 years.
•Every 2 days we create as much information as we did from the beginning
of time until 2003
.
•Every minute we send 204 million emails, generate 1,8 million Facebook
likes, send 278 thousand Tweets, and up-load 200,000 photos to
Facebook.
• Google alone processes on average over 40,000 search queries per
second, making it over 3.5 billion in a single day.
•The number of Bits of information stored in the digital universe is thought
to have exceeded the number of stars in the physical universe in 2007.
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Exabyte : Blankets west coast states
Zetabyte : Fills the pacific ocean
VELOCITY
 Clickstreams and ad impressions capture user behavior
at millions of events per second
 High-frequency stock trading algorithms reflect market
changes within microseconds
 Machine to machine processes exchange data between
billions of devices
 Infrastructure and sensors generate massive log data in
real-time
 Online gaming systems support millions of concurrent
users, each producing multiple inputs per second.
VARIETY
 Big Data isn't just numbers, dates, and strings.
Big Data is also geospatial data, 3D data, audio
and video, and unstructured text, including log
files and social media.
 Traditional database systems were designed to
address smaller volumes of structured data,
fewer updates or a predictable, consistent data
structure.
 Big Data analysis includes different types of data
VERACITY
 The quality of captured data, which can vary greatly.
Accurate analysis depends on the veracity of source data.
 The term often refers simply to the use of predictive
analytics or other certain advanced methods to extract value
from data, and seldom to a particular size of data set.
 Accuracy in big data may lead to more confident decision
making. And better decisions can mean greater operational
efficiency, cost reduction and reduced risk.
WHY BIG DATA
•FB generates 10TB daily
•Twitter generates 7TB of data
Daily
•It is expected that by 2020 the
amount of digital information in
existence will have grown from
3.2 zettabytes today to 40
zettabytes.
THE STRUCTURE OF BIG DATA
 Structured
• Most traditional data
sources
 Semi-structured
• Many sources of big
data
 Unstructured
• Video data, audio
data
14
BIG DATA SOURCES
Mobile Devices
Readers/Scanners
Science facilities
Microphones
Cameras
Social Media
Programs/ Software
Technologies
& Vendors
1. A/B testing
2. Crowdsourcing
3. Data fusion and integration
4. Genetic algorithms
5. Machine learning
6. Natural language
processing
7. Signal processing
8. Simulation
9. Time series analysis and
visualisation
Example Vendors
IBM – Netezza
EMC – Greenplum
Oracle – Exadata
Application Of Big Data analytics
Homeland
Security
Smarter
Healthcare
Multi-channel
sales
Telecom
Manufacturing
Traffic Control
Trading
Analytics
Search
Quality
RISKS OF BIG DATA
• Will be so overwhelmed
• Need the right people and solve the right
problems
• Costs escalate too fast
• Isn’t necessary to capture 100%
• Many sources of big data
is privacy
• self-regulation
• Legal regulation
18
POTENTIAL VALUE OF BIG DATA
 $300 billion potential
annual value to US health
care.
 $600 billion potential
annual consumer surplus
from using personal
location data.
 60% potential in retailers’
operating margins.
BENEFITS OF BIG DATA
Real-time big data isn’t just a process for storing petabytes or exabytes of
data in a data warehouse, It’s about the ability to make better decisions and
take meaningful actions at the right time.
Fast forward to the present and technologies like Hadoop give you the scale
and flexibility to store data before you know how you are going to process it.
Technologies such as MapReduce,Hive and Impala enable you to run queries
without changing the data structures underneath.
Our newest research finds that organizations are using big data to target
customer-centric outcomes, tap into internal data and build a better
information ecosystem.
Big Data is already an important part of the $64 billion database and data
analytics market
It offers commercial opportunities of a comparable scale to enterprise
software in the late 1980s
LEADING TECHNOLOGY VENDORS
Example Vendors
 IBM – Netezza
 EMC – Greenplum
 Oracle – Exadata
Commonality
• MPP architectures
• Commodity Hardware
• RDBMS based
• Full SQL compliance
FUTURE OF BIG DATA
 $15 billion on software firms only specializing in
data management and analytics.
 This industry on its own is worth more than
$100 billion and growing at almost 10% a year
which is roughly twice as fast as the software
business as a whole.
 In February 2012, the open source analyst firm
Wikibon released the first market forecast for
Big Data , listing $5.1B revenue in 2012 with
growth to $53.4B in 2017
 The McKinsey Global Institute estimates that
data volume is growing 40% per year, and will
grow 44x between 2009 and 2020.
MOST PEOPLE DON’T KNOW
WHAT TO DO WITH ALL THE DATA
THAT THEY ALREADY HAVE…
BIG DATA ISN’T BIG,
IF YOU KNOW HOW TO
USE IT.
THANK
YOU.

More Related Content

What's hot

Big data introduction
Big data introductionBig data introduction
Big data introduction
Chirag Ahuja
 
Big data ppt
Big data pptBig data ppt
Big data ppt
IDBI Bank Ltd.
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Big Data
Big DataBig Data
Big Data
Big DataBig Data
Big Data
Seminar Links
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
Vivek Gautam
 
BIG DATA-Seminar Report
BIG DATA-Seminar ReportBIG DATA-Seminar Report
BIG DATA-Seminar Report
josnapv
 
Big data
Big dataBig data
Big data
Nausheen Hasan
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
SlideTeam
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
Yaman Hajja, Ph.D.
 
Big data
Big dataBig data
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
Bernard Marr
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
Shatavisha Roy Chowdhury
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
Sivashankar Ganapathy
 
Big data analytics and innovation
Big data analytics and innovationBig data analytics and innovation
Big data analytics and innovation
Ahmed Fattah
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
Sadhana Singh
 
Data science
Data scienceData science
Data science
Ranjit Nambisan
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
NAGARAJAGIDDE
 
Data Science
Data ScienceData Science
Data Science
Amit Singh
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Deepika ParthaSarathy
 

What's hot (20)

Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
BIG DATA-Seminar Report
BIG DATA-Seminar ReportBIG DATA-Seminar Report
BIG DATA-Seminar Report
 
Big data
Big dataBig data
Big data
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Big data
Big dataBig data
Big data
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Big data analytics and innovation
Big data analytics and innovationBig data analytics and innovation
Big data analytics and innovation
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
Data science
Data scienceData science
Data science
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Data Science
Data ScienceData Science
Data Science
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 

Viewers also liked

Big data Ppt
Big data PptBig data Ppt
Big data Ppt
Prashant Navatre
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Shweta Sahu
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
VIKAS KATARE
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Yash Raj
 
Big-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesignersBig-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesigners
AryaTadbir Network Designers
 
Big data
Big dataBig data
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
Prashant Sharma
 
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
Desing Pathshala
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosql
Khanderao Kand
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Andrei Lyskov
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
Thomas Kejser
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data Trends
IMC Institute
 
Big Data in Oil and Gas
Big Data in Oil and GasBig Data in Oil and Gas
Big Data in Oil and Gas
Bjorn Andersson
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
Julianna DeLua
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
Chicago Hadoop Users Group
 
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
Karthikeyan Rajamanickam
 
Big data processing with apache spark
Big data processing with apache sparkBig data processing with apache spark
Big data processing with apache spark
sarith divakar
 
The First Class Integration of Solr with Hadoop
The First Class Integration of Solr with HadoopThe First Class Integration of Solr with Hadoop
The First Class Integration of Solr with Hadoop
lucenerevolution
 

Viewers also liked (20)

Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesignersBig-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesigners
 
Big data
Big dataBig data
Big data
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
 
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
Hadoop Basics - Apache hadoop Bigdata training by Design Pathshala
 
Big data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosqlBig data hadoop ecosystem and nosql
Big data hadoop ecosystem and nosql
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data Trends
 
Big Data in Oil and Gas
Big Data in Oil and GasBig Data in Oil and Gas
Big Data in Oil and Gas
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
 
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
 
Big data processing with apache spark
Big data processing with apache sparkBig data processing with apache spark
Big data processing with apache spark
 
The First Class Integration of Solr with Hadoop
The First Class Integration of Solr with HadoopThe First Class Integration of Solr with Hadoop
The First Class Integration of Solr with Hadoop
 

Similar to Big data PPT prepared by Hritika Raj (Shivalik college of engg.)

bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
KammetaJoshna
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
Guduru Lakshmi Kiranmai
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptx
VaishnavGhadge1
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
KARTIKEY TRIPATHI
 
Big data
Big dataBig data
Big data
Mahmudul Alam
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
kalai75
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
Vedanand Singh
 
big data
big databig data
big data
sai praneeth
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
nayanbhatia2
 
In memory big data management and processing
In memory big data management and processingIn memory big data management and processing
In memory big data management and processing
Pranav Gontalwar
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
TanguturiAvinash
 
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docxBIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
tangyechloe
 
Our big data
Our big dataOur big data
Our big data
uthrarajan
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
berasrujana
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
dickonsondorris
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
ParasSundriyal2
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
Aswadmehar
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
Bohitesh Misra, PMP
 
130214 copy
130214   copy130214   copy
130214 copy
Arpit Arora
 

Similar to Big data PPT prepared by Hritika Raj (Shivalik college of engg.) (20)

bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptx
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Big data
Big dataBig data
Big data
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
big data
big databig data
big data
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
In memory big data management and processing
In memory big data management and processingIn memory big data management and processing
In memory big data management and processing
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docxBIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
 
Our big data
Our big dataOur big data
Our big data
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
130214 copy
130214   copy130214   copy
130214 copy
 

Recently uploaded

Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
IJAEMSJORNAL
 
Net Zero Case Study: SRK House and SRK Empire
Net Zero Case Study: SRK House and SRK EmpireNet Zero Case Study: SRK House and SRK Empire
Net Zero Case Study: SRK House and SRK Empire
Global Network for Zero
 
LeetCode Database problems solved using PySpark.pdf
LeetCode Database problems solved using PySpark.pdfLeetCode Database problems solved using PySpark.pdf
LeetCode Database problems solved using PySpark.pdf
pavanaroshni1977
 
Lecture 3 Biomass energy...............ppt
Lecture 3 Biomass energy...............pptLecture 3 Biomass energy...............ppt
Lecture 3 Biomass energy...............ppt
RujanTimsina1
 
IS Code SP 23: Handbook on concrete mixes
IS Code SP 23: Handbook  on concrete mixesIS Code SP 23: Handbook  on concrete mixes
IS Code SP 23: Handbook on concrete mixes
Mani Krishna Sarkar
 
IWISS Catalog 2024
IWISS Catalog 2024IWISS Catalog 2024
IWISS Catalog 2024
Iwiss Tools Co.,Ltd
 
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
sanabts249
 
Biology for computer science BBOC407 vtu
Biology for computer science BBOC407 vtuBiology for computer science BBOC407 vtu
Biology for computer science BBOC407 vtu
santoshpatilrao33
 
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.docCCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
Dss
 
Press Tool and It's Primary Components.pdf
Press Tool and It's Primary Components.pdfPress Tool and It's Primary Components.pdf
Press Tool and It's Primary Components.pdf
Tool and Die Tech
 
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdfGUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
ProexportColombia1
 
How to Manage Internal Notes in Odoo 17 POS
How to Manage Internal Notes in Odoo 17 POSHow to Manage Internal Notes in Odoo 17 POS
How to Manage Internal Notes in Odoo 17 POS
Celine George
 
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
YanKing2
 
Development of Chatbot Using AI/ML Technologies
Development of  Chatbot Using AI/ML TechnologiesDevelopment of  Chatbot Using AI/ML Technologies
Development of Chatbot Using AI/ML Technologies
maisnampibarel
 
Social media management system project report.pdf
Social media management system project report.pdfSocial media management system project report.pdf
Social media management system project report.pdf
Kamal Acharya
 
Chlorine and Nitric Acid application, properties, impacts.pptx
Chlorine and Nitric Acid application, properties, impacts.pptxChlorine and Nitric Acid application, properties, impacts.pptx
Chlorine and Nitric Acid application, properties, impacts.pptx
yadavsuyash008
 
Exploring Deep Learning Models for Image Recognition: A Comparative Review
Exploring Deep Learning Models for Image Recognition: A Comparative ReviewExploring Deep Learning Models for Image Recognition: A Comparative Review
Exploring Deep Learning Models for Image Recognition: A Comparative Review
sipij
 
Germany Offshore Wind 010724 RE (1) 2 test.pptx
Germany Offshore Wind 010724 RE (1) 2 test.pptxGermany Offshore Wind 010724 RE (1) 2 test.pptx
Germany Offshore Wind 010724 RE (1) 2 test.pptx
rebecca841358
 
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model SafeRohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
binna singh$A17
 
Rotary Intersection in traffic engineering.pptx
Rotary Intersection in traffic engineering.pptxRotary Intersection in traffic engineering.pptx
Rotary Intersection in traffic engineering.pptx
surekha1287
 

Recently uploaded (20)

Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
Best Practices of Clothing Businesses in Talavera, Nueva Ecija, A Foundation ...
 
Net Zero Case Study: SRK House and SRK Empire
Net Zero Case Study: SRK House and SRK EmpireNet Zero Case Study: SRK House and SRK Empire
Net Zero Case Study: SRK House and SRK Empire
 
LeetCode Database problems solved using PySpark.pdf
LeetCode Database problems solved using PySpark.pdfLeetCode Database problems solved using PySpark.pdf
LeetCode Database problems solved using PySpark.pdf
 
Lecture 3 Biomass energy...............ppt
Lecture 3 Biomass energy...............pptLecture 3 Biomass energy...............ppt
Lecture 3 Biomass energy...............ppt
 
IS Code SP 23: Handbook on concrete mixes
IS Code SP 23: Handbook  on concrete mixesIS Code SP 23: Handbook  on concrete mixes
IS Code SP 23: Handbook on concrete mixes
 
IWISS Catalog 2024
IWISS Catalog 2024IWISS Catalog 2024
IWISS Catalog 2024
 
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
21CV61- Module 3 (CONSTRUCTION MANAGEMENT AND ENTREPRENEURSHIP.pptx
 
Biology for computer science BBOC407 vtu
Biology for computer science BBOC407 vtuBiology for computer science BBOC407 vtu
Biology for computer science BBOC407 vtu
 
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.docCCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
CCS367-STORAGE TECHNOLOGIES QUESTION BANK.doc
 
Press Tool and It's Primary Components.pdf
Press Tool and It's Primary Components.pdfPress Tool and It's Primary Components.pdf
Press Tool and It's Primary Components.pdf
 
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdfGUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
GUIA_LEGAL_CHAPTER-9_COLOMBIAN ELECTRICITY (1).pdf
 
How to Manage Internal Notes in Odoo 17 POS
How to Manage Internal Notes in Odoo 17 POSHow to Manage Internal Notes in Odoo 17 POS
How to Manage Internal Notes in Odoo 17 POS
 
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large...
 
Development of Chatbot Using AI/ML Technologies
Development of  Chatbot Using AI/ML TechnologiesDevelopment of  Chatbot Using AI/ML Technologies
Development of Chatbot Using AI/ML Technologies
 
Social media management system project report.pdf
Social media management system project report.pdfSocial media management system project report.pdf
Social media management system project report.pdf
 
Chlorine and Nitric Acid application, properties, impacts.pptx
Chlorine and Nitric Acid application, properties, impacts.pptxChlorine and Nitric Acid application, properties, impacts.pptx
Chlorine and Nitric Acid application, properties, impacts.pptx
 
Exploring Deep Learning Models for Image Recognition: A Comparative Review
Exploring Deep Learning Models for Image Recognition: A Comparative ReviewExploring Deep Learning Models for Image Recognition: A Comparative Review
Exploring Deep Learning Models for Image Recognition: A Comparative Review
 
Germany Offshore Wind 010724 RE (1) 2 test.pptx
Germany Offshore Wind 010724 RE (1) 2 test.pptxGermany Offshore Wind 010724 RE (1) 2 test.pptx
Germany Offshore Wind 010724 RE (1) 2 test.pptx
 
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model SafeRohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Rohini @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
 
Rotary Intersection in traffic engineering.pptx
Rotary Intersection in traffic engineering.pptxRotary Intersection in traffic engineering.pptx
Rotary Intersection in traffic engineering.pptx
 

Big data PPT prepared by Hritika Raj (Shivalik college of engg.)

  • 1. BIG DATA Prepared By : Hritika Raj
  • 2. CONTENT 1. Introduction 2. What is Big Data 3. Characteristic of Big Data 4. Storing , selecting and processing of Big Data 5. Why Big Data 6. How it is Different 7. Big Data sources 8. Tools used in Big Data 9. Application of Big Data 10. Risks of Big Data 11. Benefits of Big Data 12. How Big Data Impact on IT 13. Future of Big Data
  • 3. Introduction Big Data may well be the Next Big Thing in the IT world. Big data burst upon the scene in the first decade of the 21st century. The first organizations to embrace it were online and startup firms. Firms like Google, eBay, LinkedIn, and Facebook were built around big data from the beginning. Like many new information technologies, big data can bring about dramatic cost reductions, substantial improvements in the time required to perform a computing task, or new product and service offerings.
  • 4. DATA WITH A LOT OF INFORMATION.
  • 5. BIG DATA IS NOT ONLY ABOUT THE SIZE OF THE DATA, IT’S ABOUT THE VALUE WITHIN THE DATA.
  • 6. ‘Big Data’ is similar to ‘small data’, but bigger in size But having data bigger it requires different approaches: Techniques, tools and architecture An aim to solve new problems or old problems in a better way Big Data generates value from the storage and processing of very large quantities of digital information that cannot be analyzed with traditional computing techniques. WHAT IS BIG DATA?
  • 8. VOLUME •Over 90% of all the data in the world was created in the past 2 years. •Every 2 days we create as much information as we did from the beginning of time until 2003 . •Every minute we send 204 million emails, generate 1,8 million Facebook likes, send 278 thousand Tweets, and up-load 200,000 photos to Facebook. • Google alone processes on average over 40,000 search queries per second, making it over 3.5 billion in a single day. •The number of Bits of information stored in the digital universe is thought to have exceeded the number of stars in the physical universe in 2007.
  • 9. Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Exabyte : Blankets west coast states Zetabyte : Fills the pacific ocean
  • 10. VELOCITY  Clickstreams and ad impressions capture user behavior at millions of events per second  High-frequency stock trading algorithms reflect market changes within microseconds  Machine to machine processes exchange data between billions of devices  Infrastructure and sensors generate massive log data in real-time  Online gaming systems support millions of concurrent users, each producing multiple inputs per second.
  • 11. VARIETY  Big Data isn't just numbers, dates, and strings. Big Data is also geospatial data, 3D data, audio and video, and unstructured text, including log files and social media.  Traditional database systems were designed to address smaller volumes of structured data, fewer updates or a predictable, consistent data structure.  Big Data analysis includes different types of data
  • 12. VERACITY  The quality of captured data, which can vary greatly. Accurate analysis depends on the veracity of source data.  The term often refers simply to the use of predictive analytics or other certain advanced methods to extract value from data, and seldom to a particular size of data set.  Accuracy in big data may lead to more confident decision making. And better decisions can mean greater operational efficiency, cost reduction and reduced risk.
  • 13. WHY BIG DATA •FB generates 10TB daily •Twitter generates 7TB of data Daily •It is expected that by 2020 the amount of digital information in existence will have grown from 3.2 zettabytes today to 40 zettabytes.
  • 14. THE STRUCTURE OF BIG DATA  Structured • Most traditional data sources  Semi-structured • Many sources of big data  Unstructured • Video data, audio data 14
  • 15. BIG DATA SOURCES Mobile Devices Readers/Scanners Science facilities Microphones Cameras Social Media Programs/ Software
  • 16. Technologies & Vendors 1. A/B testing 2. Crowdsourcing 3. Data fusion and integration 4. Genetic algorithms 5. Machine learning 6. Natural language processing 7. Signal processing 8. Simulation 9. Time series analysis and visualisation Example Vendors IBM – Netezza EMC – Greenplum Oracle – Exadata
  • 17. Application Of Big Data analytics Homeland Security Smarter Healthcare Multi-channel sales Telecom Manufacturing Traffic Control Trading Analytics Search Quality
  • 18. RISKS OF BIG DATA • Will be so overwhelmed • Need the right people and solve the right problems • Costs escalate too fast • Isn’t necessary to capture 100% • Many sources of big data is privacy • self-regulation • Legal regulation 18
  • 19. POTENTIAL VALUE OF BIG DATA  $300 billion potential annual value to US health care.  $600 billion potential annual consumer surplus from using personal location data.  60% potential in retailers’ operating margins.
  • 20. BENEFITS OF BIG DATA Real-time big data isn’t just a process for storing petabytes or exabytes of data in a data warehouse, It’s about the ability to make better decisions and take meaningful actions at the right time. Fast forward to the present and technologies like Hadoop give you the scale and flexibility to store data before you know how you are going to process it. Technologies such as MapReduce,Hive and Impala enable you to run queries without changing the data structures underneath. Our newest research finds that organizations are using big data to target customer-centric outcomes, tap into internal data and build a better information ecosystem. Big Data is already an important part of the $64 billion database and data analytics market It offers commercial opportunities of a comparable scale to enterprise software in the late 1980s
  • 21. LEADING TECHNOLOGY VENDORS Example Vendors  IBM – Netezza  EMC – Greenplum  Oracle – Exadata Commonality • MPP architectures • Commodity Hardware • RDBMS based • Full SQL compliance
  • 22. FUTURE OF BIG DATA  $15 billion on software firms only specializing in data management and analytics.  This industry on its own is worth more than $100 billion and growing at almost 10% a year which is roughly twice as fast as the software business as a whole.  In February 2012, the open source analyst firm Wikibon released the first market forecast for Big Data , listing $5.1B revenue in 2012 with growth to $53.4B in 2017  The McKinsey Global Institute estimates that data volume is growing 40% per year, and will grow 44x between 2009 and 2020.
  • 23. MOST PEOPLE DON’T KNOW WHAT TO DO WITH ALL THE DATA THAT THEY ALREADY HAVE…
  • 24. BIG DATA ISN’T BIG, IF YOU KNOW HOW TO USE IT.

Editor's Notes

  1. Acco.to IBM
  2. Quote practical examples