The Twitter data firehose delivers hundreds of millions of Tweets every day. This data flood comes with many ‘big data’ challenges in terms of both data volumes and velocities. This presentation will focus on tools that help you find your data ‘signal’ of interest, and will include several demos that focus on using Twitter for flood early-warning systems. These demos will highlight the real-time, public broadcast nature of Twitter, examples of real-time firehose filtering, as well as recent Internet of Things (IoT) Twitter integrations.
Report
Share
Report
Share
1 of 36
More Related Content
Floods of Twitter Data - StampedeCon 2016
1. Floods of Twitter Data
Jim Moffitt
Developer Advocate @Twitter
@snowman
StampedeCon 2016
2. Gnip develops the enterprise-grade data APIs and services to
help unlock the power of Twitter data
3. agenda
(6 characters)
• Twitter 101 - Public, mobile, real-
time and big data.
• Finding your signal of interest.
• Twitter, early-warning systems
and IoT.
5. Firehose volume:
• Hundreds of millions of
Tweets every day.
• Billions of Tweets per
month.
Twitter is big data
6. Twitter is big data
Firehose velocity:
• Many thousands of
Tweets per second.
• 20-25K per second
now common during
events.
• Current record: 140K
tweets per second.
8. Twitter is really fast
#StampedeCon OR #realtime OR @snowman OR
(point_radius:[-90.202222 38.624419 1mi] (@StampedeCon OR
#demo OR "big data" OR #BigData))
13. Publisher Data Platform
Sample Analytics Providers
Brands
Acting on Insights from Twitter Data
Providing Analysis
from Twitter Data
Filtering, Enriching
& Distributing
Content
Creating Social
Media Content
TWITTER DATA ECOSYSTEM
17. • Agnostic APIs.
• Most used languages: Java, Python, Ruby, C#, Node.js
• Understanding Twitter metadata and filtering syntax.
• Stream consumer is lightweight and writes Tweets to a queue.
• JSON parser is flexible and tolerant.
• Bending and not breaking.
• Storage.
Client-side details