SlideShare a Scribd company logo
David De Roure
 @dder


New Forms of Data:
New Computational Needs
DIRECTOR, UNIVERSITY OF OXFORD E-RESEARCH CENTRE
http://zubitubi.blogspot.co.uk/2016/03/school-funny-picture-prank-memories.html
TORCH	DH	
Zooniverse	
IoT															
Engineering	
Cyber	
Linguis7cs	
English	
Oxford	
Mar7n	
School	
Saïd	
ARC	 IT	Services	
ECI	
Geography	
SKA	
CUDA	
Physics	
Computer	Science	
Maths	
History	
Oxford	
Internet	
Ins7tute	
Music	
Archaeology	
Classics	
Zoology	
Museums	
Wolfson	
Law	
Bodleian	Libraries	
Pharmacology	
Biochemistry	
Social
Sciences
Mathematical,
Physical &
Life Sciences
Humanities
Medical
Sciences
Colleges
GLAM
More	people	
More	machines	
HPC	
	
	
Conven7onal	
Computa7on		
Social	
Networks	
Science	2.0	
Internet	of	Things	
Big	
Data	 AI	
Computa7onal	
Linguis7cs,	
Musicology,	
Social	Science,	
Humani7es	
Ci7zen	Science	
Volunteer	
Compu7ng	
SKA	
Visualiza7on	
Bio	Data	sharing	
Machine	
Learning	
Scien7fic	
Compu7ng	
Cybersecurity
Energy Efficient
Computing
Infrastructure
(STFC)
De-identified admin
(inc. health) data
Business
data
Open data
(public sector)
Social media
data
Research
data
Longitudinal
survey data
Open data
Securely held data
Environment
data
Business and LG Data
Research Centres
(ESRC)
Admin Data Research
Centres (ESRC)
High Performance
Data Environment
(NERC)
Clinical
data
Medical Bioinformatics (MRC)
Understanding Populations
(ESRC)
Clinical Practice Datalink
(MHRA, NIHR)
100,000 Genome Project NHS)
Research Data Facility (EPSRC)
European Bioinformatics
Institute (EMBL)
Bioscience E-Infrastructure
(BBSRC)
Square Kilometre Array (STFC)
Digital Transformations
(AHRC)
Archive
data
Open Data
Institute
Commercial
Research
Understanding Populations
(ESRC)
New Research Questions
▶ Social media data offers
the possibility of studying
social processes as they
unfold at the level of
populations, as an
alternative to traditional
surveys or interviews.
▶ The data from social media
is described as "qualitative
data on a quantitative
scale" and requires
innovative analysis
techniques.
Social media data and
real time analytics
https://twitter.com/CR_UK/status/446223117841494016/
Some people's smartphones
had autocorrected the word
"BEAT" to instead read
"BEAR".
"Thank you for choosing an
adorable polar bear," the
reply from the WWF said.
"We will call you today to set
up your adoption."
http://www.bbc.com/news/technology-26723457
http://www.parliament.uk/business/committees/committees-a-z/commons-select/science-and-technology-committee/news/report-responsible-use-of-data/
theODI.org
Social Media Triangle
social media
data and
analytics
social media for
engagement with
research
social media
as a subject
of research
Sam McGregor
Tweets per day
http://www.internetlivestats.com/twitter-statistics/
New Forms of Data
▶ Internet data, derived from social
media and other online interactions
(including data gathered by
connected people and devices, eg
mobile devices, wearable
technology, Internet of Things)
▶ Tracking data, monitoring the
movement of people and objects
(including GPS/geolocation data,
traffic and other transport sensor
data, CCTV images etc)
▶ Satellite and aerial imagery (eg
Google Earth, Landsat, infrared,
radar mapping etc) http://www.oecd.org/sti/sci-tech/new-data-for-
understanding-the-human-condition.htm
New Forms of Data Centre for Doctoral Training
Much of the value of ‘new forms of data’ lie in the
potential for them to be analysed in near real-time,
which presents opportunities for revealing
phenomena as they unfold, enabling timely response
with immediate influence.
Such analysis brings distinct new computational
requirements, requires new skills, and makes new
demands on the ease of use and capability of the
national e-Infrastructure.
http://www.esrc.ac.uk/funding-and-guidance/postgraduates/dtc/
dtc-policy/commissioning-of-centres-for-doctoral-training.aspx
A rehearsal for the future
▶  The Internet of Things
describes a world in which
everyday objects are
connected to a network so that
data can be shared
▶  But it is really as much about
people as the inanimate object
▶  It is impossible to anticipate
all the social changes that
could be created by connecting
billions of devices
https://www.gov.uk/government/publications/internet-of-things-blackett-review
There is no such thing as the Internet of Things
There is no such thing as a closed system
Humans are creative and subversive
The Rise of the Bots A Swarm of Drones
Accidents happen (in the lab, bin)
Holding machines to account Software vulnerability
Where are the throttle points?
@dder
PETRAS Privacy, Ethics, Trust, Reliability, Acceptability, and Security
for the Internet of Things
•  The fusion of the cyber, physical and human elements
•  Scale: from 1mm3 devices to large infrastructure systems
•  Managing devices throughout their (decades long) lifetimes
•  New and evolving threat landscape
•  Continue to operate when partially compromised
The Challenges are numerous
•  Safety vs Security
•  Security vs Efficiency
•  Hardening vs Adaptive Response
Tradeoffs
New Data `New Computation
Data Detect Store AnalyticsFilter Analysts
Real life is and must be full of all kinds of social
constraint – the very processes from which society
arises. Computers can help if we use them to create
abstract social machines on the Web: processes in
which the people do the creative work and the
machine does the administration... The stage is set
for an evolutionary growth of new social engines.
The ability to create new forms of social process
would be given to the world at large, and
development would be rapid.
Berners-Lee, Weaving the Web, 1999 (pp. 172–175)
Social Machines
The Macroscope
Observer of
one social
machine
Observers using third
party observatory
Observer of
multiple social
machines
Human
participants in
Social
Machine
Human participants in
multiple Social Machines
Observer of Social
Machine infrastructure
1	
4	
2	
3	
5	
6	
SM
SM
SM
Social Machine
Observing Social
Machines
7	
@dder
De Roure, D.,
Hooper, C., Page,
K., Tarte, S., and
Willcox, P. 2015.
Observing Social
Machines Part 2:
How to Observe?
ACM Web Science
New Data `New Computation
Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research
Challenges. Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
A computationally-enabled
sense-making network of
expertise, data, software,
models and narratives
Big Data, in a
Big Data Centre
The	R	Dimensions	
reproducible	repeatable	replicable	reusable	
referenceable	retrievable	reviewable	rerunnable	
replayable	re-interpretable	reprocessable	
recomposable	reconstructable	repurposable	
reliable	respec]ul	reputable	revealable	
recoverable	restorable	reparable	refreshable	
@dder 14 April 2014
sci	method	
access	
understand	
new	use	
social	
cura7on	
De Roure, D. 2014. The future
of scholarly communications.
Insights: the UKSG journal,
27, (3), 233-238.
DOI 10.1629/2048-7754.171
Human	Digital	Physical	Triangle	
human
digital physical
social
media
IoT
automation
and scale
New Data New Computation
▶ From Large Hadron Collider to Large People Collider
▶ New forms of data, new social processes
▶ Internet of Things is a thing
▶ Growing requirement for realtime data fusion and
analytics
▶ Increasing automation: growth of machine learning
▶ Fourth quadrant is underpinned by emerging
computational capability
David De Roure
david.deroure@oerc.ox.ac.uk
Thanks to Christine Borgman, Emil Lupu, Sam McGregor,
Nigel Shadbolt, Vonu Thakuriah. 
http://www.slideshare.net/davidderoure/new-data-new-computation

More Related Content

New Data `New Computation