The document discusses the EOSC-hub project, which aims to create an integration and management system called "the Hub" for the future European Open Science Cloud (EOSC). The Hub will engage providers from major European digital infrastructures to offer services, software, and data for advanced research. EOSC-hub will integrate production-ready services, operate and provide access to resources, and support the utilization of resources for open science, open innovation, and being open to the world. It will support the EOSC Declaration by providing a service integrator and federator for the EOSC and developing expertise in procuring digital services.
Report
Share
Report
Share
1 of 40
More Related Content
EOSC-hub: first steps towards realising EOSC vision
1. CSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskus
EOSC-hub: first steps towards realising
EOSC vision
Per Öster, CSC – IT Center for Science Ltd
3. EOSC Declaration
• RECOGNISING the challenges of data driven research in
pursuing excellent science;
• GRANTING that the vision of European Open Science is
that of a research data commons, widely inclusive of all
disciplines and Member States, sustainable in the long-
term,
• CONFIRMING that the implementation of the EOSC is a
process, not a project, by its nature iterative and based on
constant learning and mutual alignment;
• UPHOLDING that the EOSC Summit marked the beginning
and not the end of this process, one based on continuous
engagement with scientific stakeholders, the European
Commission,
• PROPOSES that all EOSC stakeholders consider sharing the
following intents and will actively support their
implementation in the respective capacities: …
EUROPEAN COMMISSION
DIRECTORATE-GENERAL FOR RESEARCH & INNOVATION
The Director-General
Brussels, 10 July 2017
EOSC Declaration
RECOGNISING the challenges of data driven research in pursuing excellent science;
GRANTING that the vision of European Open Science is that of a research data commons,
widely inclusive of all disciplines and Member States, sustainable in the long-term,
CONFIRMING that the implementation of the EOSC is a process, not a project, by its nature
iterative and based on constant learning and mutual alignment;
UPHOLDING that the EOSC Summit marked the beginning and not the end of this process,
one based on continuous engagement with scientific stakeholders, the European Commission,
PROPOSES that all EOSC stakeholders consider sharing the following intents and will
actively support their implementation in the respective capacities:
Data culture and FAIR data
[Data culture] European science must be grounded in a common culture of data stewardship, so
that research data is recognised as a significant output of research and is appropriately curated
throughout and after the period conducting the research. Only a considerable cultural change will
enable long-term reuse for science and for innovation of data created by research activities: no
disciplines, institutions or countries must be left behind.
[Open access by-default] All researchers in Europe must enjoy access to an open-by-default,
efficient and cross-disciplinary research data environment supported by FAIR data principles.
Open access must be the default setting for all results of publicly funded research in Europe,
allowing for proportionate limitations only in duly justified cases of personal data protection,
confidentiality, IPR concerns, national security or similar (e.g. 'as open as possible and as closed
as necessary').
[Skills] The necessary skills and education in research data management, data stewardship and
data science should be provided throughout the EU as part of higher education, the training
system and on-the-job best practice in the industry. University associations, research
organisations, research libraries and other educational brokers play an important role but they
need substantial support from the European Commission and the Member States.
[Data stewardship] Researchers need the support of adequately trained data stewards. The
European Commission and Member States should invest in the education of data stewards via
career programmes delivered by universities, research institutions and other trans-European
agents.
[Rewards and incentives] Rewarding research data sharing is essential. Researchers who make
research data open and FAIR for reuse and/or reuse and reproduce data should be rewarded, both
Ref. Ares(2017)3488418 - 11/07/2017
4. Data Commons?
commons |ˈkɒmənz|
Plural noun
1 (the Commons) short for House of Commons.• historical the common people
regarded as a part of a political system, especially in Britain: the state was
divided into clergy, nobility, and commons | both lords and commons
won some important concessions.
2 [treated as singular] land or resources belonging to or affecting the whole of a
community: the mismanagement of a commons | the global commons of
Antarctica.• US a dining hall in a school or college.3 archaic provisions shared in
common; rations.
commons |ˈkɒmənz|
Plural noun
1 (the Commons) short for House of Commons.• historical the common people
regarded as a part of a political system, especially in Britain: the state was
divided into clergy, nobility, and commons | both lords and commons
won some important concessions.
2 [treated as singular] land or resources belonging to or affecting the whole of
a community: the mismanagement of a commons | the global commons of
Antarctica.• US a dining hall in a school or college.3 archaic provisions shared in
common; rations.
5. “The e-Infrastructure
Commons is the political,
technological, and
administrative framework
for an easy and cost-
effective shared use of
distributed electronic
resources across Europe.”
6. • ”…an "e-Infrastructure Commons" should liberate scientists
from the often complex and distracting business of moving,
storing and processing data. They need services that are
coherent, managed and above all integrated so that they can
get on with the business of research and science.”
• “…we must be careful to not become constrained and stifle
innovation in the development, provision and use of these
services. The idea that there will be just "one way" of
supplying or using any service through an "efficient"
mandated or "voluntary" monopoly, has to be avoided.”
7. Set-up of an High Level Group of Experts on EOSC
to advise on its implementation
Launch of a Pilot action for EOSC in 2016 under the H2020
Research Infrastructure part
To demonstrate how to ensure availability of scientific data and data-analysis
services through a cloud infrastructure and design a stakeholder driven
governance framework.
E-Infrastructures topics in 2017 H2020 calls for federation &
interoperability of data infrastructures
Further substantial investments in the 2018-2020 WP
Involvement of MS & engagement of science communities is crucial
European Open Science Cloud: how
Acknowledgement: Lorenza Saracco, European Commission, DG RTD.B4
9. • Drivers of power consumption
o Number of users
o Number of devices
o Number of communicating apps
31.1.20189
10% of UK
power
consumption
due to ICT
1/3
network
1/3
devices
1/3
datacentres
Amount of data
12. Secure Compute Clouds
Supporting sample
logistics
• Federated Authentication
• Authorization
• Dataset registry
• Data transfer hub
• Policy and Legal Framework
Services and
Coordination
High speed encrypted
data transfer
GridFTP/Globus/Aspera
Secure data access remote API
( GA4GH )
Sequencing centers
Data
Users
EGA
at
Data Archiving
Bringing users
to data
Data Generation
Managing Access
Data Owner
Data Access Agreement
Data Access Committee
Data Request
Authorization Management Tools
( EGA and CSC REMS )
12
13. From field measurements to open data
31.1.201813 Questions:
Sensitive data?
Requirements on Authentication and authorization?
14. Instrument
Measuring PCs:
Raw data
at the stations
File servers at
stations:
Raw data and
field diaries,
cal documents
File servers
in Helsinki:
Raw and
intermediate
data,
documents,
scripts
SMEAR database:
Processed data
in Helsinki
ICOS, EBAS,...
databases:
Near real time
and processed data
outside UH
Routine data processing =
(- unit conversion)
- calibration correction
- quality check, gapfilling
- averaging over space or time
SMEAR
data flow
A/D conversion
unit conversion
IDA (CSC data
service):
Raw data &
document archive,
database datasets
Field
documentation
Researchers,
Data processing
server
Feedback on
data quality
Metadata
Metadata
Metadata
31.1.201814
https://avaa.tdata.fi/web/smart/smear
15. Data growth in
the life sciences
Data growth at EMBL-EBI
Source: Charles E. Cook et al. Nucl. Acids
Res. 2016;44:D20-D26
16. Data resources in life science
Nucleic Acids Research annual Database Issue
and the NAR online Molecular Biology Database Collection in 2012.
MY Galperin, GR Cochrane – Nucleic Acids Research, 2011
~1800
molecular biology
data resources
17. Support in All Phases of Research Process
201617
Plan
Customer Portal
Experts
Guides
Websites
Training
Service Desk
Produce
& Collect
Data
International
resources
Modelling
Software
Supercomputers
Analyse
Cloud Services
Training
Data science
Computing
Software
Store
B2SAFE
B2SHARE
HPCArchive
IDA
Databases
Research long-
term preservation
(LTP)
Share &
Publish
AVAA
B2DROP
B2SHARE
Databank
Etsin
Funet FileSender
19. Termination. Company may terminate your access to all or any part of the Service
at any time, with or without cause, with or without notice, effective immediately,
which may result in the forfeiture and destruction of all information associated with
your account, including User Submissions. If you wish to terminate your account,
you may do so by following instructions available on the Site. Any fees paid
hereunder are non-refundable. All provisions of the Terms of Use which by their
nature should survive termination shall survive termination, including, without
limitation, ownership provisions, warranty disclaimers, indemnity and limitations of
liability.
20. ARTICLE 2: DISCLAIMER
1. The Service is provided "as is" and the Provider disclaims any and all
representations and warranties, whether express or implied, including;- but
not limited to;- implied warranties of title, merchantability, fitness for any
particular purpose or non-infringement. The Provider does not promise any
specific results, effects or outcome from the use of the Service.
2. …
3. The Provider reserves the right to change, reduce, interrupt or discontinue
the Service or parts of it at any time.
4. No one has a right to use the Service; the Provider reserves the right to
exclude certain Users.
21. Are the commercial services sufficient?
• Nice complement but can not serve as the fundamental infrastructure for research
data of national and international interest
• Need for publicly funded and operated infrastructure
31.1.201821
e-Science Data Factory
22. EOSC-hub receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement
No. 777536.
EOSC-hub
Integrating and managing services for the
European Open Science Cloud
23 January 2018
23. 1/31/2018 23
EOSC-hub engages providers from 20 major digital
infrastructures, EGI, EUDAT CDI and INDIGO-DataCloud
jointly offering services, software and data for advanced
data-driven research and innovation.
24. 1/31/2018 24
Project in figures
• 100 Partners, 76 beneficiaries (75 funded)
• 3874 PMs, 108 FTEs, more than 150 technical and scientific
staff involved
– €33,331,18 of which the European Commission funds €30,000,000
• Duration: 36 months, Jan 2018 – Dec 2020
25. 1/31/2018 25
Mission
The project will create the Hub, a
federated integration and management
system for the future EOSC
Resources
and
Services
Federation
“core”
services
Federated
operations
Processes
and
policies
26. 1/31/2018 26
What does the Hub provide?
• Contact point for researchers and innovators to discover,
access, use and reuse a broad spectrum of resources for
advanced data-driven research
• Catalogue of resources and services
• Humanities, Engineering, Medical and Health Sciences, Natural
Sciences
• Corpus of policies, processes and federation services
(community-defined)
• Principles of engagement and EOSC “core” services
• Quality assurance reviews
• Manage end-to-end service level management performance
• Competence Centres and a Joint Digital Innovation Hub
• Specialized Technical support, training
27. 1/31/2018 27
The Hub and the Three Os
• Open Science
– Resources and services for sharing, discovery access, use
and reuse
• Collaboration with OpenAIRE-Advance
• Open Innovation
– A open collaborative effort of service providers and user
communities
• Co-design of process with Competence Centres and a Joint Digital
Innovation hub
• Open to the world
– Aggregates services from local, regional and national e-
Infrastructures in Europe and other regions of the world
28. 1/31/2018 28
From integration to utilization
Integrate
production-
ready
services
Operate and
Provide
Access and
Consume
30. EOSC – Service Architecture
Federation
Services
AAI,
Accounting,
Monitoring,
Operations,
Security Coord.
Basic Infrastructure
Compute and Storage
Open
Collaboration
Platforms
Application
Repository,
Configuration
Management,
Marketplace
Common services
Thematic
Service
Thematic
Service
Thematic
Service
Thematic
Service
Thematic
Service
Community Support services
Thematic
Service
Added Value Services
Compute, Data, Software
Management and Preservation
31. 1/31/2018 31
Research infrastructures & Communities
Thematic service providers
• CMS, astronomy/astro-particle physics
• Climate change/ENES
• GEO/Global Earth Observation System
of Systems
• Costal protection/LNEC
• Structural biology/WeNMR
• Earth Observation core data
resources/ESA, Copernicus sentinel
data, LANDSAT and other major EO
data archives
• Biodiversity / Lifewatch
Competence Centres
• Marine research/Euro-Argo
• LOFAR
• EuroFusion/ITER
• EPOS/EIDA seismic data and
computational seismology services
• EISCAT-3D
• ELIXIR/BBMRI/ECRIN
• DARIAH
• ICOS
32. 1/31/2018 34
Support to the Declaration:
Governance and Funding
EOSC-hub
– Provides an EOSC service integrator and federator.
– Develops the know-how and the prototype
procurement and purchasing framework to
acquire digital services from publicly funded
infrastructures and commercial providers.
EUROPEAN COMMISSION
DIRECTORATE-GENERAL FOR RESEARCH & INNOVATION
The Director-General
Brussels, 10 July 2017
EOSC Declaration
RECOGNISING the challenges of data driven research in pursuing excellent science;
GRANTING that the vision of European Open Science is that of a research data commons,
widely inclusive of all disciplines and Member States, sustainable in the long-term,
CONFIRMING that the implementation of the EOSC is a process, not a project, by its nature
iterative and based on constant learning and mutual alignment;
UPHOLDING that the EOSC Summit marked the beginning and not the end of this process,
one based on continuous engagement with scientific stakeholders, the European Commission,
PROPOSES that all EOSC stakeholders consider sharing the following intents and will
actively support their implementation in the respective capacities:
Data culture and FAIR data
[Data culture] European science must be grounded in a common culture of data stewardship, so
that research data is recognised as a significant output of research and is appropriately curated
throughout and after the period conducting the research. Only a considerable cultural change will
enable long-term reuse for science and for innovation of data created by research activities: no
disciplines, institutions or countries must be left behind.
[Open access by-default] All researchers in Europe must enjoy access to an open-by-default,
efficient and cross-disciplinary research data environment supported by FAIR data principles.
Open access must be the default setting for all results of publicly funded research in Europe,
allowing for proportionate limitations only in duly justified cases of personal data protection,
confidentiality, IPR concerns, national security or similar (e.g. 'as open as possible and as closed
as necessary').
[Skills] The necessary skills and education in research data management, data stewardship and
data science should be provided throughout the EU as part of higher education, the training
system and on-the-job best practice in the industry. University associations, research
organisations, research libraries and other educational brokers play an important role but they
need substantial support from the European Commission and the Member States.
[Data stewardship] Researchers need the support of adequately trained data stewards. The
European Commission and Member States should invest in the education of data stewards via
career programmes delivered by universities, research institutions and other trans-European
agents.
[Rewards and incentives] Rewarding research data sharing is essential. Researchers who make
research data open and FAIR for reuse and/or reuse and reproduce data should be rewarded, both
Ref. Ares(2017)3488418 - 11/07/2017
33. 1/31/2018 35
Support to the Declaration:
Data Culture and FAIR
EOSC-hub
–Provides production-quality FAIR data and
services through the participating Domains
–Ensures data can be used as widely as
possible across scientific disciplines and
between the private and public sector by
• Making core data resources discoverable and accessible
• Federating existing data infrastructures
EUROPEAN COMMISSION
DIRECTORATE-GENERAL FOR RESEARCH & INNOVATION
The Director-General
Brussels, 10 July 2017
EOSC Declaration
RECOGNISING the challenges of data driven research in pursuing excellent science;
GRANTING that the vision of European Open Science is that of a research data commons,
widely inclusive of all disciplines and Member States, sustainable in the long-term,
CONFIRMING that the implementation of the EOSC is a process, not a project, by its nature
iterative and based on constant learning and mutual alignment;
UPHOLDING that the EOSC Summit marked the beginning and not the end of this process,
one based on continuous engagement with scientific stakeholders, the European Commission,
PROPOSES that all EOSC stakeholders consider sharing the following intents and will
actively support their implementation in the respective capacities:
Data culture and FAIR data
[Data culture] European science must be grounded in a common culture of data stewardship, so
that research data is recognised as a significant output of research and is appropriately curated
throughout and after the period conducting the research. Only a considerable cultural change will
enable long-term reuse for science and for innovation of data created by research activities: no
disciplines, institutions or countries must be left behind.
[Open access by-default] All researchers in Europe must enjoy access to an open-by-default,
efficient and cross-disciplinary research data environment supported by FAIR data principles.
Open access must be the default setting for all results of publicly funded research in Europe,
allowing for proportionate limitations only in duly justified cases of personal data protection,
confidentiality, IPR concerns, national security or similar (e.g. 'as open as possible and as closed
as necessary').
[Skills] The necessary skills and education in research data management, data stewardship and
data science should be provided throughout the EU as part of higher education, the training
system and on-the-job best practice in the industry. University associations, research
organisations, research libraries and other educational brokers play an important role but they
need substantial support from the European Commission and the Member States.
[Data stewardship] Researchers need the support of adequately trained data stewards. The
European Commission and Member States should invest in the education of data stewards via
career programmes delivered by universities, research institutions and other trans-European
agents.
[Rewards and incentives] Rewarding research data sharing is essential. Researchers who make
research data open and FAIR for reuse and/or reuse and reproduce data should be rewarded, both
Ref. Ares(2017)3488418 - 11/07/2017
34. 1/31/2018 36
Support to the Declaration:
Research Data Services and Architecture
EOSC-hub
– Realizes a system of EOSC domains
– Delivers the Hub as a delivery channel of
resources and services
– Improves skills and knowledge among researchers
and service operators by delivering specialised
trainings and co-design
EUROPEAN COMMISSION
DIRECTORATE-GENERAL FOR RESEARCH & INNOVATION
The Director-General
Brussels, 10 July 2017
EOSC Declaration
RECOGNISING the challenges of data driven research in pursuing excellent science;
GRANTING that the vision of European Open Science is that of a research data commons,
widely inclusive of all disciplines and Member States, sustainable in the long-term,
CONFIRMING that the implementation of the EOSC is a process, not a project, by its nature
iterative and based on constant learning and mutual alignment;
UPHOLDING that the EOSC Summit marked the beginning and not the end of this process,
one based on continuous engagement with scientific stakeholders, the European Commission,
PROPOSES that all EOSC stakeholders consider sharing the following intents and will
actively support their implementation in the respective capacities:
Data culture and FAIR data
[Data culture] European science must be grounded in a common culture of data stewardship, so
that research data is recognised as a significant output of research and is appropriately curated
throughout and after the period conducting the research. Only a considerable cultural change will
enable long-term reuse for science and for innovation of data created by research activities: no
disciplines, institutions or countries must be left behind.
[Open access by-default] All researchers in Europe must enjoy access to an open-by-default,
efficient and cross-disciplinary research data environment supported by FAIR data principles.
Open access must be the default setting for all results of publicly funded research in Europe,
allowing for proportionate limitations only in duly justified cases of personal data protection,
confidentiality, IPR concerns, national security or similar (e.g. 'as open as possible and as closed
as necessary').
[Skills] The necessary skills and education in research data management, data stewardship and
data science should be provided throughout the EU as part of higher education, the training
system and on-the-job best practice in the industry. University associations, research
organisations, research libraries and other educational brokers play an important role but they
need substantial support from the European Commission and the Member States.
[Data stewardship] Researchers need the support of adequately trained data stewards. The
European Commission and Member States should invest in the education of data stewards via
career programmes delivered by universities, research institutions and other trans-European
agents.
[Rewards and incentives] Rewarding research data sharing is essential. Researchers who make
research data open and FAIR for reuse and/or reuse and reproduce data should be rewarded, both
Ref. Ares(2017)3488418 - 11/07/2017
35. 1/31/2018 37
Summary
EOSC-hub will
–Create and providing the Hub as federated integration
and management system for the future EOSC
–Implement the Hub as open community-lead
framework
–Provide a wide range of services from many major
digital infrastructures and research communities
36. EUDAT Data Domain modeled on the ANDS1 Data Curation Continiuum
1. Australian National Data Service organization – www.ands.org.au
38
o “The EOSC will integrate and consolidate horizontal e-infrastructures.”
37. EOSC-hub Data is FAIR
• Findable
– assign persistent IDs, provide rich metadata, register in a searchable resource...
• Accessible
– Retrievable by their ID using a standard protocol, metadata remain accessible even if data
aren’t...
• Interoperable
– Use formal, broadly applicable languages, use standard vocabularies, qualified references...
• Reusable
– Rich, accurate metadata, clear licences, provenance, use of community standards...
www.force11.org/group/fairgroup/fairprinciples
39. Acknowledgment
• Tiziana Ferrari, EGI Foundation
• Damien Lecarpentier, EUDAT Project Director
• Mikael Linden, CSC
• Tommi Nyrönen, ELIXIR-Finland Head of Node
• TimmoVesala, INAR RI, Helsinki University
31.1.201841