FSCI Persistent Identifiers

Force 11 Scholarly Communications Institute
Summer School
31 July – 4 August 2017
University of California, San Diego
Data in the Scholarly Communications
Lifecycle
Natasha Simons
Senior Research Data Specialist

Wednesday 2 August
Session two – persistent identifiers for research (data)
• Why do we need PiDs?
• What are PiDs?
• Why use PiDs?
• Why are there so many PiDs?
• Examples: Handles, DOIs, ORCIDs
• Which PiD to choose?
• Power of linking PiDs
• PiD fails
• PiD community
Duration: 30 mins

What are Persistent Identifiers (PiDs)?
A persistent identifier is a long–lasting reference
to a digital resource
Photo attribution: Jan Hettenhausen - j.hettenhausen@griffith.edu.au (reproduced with permission)

Use PiDs to connect…
Researchers Publications
Data
Software
Methods
Equipment
???
Why use PiDs?
PiDs play a key role in the discoverability,
accessibility and reproducibility of research.

Why are there so many PiDs?
Marked by differences in:
• Purpose
• Scope
• Underlying technology
• Governance and social infrastructure
• Metadata collected
• Cost
• Extent of use
ARK
PURL
NLA party ID

Example: The Handle System
• Run by CNRI
• Robust system
• Widely used in publication repositories
• Used to identify research datasets

How do Handles work?
Example: http://hdl.handle.net/11343/130078
http://handle.net = resolver service
/
11343 = prefix identifying assigning body (Uni Melb)
/
130078 = suffix identifying resource (Melb Uni report)

Example: Digital Object Identifiers (DOIs)
• Run by international DOI Foundation
• Robust – built on the Handle System
• Origins in publishing industry
• Used to identify and cite publications and
research datasets
• The most widely used PiD for research data

How do DOIs work?
This is an example from Griffith University:
http://doi.org = resolver service
/
10.4225 = prefix identifying the assigning body (ANDS)
/
01 = Suffix 1 – the institution identifier (Griffith University)
/
4F3DB08617645 = Suffix 2 – the resource item or collection
identifier (a dataset held in the Griffith data repository)

More about DOIs
• Metadata required! Example: DataCite Metadata Schema
https://schema.datacite.org/
• DOI search services e.g. DataCite
https://search.datacite.org/
• Cost involved but some agencies like ANDS offer a free
service
• To get a DOI through the ANDS service: m2m or manual
minting

Example: ORCIDs
• Run by ORCID organisation
• Identifier for people (researchers)
• Links people with their research ‘works’
• Widely used internationally
• Australian research sector-wide endorsement
• Embedded in scholarly workflows

How do ORCIDs work?
https://orcid.org/0000-0003-0635-1998
• 16 digit identifier based on ISNI block
• Prototype: Thomson Reuters ResearcherID
• Most metadata fields are optional
• Free for researchers, fee for members
(organisations)
• Public API (free) and premium API
(members)
• Transparent governance and development
process

The power of linking PiDs
• International efforts to link ORCIDs
(researchers) with DOIs (publications and
data)
• The Scholix initiative:
• a global framework to improve the links
between publications and data
• beneficial for all, especially publishers
(display this link in journals) and
repositories (link back to data held in
repositories)

Which PiD to choose?
Evaluate the PiD service:
• Purpose
• Scope
• Underlying technology
• Governance and social
infrastructure
• Metadata collected
• Cost
• Extent of use
• Trustworthiness?
Choose the best fit PiD for
the type of resource and it’s
point in the research lifecycle
Better to choose one than
none!

PiDs sound great - but hang on….?
Erm…
• Recent PiD crises: PURL, LSID
• “Zombie PiDs”?
Remember:
• PiDs are both social and technical
systems
• Governance/ organisations can be the
archilles heel of PiD systems
See: Klump, J. & Huber, R., (2017). 20 Years of
Persistent Identifiers – Which Systems are Here
to Stay?. Data Science Journal. 16, p.9.
DOI:http://doi.org/10.5334/dsj-2017-009
Have PiD systems ever failed? What’s the
guarantee they will stay “long lasting”?

Cool and groovy international PiD community

Summary
• PiDs play a key role in the discovery, accessibility and
reproducibility of research.
• There are many PiD systems which vary in purpose, scope,
underlying technology, governance and social infrastructure,
metadata collected, cost, extent of use.
• When evaluating which PiD to assign to a resource, consider:
• The differences above and importantly, trustworthiness
• Better to assign a PiD or more than no PiD at all
• Remember that PiDs are about social as well as technical
infrastructure. It is the responsibility of the PiD owner (e.g. a
university) to update the PiD if the resource location changes.
• PiDs are evolving so get your geek on and join in the discussions!

Want more?
Have a go at:
• Thing 14 – Identifiers and linked data
Read:
• ANDS website for PiD Guides, DOI service, Handle service:
• More about DataCite
• More about ORCID
• ICSU/CODATA Data Science Journal special issue: 20 years of
Persistent Identifiers
Watch:
• ANDS PiDs short bites webinar series
(persistent identifiers playlist) - more to come in this series!
• THOR Project webinar series

With the exception of logos, third party images or where otherwise indicated, this
work is licensed under the Creative Commons Australia Attribution 3.0 Licence.
ANDS is supported by the Australian
Government through the National Collaborative
Research Infrastructure Strategy Program.
Monash University leads the partnership with
the Australian National University and CSIRO.
Natasha Simons
natasha.simons@ands.org.au
Tw: @n_simons
ORCID: https://orcid.org/0000-0003-0635-1998

FSCI Persistent Identifiers

Related slideshows

More Related Content

FSCI Persistent Identifiers

Editor's Notes