Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
- 1. International perspectives and cooperation
Archiving The World's E-Journals:
The Keepers Registry As Global Monitor
Peter Burnhill, Francoise Pelle, Pierre Godefroy,
Fred Guy, Morag Macgregor & Adam Rusbridge
EDINA (JISC & University of Edinburgh) & ISSN-IC (Paris)
28 September, 2012
UNESCO Digital Memory of The World Conference,
Vancouver, Canada
- 2. Digital Preservation of Serial Content
Helping to ensure Identifying
continuity of access the stream
http://www.flickr.com/photos/shinez/5000985919/
- 3. The Basics of a Shared Concern
[The Good News]
What was once availably locally <on shelf, in drawer/datacentre>
is now online & accessed remotely, anytime/anywhere
– Studies show that scholarly literature is now nearly all online
– Libraries moving to an e-only environment for journals
[The Bad News]
The role of libraries as trusted keepers of information and
culture has been disrupted
Libraries no longer take physical custody of digital
– Publishers license content online remotely
Risk of loss for future scholars, citizens & our children
- 4. Our Shared Understanding
World heritage & scientific understanding is global
Scholarship & science has global literature
Researchers in any one country are dependent upon
content written and published in other countries
International/National Reports & Activity: 10 Years On
Draft Charter on the Preservation of the Digital Heritage, 2003
Increasingly born digital,
or re-born digitized
Archiving E-Journals (JISC: Maggie Jones, 2003)
Need to ensure continuity
Archiving Electronic Journals of
(L. Cantara (Ed) DLF/CLR, 2003)
access
E-Journal Archiving Metes and Bounds: A Survey of the
Landscape (Anne Kenney et al, 2006)
…
- 5. The real heroes in the story are …
The Keepers, organisations that act as our
digital shelves:
a. web-scale not-for-profit organizations
* e.g. CLOCKSS Archive & Portico
a. national libraries
*e.g. British Library, e-Depot (Netherlands)
& National Science Library of China
a. library consortia
*e.g. HathiTrust & Global LOCKSS Network
- 6. Many archiving initiatives is a Good Thing
“Digital information is best preserved by replicating it at
multiple archives run by autonomous organizations”
B. Cooper and H. Garcia-Molina (2002)
- 7. A Registry to discover ‘who is looking after what’
• Idea mooted in UK Report (JISC: Maggie Jones, 2003/4);
Call in USA/Canada for “clarity of public statement by each
agency or through a registry” (CLIR Report, 2006)
• UK scoping study recommended an e-journals preservation
registry be built (JISC: Rightscom/U. of Loughborough 2007)
• JISC funded EDINA & ISSN-IC as partners to Pilot an
E-Journal Preservation Registry Services (PEPRS)
– Phase 1: August 2008 – July 2010
‘investigate, prototype and build’ [evaluation in Feb. 2010]
– Phase 2: August 2010 – July 2012
‘preparing for service & governance’
- 8. Abstract Data Model: Figure 1 in reference paper in Serials, March 2009
SERVICES: user requirements
E-J Preservation Registry Service
Piloting an
E-journals E-Journal METADATA
Preservation Preservation on preservation action
(b)
Registry Registry
Service
(a)
METADATA Digital Preservation Agencies
e.g. CLOCKSS, Portico; BL, KB;
on extant e-journals UK LOCKSS Alliance etc.
Data dependency
ISSN
Register
- 9. Partners have 15+ years of association
• ISSN International Centre (ISSN IC)
linked to national libraries and publishers
– an intergovernmental institution governed by statutes/
convention between UNESCO & France (as host country)
– coordinates the ISSN Network of national centres, operating
an automated system for the registration of serials, via
assignment of International Standard Serial Numbers (ISSNs)
http://www.issn.org
• EDINA linked to needs of research & teaching
– part of The University of Edinburgh (Scotland, UK)
– designated in 1995 to act as a national data centre by JISC,
the ICT agency for UK universities and colleges
http://edina.ac.uk
- 10. The Keepers Registry
• Initial scope was upon what is published in digital form
– in serials and periodicals (journals, magazines, newspapers etc)
• Content of significance for each & every country:
– publishers & archiving agencies are international
We can already report some good progress, having
launched a Beta service a year ago, as noted on our blog,
http://thekeepers.blogs.edina.ac.uk/
- 11. http://thekeepers.org
and their activity
search on
title or
ISSN
a showcase
for archiving
organisations
- 12. Example search for: Origins of Life
This e-journal is being archived
by 5 archiving agencies …
… but coverage
of volumes is
partial & patchy
12
- 13. Reminder about the real heroes in the story
The Keepers of e-journal content
National Science Library,
Chinese Academy of Sciences
And others, as they tell all via the Keepers Registry …
http://thekeepers.blogs.edina.ac.uk/2012/06/27/draft-inclusion-criteria-released-for-review/
- 14. Sidebar note on National Libraries
Should we wait upon Legal Deposit?
– 94% of libraries have some form of legal deposit for print.
• Only 44% national libraries had legislation in 2011 for
e-books or e-journals; expected to rise to 58% by June 2012.
• Only 27% [expected to rise to 37% by June 2012] actually
ingesting via legal deposit
Total national libraries collecting =
those 14 via legal deposit + 9 by other means
(Netherlands, UK/BL & Switzerland have voluntary deposit )
2 (KB e-Depot & BL) participate in the Keepers Registry
– Only when the other 21 join will all know about their activity
from presentation, CENL 2011 Survey by Lynne Brindley
to CDNL Annual Meeting Puerto Rico, 15/8/11
- 15. Sidebar note on Identifiers
Our focus when building the Registry was online content
that had an ISSN, as project convenience
– Big increase in assignment of ISSN to electronic continuing
resources: about 100,000
Now as a matter of principle: if it is worth preserving
then it needs/deserves an identifier
• ISSN-L has been devised to link the different ISSN for
digital and for print
• Rules now agreed for assigning ISSN to digitised
content from print journals
• ISSN assignment rules for integrating resources
(websites and online databases that change over time)
- 16. Work-in-progress for the Keepers Registry
1. Revise schema for archival action for self-statement by the
archiving organisations: [*The Keepers Registry is not an audit*]
– Brief summary
– Ingest & preservation workflow: ingest & preserve
– Library access to content: conditions
– Policies & procedures (inc. external audit): authenticity/integrity
– Latest data: report of serials/journals being archived, with volume detail.
1. Establish Governance, Business Model & Sustainability:
JISC & international (UNESCO, EU; LIBER, ARL; …)
2. Inter-working between archiving agencies [safe places network]
3. Engage/inspire others to do advocacy: for coverage & use
– http://www.eifl.net/ (as example)
1. Functionality: Upload / cross-check facility for libraries etc
2. Work on methodological problems (serials, interoperability etc)
1. ‘universal holdings statements/records’, ONIX for Preservations etc
- 17. Strategies -> Action Plans -> Delivery
“develop strategies that will contribute to greater protection of digital
assets and help to define an implementation methodology that is
appropriate for developing countries, in particular.” UNESCO
• Principles into practice [even for unique/special objects]
a. Assign an identifier at ‘point of issue’ [ISSN for the stream]
b. Archive routinely (preferably have others/peers do that for you too)
c. Tell someone what you are doing (and how) [e.g. Keepers Registry]
d. Publish terms of access (now and when triggered as orphaned) [OA]
• Make Copies, Establish Safe Places & Monitor Progress
– What is different about the digital includes the ease with which digital
content can be copied, cheaply and exactly.
– Strategy for a safe places network in which the responsibility for
custody of the new digital content is shared.
http://www.eifl.net/
- 18. Use Seriality for Preservation Monitoring
Thank you for listening
p.burnhill@ed.ac.uk
http://thekeepers.org
http://thekeepers.blogs.edina.ac.uk/
http://www.flickr.com/photos/shinez/5000985919/
- 19. We have been busy building international support…
2008: JISC Journals WG, London; ISSN National Directors Meeting
2009: NASIG Annual Conference, Ashville NC, USA;
Lib. of Academy of Science, Beijing; ISSN Directors, Beijing;
PARSE.Insight, Germany; Knowledge Exchange, Edinburgh
2010: E-journals are Forever Workshop, JISC/DPC, London;
IFLA 2010 Gothenburg; RLUK Conference, Edinburgh;
Columbia Univ., NYC
2011: UKSG; ISSN Governing Body; ARL, Montreal; ALA; UNESCO …
P.Burnhill, F.Pelle, P.Godefroy, F.Guy, M.Macgregor, A.Rusbridge & C.Rees
Piloting an e-journals preservation registry service.
Serials 22(1) March 2009. [UK Serials Group]
P.Burnhill. Monitoring Archiving with Issues about Serials on the Web:
Keepers Registry. Serials Review (submitted)
19
Editor's Notes
- T: So we could regard each journal as a data stream 'as a continuing resource' P: Yes, trying to preserve the content of journals seems a much more sensible approach. T: Put simply, there are fewer ISSNs than DOIs P: That’s good Theo. And remember that those Authors’ Final Copies rarely have any identifier and often have rubbish metadata. [60]
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- 10/01/12
- T: So we could regard each journal as a data stream 'as a continuing resource' P: Yes, trying to preserve the content of journals seems a much more sensible approach. T: Put simply, there are fewer ISSNs than DOIs P: That’s good Theo. And remember that those Authors’ Final Copies rarely have any identifier and often have rubbish metadata. [60]