Presentation given by Chris Higgens at the Annual Infrastructure for Spatial Information in European (INSPIRE) Conference Krakow, Poland. 22 June 2010.
The Go-Geo! Spatial Data Portal provides a discovery and research tool for UK academics to find geospatial resources. It includes over 2,820 searchable geospatial datasets and metadata records. Go-Geo! also provides geospatial metadata best practices and guidelines, metadata editing and publishing tools, workshops and training to support the use and sharing of spatial data across UK academia.
Addy Pope demonstrates how a suite of EDINA and Edinburgh University Data Library tools and apps can make curating your spatial data a breeze. Presented at the Open Repositories 2014, June 9-13, Helsinki, Finland http://or2014.helsinki.fi
Presentation given by Peter Burnhill, director of EDINA, at #ReCon_15 : Beyond the paper: publishing data, software and more. Edinburgh, 19 June 2015
Peter Burnhill
http://reconevent.com/
This document summarizes the agenda and key discussion points from a kickoff meeting for the Interoperable Geographic Information for Biosphere Study (IGIBS) project. The agenda included introductions and presentations on the project context from various stakeholders. Discussion focused on aligning objectives, prioritizing deliverables, and planning the project workflow, including stakeholder engagement, application development, and access control using Shibboleth authentication. The goal is to improve interoperability between public and academic sector geospatial data and services.
Overview of the world of geospatial metadata, and the role of the EDINA service GoGeo in creating, saving, and discovering it. Presented on 19 June 2014 by Tony Mathys in Aberdeen, Scotland.
The document discusses the EU INSPIRE Directive and its implications for UK academia. The INSPIRE Directive aims to create a European Spatial Data Infrastructure (SDI) to improve sharing of spatial information between public authorities and accessibility for the public. This will allow better environmental policies and outcomes. While initially for environmental policy, INSPIRE intends to extend to other domains. The directive may apply to UK universities as they are considered public authorities. This could mean universities would need to make certain spatial datasets available according to INSPIRE specifications. The directive presents both obligations and opportunities for UK academia as data providers and data users.
Designing and delivering an international MOOC on Research Data Management an...Robin Rice
This document discusses the design and delivery of an international MOOC on research data management and sharing. The MOOC was created by Dr. Helen Tibbo of UNC-Chapel Hill and Robin Rice of the University of Edinburgh. It was partially funded by several organizations and aimed to educate librarians, researchers, and students on best practices for data management, sharing, and archiving. The MOOC covered topics like understanding research data, data management planning, working with data, sharing data, and archiving data. Feedback from participants showed they found the course informative and useful for learning about research data management.
The document discusses open educational resources (OER) and the MANTRA project. It defines OER as teaching, learning, and research materials that are freely available or have an open license allowing free use. The MANTRA project aims to create online learning modules about research data management and make them available as OER. Key lessons from creating the modules included underestimating the time needed, challenges of authoring content, and ensuring consistency across materials.
Robin Rice presented on guiding users through data deposit from the perspective of a data librarian. The presentation covered supporting data users throughout the research lifecycle, fostering a culture of responsible sharing through training, and incentivizing open data sharing through easy-to-use repository services. It also discussed balancing the needs of a multi-disciplinary institutional repository to accommodate different domains while striving for FAIR data principles and lowering barriers to deposit. An online demo of user documentation for the Edinburgh DataShare repository was provided.
The document discusses the UK LOCKSS Alliance program which aims to help libraries preserve electronic journals and books through local archiving. It describes how the collaborative organization coordinates support for members to build and maintain their own archives of web published content. The UK LOCKSS Alliance works to ensure long-term sustainable access to scholarly works.
This document maps the landscape of research literature repositories by exploring the key stakeholders and workflows. It shows researchers submitting manuscripts to journals after receiving grants from funders. It also depicts authors depositing final copies in institutional repositories, which are the focus of open access. Various stakeholders like funders, institutions, publishers and readers are represented along with systems like CRIS and metrics collection. The document aims to understand the ecosystem and identify areas for improvement or cross-pollination between workflows.
The document discusses managing research data and digital repositories in difficult economic times. It provides an overview of policies, strategies, technologies and infrastructure used to manage research and teaching materials. It also discusses funding from JISC and other organizations for repository services and projects in the UK.
Presented by Peter Burnhill, Director of EDINA, Beyond Books: What STM & Social Science publishing should learn from each other, London. Conference programme. 22 April 2010.
The document reports on the progress of the IASSIST Latin Engagement Strategic Action Group. It summarizes the group's findings from surveying data professionals in Spain. It found that while data library roles are not prominent, interest in research data management is growing. The document recommends that IASSIST provide multilingual resources, training events in Spain, and opportunities for Latin American members to attend conferences to further engage Latin members.
The University of Edinburgh has taken several steps to improve research data management:
1. They developed the first research data management policy in the UK to provide guidelines for storing, sharing, and preserving research data.
2. They created online training and guidance materials called MANTRA to teach researchers best practices for data organization, documentation, and long-term access.
3. They are developing research data services including a data library, repository, and storage strategies to support researchers in managing their data throughout the research lifecycle and ensuring access over time.
Slides accompanying the University of Edinburgh Digital Day of Ideas 2016 (#DigScholEd) workshop on Tweeting and Blogging for Academics run by Nicola Osborne (EDINA) and Lorna Campbell (EDINA/LTW). The workshop took place on 18th May 2016. Read more about the event here: http://www.digital.hss.ed.ac.uk/ddi/ddi-2016/
Big Just Got Bigger! discusses the challenges of managing large map collections through the Digimap service. Digimap provides access to geospatial data from various sources, including Ordnance Survey, British Geological Survey, aerial imagery, and more. It has grown significantly over time to include more data sources and users. Managing such large datasets and meeting user expectations of current data and performance presents challenges. Issues include keeping data current while sharing across platforms, disk storage needs increasing exponentially over time, and ensuring data can be accessed and used through various tools and formats.
The document discusses two phases of the AddressingHistory project, which aimed to crowdsource the georeferencing of historical Scottish Post Office Directories. Phase 1 focused on creating an online tool to allow users to georeference directories from 1784-1885 and 1905-1906. Phase 2 expanded the tool's functionality and added content from 1881-1891. The project demonstrated how crowdsourcing could be used at both the individual record level and higher levels to improve optical character recognition of directories and make the historical data more accessible and searchable.
This document provides an overview of a geospatial metadata and spatial data workshop. It discusses the importance of metadata for discovering and managing spatial datasets. It introduces common geospatial metadata standards like FGDC, ISO 19115, and INSPIRE and the concept of application profiles. The document outlines tools and resources for UK academics to create and publish metadata, including the UK AGMAP profile, Geodoc editor, GoGeo portal, and ShareGeo repository. Hands-on sessions demonstrate using these resources to generate metadata and access open spatial data.
Presented by Robin Rice at the "IRs dealing with data" workshop at the Open Repositories 2013 Conference in Charlottetown, Prince Edward Island, Canada, on 8 July 2013.
This was a guest lecture presented online at 12.30pm, Monday 14th October 2013, as part of Session 2: Co-creation in the University of Edinburgh Institute for Academic Development's Online Tutoring course (Autumn 2013).
The document introduces COBWEB, a research project that develops a crowdsourcing infrastructure for collecting and analyzing environmental data provided by citizens. The project aims to address data quality issues and support policy decisions. It has several pilot sites and partners, including UNESCO biosphere reserves. The framework includes mobile apps, QA processes, and a portal to view and analyze citizen-submitted data. It uses open standards and aims to be customizable for different use cases involving topics like biological monitoring and flooding.
This document discusses drivers and organizational responses to research data management (RDM) maturity from transatlantic perspectives. It describes external funder mandates in the US and UK that require open sharing of research publications and data. Universities have responded by developing RDM policies, tools, expertise, and education/outreach for researchers. Key RDM components discussed include policies, storage and repository tools, expertise and staffing models, and outreach/education activities. Connecting electronic lab notebooks to other RDM infrastructure is presented as an approach to better integrate researcher workflows with institutional RDM. The document concludes with an invitation to provide comments on RDM maturity through an online survey.
Presented by Peter Burnhill, Director of EDINA, at PARSE.insight workshop on Preservation, Access and Re-use of Scientific Data, Darmstadt, Germany, 22 September 2009.
Presented by Natasha Aburrow-Jones at the CILIP Cataloguing and Indexing Group Conference 2014 at Canterbury on 8 September 2014. Poor quality, non-standardised metadata may not lead directly to the end of the world, but it won't help!
The MIMAS workshop discussed the RepositoryNet infrastructure and components including aggregation, text mining, search, benchmarking and statistics, registries, deposit tools, and metadata quality. It provided updates on components outside RepositoryNet like IRS Search and NAMES 2. A demonstration of IRUS showed its current functionality for benchmarking and statistics and future plans for funding, APIs, international scope, and business models. Developing service level agreements for RepNet services was also discussed.
The document summarizes a meeting at the University of Edinburgh about library roles in research data management. It includes:
- Welcome and introduction from Robin Rice about data library services and projects related to research data management.
- A talk from Sam Searle of Monash University about library roles in research data management based on experiences at Monash.
- A discussion session.
- A presentation by Sheila Cannell about getting a research data management policy for the University of Edinburgh.
- A presentation by Jeff Haywood about the University of Edinburgh's storage strategy.
The document discusses the importance of promoting online resources to library users. It notes that without active promotion, usage of subscription resources will be lower. It also suggests that libraries should alert users to useful free resources to create engaging sites and establish connections with users. The document then examines some challenges to promoting resources, such as information overload and competition from other online sources. It provides suggestions on how to effectively promote resources, including using metrics to monitor impact, ensuring accurate resource descriptions, and employing search engine optimization strategies.
The document describes a "DIY" research data management training kit for librarians created by Stuart Macdonald. The kit was designed to train librarians to support their institution's research data management needs. It includes open educational materials that guide librarians through topics like data management planning, storage, and sharing. The training involves self-paced reading, reflective writing, and group exercises. The goal is to empower librarians to learn RDM skills and help researchers comply with their university's new RDM policy.
Presentation by Stuart Macdonald of the Edinburgh University Data Library at the Graduate School of Social and Political Science Induction, 15 and 16 Septeber, 2011, University of Edinburgh
This document provides an overview of several data resources and services available through EDINA and the National Data Service in the UK. It summarizes the Data Library which houses large-scale survey, census and other data. It also describes the Applied Quantitative Methods Network (AQMeN) which provides data analysis tools, software and teaching resources. Finally, it briefly outlines additional data services including EDINA, UKBORDERS, Digimap, the UK Data Archive and Economic and Social Data Service.
Presentation made at the 'Towards linked science - Open Data and DataCite Esrtonia seminar as part of the Estonian Open Access Week at University of Tartu
Slides presented at the Spanish Agency of Science and Technology (FECYT) and the network of Spanish repositories (RECOLECTA) Research Data Management Webinar Series - see url:
http://www.recolecta.net/buscador/webminars.jsp
The document discusses open data initiatives and tools for data sharing. It describes projects from the EDINA National Data Centre, DISC-UK DataShare project which investigated legal and technical issues around research data sharing, and tools for visualizing and sharing numeric and spatial data online like Many Eyes, Gapminder and OpenStreetMap. It also covers barriers to data sharing, harnessing collective intelligence through open science, and citizens contributing geographic data through tools like geograph.
Research Data Support at the University of EdinburghRobin Rice
The document summarizes the research data support services at the University of Edinburgh. It describes the university's background and information services department. It then outlines the maturity model that guides the research data management (RDM) services, the governance structure overseeing the RDM service, and the funding model that supports it. The document also summarizes the university's RDM policy and the various tools and support provided across the research data lifecycle, from creating data management plans and storing data to publishing and preserving data in the long term.
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryRobin Rice
1) The document discusses Edinburgh DataShare, a data repository at the University of Edinburgh that was established as part of the DISC-UK DataShare project to explore new ways for academics to share research data over the internet.
2) It describes lessons learned from establishing the repository, including that top-down drivers are important for data sharing, and that data libraries can help bridge communication between researchers and repository managers.
3) The document recommends that institutions develop research data policies to clarify rights and responsibilities regarding data sharing and management.
The document provides background information on RDM services at the University of Edinburgh. It summarizes that EDINA and the University Data Library provide research data management support and online resources. It then overviews key RDM services including DataStore for active research data storage, DataShare for open data publication, and plans for a long-term DataVault archive. The document also discusses RDM training and the university's RDM policy implemented through a multi-phase roadmap.
In this presentation we were talking about preparing documentation and adapting work processes for acquiring DSA. We briefly introduced ADP as the national data repository for social sciences. We explained the brief history of policy development. The changes on national and international level and internal and external changes that caused new challenges. Finnally we explained the process of preparing and developing policy.
Related link: http://www.dans.knaw.nl/nl/actueel/agenda/cessda-expert-seminar-2015
EOSC-hub: first steps towards realising EOSC visionEUDAT
The document discusses the EOSC-hub project, which aims to create an integration and management system called "the Hub" for the future European Open Science Cloud (EOSC). The Hub will engage providers from major European digital infrastructures to offer services, software, and data for advanced research. EOSC-hub will integrate production-ready services, operate and provide access to resources, and support the utilization of resources for open science, open innovation, and being open to the world. It will support the EOSC Declaration by providing a service integrator and federator for the EOSC and developing expertise in procuring digital services.
Research data support: a growth area for academic libraries?Robin Rice
This document summarizes a presentation given by Robin Rice from the University of Edinburgh on research data management and the role of academic libraries. The presentation covered open science and the FAIR data principles, drivers for research data management policy changes, examples of research data management services, and the changing skills needed in academic libraries to support research data. It provided an overview of the University of Edinburgh's research data services, which include tools and support across the data lifecycle from writing data management plans to long-term data preservation. The presentation also discussed the skills important for data librarians and ways for librarians to develop skills in open science and research data management.
RISE background for project board mtg 2011 04-01Liz Work
The RISE project aims to use data from the Open University's EZProxy system to provide personalized recommendations to students within the university's EBSCO Discovery search tool. The project has several work packages, including creating a recommendations database from EZProxy log files, developing algorithms to generate recommendations, and building a Google Gadget to integrate recommendations into searches. The project will evaluate whether recommendations are useful for students and help enhance the search experience. It is governed by a Project Board and reports monthly to the Library Leadership Team.
Presenter: Peter Burnhill, Director, EDINA national academic data centre, University of Edinburgh, Scotland UK
Presentation given at Beyond Books: What STM & Social Science publishing should learn from each other Marriott Hotel/Kensington, London, 22 April 2010
How the Research Data Service supports Open Research (aka Open Science) at the University of Edinburgh. Abridged slides used for presentation to Open Access Scotland meeting in Edinburgh on Wednesday 27th of March 2019.
Presented by Stuart Macdonald at the IT Professionals Forum (20/5/14) and the PPLS (School of Philosophy, Psychology and Language Sciences) RDM Workshop (6/5/14).
Similar to Introduction to Edinburgh University Data Library and national data services (20)
A look at the research being carried out by Dr Stuart Dunn at Kings College London. This includes his work on rediscovering Corpse Paths in Great Britain.
The Land Cover Map 2015 (LCM2015) is a map of land cover classes across the UK produced every 5-10 years. It is based on classification of Landsat satellite imagery from the summer and winter and additional data layers. The LCM2015 contains over 7.5 million land parcels classified into 21 land cover classes. It is an important resource used widely in research, commercial, government and nonprofit applications related to agriculture, ecology, climate, planning and more.
A presentation by John Murray from Fusion Data Science given at EDINA's GeoForum 2017 about the use of Lidar Data and the technology and techniques that can be used on it to create useful datasets.
Slides accompanying the presentation:"Reference Rot in Theses: A HiberActive Pilot", a 10x10 session (10 slides over 10 minutes) presented by Nicola Osborne (EDINA, University of Edinburgh). This presentation was part of Repository Fringe 2017 (#rfringe17) held on 3rd August 2017 in Edinburgh. The slides describe a project to develop Site2Cite, a new (pilot) tool for researchers to archive their web citations and ensure their readers can access that archive copy should the website change over time (including "Reference Rot" and "Content Drift").
This document provides an overview of managing digital footprints. It discusses what a digital footprint is, research conducted at the University of Edinburgh on digital footprints, and factors that contribute to one's digital footprint such as social media, location data, and online searches. The document notes that digital footprints can impact professional and personal reputation. It provides tips for taking ownership of one's digital footprint such as regularly searching for oneself online and reviewing privacy settings. Resources for further information and managing digital footprints are also listed.
The document discusses using digital technology and maps to represent the HMS Iolaire tragedy, a maritime disaster in 1919 where 205 men from the Isle of Lewis died after returning from World War I. It describes adding photos, text, and showing change over time to maps to help tell the story and create a sense of place. Specific details are provided about the journey the men took from England to the Western Isles on New Year's Day 1919 and how maps at different scales can portray events in different ways.
This document introduces Digimap for Schools, an online mapping service designed for schools to use in geography and other subjects. It has Ordnance Survey maps of Great Britain at different zoom levels, as well as historic maps and aerial photography. Students can add their own labels, markers, and other elements to maps. The service allows measuring distances and areas. It is browser-based and can be accessed from school or home. Over 2,690 schools in Britain currently use the service, including 185 Scottish secondary schools. The document outlines how Digimap for Schools can support teaching and learning in subjects beyond geography like numeracy, social studies, sciences, and more. Examples of lessons and activities using the mapping service are provided.
This document provides an introduction to Digimap for Schools, an online mapping service designed for use in UK schools. It highlights key features such as access to historic maps from the 1890s and 1950s, aerial photography, and tools for annotating, measuring, and analyzing maps. Schools subscribe to the service, which allows unlimited users per school to access maps and tools through a web browser on any device. The presenter emphasizes how Digimap for Schools can support teaching and learning across the Scottish curriculum, particularly for geography, by facilitating hands-on activities with maps, data, and spatial analysis. Examples are given of how schools have used the service for topics like land use change, density calculations, and proportional mapping. Teachers observing the presentation
"Managing your Digital Footprint : Taking control of the metadata and tracks and traces that define us online" invited presentation for CIG Scotland's 7th Metadata & Web 2.0 Seminar: "Somewhere over the Rainbow: our metadata online, past, present & future", which took place at the National Library of Scotland, 5th April 2017.
Slides accompanying Nicola Osborne's(EDINA Digital Education Manager) session on "Social media and blogging to develop and communicate research in the arts and humanities" at the "Academic Publishing: Routes to Success" event held at the University of Stirling on 23rd January 2017.
"Enhancing your research impact through social media" - presentation given by Nicola Osborne, EDINA Digital Education Manager, at the Edinburgh Postgraduate Law Conference 2017 (19th January 2017).
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne, EDINA Digital Education Manager, for Abertay University (Dundee) 4th Year Marketing Students.
Best Practice for Social Media in Teaching & Learning Contexts, slides accompanying a presentation by Nicola Osborne, EDINA Digital Education Manager, for Abertay University (Dundee). The hashtag for this event was #AbTLEJan2017.
This document summarizes new and enhanced features in Digimap services from 2015-2016. Key updates include a refreshed homepage, responsive design for tablets, a new historic downloader application, marine chart roam with updated data, additions to ancient roam, land cover vector data, and improvements to geology, marine, and OS data. Usability and performance enhancements were also made, such as improved geo-referencing, easier use of 3D data, and a more reliable backend system. Feedback from users helped inform priority quality improvements.
A talk by Dr. Phil Bartie about Spatial Data, how he has used it, issues of quality and how Digimap has helped him by making it available throughout his academic career.
### Data Description and Analysis Summary for Presentation
#### 1. **Importing Libraries**
Libraries used:
- `pandas`, `numpy`: Data manipulation
- `matplotlib`, `seaborn`: Data visualization
- `scikit-learn`: Machine learning utilities
- `statsmodels`, `pmdarima`: Statistical modeling
- `keras`: Deep learning models
#### 2. **Loading and Exploring the Dataset**
**Dataset Overview:**
- **Source:** CSV file (`mumbai-monthly-rains.csv`)
- **Columns:**
- `Year`: The year of the recorded data.
- `Jan` to `Dec`: Monthly rainfall data.
- `Total`: Total annual rainfall.
**Initial Data Checks:**
- Displayed first few rows.
- Summary statistics (mean, standard deviation, min, max).
- Checked for missing values.
- Verified data types.
**Visualizations:**
- **Annual Rainfall Time Series:** Trends in annual rainfall over the years.
- **Monthly Rainfall Over Years:** Patterns and variations in monthly rainfall.
- **Yearly Total Rainfall Distribution:** Distribution and frequency of annual rainfall.
- **Box Plots for Monthly Data:** Spread and outliers in monthly rainfall.
- **Correlation Matrix of Monthly Rainfall:** Relationships between different months' rainfall.
#### 3. **Data Transformation**
**Steps:**
- Ensured 'Year' column is of integer type.
- Created a datetime index.
- Converted monthly data to a time series format.
- Created lag features to capture past values.
- Generated rolling statistics (mean, standard deviation) for different window sizes.
- Added seasonal indicators (dummy variables for months).
- Dropped rows with NaN values.
**Result:**
- Transformed dataset with additional features ready for time series analysis.
#### 4. **Data Splitting**
**Procedure:**
- Split the data into features (`X`) and target (`y`).
- Further split into training (80%) and testing (20%) sets without shuffling to preserve time series order.
**Result:**
- Training set: `(X_train, y_train)`
- Testing set: `(X_test, y_test)`
#### 5. **Automated Hyperparameter Tuning**
**Tool Used:** `pmdarima`
- Automatically selected the best parameters for the SARIMA model.
- Evaluated using metrics such as AIC and BIC.
**Output:**
- Best SARIMA model parameters and statistical summary.
#### 6. **SARIMA Model**
**Steps:**
- Fit the SARIMA model using the training data.
- Evaluated on both training and testing sets using MAE and RMSE.
**Output:**
- **Train MAE:** Indicates accuracy on training data.
- **Test MAE:** Indicates accuracy on unseen data.
- **Train RMSE:** Measures average error magnitude on training data.
- **Test RMSE:** Measures average error magnitude on testing data.
#### 7. **LSTM Model**
**Preparation:**
- Reshaped data for LSTM input.
- Converted data to `float32`.
**Model Building and Training:**
- Built an LSTM model with one LSTM layer and one Dense layer.
- Trained the model on the training data.
**Evaluation:**
- Evaluated on both training and testing sets using MAE and RMSE.
**Output:**
- **Train MAE:** Accuracy on training data.
- **T
How We Added Replication to QuestDB - JonTheBeachjavier ramirez
Building a database that can beat industry benchmarks is hard work, and we had to use every trick in the book to keep as close to the hardware as possible. In doing so, we initially decided QuestDB would scale only vertically, on a single instance.
A few years later, data replication —for horizontally scaling reads and for high availability— became one of the most demanded features, especially for enterprise and cloud environments. So, we rolled up our sleeves and made it happen.
Today, QuestDB supports an unbounded number of geographically distributed read-replicas without slowing down reads on the primary node, which can ingest data at over 4 million rows per second.
In this talk, I will tell you about the technical decisions we made, and their trade offs. You'll learn how we had to revamp the whole ingestion layer, and how we actually made the primary faster than before when we added multi-threaded Write Ahead Logs to deal with data replication. I'll also discuss how we are leveraging object storage as a central part of the process. And of course, I'll show you a live demo of high-performance multi-region replication in action.
Amazon DocumentDB(MongoDB와 호환됨)는 빠르고 안정적이며 완전 관리형 데이터베이스 서비스입니다. Amazon DocumentDB를 사용하면 클라우드에서 MongoDB 호환 데이터베이스를 쉽게 설치, 운영 및 규모를 조정할 수 있습니다. Amazon DocumentDB를 사용하면 MongoDB에서 사용하는 것과 동일한 애플리케이션 코드를 실행하고 동일한 드라이버와 도구를 사용하는 것을 실습합니다.
Amul goes international: Desi dairy giant to launch fresh ...
Introduction to Edinburgh University Data Library and national data services
1. Introduction to the
University Data Library
and national data services
Stuart Macdonald
Associate Data Librarian
Graduate School of Social & Political
Science Induction
September, 2016
2. Outline
About the Data Library
Data Library services & resources
National data services for social sciences
UKDS
EDINA
ADRN
Non-academic data sources & tools
Skills and training
3. EDINA and Data Library within
Information Services
Divisions
User Services
Library and University Collections
IT Infrastructure
Applications
EDINA and Data Library
Digital Curation Centre
Learning, Teaching and Web Services
4. What is a data library?
A data library refers to both the content and the services that
foster use of collections of numeric and/or geospatial data sets
for secondary use in research. A data library is normally part of a
larger institution (academic, scientific, medical, governmental,
etc.) established to serve the data users of that organisation.
The data library tends to house local data collections and
provides access through various means (CD-/DVD-ROMs or
central server for download). A data library may also maintain
subscriptions to licensed data resources.
5. Data Library: History
Established out of the Program Library Unit in early
1980s to provide access to data on mainframes, e.g.
1981 population census data. Was part of EUCS,
now Information Services
Part of long tradition of sharing machine-readable
data for secondary analysis in the social sciences
Became a national data centre as EDINA in 1996 -
data library continues University remit
celebrated 30th anniversary in 2013
7. Data Library & Consultancy
Finding…
“I need to analyse some data for a project, but all I can find are published
papers with tables and graphs, not the original data source.”
Accessing …
“I’ve found the data I need, but I’m not sure how to gain access to it.”
Using …
“I’ve got the data I need, but I’m not sure how to analyse it in my chosen
software.”
Managing …
“I have collected my own data and I’d like to document and preserve it and
make it available to others.”
Reference Interviews
Training and resources:
http://www.ed.ac.uk/information-services/research-support/data-
library/training-resources
8. Data Library Resources
Large-scale social science survey data
Country and regional level time series data
Population and Agricultural Census data
Data for mapping
Resources for teaching
Opening hours: 9.30am – 5.30pm (Mon. – Fri.)
Argyle House
3 Lady Lawson Street
EH3 9DR
Tel.: 0131 651 1431 or 0131 651 1744
Email: datalib@ed.ac.uk
10. Research Data Management
Services
Research data management training:
Research Data MANTRA - Open online course for researchers
managing digital data as part of the research process
Research Data Management & Sharing MOOC -
https://www.coursera.org/learn/data-management
Institutional data repository provision
Edinburgh DataShare
Assistance with deposit in national archives
Assistance with Data Management Plans (DMPs)
11. Research Data MANTRA
• Organising data
• Documentation & metadata
• Storage & security
• File formats & transformations
• Data management planning
• PLUS software-specific data handling practicals
• Open online, self-paced course for researchers
• http://datalib.edina.ac.uk/mantra
15. Applied Quantitative Methods Network (AQMeN) -
http://aqmen.ac.uk/
AQMeN is a Research Centre that aims to develop projects to improve our
understanding of current social issues in the UK and provide policy makers
and practitioners with independent research-based evidence to build a
better future.
Funded by ESRC (2013-2016) AQMeN has three primary strands of
research involving a multidisciplinary team of researchers from the UK and
abroad :
• Crime and victimisation
• Education and social stratification
• Urban segregation and inequality
Original aim was to build capacity in the use of quantitative methods
amongst Scotland’s social science community and beyond.
AQMeN offer practical quantitative methods training on a range of
techniques developed as part of their programme of research
Data Library host and manage the AQMeN website
18. UK Data Archive
Founded in 1967 and based at the University of Essex
ESRC / JISC-funded
Work closely with CESSDA, ICPSR, NatCen social research, NCRM, ONS
Houses several thousand social science and humanities datasets
Provides resource discovery and support for secondary use of quantitative and
qualitative data in learning, teaching and research.
More recently it has worked with environmental and medical data sources.
Host to the UK Data Service which provides the following facilities:
• History Data Service
• Census.ac.uk
• Variable and question bank
• Secure Lab
• Qualibank
19. UK Data Service (UKDS)
£17 million investment over 5 years
UKDS - a comprehensive resource funded by the ESRC to support
researchers, teachers and policymakers who depend on high-quality social
and economic data.
A single point of access to high quality census data, government-funded
surveys, longitudinal studies, international macro-data, qualitative data and
business micro-data.
Distributed service led by Univ. of Essex in collaboration with:
Jisc Manchester, Cathie Marsh Centre for Census & Survey Research (Univ. of Manchester)
School of Geography (Univ. of Leeds)
Geography and Environment (Univ. of Southampton)
EDINA (Univ. of Edinburgh)
Centre for Advanced Spatial Analysis (UCL)
Commenced 1 October 2012 – integrated ESDS, Census Programme,
Secure Data Service, and other elements of the data service infrastructure
provided by ESRC, including UKDA
21. UKDS features include:
Access – free for academic use, registration required
Data – full/partial catalogue search, browse by subject, theme, and
geography, also links to major studies, new releases
Support – finding data, online guides, learning and teaching
resources, online data analysis tools (NESSTAR), external statistical
sources, FAQ
Resources – Online data browsing tools, metadata tools, software
guides, qualitative tools
Advice for using, managing, depositing, sharing research data -
best practice for ESRC researchers and beyond, preservation
guidelines
News & Events – workshops, new data, publications (incl. good
practice guides)
24. • Part of "EDINA and Data Library" division of Information Services at the
University of Edinburgh
• Develops and delivers shared online services and infrastructure to
support research and education in the UK
• Services free at the point of use for use by staff and students in
learning, teaching and research through institutional subscription
• Focus is on service delivery and support but also undertake R&D
(projects services)
delivers about 10 online services
employs about 50 staff (Edinburgh & St Helens)
EDINA – a Jisc centre for digital expertise and online service
delivery – http://edina.ac.uk
27. Government data sources
National Records of Scotland (NRS) –
http://www.nrscotland.gov.uk/
Scottish Census 2011 – Data Explorer –
http://www.scotlandscensus.gov.uk/
Office for National Statistics (ONS) –
http://www.ons.gov.uk/ons/index.html
ONS Neighbourhood Statistics –
http://www.neighbourhood.statistics.gov.uk/
Statistics.Gov.Scot (replacing Scottish
Neighbourhood Statistics) - http://statistics.gov.scot/
NISRA - http://www.nisra.gov.uk/
Welsh Government Statistics - http://wales.gov.uk/topics/statistics/?lang=en
Data.Gov.UK - http://data.gov.uk/
Eurostat – http://ec.europa.eu/eurostat
28. DataShine
The DataShine mapping platform is part of the ESRC Big Open Data:
Mining and Synthesis (BODMAS) project.
Developed and maintained by UCL’s Centre for Advanced Spatial Analysis
The overall project seeks promote and develop the use of large and open
datasets amongst the social science community.
Promote visualisation of these data in new and informative ways to
inspire new uses and generate insights.
Phase one has been to create the mapping platform with data from the
2011 Census.
The next phases developing tools that will enable the synthesis of data
across multiple sources.
32. Information Services: Digital Skills and Training - http://www.ed.ac.uk/information-
services/help-consultancy/is-skills/classroom-courses
Classroom
courses include:
ArcGIS Quantum
GIS
SPSS
NVivo
Unix
Skills and Training
33. An online skills provider offering a
library of high quality video courses
including:
Digital citizenship; Developing with
Python; Pixel playground
Plus many more catering for all
levels from beginner to expert.
From the soft launch in June 2016,
all staff and students will be entitled
to access Lynda.com.
Lynda.com - http://www.ed.ac.uk/information-services/help-consultancy/is-skills/lynda
34. MOOCS
There are many free Massive Open Online
Courses out there on a range of data-related
subjects
Coursera are one of the MOOC platforms
and offer a range of courses hosted by a
number of leading research-intensive
universities:
The Data Scientist’s Toolbox, Data Structures
and Algorithms, Statistics with R, Python
Data Structures, Data Visualisation, R
Programming, Getting and Cleaning Data,
Exploratory Data Analysis –
See: https://www.coursera.org/browse/data-science/data-analysis?languages=en
35. The Software
Sustainability
Institute (SSI)
(based in University of
Edinburgh, JCMB) offer
a number of training
opportunities including
Software Carpentry and
Data Carpentry
workshops – see:
http://www.software.ac
.uk/what-do-we-
do/training
36. New skills:
● (Big Data) Data analysts (R, Python, SPSS, SAS, PHP)
● Data carpentry (software skills & tools for effectively working with
data)
● Data journalists (journalism specialty reflecting the role of numerical
data in the production and distribution of information in the digital
era)
● Data wranglers (munging, mining, handling, manipulating)
● Data technicians, Data scientists
38. Pics from Flickr (creative commons attribution) – credits
include: D Sharon Pruitt
Ask us about data
stuart.macdonald@ed.ac.uk
datalib@ed.ac.uk
Editor's Notes
Apps Div – developing computer applications and bespoke computing tool solutions for the university as a whole
IT infrastructure – Networks, Audio-visual support, telephony/ISDN connectivity
One was because disk storage was a scarce and expensive resource. Researchers wanted and petitioned for centrally-managed university wide provision of access to large-scale datasets, typically the decennial population censuses for Scotland, the annual agricultural censuses for England & Wales and for Scotland, the General Household Surveys and a range of digitized boundaries being used in what was still path-breaking ways to do computerized mapping.
Reference interview
GEO-REFER – geographical referencing resources for social scientists, Go-Geo!, ESRI Virtual Campus plus Macro and Microdata teaching resources
Security, backing up, data storage, data documentation and metadata, file formatting, versioning, preservation and sharing, copyright and ethical concerns, IPR etc
Research Data MANTRA is an online, self-paced course developed by the Data Library with IAD for University PhD students. There are eight interactive learning units with pictures, video clips and quizzes, as well as four data handling practical exercises with open datasets for use in R, SPSS, NVivo and ArcGIS.
Increasingly, research funders are requiring good data management practice for the research they fund. Data management skills will help you organise and document your data, keep your data safe, make your research more reproducible, and preserve it for future use.
The Secure Lab is funded by the ESRC (Economic and Social Research Council) and enables safe and secure remote access by researchers to data deemed too sensitive, detailed, confidential or potentially disclosive to be made available under standard licensing and dissemination arrangements.
GDP, birth rate, illiteracy rates, population, household consumption, external trade indicators, import-export figures, earnings, labour costs, employment, health care expenditure etc
Eurostat, IMF, OECD, UN, World Bank etc
Thematic user guides – food studies, ethnicity, scotland, introductory guides to using govt surveys for health / crime research
user guides for the Beyond 20/20 and CommonGIS, Nesstar WebView, and ESDS Qualidata online data services
Statistical software user guides (SPSS, Stata)
Dataset guides (as per each of the 4 ESDS areas)
British social attitudes survey, annual popuation survey,national food surveys, health surveys ! Eurobarometer, euro social survey / euro election study! GUS, MCS, BHPS! Eurostat, IMF, OECD, UN, ! Annual Survey of hours and earnings via secure data service
UKBORDERS
Digimap Collection
Go-Geo!
Agcensus
Moving pictures and sound services - EIG, newsFilm Online
Plus A&I databases
The Depot
HILT
GetRef
LOCKSS
PePRS
Geology Digimap - largest scale geology data (1:50,000) available with national coverage for the UK, A mid-scale (1:250,000) product with Bedrock and Faults layers, Small scale mapping (1:625,000) for the whole of the UK.
SNS will diseminate a range of small area statistics including information on health, education, poverty, unemployment, housing, population, crime and social / community issues at the data zone level and above which support a number of the Governments targets and commitments aimed at closing the gap between disadvantaged areas and the rest of Scotland.