Qs1 group a

Shining a light on our analytics and usage data
Ben Showers, Jisc
Joy Palmer, Mimas
Graham Stone, University of HuddersfieldThis work is licensed under a
Creative Commons Attribution 3.0
Unported License

To support the hypothesis that…
“There is a statistically significant correlation across a
number of universities between library activity data
and student attainment”

Library Impact Data Project 1
Original data requirements for each student:
• Final grade achieved
• Number of books borrowed
• Number of times e-resources were accessed
• Number of times each student entered the library, e.g. via a
turnstile system that requires identity card access
• School/Faculty

– Showed a statistical
significance between:
• Final grade achieved
• Number of books
borrowed
• Number of times e-
resources were accessed
– Across all 8 partners

Library Impact Data Project
Phase I looked at over 33,000 students across 8 universities
Phase II looked at around 2,000 FT undergraduate students at
Huddersfield

Now with additional data:
• Demographics
• Discipline
• Retention
• On/off campus use
• Breadth and depth of e-resource usage
• UCAS points (entry data)
• Correlations for Phase 1

Conclusions:
• We showed statistical significance for demographics such as
age, gender, ethnicity and country of origin
• We showed statistical significance across top level subjects
and within these disciplines
• We showed a connection between library use and retention
• We showed the depth and breadth of a collection may make
a difference

Drivers for change 1And we know all this is firmly on
Libraries’ radars
Our survey:
How important will analytics be to
academic libraries now and in the
future, and what is the potential for a
service in this area?

What about sharing your data about usage with other
institutions?
There’s a significant appetite for analytics services….But hesitation over
sharing entry data and other student data than other forms of usage
data.
Only 46% would be willing to share data if the institution was named.
But if institutional identity can be anonymised, that changes to 91%

And is this a current strategic priority?

What about in the next five years?

Can we collect data from institutions and create tools that allow
libraries to analyze how their resources are being used, when
and by whom?

Can this dashboard also give institutions the tools to
compare or even benchmark usage
against other institutions?
What about the benefits of scale?

What data can we use and get a hold of?
UCAS data, loan data, eResource logins..
(but not data on usage of individual items)
(yet)

Our collaborators and data providers:

data wrangling:
Getting, analyzing, cleaning, and presenting that data

A brief (important) word on ethics
Should we be holding and analyzing this kind of data?
• Data protection issues & ‘Big brother’ concerns
• All students pay the same fees – shouldn’t they be treated the same?
But what if we didn’t do this
• What would the reaction be if it was found that we had this data but didn’t act
on it?
• We have a duty to care for the individual wellbeing of our students

Working with the API to present the data…
How should users work with the data?
What do they want to be able to do?
What do they do?What does the system do?

The Epic User Stories
• connect the library with the university
mission
• contribute to the institutional analytics
effort
• demonstrate value added to users
• ensure value from major investments
• develop investment business cases
• impact student measures of
satisfaction, such as NSS
• address measures of equality and
diversity of opportunity
• inform / justify library policy and
decisions as evidence led
• engage stakeholders in productive
dialogue
• identify basket of measures covering
all key areas
• inform librarian professional
development
• enable the sector to understand the
questions to be answered

JiscLAMP –What did we achieve?
• LAMP project outputs
– We managed to clean up and process the data from all of
the partners
– We created a prototype – our analytics engine
– We performed a benchmarking exercise
• We showed that the idea of a shared library analytics service
was feasible

What can we do with the data?
» We can demonstrate usage by cohorts:
› Department
› Degree name
› Course
› Course ‘type’?
› Gender/Ethnicity/Nationality/Disability/Age
› Level of attainment
› Attendance mode (full time/part time)
› UCAS points

What can we do with the data?
And we can demonstrate correlations between usage and
attainment, usage and cohort (and attainment and cohort)

And we can potentially signal if
findings are statistically significant or not…

But where exactly does the user journey and workflow begin?
Within the system or outside of it?

How much do we assume the user is analysing the data?
And much analysis should the tool perform on behalf of the end
user?

» Here’s a simple question:
How do humanities and social
science students use books? *
*All the data used in this presentation is completely made up.Any resemblance
to real university library usage data, living or dead, is purely coincidental.

» And a simple answer
Humanities bigger users than
social sciences

» But wait!
Humanities still bigger users,
but the difference isn’t so stark

» What about other factors?
In both disciplines, full-time
students borrow more books
than part-time ones
The difference is smaller for
social scientists than for
humanities students
Part-time students seem quite
similar across disciplines

» Who’s not borrowing?
So although the means were
fairly similar (2.9 to 2.3),
proportionally there are lots
more social scientists who have
never borrowed a book.

There’s clearly a big problem
with part-time students in the
humanities
In the social sciences, everyone
is equally in need of help
Social science part-timers are
more likely to borrow books
than humanities part-timers –
why might that be?

Shows median borrowing
(humanities = 3, social sciences = 2)
Shows upper and lower quartiles
Shows max and min values
Standard way of describing data –
but is it useful here?

Where do we go from here? Phase 2
» We have funding for
Phase 2
» We’re now testing the
‘ugly prototype’
» Currently putting
together the new
project and marcomms
plan

» Make the data beautiful and
compelling….
› Develop a dashboard UI
through iterative testing
and development
Our key areas for focus 2014-15

» Usage data to ‘profile’
individuals, e.g. for REF
or intervention
purposes?
› What are the ethical
or legal issues?

» eResource item level
usage and the current
approach of the UK
Access Management
Federation
› Is it possible to crack
that nut?

» NSS data and SCONUL
stats. Integration would
be of major value.
› How can we bring
that data into scope?

» Data literacy
› What does it mean?
Who needs it?
› What needs to be
automated and what
needs to be taught as
a skillset?

» Benchmarking.
› The killer app?
› Is there a business
case for the service if
it doesn’t provide the
capability to compare
across institutions?
› How would this work?

JiscLAMP – Phase 2
• Workshop with SCONUL (London 7 May 2014)
• Key contacts/relationships for next phase
– HESA (NSS)
– Shibboleth/Athens
– SCONUL (performance group)
• Developing the business case and model

How can you get involved?
» Follow and comment on our
blog:
http://jisclamp.mimas.ac.uk
» Attend a LAMP workshop
(tba)
» Become a data contributor!
› email: b.showers@jisc.ac.uk
› email joy.palmer@manchester.ac.uk
› email g.stone@hud.ac.uk

Qs1 group a

Related slideshows

More Related Content

Qs1 group a

Editor's Notes