2019:GLAM/Institutional ingestion of Wikimedia Data: Trust, Tooling and Expectations
This is an Accepted submission for the GLAM space at Wikimania 2019. |
Title
[edit | edit source]Institutional ingestion of Wikimedia Data: Trust, Tooling and Expectations
Description
[edit | edit source]The Swedish National Heritage Board has facilitated a project during the spring of 2019* where three museums piloted the ingestion of data from Wikidata and Wikimedia Commons. A survey was carried out before the pilots with GLAMs globally aiming to identify technical, trust/institutional, and quality barriers to data ingestion. The result of this survey and complementary interviews has guided the pilots executed during late spring.
Two of the pilots used Wikimedia Commons as a platform for targeted crowdsourcing campaigns. One campaign aimed to engage users in translations of image descriptions, while the other took advantage of Structured Data on Commons to invite users to add Depicts statements to images. The third pilot downloaded and ingested Wikidata authorities into their own collection management system.
After the campaigns and the data acquisition each institution performed quality reviews about ingesting data into their collection management systems.
Each pilot addressed a different data storage method in Wikimedia. One dealt with Wikitext, the second with Structured data on Commons and the third with Wikidata. This allowed us to choose the best strategies for access and tooling for each pilot.
Relationship to the theme
[edit | edit source]This session will address the conference theme — Wikimedia, Free Knowledge and the Sustainable Development Goals — in the following manner: Partnerships (SDG 17) with learned institutions around the world are the best way to improve availability of expert knowledge (SDG 4) to all and to ensure that it is accessible to as many as possible.
Session outcomes
[edit | edit source]At the end of the session, the following will have been achieved: Attendees will:
- Have an insight in the challenges institutions faces when reusing and ingesting data from Wikimedia platforms.
- Have an insight in how Structured Data on Commons benefits the tool and third party ecosystem around Wikimedia Commons.
- Have an insight in the expectations and challenges GLAMs sees in the Wikimedia platforms when it comes to reusing and ingesting data hosted there.
- Have inspiration for running their own crowdsourcing campaigns on Wikimedia Platforms for GLAMs as well as for non GLAMs.
Session leader(s)
[edit | edit source]- Albin Larsson, Business Developer, The Swedish National Heritage Board
Usernames
[edit | edit source]- Abbe98
Affiliation/country
[edit | edit source]- The Swedish National Heritage Board
E-mail contact
[edit | edit source]- albin.larsson@raa.se
Session type
[edit | edit source]Each Space at Wikimania 2019 will have specific format requests. The program design prioritises submissions which are future-oriented and directly engage the audience. The format of this submission is a:
- Lecture
- Other
Length of session
[edit | edit source]If other than 20 minutes, specify how long
Supporting work
[edit | edit source]Optional:
- Research Report: Returning Commons community metadata additions and corrections to source
- Project midterm report_-_Wikimedia Commons Data Roundtripping
- Midterm brief in GLAMwiki Newsletter
- The final report was announced in June 2019: outreach:GLAM/Newsletter/June 2019/Contents/Sweden report
Requirements
[edit | edit source]The session will work best with these conditions:
- Room:
Large classroom or a lecture hall equipped with a projector.
- Audience:
The target audience is people working at third party institutions, minor prior knowledge about GLAMwiki is recommended.
- Recording:
Single fixed-location camera should do the trick and the presentation will be licensed under CC-BY or CC0.