This hands-on workshop will introduce participants to tools and methods for extracting, analyzing, and preparing MARC (MAchine-Readable Cataloging) bibliographic and authority data and EAD (Encoded Archival Description) data for Wikidata reconciliation and entity/statement creation using OpenRefine, with an emphasis on the conversion of local authority file data and textual data in Biography/History notes (information about persons, families, or corporate bodies) in descriptions of archival resources.
Participants are encouraged to bring their own data sets to work on, but data will also be provided for hands on practice.
This session will address the conference theme — Wikimedia, Free Knowledge and the Sustainable Development Goals — in the following manner:
This session intersects with SDG #4, Quality education, by building capacity within librarie
s and archives to make their data about persons, families, and corporate bodies more widely available in service of increased discovery of collections and primary source material for use in educational contexts.
By focusing on archival collections and local authority files, we also see see this as promoting the appreciation of cultural diversity by providing tools to increase access to and information about resources held by libraries and archives, including resources created by communities and individuals whose voices are less frequently found in mainstream publishing and scholarly production.
To a lesser extent, this proposal intersects with SDG #16, Peace, justice and strong institutions, by building skills with open source tools to enable librarians and archivists to increase public access to information that is often obscured by data storage and display systems.
At the end of the session, the following will have been achieved:
Participants will have:
- A basic understanding of OpenRefine, including the use of facets/filtering and transformations using regular expressions for data clean up and analysis
- Experience reconciling data against Wikidata and other sources (SNAC, VIAF)
- Successfully created Wikidata statements using OpenRefine
- SPARQL queries for use in supporting Wikidata/Wikipedia editathon planning by libraries and archives
Honor Moody (User:H.Mary), Metadata Creation Manager, Harvard Library
Christine Fernsebner Eslao, (User:fernsebner), Metadata Technologies Program Manager, Harvard Library (maybe)
Each Space at Wikimania 2019 will have specific format requests. The program design prioritises submissions which are future-oriented and directly engage the audience. The format of this submission is a:
- Workshop/Computer-based training
The session will work best with these conditions:
- Room: Classroom style, with outlet access for folks with laptops.
- Audience: Participants should have some experience editing and using Wikidata and familiarity with MARC and/or EAD; knowledge of OpenRefine and/or MarcEdit helpful, but not required.
- Recording: Some portions might be appropriate for recording, but there will be an emphasis on hands-on participation that probably isn't appropriate for recording.
- Time requested: 90 minutes