Photo by Museum of Veterinary Anatomy FMVZ USP/Wagner Souza e Silva, CC BY-SA 4.0.

Wikimedia Commons is one of the world’s largest free-licensed media repositories with over 40 million image, audio, and video files. But the MediaWiki software platform that Commons is built on was designed for text, not rich media. This creates challenges for everyone who uses Wikimedia Commons: media contributors, volunteer curators, and those who use media hosted on Commons—on Wikimedia projects and beyond.

One of the main challenges for people who want to contribute media to Commons, or find things that are already there, is the lack of consistent metadata—information about a media file such as who created it, what it is, where it’s from, what it shows, and how it relates to the rest of the files in Commons’ massive archive.

The Structured Data on Commons (SDC) program aims to address this metadata problem by creating a more consistent, structured way of entering and retrieving the important metadata. This structured data functionality, based on the same technology that powers WikiData, will allow people to describe media files in greater detail, find relevant content more easily, and keep track of what happens to a piece of media after it’s uploaded.

One of the challenges SDC faces in this massive and ambitious redesign project is prioritization: What problems are we trying to solve? Who experiences those problems? Which ones should we tackle first? How can we avoid breaking other things in the process?

To answer these questions, we have been performing user research with different kinds of Commons participants, starting with GLAM projects—Galleries, Libraries, Archives, and Museums. We chose GLAM as the initial user research focus for a few reasons:

In other words, the motivations, needs, and workflows of GLAM project participants are diverse enough to potentially apply to many other kinds of people who contribute and consume media on Commons every day. Improving the Commons experience for GLAMs will likely benefit other users as well.

Our research into how to support GLAM projects during the transition to structured data on Commons began with a workshop in February 2017 at the European GLAM coordinators meeting. Between July and October 2017 we interviewed a dozen GLAM project participants from Africa, the Americas, Europe, and South Asia and ran surveys of GLAM participants and Commons editors.

We organized findings from the workshop, interviews, and surveys into five themes. Each theme represents a set of challenges and opportunities related to the way GLAM projects currently interact with Wikimedia Commons:

Through our research we were able to document a rich diversity of roles, goals, tools, and activities across GLAM projects, as well as identify motivations, unmet needs, and pain points that many GLAMs have in common. We also observed a number of inventive workarounds that GLAM project participants used to capture important metadata that wasn’t easy to record using the current systems like categories and templates. These workarounds illustrate the importance of metadata for making a file or a collection findable, useful, and usable, and the need for better ways to record all of that vital contextual information.

In the Structured Data on Commons program, we also regularly consult with Wikimedia Commons editors about how structured data will impact their work. With input from the Commons community, we developed a prioritized list of important community-developed tools for organizing media on Commons, which also helps us to understand typical workflows and to prioritize functionalities.

Research findings and community feedback will be combined into personas, journey maps, and user stories to help product teams set development priorities and define requirements for improving Commons file pages, upload tools, and search interfaces that use structured data.

A full report of our GLAM interview and survey research is available on the Research portal on meta.wikimedia.org, along with slides and a video of a recent presentation of findings.

The next steps of this project include additional interviews with Commons editors to understand how structured data will impact ongoing curation activities. We are also interested in speaking with re-users of Commons media outside of the Wikimedia movement to learn how structured data can make Commons an even more valuable global resource for high-quality free-licenced media—get in touch with us at jmorgan[at]wikimedia[dot]org and sfauconnier[at]wikimedia[dot]org.

Jonathan T. Morgan, Senior Design Researcher
Sandra Fauconnier, Community Liaison 
Wikimedia Foundation

Related

Read further in the pursuit of knowledge

Communications Community Picture of the Year Wikimedia Commons

‘Unfrogettable’ picture of the year announced

Whichever frog pun (or caption) you choose to label it with, the photo above is this year’s Wikimedia Commons picture of the year. It features two Phyllomedusa rohdei, frogs endemic to Brazil, with one stepping on the other’s head, seemingly reaching for something just out of the frame. The photo was taken by biologist Renato Augusto….

Foundation From the archives MediaWiki Technology

Evolving the MediaWiki platform: Why we replaced Tidy with a HTML5 parser

Three years ago, the Wikimedia Foundation's Parsing Team decided to replace Tidy, a tool to fix HTML errors, with a HTML5-based tool. Here's what we did in that time period, and what kind of complexities we faced in changing pieces of the technical infrastructure powering Wikimedia wikis.

A mockup of the new page previews feature, using the English Wikipedia's article on the Andromedia Galaxy.

Features From the archives Technology Wikipedia

Here’s everything we published from the design, development, and data process for the page previews feature

As an open and transparent organization, most of our documentation is placed online, able to be viewed and emulated by anyone. Here's a list of the documentation for one of our recently released features.

Help us unlock the world’s knowledge.

As a nonprofit, Wikipedia and our related free knowledge projects are powered primarily through donations.

Donate

Connect —

Stay up-to-date about the Wikimedia Foundation

Get email updates

Subscribe to news about ongoing projects and initiatives.

Contact a human

Questions about the Wikimedia Foundation or our projects? Get in touch with our team.

Photo credits

Domestic pig heart

Museum of Veterinary Anatomy FMVZ USP/Wagner Souza e Silva

CC BY-SA 4.0

Perereca-macaco - Phyllomedusa rohdei

Renato Augusto Martins

CC BY-SA 4.0

Castle_Cary_catch_points_-_02

A mockup of the new page previews feature, using the English Wikipedia's article on the Andromedia Galaxy.

504A8061-4