2019:Hackathon/Outcomes
Appearance
Video recording:
Projects presented at the showcase session at Wikimania Hackathon 2019 Stockholm
[edit | edit source]No | Name | Task | Brief description | URL | Showcase presentation notes | Team | Development status | Audience | Slides |
---|---|---|---|---|---|---|---|---|---|
1 | PagePile Visual Filter | phab:T230598 | Auxiliary tool for PagePile to visually select a subset of a pile, in a manner similar to the Cat-a-lot gadget, and save it as a new pile. | https://tools.wmflabs.org/pagepile-visual-filter | Structured Data on Commons lets you declare which figure skater is shown in a file, and in which pose. But selecting all the images in a certain pose is hard if the figure skater’s category is split, e. g. by year. PetScan can collect all the files from the subcategories, but not let you select the right ones. You can export the result as a PagePile and feed that into the new PagePile Visual Filter tool, where you click on the images which show that pose, then export the result into a new PagePile. This one can then be fed into other tools like AC/DC (add structured data statements) or QuickCategories (add categories). | @LucasWerkmeister, @Harmonia_Amanda | development | Editors | File:Wikimania_Hackathon_2019_showcase_–_PagePile_Visual_Filter.pdf |
2 | Item creation helper for Wikidata | phab:T230599 | Some concepts need several items on Wikidata to describe them, each time created the same way. For example, for an annual iteration of an figure skating competition, we need a general item for the competition, one for each of the ladies/men/couple/dance part, each of them further having two further items for the technical and free programs. This tool help create all these items, and link them together, with a simple form. Furthermore, it is a proof of concept for similar forms for other similar needs. | https://www.wikidata.org/wiki/User:Ash_Crow/creation-helpers | Gadget for Wikidata that allows you to create a group of statements (includes references), and then creates a batch of new items related to each other. | @Ash_Crow, @Harmonia_Amanda | implementation | Editors, Developers | File:Wikimania_2019_creation_helper.pdf |
3 | Knowledge crystals / Wikibase for structured discussions | phab:T230558 | We want to structure discussions and debates so that they can crystallise knowledge. Each comment has its own wikibase item and connects to other items to form a network that represents the discussion. We set up the wikibase and created visualisations and a visual editor. Next, we want to find good use cases such as factual debate on Wikipedia's Talk pages. | http://dev.tietokide.fi/ and URL2: http://dev.tietokide.fi/?Q10 | Organizing discussions by linking posts as attacking/agreeing with other arguments. Structure is based on argumentation theory, now implemented on Wikibase. Can hide irrelevant or less important details, to highligh most important arguments so that the participants can focus on those. The demonstration tool works, looking for collaborators who might use it. | @Jtuom @TuukkaH | development | Readers, Editors | |
4 | Mortar: Interactive documentaion for GLAM Uploads | phab:T223608 | Create and document a unified, user-friendly workflow for GLAM uploads on both Wikidata and Commons. We created a workflow using Pattypan and OpenRefine, defined the missing steps needed, and started an easily translatable and updatable documentation. | Tool link: https://tools.wmflabs.org/mortar/?lang=en | The general idea is that we have many tools to do some parts of importing items collections and photographs of these items on Wikidata and Commons. Wanted to see how we can add some mortar to ease the process and link the different tools toghether. Two basic use cases: Institution want to upload their items, or volunteer who takes own photos and wants to upload them. Implemented in node.js on toolforge. Can select the kind of item, e.g. picture; then upload. Second part is [?]. This is Mh-3110's first tool. (!!) | @Lena, @Anne-LaureM, @Ecritures, @Ash_Crow, @Mh-3110, @PierreSelim | Development | Readers, Editors | Slides: https://docs.google.com/presentation/d/1ANtFM7wGGCelSr7tUYv4bp_pyG6SwCHi1BDqNFtbEj0/edit?usp=sharing |
5 | iNaturalist Import Tool for Commons | phab:T222648 | A UserScript that allows you to browse and import free-license images of a plant or animal from iNaturalist from within the corresponding Commons category or gallery page | Load a page like https://commons.wikimedia.org/wiki/Category:Tauraco_corythaix with this User Script installed: https://commons.wikimedia.org/wiki/User:Kaldari/inat2commons.js http://inaturalist.org | iNaturalist ( https://www.inaturalist.org/ ) explanation: project that is similar to WM projects, in that it's open-source crowdbuilt database of plants and animals; take pictures in your area, upload them to central DB, people can ID them and comment. Has a nice interface. They encourage the use of free licenses, and have 25 million of photos of plants and animals. Wrote user script to allow user to upload inaturalist photos to commons. [kaldari can't remember his password :-D] [bell rings] Once installed, you can go to any category or gallery page, and you will have an upload link that will allow you to select inaturalist photos that match. It is then uploaded and yay! | @kaldari | testing | Commons Editors | |
6 | Scribe | phab:T230613 | Supporting editors of low resource languages to create new articles on their Wikipedia, focusing on mobile devices | https://www.youtube.com/watch?v=-XZj0h5-hV0 | Scribe on mobile is a tool that supports new editors on unresourced Wikipedias to write new articles. On mobile, you select a structure for an article. you select some references [lags] it suggests then building blocks to create an article. | Lucie-Aimée Kaffee, Hady Elsahar, Joe Reeve | development | (new) Editors | |
7 | Parliament diagram tool | Allowing users to re-use previous party lists for new diagrams | https://tools.wmflabs.org/parliamentdiagram/parlitest.php | Based on experience volunteering on the Graphic lab. A previous tool was used to create PNG diagrams which were frequently requested to convert to SVG, and the tool that was developed a few years ago does this. A frequent request was to simplify the creation of diagrams. In this Hackathon, we allowed two new options: 1) Use Wikidata through this tool: select a party or multiple parties, and generate a list of elected in an assembly-shaped diagram in SVG. 2) Copy list of parties from a previous diagram. Now you can generale 20 diagrams very easily to cover the evolution of the parliament. | @slashme ( David Richfield) @ranjithsiji (Ranjith Siji) @Ambadyanands (Ambady Anand S) | maintenance | Readers, Editors | ||
8 | pywikisource | phab:T230529 | Allowed Developers to get Wikisource related data like number of book pages, Current page quality status, proofreader, and validator in python. | https://github.com/indictechcom/pywikisource#Demo | There is a lot of activity on Wikisource in India, and they have a lot of requests to create tools for Wikisource related. Quality, proofreading, and validation data can be got by this python library. | @Jayprakash12345 @Suyash.dwivedi | implementation | Developers | |
9 | Maps, maps, maps! | phab:T227036 | We're making it easier to discover maps on Wikimedia Commons, and to better store their geospatial metadata! | https://github.com/bertspaan/wikimania-hackathon-2019 | Finding maps on Commons is hard. You can use category:maps or rectified maps via Wikimaps Warper ( https://warper.wmflabs.org/ ). Georectifying uses fitted points on a map to show how the map fits over new maps. This can be used to drive much nicer discovery. This kind of tool should be on Commons too;and the metadata it uses should be available on WD/Commons. Maps are georectified, and the tool indexes the maps based on where they are in the world, and presents their outlines. 5 master repositories of maps have been used. The project created data specs, prototyped a service. Next is moving it to WMF labs, and to get the WD properties approved. Follow us on Github. | @bert, @JHeald, @thisismattmiller, @Abbe98 | development | Editors, Developers | Slides: https://bertspaan.nl/wikimania-hackathon-2019/presentation |
10 | InternetArchiveBot | m:InternetArchiveBot | Better less demanding OAuth for privacy focused users, and UI enhancements. | https://tools.wmflabs.org/iabot | Internet Archive archives Webpages. It is really important to keep track of references on Wikipedia, and we use Internet Archive to do so. Making updates to admin interface. IABot requires lots of permissions to allow you to fix deadlinks; normally not needed. If you don't need to run the bot on your own, you can still queue up the bot, it will attribute the edits to your name. Changing Domain Data is time consuming and it can timeout on your browser. The tool allows you to make the queries on a separated server. If you want to actually run the tool, however, you can upgrade your permissions. UI has also been improved. | @cyberpower678, @skalman, @thedj | testing and enhancement | Editors and readers | |
11 | Hostbot-AI Experiment Analysis | phab:T211160 | Did an AI-powered version of enwiki's hostbot retain more newcomers than before. | http://isalix.hestia.feralhosting.com/hostbot-analysis.png | Teahouse is a place on English Wikipedia ( https://en.wikipedia.org/wiki/Wikipedia:Teahouse ) where newbies can get mentoring. The main way that people come there is that HostBot invites them. There are limited hosts on the Teahouse, so they only want 300 newcomers per day. The problem was solved by using heuristics like edit count to find suitable newbies to invite. Now we have ORES to determine the quality of people's edits ( https://www.mediawiki.org/wiki/ORES#Edit_quality ). We ran an A/B test to see the difference in how long people stay on WP depending on whether they're invited by heuristic-based or ORES-based inviter. AI bot was consistently better than heuristic-based. | @notconfusing | analysis | AI-developers / researchers/ mentors | |
12 | Multilingual JavaScript | phab:T230665 | A global project deserves a multilingual scripting/templating language. | https://docs.google.com/presentation/d/1DhFIYK6MmKwp0rLnSZ9M2uufzvx442BOa82dFvcn6yE/edit | Works for WMF, but this is a personal project. Although it’s likely that not everything will always be translated, knowledge of English should not be a prerequisite for participation in our projects. Right now English is used for writing Template code and Scribunto modules. But we could truely localize JavaScript and then use it to write templates. Rewrite all the strings and variable names and comments and method calls to translatethe code. Gives an option to have template code translated in various languages. | @cscott | design | Developers | |
13 | Copyvio bot for Commons | phab:T230561 (see also phab:T31793) | A bot to help commons patrollers to find files require additional attention for copyright issues | https://commons.wikimedia.org/wiki/User:%D7%A2%D7%A8%D7%9F/Automated_copyright_check | To help editors and patrollers on Commons to find copyright problems. The bot goes over new files by new editors, without exif data, and creates a record. The record has a score based on a [] model that looks at the metadata of the file, and decides which need more attention. For example this new image, which is already flagged for speedy deletion. The metadata contains a string that indicates that it's from Facebook. This wasn't hard-coded. Google doesn't always scan FB, so a google search won't always find the duplicates. Hopefully this will help the commons community to find such copyvios. | @eranroz | development (POC) | Editors, Administrators | |
14 | Various i18n tasks | phab:T229922 | Fix a "lego" message to enable gramatically correct translations in Advanced Search. Took wikidata's FAQ page from 11% to 100% in Bulgarian | https://commons.wikimedia.org/wiki/File:I18n_lego_string.png | Worked on various small fixes. One example: Bug in Advanced Search Interface. Tag in search that says "sort by relevance"; it was two messages that were concatenated during runtime, and in some languages this isn't grammatically correct. This is now fixed. Call to audience: get into the Wikidata Help pages and get translating! | @Tonina_Zhelyazkova_WMDE | implementation | Editors, Developers | |
15 | Mentorship tools | phab:T228210 | Newcomer homepage <https://www.mediawiki.org/wiki/Special:Mylanguage/Growth/Personalized_first_day/Newcomer_homepage> automatically assign a mentor to all newcomers, but it doesn't allow to change for eg. wikicourses or similar. This feature allows users to change the mentor. | https://docs.google.com/presentation/d/1yh3rmpCJlrHR94ycOrBaDxUrO6thV6LyXk1mrTpAJvk/edit | The mentorship system created by the Growth team programmatically assigns mentors to all newbies. Problem is that the mentor is assigned automatically, but if you're on a course, you want your instructor. This tool allows the mentor to take a mentee specifically. | Team:@Urbanecm | testing | Editors | |
16 | Lightweight tool to visualise Wikipedia article contest flow & results | phab:T230464 | Article contests and edit-a-thons might be fun for participants, but for rest of us it's hard to get insight into what exactly happened during these events. For that we visualise the flow and results of these events in data story style to be presentable for general public as well as media in after event press releses etc. During the hackathon we created basic structure to gather and store the data need and made an example visualisation to be easily expanded for other visualisations and additional data. | https://infoaed.ee/vagavesi/ | Article contest: if you look through the articles written for the contest you don't see it easily, need to go through all the articles. [?] Article contest with users and articles [?] Visualisation goes through all the users and articles and articles written during the time of the contest. Example visualisation: articles by length shown as bubble diagram, so don't have to go through pages and text to figure it out. Second visualisation is a bit more tricky: users and number of articles written by them. Some registered participants and other users affecting article, so visualisation to split these groups. Can be useful when setting up press releases. | @tramm | development | Readers, Developers | |
17 | Category overview | phab:T230678 | There are often lots of articles in a category but no easy way to get an overview of the articles at a glance. This tool is for having a quick overview of a category on the wiki. Right now it only works on enwiki but it's easy to make it work on other projects too. | https://niharika29.github.io/wikitopics/ | Work as product manager on tools team but this isn't relevant to this. Quick overview of topic on WP? Are category pages useful? Lots of info there, but hard to get overview from it. Javascript tool to get info about categories and present it in a usable way. [Demo: "fire"] Shows article and page extract with pics. [Demo "goats"] same: now we know that there's a rent-a-goat company ( https://en.wikipedia.org/wiki/Rent_A_Goat ). Can make category pages more useful. Might at some point be converted into a user-script | @Niharika29 | analysis / design / development | Readers | |
18 | Bookmarklet for URL Shortener | phab:T222360#5420068 | The URL shortener is a great tool for generating short links to wiki pages, but you have to navigate to a specific page, then copy and paste your URL in to use this tool, which is time consuming. This bookmarklet turns that into a one-click process, giving you your short URL instantly. | https://phabricator.wikimedia.org/T222360#5420068 | Usually works on editing team on VisualEditor, but now something different: URL Shortener (example: https://w.wiki/7Ed ) takes any URL from a WM site, even etherpad; go to w.wiki, click "shorten" and get a short URL. Could be easier, though: make bookmarklet that makes an API request and presents it for you. If you're on etherpad or phab, you're redirected because they don't have access to your credentials. | @ESanders | implementation | Readers, Developers | |
19 | Offline voice based Wikipedia | If you don't have internet and maybe struggle with text it might be nice to hear Wikipedia content anyway | Cool if you could have WP offline with just your voice. Built on Mozilla DeepSpeech ( https://github.com/mozilla/DeepSpeech ) and Common Voice ( https://voice.mozilla.org/en ) projects, powered by TensorFlow. [Demo] Hello Sweden, will you please go offline. Waitu waitu waitu waitu waitu OK 1231 Cornflakes.Here is what I heard you say: Cornflakes. Let me ask you [?] {And not much happens due to the curse of live demos.} Here is what I heard you say: Solar system Let me ask Wikipedia. Hang on. Let me see if Wikipedia can help you hang on. I do have one file. It is too small. Perhaps we should wait a few weeks and try later. [Demo fail, author is happy to show later.] | @dr0ptp4kt | analysis / design / development / testing | listeners / voiced (for now) |
Not presented in the showcase, but here's some info documenting our project
[edit | edit source]Name | Task | Brief description | URL | Team | Development status | Audience |
---|---|---|---|---|---|---|
SQID 2.0 | phab:T230467 | This is a from-scratch rewrite of SQID, the Wikidata data browser. The old version is nearly unusable now, since several architectural assumptions have been invalidated by the massive growth of Wikidata. The new version should stay usable even in the face of continued growth. | https://tools.wmflabs.org/sqid/sqid-ng/ | @mmarx | implementation | Readers, Editors |
The Exploration Game | phab:T230468 | We present (the first protoype of) an interactive game, where players can select some Wikidata properties and interactively explore the implicational theory that emerges from the subset of Wikidata spanning these properties. Such implications can lead to the discovery of new insights over the stored data, as well as point to missing or incorrect information. | https://tools.wmflabs.org/teg/ | @mmarx, @thanika | implementation | Readers, Editors |
Track module registry size over time / Improve module registery overhead | phab:T229836 and phab:T229791 | We need to track resource module registery overhead that have overhead in every request to wikimedia | https://grafana.wikimedia.org/d/BvWJlaDWk/startup-module-size | @Krinkle @Ladsgroup | maintenance | Developers |
RFC: Abstract schemas and schema changes | phab:T191231 | Using a better system to do schema changes in mediawiki core | @Ladsgroup | maintenance | Developers | |
WikiBooster / Booster | Tool to help editors make tedious actions semi-automatically. Bot scans Wikipedia for some problem (e.g., articles about people eswithout DEFAULTSORT) and proposes changes. The user then presses "Yes"/"No" to accept or reject change. Kind of similar to Wikidata Game. | https://tools.wmflabs.org/booster/#/ | @Edgars2007 | development | Editors | |
Wikidocumentaries | phab:T229724 | Wikidocumentaries is a website where users can find media in GLAM collections around the world about topics in Wikidata. By engaging with interesting content, they will enrich the media and import that content and data to Wikimedia projects. To allow users to write data and media to Wikimedia projects, we need to make registration easier and allow OAuth. | http://wikidocumentaries-demo.wmflabs.org/Q3885577 | @MikkoRinne, @Susannaanas... | development | Readers, Editors, Administrators, Developers |
New Wikitext for Discussions (pls schedule after 1445) | phab:T230654 | Crazy ideas for the wikitext markup language to make discussions better! | https://docs.google.com/presentation/d/1mI9FHi8y1ivx9skMx7bIKJsSF0UoV0dtE15Lc_1qT3A/edit | @cscott, @ssastry | design | Editors |
Clean up part of common.css | phab:T228604 | Making Wikipedia faster by cleaning up the unused common.css and/or moving them to the right place. If you want to help it for your wiki, read the ticket! | NA | @Ladsgroup and lots of contributors <3 | maintenance | Administrators, Developers |
Gadget size summary | not sure if we have a task, or if we're going to have a task. At some point we'll have a Gerrit change number | Each gadget registered in MediaWiki:Gadgets-definition adds some bytes to each MW page request, and each gadget enabled (or made default i.e. enabled for all users by default) adds quite a bit of hidden content on each page request. This is fine for first-world users on decent connections and with decent devices, but not so much in parts of the world with poorer internet connections and devices. I would like information about the size of gadgets to appear to interface administrators. | Not available yet. If/When approved it will eventually show up at beta.wmflabs.org sites like https://en.wikipedia.beta.wmflabs.org/wiki/MediaWiki:Gadgets-definition for a few days before appearing on live wikis | @Krenair (Alex Monk), idea from @Jdforrester-PERSONAL (James Forrester) | development | Interface Administrators, Developers |
Changing internationalisation of API messages | phab:T189982#5408201 | API messages have been internationalisable for a few years, but the messages from extensions are grouped with the rest of the extension's messages. The problem with this is that they tend to require translators with more technical knowledge, and likely have lower visibility than user interface messages, but count against the extension's normal translation group statistics. I have written scripts to create commits for this and have begun uploading for review. | N/A | @Krenair (Alex Monk) | implementation | Translators, Developers |
Dynamic navboxes / wikitext syntax | WMF parsing team sat down with fandom/wikia to talk about alternative wikitext syntax for infobox/navbox elements | https://etherpad.wikimedia.org/p/wikimania2019wikitext | @cscott, @ssastry, Isaac Fisher | planning / analysis / design / development / testing / implementation / maintenance | Readers, Editors, Administrators, Developers |