MIT Technology Review, 13 January 2014 link
Bots vs. Wikipedians, Anons vs. Logged-InsMuch of the editing work on Wikipedia is too mind-numbingly repetitive for humans, so automated bots do it instead. But keeping track of automated editing has always been hard…until now. In a little over a decade, Wikipedia has evolved from an internet experiment into a global crowdsourcing phenomenon. [...] Less well known is Wikidata, an information repository designed to share basic facts for use on different language versions of Wikipedia. [...] Behind the scenes, automated bots scan Wikipedia and Wikidata pages continually polishing the content for human consumption.
But that raises an important question. How much bot activity is there? What are these bots doing and how does it compare to human activity? Today, we get an answer thanks to the work of Thomas Steiner at Google’s German operation in Hamburg. Steiner has created an application that monitors editing activity across all 287 language versions of Wikipedia and on Wikidata. And he publishes the results in real time online so that anybody can see exactly how many bots and humans are editing any of these sites at any instant. [...] Steiner’s page also lists the most active bots. Wikipedia and Wikidata have long recognised the damage that bots can do and so have strict guidelines about their behaviour. Wikidata even lists bots with approved tasks
What’s curious about the automated edits on Wikidata is that the most active bots are not on this list. For example, at the time of writing a bot called Succubot is making 5797 edits to Wikidata entries and yet appears to be unknown to Wikidata. What is this bot doing? Steiner’s page will give administrators a useful window into this seemingly shadowy behaviour. [...] An interesting corollary is that bots are becoming much more capable at producing articles of all kinds. [...] All that’s required is to cut and paste the relevant information into the correct places. [...] And while this kind of automated writing can be hugely useful, particularly for Wikipedia and its well documented problems with manpower, it could also be used maliciously too. So ways of monitoring automated changes to text are likely to become more important in future.
Cornell University Library, 5 February 2014 link
PDF of study: linkAbstract
Wikipedia is a global crowdsourced encyclopedia that at time of writing is available in 287 languages. Wikidata is a likewise global crowdsourced knowledge base that provides shared facts to be used by Wikipedias. In the context of this research, we have developed an application and an underlying Application Programming Interface (API) capable of monitoring realtime edit activity of all language versions of Wikipedia and Wikidata. This application allows us to easily analyze edits in order to answer questions such as "Bots vs. Wikipedians, who edits more?", "Which is the most anonymously edited Wikipedia?", or "Who are the bots and what do they edit?". To the best of our knowledge, this is the first time such an analysis could be done in realtime for Wikidata and for really all Wikipedias--large and small. Our application is available publicly online at this http URL, its code has been open-sourced under the Apache 2.0 license.
Wikipedia and Wikidata Realtime Edit Stats: link