-
English Wikipedia pageviews by second
This file contains a count of pageviews to the English-language Wikipedia from 2015-03-16T00:00:00 to 2015-04-25T15:59:59, grouped by timestamp (down to a one-second resolution... -
Wikipedia Clickstream
This project contains data sets containing counts of (referer, resource) pairs extracted from the request logs of Wikipedia. A referer is an HTTP header field that identifies... -
Wikidata
The free knowledge base anyone can edit https://wikidata.org -
Teahouse corpus
The Teahouse corpus is a set of questions asked at the Wikipedia Teahouse, a peer support forum for new Wikipedia editors. This corpus contains data from its first two years of... -
Wikimedia user agents
A dataset of parsed reader and editor browser agents from the Wikimedia web properties. The intent behind releasing the parsed agents is to make it easier for Wikimedia... -
Scholarly article citations in Wikipedia
About This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia. License All files included in this datasets are released under... -
Wikipedia Article Feedback corpus
This dataset contains the entire corpus of feedback submitted on the English, French and German Wikipedia during the Article Feedback v.5 pilot (AFT). The Wikimedia Foundation... -
English Wikipedia Reverts
Reverting and reverted revisions from the English Language Wikipedia. See https://meta.wikimedia.org/wiki/Research:Revert This dataset was last updated in on Aug. 23rd, 2014. -
Wikipedia new user registrations
Historical data on new user account registrations to the English Wikipedia and other large Wikipedias. -
Wikipedia pageview stats
This is real, accurate hourly snapshot data on the access to Wikipedia captured from the Wikimedia Squid servers. Project counts show the total access in a time period to the... -
Wikipedia user preferences
Data on user preferences set by active Wikipedia editors. Active editors are defined as registered users with at least 5 edits per month in a given project. The dumps were... -
Wikipedia Templates
This dataset shows the top 60 Wikipedia templates that editors, both new and experienced, receive on their Talk pages. The dataset covers the period 2007 - 2011. -
Wikipedia Editor Engagement Experiments: Timestamp position modification
This experiment looks at the effects of linking to the revision history of Wikipedia articles with a prominent "last modified" timestamp. Currently, the only way for readers to... -
Wikipedia Banner Challenge: Votes file
This file has one row for each vote. For a more detailed file layout, see http://blog.allourideas.org/post/2739358388/download-your-data -
Wikipedia Banner Challenge: Non-votes file
This file has one row for each non-vote (e.g., a voter clicking "I can't decide"). For full file layout details, see:... -
Wikipedia Banner Challenge: Banner file
This file has one row for each banner. For a full file layout, see http://blog.allourideas.org/post/2739358388/download-your-data. -
Wikipedia article ratings
A complete anonymized dump of 11M article ratings collected over 1 year (July 2011 - July 2012) from the English Wikipedia. Read more... -
Wikipedia, the free encyclopaedia
Wikipedia dumps of full content of wikipedia. Database backup dumps - A complete copy of all Wikimedia wikis, in the form of wikitext source and metadata embedded in XML. A... -
Wikimedia Research Newsletter corpus
A curated corpus of references on Wikipedia and Wikimedia research, reviewed in the monthly Wikimedia Research Newsletter. -
Wikimedia Fundraiser Public Data
Public data about the Wikimedia Fundraiser. Data is refreshed every 15 minutes and includes the complete historical series since 2006.