Jump to content

User talk:Junjulien

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Hello,
sorry for the late answer (holidays...). It is true that I compile part of the Multilingual statistics, but my contribution is limited to getting the current copy of http://meta.wikimedia.org/wiki/List_of_Wikipedias and feeding it to a script which generates the table. The list of wikipedias itself, as far as I know, is bot-generated, but I only have the foggiest idea of how (wikipedia's ways can be strange at times... :-)
Your project would need a great deal of data about editors and readers, and data about the readers is probably unavailable as it would require collecting server logs, and Wikimedia servers do not have the capability of recording visitor logs at our current load. I remember seeing on wikitech-l that someone is recording decimated data, e.g. one in 10 or 100 visitors, but deleting personal info like the originating IP, which would defeat geolocation.
About the editors, the IP addresses of logged in users are not collected (again). While for anonymous editors, the IP is recorded in the history and you could download a full history dump from http://download.wikimedia.org and see what you can recover. In short, i don't really know how to help you. Try to write to wikitech-l (see http://lists.wikimedia.org/mailman/listinfo/wikitech-l), and see if someone has the data you need.

Cheers,
Alfio

Start a discussion with Junjulien

Start a discussion