Wikipedia:WikiProject Wikidemia/Quant/Arch
Parser[edit]
- This converts the zipped xml database dumps into csv files with file specification:....
Stats[edit]
- csv files of header information can be read into Statistical software packages R and Stata
Analysis[edit]
Figure Production[edit]
Table Production[edit]
Data Anomalies[edit]
In the Indonesian Wikipedia dump occasionally usernames appear in the <ip> tag (e.g. user:Vyasa). These appear to be localized to 2003. It is not clear why this occurs.