* Ashton Anderson
In my project, I plan to visualize the quality of Wikipedia pages. I'll make use of the recent feature by which visitors can vote on the quality of Wikipedia pages. They can distinguish between different dimensions of quality (e.g., "complete", "well-written", "neutral"). I want to allow users to see at a glance how the quality of Wikipedia articles (a subject of increasing importance as Wikipedia becomes used more widely in classrooms, etc.) varies across Wikipedia, whether different "areas" of Wikipedia are of generally higher quality than others, which of these areas could use work, and how the different dimensions of quality relate to each other (does one lead to another? Are they simply equivalent?).
To my knowledge, no one has attempted to visualize the quality of Wikipedia articles.
However, there has been a lot of work on (1) Wikipedia visualization and (2) Wikipedia article quality:
In 2005, a Nature investigation made waves when it announced that it found Wikipedia and Encyclopedia Brittanica science articles were of similar quality (http://www.nature.com/nature/journal/v438/n7070/full/438900a.html?&$NMW_TRANS$=ext).
In their paper "Measuring Article Quality in Wikipedia: Models and Evaluation" (http://dl.acm.org/citation.cfm?id=1321440.1321476), the authors propose and evaluate several different metrics for measuring article quality directly.
Wikipedia article quality has been an issue for a long time. Jimmy Wales discussed it back in 2005 (and prior to that as well): (http://www.theregister.co.uk/2005/10/18/wikipedia_quality_problem/).
In "Assessing the value of cooperation in Wikipedia" (http://arxiv.org/abs/cs/0702140), the authors show that, generally speaking, articles become of higher quality as they accumulate edits.
Wikipedia visualizations are abundant; e.g. see examples in WikiViz 2011 (http://www.wikisym.org/ws2011/wikiviz:presentation), an entire conference devoted Wikipedia visualizations.
- 11/29 -- Presentation
- 12/3 -- Finish downloading, cleaning, sorting, organizing, aggregating, pre-processing the data. This includes comparing all pairwise article similarities.
- 12/6 -- Finish projection of articles to 2-D space.
- 12/12 -- Finish all other aspects of visualization (outlined in slides).
- 12/13 -- Poster
- 12/15 -- Paper
CS448B_AA_windows.zip -- Windows source and executable
CS448B_AA_macosx.zip -- Mac OS X source and executable
CS448B_AA_linux.zip -- Linux source and executable
CS448B_AA_paper.zip -- Final paper (PDF)
CS448B_AA_poster.zip -- Final poster (PDF)