Understanding Open Collaboration of Wikipedia Good Articles with Factor Analysis
Abstract
This research aims at understanding the open collaboration involved in producing Wikipedia Good Articles (GA). To achieve this goal, it is necessary to analyse who contributes to the collaborative creation of GA and how they are involved in the collaboration process. We propose an approach that first employs factor analysis to identify editing abilities and then uses these editing abilities scores to distinguish editors. Then, we generate sequence of editors participating in the work process to analyse the patterns of collaboration. Without loss of generality, we use GA of three Wikipedia categories covering two general topics and a science topic to demonstrate our approach. The result shows that we can successfully generate editor abilities and identify different types of editors. Then we observe the sequence of different editor involved in the creation process. For the three GA categories examined, we found that GA exhibited the characteristic of highly scored content-shaping ability editors involved in the later stage of the collaboration process. The result demonstrates that our approach provides a clearer understanding of how Wikipedia GA are created through open collaboration.
References
- 2011] Information quality in Wikipedia: The effects of group composition and task conflict. Journal of Management Information Systems, 27(4), 71–98. Crossref, Web of Science, Google Scholar [
- 2008] Size matters: Word count as a measure of quality on Wikipedia. In Proceedings of the 17th International Conference on World Wide Web, pp. 1095–1096. Crossref, Google Scholar [
- 1993] Human Cognitive Abilities: A Survey of Factor-Analytic Studies, Cambridge: Cambridge University Press. Crossref, Google Scholar [
- 2020] Understanding open collaboration of Wikipedia Good Articles. In International Conference on Human–Computer Interaction, pp. 29–43. Cham: Springer. Crossref, Google Scholar [
- 2008] Collaboration in context: Comparing article evolution among subject disciplines in Wikipedia. First Monday, https://firstmonday.org/article/view/2217/2034. Crossref, Google Scholar [
- 2005] Internet encyclopedias go head to head. Nature, 438, 900–901. Crossref, Web of Science, Google Scholar [
- 2008] An analysis of topical coverage of Wikipedia. Journal of Computer-Mediated Communication, 13(2), 429. Crossref, Web of Science, Google Scholar [
- 2008] Patterns of revision in online writing: A study of Wikipedia’s featured articles. Written Communication, 25(2), 262. Crossref, Web of Science, Google Scholar [
- 2011] A multimethod study of information quality in wiki collaboration. ACM Transactions on Management Information Systems, 2(1), 1–16. Crossref, Google Scholar [
- 2007] Power of the few vs. wisdom of the crowd: Wikipedia and the rise of the bourgeoisie. World Wide Web, 1(2), 19. Google Scholar [
- 2009] Coordination in collective intelligence: The role of team structure and task interdependence. In SIGCHI Conference on Human Actors in Computing Systems, pp. 1495–1504. Boston: ACM. Crossref, Google Scholar [
- 2008] Harnessing the wisdom of crowds in Wikipedia: Quality through coordination. In International Conference on Computer Supported Cooperative Work, pp. 37–46. ACM CSCW. Crossref, Google Scholar [
- 2015] The virtuous circle of Wikipedia: Recursive measures of collaboration structures. In Proceedings of 18th Conference on Computer Supported Cooperative Work and Social Computing, pp. 1106–1115. Crossref, Google Scholar [
- 2014] The wisdom of minority: Discovering and targeting the right group of workers for crowdsourcing. In Proceedings of the 23rd International Conference on World Wide Web, pp. 165–176. Crossref, Google Scholar [
- 2020] Wisdom of crowds: The effect of participant composition and contribution behaviour on Wikipedia article quality. Journal of Knowledge Management, 24(2), 324–345. Crossref, Web of Science, Google Scholar [
- 2011] Who does what: Collaboration patterns in the Wikipedia and their impact on article quality. ACM Transactions on Management Information Systems, 2(2), 1–23. Crossref, Google Scholar [
- 2010] Wisdom of the crowd or technicity of content? Wikipedia as a sociotechnical system. New Media and Society, 12(8), 1368–1387. Crossref, Web of Science, Google Scholar [
- 2006] Cultural differences in collaborative authoring of Wikipedia. Journal of Computer-Mediated Communication, 12(1), 88–113. Crossref, Web of Science, Google Scholar [
- Ren, R and B Yan (2017). Crowd Diversity and Performance in Wikipedia. In The Mediating Effects of Task Conflict and Communication, In CHI Conference on Human Factors in Computing Systems, pp. 6342–6351. ACM. Google Scholar
- 2015] Crowd size, diversity and performance. In ACM Conference on Human Factors in Computing Systems, pp. 1379–1382. ACM, Seoul. Crossref, Google Scholar [
- 2008] Information quality work organization in Wikipedia. Journal of the American Society for Information Science and Technology, 59(6), 983–1001. Crossref, Web of Science, Google Scholar [
- Wikipedia (2007). Wikipedia Good Articles, https://en.wikipedia.org/wiki/Wikipedia:Good_articles. Google Scholar
- Wikipedia (2008). Wikipedia Assessment, https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Wikipedia/Assessment. Google Scholar
- Wikipedia (2018). Wikipedia: Statistics, https://en.wikipedia.org/wiki/Wikipedia:Statistics. Google Scholar
- 2020] Assessing the contribution of subject-matter experts to Wikipedia. ACM Transactions on Social Computing, 3(4), 1–36. Crossref, Google Scholar [
- 2017] Crowd development: The interplay between crowd evaluation and collaborative dynamics in Wikipedia. ACM on Human-Computer Interaction 1(CSCW), 119. Crossref, Google Scholar [