Wikipedia and the Second Renaissance

Download Report

Transcript Wikipedia and the Second Renaissance

Collaborative Writing: Wiki and
Wikipedia
Keshava P Subramanya ([email protected])
Roopa Kannan ([email protected])
Today’s Talk






Quick introduction about the wiki and
collaborative writing idea.
Wikipedia
Two views of how Wikipedia works
Criticisms
Details about the Community
Future
What is collaborative writing?



Projects where written works are created by multiple
people together (collaboratively) rather than
individually
Some projects are overseen by an editor or editorial
team
Many grow without any top-down oversight.
Computer based collaborative
writing




Revision control software providing check-in/out
( example subversion, cvs )
Enterprise information portal, Content management
system
SharePoint
Wikis
Some Collab projects




Novel Twists Online collaborative novel where each of
the 150 pages is written one at a time by a different
person.
co-write.me.uk
The Linux documentation project
OOoAuthors
What is a Wiki



Essentially a dynamic, collectively authored set of
web pages.
Invented in 1995 by Ward Cunningham to facilitate
online collaboration about programming and design
best practices.
Evolved by the early 2000’s into a way to facilitate all
kinds of online collaboration.
Wiki – Definition


A wiki (according to Ward Cunningham) is a type of
website that allows users to add and edit content and
is especially suited for constructive collaborative
authoring.
In essence, a wiki is a simplification of the process of
creating HTML pages combined with a system that
records each individual change that occurs over time,
so that at any time, a page can be reverted to any of
its previous states.
As defined in Wikipedia.
How the Wiki Got Its Name


Wiki is the Hawaiian word meaning “quick”, “fast”, or
“to hasten”.
Wiki-Wiki is the name of the bus line in the Honolulu
International Airport.
How the Wiki Got Its Name
How the Wiki Got Its Name
“Wiki-wiki to the beach.” - Elvis
Presley (as Chad Gates) in the
movie Blue Hawaii (1961). The
line was said with a snap of the
fingers.
Some more …
Wiki (according to UIC Prof. Steve Jones)
Web-based
Interactive
Kollaborative (collaborative)
Iterative
Wiki is sometimes interpreted as the backronym
for “What I Know Is”, which describes the
knowledge contribution, storage and exchange
function.
More Uses for a Wiki






100 things to do before you die
The world’s largest “How-To” manual – wikiHow
Things to do in Seattle
World-wide travel guide – wikitravel.org
Everything you want to know about VoIP
All about the flu – Flu Wiki
Free Hosting of Wikis
wikihost.org
free-wiki-hosting.com
wikicities.com
educational.blogs.com
duckcomputing.com
pbwiki.com
wikispaces.com
What is Wikipedia?




Wikipedia is a freely licensed encyclopaedia written
by thousands of volunteers in many languages
Free license allows others to freely copy,
redistribute, and modify work commercially or noncommercially
Founded January 15, 2001
Run by the wikimedia foundation.
wikipedia.org
What is the Wikimedia
Foundation?

Non-profit foundation
Its 4th Quarter 2005 costs were $321,000 USD, with hardware
making up almost 60% of the budget
Where does it get the money ?


Aim: to distribute a free encyclopaedia to every single person on the
planet in their own language
Wikipedia and its sister projects
wikimediafoundation.org
Advantages of Freely Licensed
Content





GNU Free Documentation Licence
Remains non-proprietary
Enhances the popularity of Wikipedia
Decreases individual sense of ownership
Increases a sense of shared ownership
Free Software






MediaWiki is GPL
Uses all free software on the website
GNU/Linux
Apache
MySQL
Php
How big is Wikipedia?



English Wikipedia is largest and has over 260 million
words
English Wikipedia larger than Britannica and Microsoft
Encarta combined
In 15 months the publicly distributed compressed
database dumps may reach 1 terabyte total size
How big is Wikipedia Globally?
Total more than 5 million articles!
 English – 1,412,000 articles
 German – 172,000 articles
 Japanese – 87,000 articles
 French – 66,000 articles
 Swedish –53,000 articles
 Over 5 million across 250 languages
 19 with >10,000. 52 with >1000
(statistics could be dated)
How popular is Wikipedia?






According to Alexa.com, Wikipedia (ranked ~ 20th) is more
popular than the websites of:
IBM
Paypal
Open Directory Project
Geocities
~400 Million page views monthly
Wikipedia vs. Britannica

AP article on CNN website
This study was challenged by Encyclopædia
Britannica, who described it as "fatally flawed.“
source www.wikipedia.org
Wikimedia Projects







Wikipedia
Wiktionary
Wikibooks
Wikiquote
Wikispecies
Wikimedia Commons
Wikinews
Wikinews



Community edited news along the same principles of
Wikipedia
Fairly new project
Aim of the project
wikinews.org
Wikimedia’s Hardware




30+ servers
Squid caching servers in front to serve cached
objects quickly
Apache/PHP webservers in the middle
Database backend (MySql)
MediaWiki





MediaWiki is one of many wiki engines
Collaborative software that allows users to add or
edit content
Primarily developed for Wikipedia from 2002 onwards
Scalable and multilingual
Free license
MediaWiki features



Quality control features (versioning)
Editing features (simple markup)
Community features (talk pages, profiles, access
levels)
Page History
DEMO
DEMO
Interlanguage linking
Criticism Workshop 
Hints:




Can Wikipedia Content Be Trusted?
Systematic bias
Reliability of Information
Technology requirement
Can Wikipedia Content Be
Trusted?





Review processes
Partly post-moderation, partly reactive moderation
Linking to particular revisions
Development of a stable version
Free license allows you to modify it
Reliability of Information



Criticism
The community contribution
approach allows for too much false
information.
Without an expert background a
person can not present an unbiased,
factual position.




Rebuttal
The open source approach allows
for new information to be added on
a daily basis.
The articles that exist on Wikipedia
are a group effort where any wrong
information can be edited.
The group editing also lets people
combine information to get a broad
background.
Reliability of Information



Criticism
The large quantity of daily
information added prevents proper
fact checking.
The daily edits allow too many
mistakes to go unnoticed or be
reintroduced.




Rebuttal
Wikipedia does maintain a staff
whose sole purpose is to review and
edit articles.
Each day articles are viewed by
thousands of people, any one
person can implement changes to
correct mistakes.
Printed encyclopedias can not fix
errors once released, while
Wikipedia is always able to make
corrections.
Systematic Bias



Criticism
Systematic bias exposes WIkipedia
to unbalanced amounts of
information.
People are more likely to write
about topics that interest them as
opposed to more historically
significant topics.




Rebuttal
Past requests for information have
been met with quick action.
These responses have created huge
increases in the amount of coverage
of topics.
Wikipedia also includes a inquiry
page. Any topic can be requested
and the Wiki community is quick to
respond.
Technology Requirements




Criticism
Wikipedia faces technology
constraints as an online
encyclopedia.
A reader must have Internet access
at all times.
The possibility of tech failure on the
Wikipedia’s end also presents
problems.



Rebuttal
The technology constraints
constantly decrease as the world
becomes more advanced.
The student population has almost
100% Internet access due to school
resources and class requirements.
Latest Information


Wikipedia is built on the belief that collaboration
among users will improve articles over time.
The software of Wikipedia allows for rapid updating
of existing articles, as well as constant introduction of
new topics.
Quick Vandalism Response



Most vandalisms on Wikipedia are reverted within five
minutes.
There is a record of change made to every page and
Wikipedia volunteers watch the list of recent
changes.
If a user constantly vandalizes pages of Wikipedia,
individuals can be blocked and pages can be locked
down.
Neutral Point of View



Three sides to everything, your version, my version,
and the truth
Editors are asked to maintain a neutral point of view
when writing for Wikipedia.
When editing wars break out and neutral points of
view are not maintained, Wikipedia volunteers usually
remove the information posted.
Click here
Two Views of Wikipedia
•Emergent
•Community
of thoughtful users
Emergent


Thousands of individual
users who don’t know each
other each contribute a little
bit
Out of this emerges a
coherent body of work
A Community?
Berlin
London
Genoa
A dedicated group of a few
hundred volunteers who know
each other and work to guarantee
the quality and integrity of the
content.
Implications
Emergent Model
 Need reputation mechanisms
like Ebay, Slashdot
 Users are tiny, have no
power
Community Model
 Reputation is a natural
outcome of human
interactions
 Users are powerful, must be
respected
80/10 Rule




Counting only logged in users, and even excluding
some prominent approved bot users
10 percent of all users make 80% of all edits
5 percent of all users make 66% of edits
Half of all edits are made by just 2 1/2 percent of all
users
Edits by Anons





Controversial, intriguing
Yes, you can edit this page
Without logging in!
Anonymous ip numbers can edit Wikipedia
But these edits make up a total of around 18% of all
edits, with some evidence of a downward trend over
time
Edits across namespaces
Articles 85%
 Talk pages 8%
 User Page 3%
 User Talk Pages 4%
These percentages are stable in 2003
And 2004

Wikipedia is a community…
•How
does it work?
•Who are the users?
•How do they self-regulate?
Many types of users


As in any society, there are many types of people -these types are reflected in editing patterns
Individual users may not fit cleanly into a single type,
but thinking about editing patterns is a helpful way to
understand the community
Broad Types





Worker Bees, POV pushers
Police, Judges
Controversy lovers - Moths
Pseudo-users - Sock puppets, Vandals
Extra-Wiki - Mailing list, IRC, Board activities,
Developers
Bees





The most important users at
Wikipedia
But may go unnoticed unless
special attention is given
Generalists
Specialists
Proof-readers
Question: What attracts the bees??
Sock Puppet




Not all sock puppets are bad
Privacy
The chance to start over
But when used wrongly, is
one of the worst offenses
Moth


Drawn to flames
Not necessarily a bad
thing - some people thrive
on controversy
Vandal


Less of a problem for the community than most people
assume
Vandalism is easy to revert, and blocking vandals
(temporarily) slows them down and takes the fun away
Outside the Wiki



Developers - coders and system admins
IRC Channels
Mailing lists
Wikipedia Governance





A confusing but workable mix of
Consensus
Democracy
Aristocracy
Monarchy
Community Challenges

How can such a large community scale?
– Through software features
– Through policy (mediation, arbitration)
– Through an atmosphere of love and respect
Community Self-Regulation


Quality control features: recent changes,
watchlists, related changes, page histories, user
contributions lists
Community features: talk pages, user profiles,
access levels, user-to-user email, message
notification.
International Community




Interlanguage linking of articles
Choice of language interface
Global newsletter: Quarto
“Translation of the week”
Conclusion



Wikipedia is a community
Automated and artificial Slashdot-style
reputation metrics are not needed and may
not be desirable
Achieving quality levels equalling or
exceeding traditional publishing models can
be expected without “emergent” magic
Credits


http://www.wikipedia.org and related sites
Some slides adapted from
– Jimmy Wales President, Wikimedia Foundation Wikipedia Founder
– Prof. Burks Oakley II Prof of E.C.E University of Illinois