From Wikipedia, the free encyclopedia
In the news

WikiLeaks and Wikipedia; Google–WP collaboration to translate health information

Difficult relationship between WikiLeaks and Wikipedia

The logo of Wikileaks

As reported in last week's "In the news", the continuing media attention for Julian Assange's website WikiLeaks (which recently published thousands of documents revealing what The Guardian newspaper called "the true Afghan war") has had an adverse effect on Jimmy Wales and Sue Gardner [1] who both reported that they frequently had to deal with people who mistakenly assume that they are involved in Wikileaks. In an interview with The Independent, Wales said he was "fed up" with the volume of e-mails he was receiving that attacked him for "putting the lives of thousands of US troops at risk", but that his reaction was to "just roll my eyes, chuckle to myself and tell them they've got the wrong man." Somehow in agreement with Assange, who had responded to Gardner's statements by pointing out that "wiki" was around a long time before "Wikipedia", Wales said that he could not just copyright the word 'wiki', because he did not want to prevent other people from starting wikis. He said it is not the Wikimedia Foundation's "style" to "come to blows with Wikileaks in court". He did say, however that he thinks "the issue of having to protect our name is something I can anticipate coming up again in the future."

Both Wales' concern and the public confusion between the two "web behemoths" (The Independent) are nothing new. On January 3, 2007, (the day of the first public report of Wikileaks' existence), the domains wikileaks.com, wikileaks.us, wikileaks.biz and wikileaks.net were registered, apparently by Wales out of concern about the name. To this day, they belong to Wales' company Wikia, although they now appear to be used by Wikileaks in addition to its main site wikileaks.org.

Around that time, Wikileaks described itself as "an uncensorable version of Wikipedia for untraceable mass document leaking and analysis." [2] According to internal emails published by Wikileaks co-founder John Young on his own leaking site, Cryptome, the Wikileaks founders deliberately used Wikipedia's fame to draw attention to themselves among "1000 other organizations jostling for time" at the January 2007 World Social Forum in Kenya, where Assange lived around that time: "even those [people who aren't net-savvy] can't but help hearing about the wikpedia."

Julian Assange (2009)
Although Wikileaks' founders were sometimes quoted as having "no ties" with Wikipedia at the time, there was widespread confusion between the two, as noted by German Wikipedian Avatar, who described a particularly egregious example in his blog: A commentary published in the January 25, 2007, print edition of Die Welt, one of Germany's major broadsheet newspapers, which called Wikileaks "the youngest branch" of Wikipedia, and went on to explain that "the Internet encyclopedists with their exhibitionist will for subversion now want to spread inaccessible, allegedly suppressed knowledge, too", which "perfectly matches the Wikipedians' weird image, who always communicate the image of being a grassroots democracy in their content, but often appear to be anarchist or even chaotic in their methods." (After Avatar complained to the newspaper, the online version was quickly corrected.)

Back to 2010: In a post on his "The Wikipedian" blog some weeks ago, titled " WikiLeaks: No Wiki, Just Leaks", William Beutler ( User:WWB) speculated "that the site was so named to borrow from the credibility enjoyed (and earned) by Wikipedia". He observed that while Wikileaks was running on the MediaWiki software, all participatory functions of the software seemed to be disabled, along with those providing transparency (history and recent changes).

However, Assange initially did intend to model Wikileaks after Wikipedia's collaborative processes, according to his remarks at a symposium at the Berkeley Graduate School of Journalism on April 18 ( video, from 36:30, or on YouTube), where he explained what made him change his mind:

Wikipedia has at least twice been the subject of leaks published by Wikileaks. In 2009, an archive file was uploaded to Wikileaks that was described as containing postings from a private mailing list of some Wikipedia editors, several of whom were later sanctioned by ArbCom due to their involvement in the list. In November of the same year, during a debate in the German public about deletionism on Wikipedia (see Signpost coverage: " German Wikipedia under fire from inclusionists"), Wikileaks published a copy of one of the articles in question, which had been deleted for notability reasons. And in December 2007, Assange had generated media coverage with a report titled Wikileaks busts Gitmo propaganda team, largely containing Wikipedia edits carried out under an IP belonging to the Joint Task Force Guantanamo (but that connection had already been noted four months earlier by User:Computerjoe).

At the same time, Wikileaks and Assange appear to have been concerned about their coverage on Wikipedia. On April 9, a Twitter message by Wikileaks read "WL opponents seem to have created Julian's Wikipedia page. For ethical reasons we can't edit. Please fix". ( Privatemusings, who had started the article Julian Assange, replied that he was " not sure where that's coming from".) On April 21, Assange posted comments on the article's talk page itself, again acknowledging that "it would not be ethical for me to edit this article directly" but stating that "The nature of my work, exposing abuses by powerful organizations and nation states, tends to attract attacks on my person as a way to color debate. The history of this page has numerous examples" (without naming any) and voicing concern that a portrait photo used in the article "tends to undermine my message" (he later donated a publicity image under a free license). On August 16, Assange said "there are frequent attempts by military apologists and others to manipulate our Wikipedia pages".


Major Google–Wikipedia translation project: Health Speaks

Hospitals and doctors are unavailable to much of humanity, so providing free, accessible health-related information in local languages could have a major impact on health outcomes.
Gaining free access to online information could make a huge difference to the lives of many people in developing countries. Jennifer Haroon, Google.org's Manager of Health Initiatives, says that "in most parts of the world, ... quality information that would help people improve their health is not available online in local languages". She points out that as far back as 2004, the prestigious British medical journal The Lancet ran an article describing the lack of access to health information in local languages as a "major barrier to knowledge-based healthcare in developing countries [and that] among currently available technologies, only the Internet has the potential to deliver universal access to up-to-date healthcare information."

Fast-forward to 2010, and Google.org has announced on its home page a new collaboration with the English Wikipedia's WikiProject Medicine on an initiative called Health Speaks. Haroon says the aim is to support pilot translation projects in which "volunteers translate health articles into [Arabic, Hindi and Swahili] and publish them online on Wikipedia for all to access". The project will explore active cooperation between professional medical editors (hired by Google) and any interested Wikipedians to further improve the quality of articles selected by the Google Foundation. Haroon says, "We have chosen hundreds of good quality English language health articles from Wikipedia that we hope will be translated with the assistance of Google Translator Toolkit, made locally relevant, reviewed and then published to the corresponding local language Wikipedia site." The countries targeted are Kenya, Egypt, Tanzania, and India; among those encouraged to register for volunteer translation and translation review are medical, nursing and public health students; health professionals; health NGO employees; and other students.

Volunteers will work from a dedicated site for each language, with access to an ongoing table of articles and their translators'/reviewers' usernames. Volunteers may use the online Google translator toolkit to assist their task, and can register for news updates and information about training events. For the first two months, Google will donate 3 cents (US) to three non-profit medical institutions in India, Egypt, and east Africa for each English word translated, up to $50,000 each; these institutions will provide local support for the program.

The collaboration follows activities by Google.org's parent company Google, which likewise uses the Translator Toolkit to translate Wikipedia articles (Signpost coverage: Google uses machine translation to increase content on smaller Wikipedias and Wikipedia, Google Translate and Wikimedia's India strategy).

Wikipedia's health-science experts comment

Tim Vickers is a US-based biochemist and a key player in our Google Project Taskforce. He says the main issues for Wikipedia will be "improving the quality of the articles as much as possible before they are translated, and trying to tone down the US-centred approach we commonly take to topics." Fvasconcellos is a professional translator and an experienced member of both WP:MED and WikiProject Pharmacology. As a translator, he is glad of two things: the first is Google's choice to use Google Translator Toolkit rather than Google Translate. "The Toolkit is basically a postediting suite, and thus allows articles to be translated more easily and quickly than if the job was done manually, while reducing much of the inevitable inaccuracy of machine translation." The second thing is Google's use of "folks who know what they're doing—the usernames suggest many translators and reviewers are medical professionals. Professional input is of the utmost importance when translating medical and health-related content, as the pitfalls and potential implications of inaccuracy can be far more significant than in other fields.... when you're talking about drug dosage, symptoms, or how to prevent infectious diseases, quality assurance can become a matter of life or death."

Vasconcellos says, "I do think there is a risk that talk pages will become magnets for misguided medical advice, possibly dangerous anecdotal information, and quackery. This happens in the English Wikipedia all the time [and] should certainly be on the minds of all those involved. What's the user base like? Will knowledgeable users be watching the articles for misguided edits, and the article talk pages for inappropriate content?" Kilbad (of the Dermatology task force) says: "As the coverage of medicine-related content continues to grow and improve on Wikipedia, the number of talk-page requests for medical advice will likely increase. [If this occurs, it will be] important to make Wikipedia's medical disclaimer ( WP:MEDICAL) a prominent feature of these pages to discourage the giving and/or asking of advice". However, Tim Vickers feels the "talk-page advice" problem is not that common on enwiki pages, and is likely to be "similarly rare in other languages. Most of the important medical articles are on multiple watchlists, so this won't be a big job."

To Vasconcellos, there are more significant issues. "What will happen when well-meaning users start adding sources that enwiki would not use due to quality concerns? Arguments on the appropriateness of sources happen every day at enwiki. How will they be handled? Will they happen at all, or will articles grow and evolve in an unorganized fashion, with questionable content added and left to 'fester' unchecked?" And "how will traditional medical practices be dealt with? How do these Wikipedias deal with this sort of information at the moment? Is it allowed? Is it given equal footing with (Western) science-based content? We strive to make the medical content of enwiki evidence-based, and by that I mean based on recent, high-quality scientific sources. Is that practice followed at Hindi Wikipedia, for instance, and if not, should it be? Should content based on traditional medical views of malaria or diabetes, for instance, be added/allowed/encouraged for cultural reasons?"

For the Wikimedia Foundation, working through these and related issues is likely to make Health Speaks a fascinating venture into cross-cultural and -linguistic management. If it succeeds, Google hopes the program will grow into a larger scheme involving more languages and more articles.


Briefly

  • Indian Wikimedia office: An article by Indian business newspaper Live Mint ( Wikimedia to open office in India in six months) reported on the Wikimedia Foundation's plans to open its first ever presence outside the US. The article quoted chief global development officer Barry Newstead, who will visit India this month, and Board member Bishakha Datta. A small local team will be built for the office. (The WMF already announced a job opening for a "National Program Director, India", as reported in last week's News and notes.) The office will likely be in Mumbai, in Bangalore (where the newly formed Indian Wikimedia chapter is located) or in Pune. In an interview with The Guardian, the WMF's Executive Director Sue Gardner addressed concerns that its presence in India (a country that recently threatened to ban BlackBerry communication devices) would expose Wikimedia to state censorship pressure: "Our stance is that we won't compromise. We haven't, for example, made any concessions to the Chinese government and we were blocked there up to the [2008 Beijing] Olympics. As a result, our share is still very, very small in China."
  • Jimbo on collaboration skills: Jimmy Wales was interviewed by Howard Rheingold about collaboration skills in an 11-minute video. Wales said much of such skills consisted of basic "kindergarten ethics", but described three things that he had "learned over the years, the hard way": Assuming good faith, but realizing that sometimes a group needed to hold firm to its principles and ask malicious members to leave, in which case they should be offered alternatives, i.e. other projects that might be more suited to their goals. Wales named two conditions for starting successful online collaboration projects: First, it "must be immediately obvious" to a new contributor how to help out or what to do, a condition fulfilled by Wikipedia. He said that his Campaigns Wikia project failed for this reason, as its stated purpose was ambiguous. Secondly, the project has to have a worthwhile goal, even if this needs to be understood in a wider sense such as in the case of the Muppets wiki.
  • COI editing advice: Manny Hernandez, an US activist who founded the Diabetes Hands Foundation, gives other non-profit organizations some quick tips on how to get an article on Wikipedia. [3]
  • Houellebecq plagiarizes Wikipedia: The new novel of French writer Michel Houellebecq contains several passages that have been plagiarized from Wikipedia, according to news website Slate.fr [4]. It noted the similarity of three passages from the book with the French Wikipedia's articles about Frédéric Nihous, the housefly and the town Beauvais. Wikimedia France stated that the passages indeed appeared to have been borrowed from Wikipedia, although the text was of a certain "banality". Slate.fr quoted the chapter's president, Adrienne Alix, who clarified the relevant legal issues concerning free licenses.
  • Singaporean wiki course: The Chronicle of Higher Education reported on a course in wiki-based collaboration for Singaporean students ( Wiki project brings some student wariness at Singapore Management U.).
  • Wikipedia more credible than newspapers: A study about information usage and Web 2.0 in Austria, Slovenia and Croatia, commissioned by telecommunications firm Telekom Austria ( English press release), surveyed 500 Internet users in each country and found that among young users (born 1980 or later), Wikipedia is regarded as at least as credible as traditional news sources: "Wikipedia is turning into the new TV and the new newspaper." In Austria, 63 percent regarded Wikipedia as "credible" or "very credible", compared to only 53 percent for print media and 50 percent for online newspapers and magazines, but slightly behind TV, which 65 percent regarded as trustworthy. [5]
  • The Correctional Service of Canada is investigating a Wikipedia entry made on a government-owned computer renaming Canada's Official Languages Act as the "Quebec Nazi Act." Fed to Investigate 'disgusting' wikipedia entry- Canada
  • News agency Associated Press asked Jimmy Wales about his opinions on the future of paid news content last week. He was skeptical about paywalled newspaper websites, but optimistic about paid apps on the iPad or Kindle. Concerning Wikipedia, "he joked about how his relatively modest means has misled people into thinking he's an ideological opponent of commerce. Far from it; his new [sic] venture Wikia, which builds online communities around shared interests, is ad-supported. 'I don't think it's right for Wikipedia but I think it's perfectly fine for other contexts," he said. 'When you go to Wikipedia you should be there to learn, reflect, think contribute in a positive way ... We don't need to commercialize it.'"