Post Wikipedia

Wikimedia advertising (soft) drive

Tuesday, October 23rd, 2007

Wikipedia (actually the Wikimedia Foundation) started another yesterday. I’ll just reference what I’ve said in the past:

I am convinced by comments on the above posts and conversations since that it will take a huge shift in Wikipedia community opinion for advertising to have a chance. The time for direct argument in relevant venues is distant. If you agree with me that advertising on Wikipedia will allow the foundation to greatly speed the fulfillment of its commitment, you can make your support known without rancor:

1) When you donate, leave a comment that says “I support advertising on Wikipedia.”

2) On your Wikipedia user page (mine), add the following code, with obvious meaning (|{{PAGENAME}} may not be obvious–it’s a hack to make your name sort correctly in the relevant category listings):

[[Category:Wikipedians for optional advertisements|{{PAGENAME}}]]
[[Category:Wikipedians who think that the Wikimedia Foundation should use advertising|{{PAGENAME}}]]

Fortuitously Mozilla posted their 2006 financial statements today:

Mozilla’s revenues (including both Mozilla Foundation and Mozilla Corporation) for 2006 were $66,840,850, up approximately 26% from 2005 revenue of $52,906,602. As in 2005 the vast majority of this revenue is associated with the search functionality in Mozilla Firefox, and the majority of that is from Google. The Firefox userbase and search revenue have both increased from 2005. Search revenue increased at a lesser rate than Firefox usage growth as the rate of payment declines with volume.

Congratulations to Mozilla. The Open Web‘s prospects would look far worse if Mozilla did not have the wisdom to exploit this revenue source. Now, what about the prospects for Free Knowledge?

Addendum 20071123: The Wikimedia Fundraiser Blog is running Why Wikipedia Does Not Run Ads, a post linked to in the fundraising ad now running on Wikipedia.

Ridiculous simplicity

Monday, May 21st, 2007

is so ridiculous I’m not surprised it took so long for someone to invent it. But it is a thing of sublime beauty. Reminds me of some of the projects at last weekend’s .

pageoftext.com, which hosts wikiclock, is only ridiculous in its simplicity. Why didn’t I think of that?

Both projects via Evan Prodromou reporting on RoCoCo. I’m sad that I couldn’t make it to Montreal but glad to hear it’s coming to the SF Bay Area next year.

SXSW: Semantic Web 2.0 and Scientific Publishing

Saturday, March 10th, 2007

Web 2.0 and Semantic Web: The Impact on Scientific Publishing, probably the densest panel I attended today (and again expertly moderated by Science Commons’ John Wilbanks), covered , new business models for scientific publishers, and how web technologies can help with these and data problems, but kept coming back to how officious Semantic Web technologies and controlled ontologies (which are not the same at all, but are often lumped together) and microformats and tagging (also distinct) complement each other (all four of ’em!), even within a single application. I agree.

Nearly on point, this comment elsewhere by Denny Vrandecic of the Semantic MediaWiki project:

You are supposed to change the vocabulary in a wiki-way, just as well as the data itself. Need a new relation? Invent it. Figured out it’s wrong? Rename. Want a new category of things? Make it.

Via Danny Ayers, oringal posted to O’Reilly Radar, which doesn’t offer permalinks for comments. This just needs a catchy name. Web 2.0 ontology engineering? Fonktology?

SXSW: Commercialization of Wikis

Saturday, March 10th, 2007

Evan Prodromou gave an excellent presentation on Commercialization of Wikis: Open Community That Pays the Bills. Check out his slides.

A few points:

  • Other stuff will be recognized as having wiki nature, e.g., .
  • Four categories of wiki businesses: service provider (Wikispaces, Wetpaint, PBWiki), content hosting (wikiHow, Wikitravel, Wikia), consulting (SocialText), content development (WikiBiz). My comment: at first blush Wikia would seem to be a service provider, but they are also deeply involved in content creation and community management.
  • Down with and the notion that wiki contributors are suckers or sharecroppers. Better to think of wikis (and wiki businesses) as platforms for knowledge. Contributors use your wiki to help each other, not to give you free content. My comment: I’m not so down on crowdsourcing. Yes, it is MBA language, but the usually involve compensating contributors. Crowdsourcing shouldn’t be conflated with sharecropping, nor confused with community purpose.
  • For wikis purpose more important than friends or ego for blogs (cf. blogs and social networking).

Seven rules for commercial wikis:

  1. Have a noble purpose — e.g., shared knowledge (use a free license), help a community.
  2. Demonstrate value — most interesting example is “carry the torch”; wiki communities can be transient, an entity that keeps focus helps.
  3. Be Transparent.
  4. Extract value where you provide value — most obviously, advertising for hosting.
  5. Set boundaries.
  6. Be personally involved.
  7. Run with the right crowd — e.g., open source and open content, or you will be suspect of being a crowdsourcer.

It appears that Prodromou’s Wikitravel lives by these rules and has succeeded.

Update 20070317: Prodromou has a roundup of blog responses to his presentation. It was great indeed catching up with him.

“Querying Wikipedia like a Database”

Tuesday, January 23rd, 2007

I’ve mentioned several times as having the potential to tremendously increase the value of Wikipedia by unlocking (in the sense of making queryable) all of the data in the encyclopedia.

dbpedia.org has taken a different approach to “Querying Wikipedia like a Database” (their excellent tagline) — extract datasets from Wikipedia, presumably with a manual mapping of relevant categories and data populating infoboxes to triples (described in What have Innsbruck and Leipzig in common? Extracting Semantic from Wiki Content).

I suspect Wikipedia implementation of Semantic MediaWiki would only help dbpedia.org, but the latter is already impressive, requiring no changes at Wikipedia. In addition to making some of the data in Wikipedia queryable they’re exposing non-Wikipedia datasets.

The Semantic Web is so here, now. Doubters repent! ;-) Like I said before:

Once people get hooked on access to a semantic encyclopedia, perhaps they’ll want similar access to the entire web.

Wikipedia and Linking 2.0

Monday, January 22nd, 2007

has reasons for linking to a Wikipedia article about an organization rather than the organization’s site:

[A] lot of institutional sites are pathetic self-serving fluff served up in anodyne marketing-speak with horrible URIs that are apt to vanish.

Linking to the Wikipedia instead is tempting, and I’ve succumbed a lot recently. In fact, that’s what I did for the Canada Line. After all, the train is still under construction and there’s no real reason to expect today’s links to last; on top of which, the Line’s own site is mostly about selling the project to the residents and businesses who (like me) are getting disrupted by it, and the taxpayers who (like me) are paying for it.

Wikipedia entries, on the other hand, are typically in stable locations, have a decent track record for outliving transient events, are pretty good at presenting the essential facts in a clear, no-nonsense way, and tend to be richly linked to relevant information, including whatever the “official” Web site might currently happen to be.

I wrote something similar about a year ago:

I consider a Wikipedia link more usable than a link to an organization home page. An organization article will link directly to an organization home page, if the latter exists. The reverse is almost never true (though doing so is a great idea). An organization article at Wikipedia is more likely to be objective, succinct, and informational than an organizational home page (not to mention there is no chance of encountering Flash, window resizing, or other annoying distractions — less charitably, attempts to control my browser — at Wikipedia). When I hear about something new these days, I nearly always check for a Wikipedia article before looking for an actual website. Finally, I have more confidence that the content of a Wikipedia article will be relevant to the content of my post many years from now.

Why not preferntially link to Wikipedia? Bray feels bad about not linking directly to original content and says Wikipedia could go off the rails, though later provides a reason to not worry about the latter:

I’d be willing to bet that if Wikipedia goes off the rails and some new online reference resource comes along to compete, there’ll be an automated mapping between Wikipedia links and the new thing; so the actual URIs may retain some value.

Indeed; and the first argument explains why linking to Wikipedia is superior to linking to an institution. But what about “original content”? If the content isn’t simply a home page (of an organization, person, or product significant enough to be in Wikipedia), Wikipedia doesn’t help. For example, I linked to Bray’s post “On Linking”; only providing a link to his Wikipedia article would have been unhelpful. The Wikipedia article link in this case is merely supplementary.

So what to do to help with broken and crappy links to items not described in Wikipedia? Bray suggests “multi-ended links”. I think he’s on the right track, but this is not something a web content creator should need to worry about — robust linking need not involve choosing several typed (e.g., official, reference, search) links. The content creator’s CMS and the user’s browser ought to be able to figure this stuff out; the content creator should just use the best link available, as always.

Last year I wrote:

I predict that in the forseeable future your browser will be able to convert a Wikipedia article link into a home page link if that is your preference, aided by Semantic Mediawiki annotations or similar.

In the case of non-Wikipedia links (and those too), combatting linkrot and providing alternate and related (e.g., reference, reply, archival) links is an obvious feature add for social bookmarking services and can be made available to a CMS or browser via the usual web API/feed/scraping mechanisms.

Wiki search advertising

Tuesday, January 16th, 2007

has launched. It’s a reasonable idea, searching Wikipedia and sites Wikipedia links to (recalling search engines that have used to seed crawls). It’s much faster than Wikipedia’s built in search, but doesn’t satisfy me, as its Wikipedia results are out of date and imcomplete (indicators of the former include turning up deleted articles and finding nothing for ‘wikiseek’).

I find it interesting that Wikiseek’s footer says:

The majority of the revenue generated by Wikiseek advertising is donated to the Wikimedia Foundation.

That’s nice — apparently Searchme, Inc., intends to use Wikisearch to demonstrate its vertical search prowess — and it inspires a potential non-intrusive revenue model for Wikipedia that precisely copies Mozilla’s: sell inclusion in the search box/search page.

This wouldn’t be worth the hundreds of millions annually that tasteful text ads on articles could be (and the ability to fully fund* the Wikimedia Foundation’s mission), but it would surely obviate the need for begging to cover the costs of running Wikipedia.

* If politicians can use that vacuous phrase to indicate they “support education” I can use it in support of funding free knowledge projects.

Wikipedia advertising redux

Monday, January 8th, 2007

Many good comments regarding supporting advertising on Wikipedia (or not) here and also on Slashdot and other blogs. I may further characterize and respond to these in aggregate (see the update to my first post for some of this). For now I want to call out or respond to a few particularly worthy comments and criticisms.

Evan Prodromou’s comment:

One thing I wanted to respond to was that a couple of people seemed to think it incorrect on my part to refer to Wikipedia’s Web traffic as a “resource”. I’m not sure what else to call a potential source of tens, maybe hundreds of millions of dollars annually in income. But if people know a better word for it, please substitute that in.

Let me also point out that wikipedia.org’s current huge Web traffic is not a long-term sure thing. As Open Content, the encyclopedia can be copied onto any other Web site on the Internet, and sites like answers.com show that this can be lucrative. Anyone familiar with the Open Directory (http://dmoz.org/) knows that it’s copied to Google Categories, Yahoo Directory, and dozens of other high-profile sites. In 5 years, will there be thousands of mirrors of Wikipedia on the Web? Will wikipedia.org become more like editors.dmoz.org — an editorial interface for a data set served from many other servers?

If that’s the case, will we look back on the high-traffic days of 2005-2008 as the time when we wasted somewhere around half a billion dollars in potential revenue? Will the WMF really be glad at that point that it did so?

I hadn’t thought of this scenario and don’t consider it likely, but do think it is an important consideration. I think the canonical was seriously disadvantaged in two ways Wikipedia is not — a fairly closed editorial process (e.g., I’ve applied a few times over the years and don’t recall getting any feedback, not even rejection) and probably a horrible editor interface (e.g., I was accepted as an editor at Chef Moz, a sister site of dmoz.org — and ran away screaming).

How could Wikimedia sites lose traffic to copies? Presumably much of Wikipedia traffic comes from Google. If Google published a branded copy (with ads of course) and promoted it in (or above) search results, Wikipedia would presumably lose lots of traffic (and many people would call Google evil for it, at least for awhile). I’m sure there are more creative scenarios in which Wikimedia sites lose traffic.

Peter McCluskey:

Mike Linksvayer has a fairly good argument that raising X dollars by running ads on Wikipedia won’t create more conflict of interest than raising X dollars some other way.

Almost. The second X is Y and an order of magnitude or so smaller than X. McCluskey:

But the amount of money an organization handles has important effects on its behavior that are somewhat independent of the source of the money, and the purpose of ads seems to be to drastically increase the money that they raise.

I can’t provide a single example that provides compelling evidence in isolation, but I think that looking at a broad range of organizations with $100 million revenues versus a broad range of organizations that are run by volunteers who only raise enough money to pay for hardware costs, I see signs of big differences in the attitudes of the people in charge.

Wealthy organizations tend to attract people who want (or corrupt people into wanting) job security or revenue maximization, whereas low-budget volunteer organizations tend to be run by people motivated by reputation. If reputational motivations have worked rather well for an organization (as I suspect the have for Wikipedia), we should be cautious about replacing those with financial incentives.

It’s possible that the Wikimedia Foundation could spend hundreds of millions of dollars wisely on charity, but the track record of large foundations does not suggest that should be the default expectation.

Yes, this could be a major problem. As I said last year, “[advertising] could fund a staggering Wikimedia Foundation bureaucracy, or it could fund additional free knowledge projects.” The possibility that new funds will not be used effectively lowers the expected benefit of running ads. Two items give me some confidence that the Wikimedia Foundation would be less susceptible to waste than the average foundation:

  • Wikimedia Foundation’s history of transparency sets the tone for what would become a much larger organization
  • An incomparable set of watchdogs (Wikipedians)

Regarding subversion of current volunteer motivations and ethics (which is really the point of McCluskey’s post), I would not advocate financial incentives for functions currently carried out by volunteers, certainly not any content-related function. Of course given large amounts of money there would be pressure to convert an ever larger group of volunteers into employees, regardless of what advocates of advertising on Wikipedia might have wanted. The possibility that this would occur and go badly should also weigh against advertising.

Addressing this possibility, I concur with Per Abrahamsen’s recommendation segregating Wikimedia projects and foundation funding of compatible projects:

Wikipedia is clearly able to earn its own money, begging for donations on the front (and every other) page is an insult to both visitors, and to the many worthy cases that are not in that lucky position.

So I support advertising on Wikipedia.

The adds should be non-intrusive, textual, clearly separated from content, and selected algorithmically, similar to the adds known from Google.

However, if the money are really that big (more than the current need), additional precautions would have to be taken. The most important would be to split the foundation into two, with watertight boundaries between. One that ran the current Wikimedia projects, and another solely responsible for distribution the ad-money between causes that promote the goals of the foundation, but had no say in the running of any of the projects. Money do corrupt, hence the separation.

Slashdot commenter FooAtWFU (and others) suggested that the real problem with advertising is that large numbers of contributors would leave in protest, seriously damaging Wikipedia. I doubt it. A very vocal minority would raise hell and some of them would leave, at least temporarily. I suspect most contributors would not even notice the presence of ads. I conjecture that Wikipedia contributors, however superior some may feel, are not that different from MySpace “contributors” (who seem not to be deterred by gratutous advertising). In a relatively short time (a year is my wild guess) a majority of contributors would have become contributors after advertising had begun. Such is the nature of a rapidly growing site.

A 2002 fork of () could be interpreted as evidence in either direction. The fork apparently occurred in part due to “our rejection of censorship, of an editorial line, and of including advertising.” Whatever the merits of these claims, article counts show the fork growing more quickly for about a year and a half. From 2004 on Spanish Wikipedia grew much more quickly and currently is over five times the size of Enciclopedia Libre. So the loss of those ideologically motivated against advertising and perhaps with other complaints could be seen as a terrible blow to Spanish Wikipedia (a year or more delayed progress) or no big deal, considering current relative sizes. Is there any reason to think the proportion of Spanish Wikipedians disgusted by advertising is significantly different than that of any other language?

Of course it is possible if Wikipedia had taken ads in 2002, many more may have left, and perhaps the fork would now be five times the size of the parent instead of vice versa. This would not necessarily be a horrible thing. After all, the two sites (and any potential Wikipedia fork) use the same license, so work done on one is not entirely lost to the other.

This does suggest an experiment however — run ads on Spanish Wikipedia and see how many contributors move to Enciclopedia Libre. The existence of the latter would make it both easier for ad objectors to move and easier to determine who had moved, indicating a probable maximum negative impact on contributions to other Wikipedias, should they run ads, as no other language has an alternative as viable as Enciclopedia Libre — at least not viable for those who hate ads! The largest encyclopedic wikis outside Wikimedia run Google AdSense, e.g., (Russian) and (Swedish).

BlackNet is a wiki?

Sunday, January 7th, 2007

Wikileaks, currently vapor, may be a joke. If Wikileaks is not a joke and if it successfully exposes a large number of secrets, I’d find it hilarious to see this happening on a public website and without financial incentives. P2P, digital cash, information markets, and crypto anarchy? Nope, just a wiki and a communinty.

Wikileaks FAQ:

WikiLeaks will be the outlet for every government official, every bureaucrat, every corporate worker, who becomes privy to embarrassing information which the institution wants to hide but the public needs to know. What conscience cannot contain, and institutional secrecy unjustly conceals, WikiLeaks can broadcast to the world.

Untraceable Digital Cash, Information Markets, and BlackNet (1997, but these ideas spread widely in the early 1990s):

One of the most interesting applications is that of “information markets,” where information of various kinds is bought and sold. Anonymity offers major protections for both buyers and sellers, in terms of sales which may be illegal or regulated. Some examples: corporate secrets, military secrets, credit data, medical data, banned religious or other material, pornography, etc.

Why is more information not leaked on the net already? The technology exists to do so anonymously and has for a long time. Why is there not (or to what extent is there) a market for secrets? Again, the technology exists.

Perhaps lack of the relevant institutions in each case. One could email secrets or post to a blog anonymously, but what then? Will anyone notice? One could want to sell secrets, but how to find a buyer?

If Wikileaks succeeds it will be because it will provide, or rather its community will be, the relevant institution. Again from the Wikileaks FAQ:

WikiLeaks opens leaked documents up to a much more exacting scrutiny than any media organization or intelligence agency could provide: the scrutiny of a worldwide community of informed wiki editors.

Instead of a couple of academic specialists, WikiLeaks will provide a forum for the entire global community to examine any document relentlessly for credibility, plausibility, veracity and falsifiability. They will be able to interpret documents and explain their relevance to the public. If a document is leaked from the Chinese government, the entire Chinese dissident community can freely scrutinize and discuss it; if a document is leaked from Somalia, the entire Somali refugee community can analyze it and put it in context.

I have not read the Wikileaks email archived at cryptome.

I support advertising on Wikipedia

Tuesday, January 2nd, 2007

Wikimedia Foundation is over halfway through a . I hope that when you give you leave the following public comment:

I support advertising on Wikipedia.

Evan Prodromou summarizes a completely unwarranted controversy regarding a matching fund (bottom of page):

All fine so far, right? But a small logo in the donations notice — seen by non-logged-in users on every page of every WMF site — was considered by many Wikipedians and other WMF editors as dangerously close to the line on advertising — or over it. There have been several prominent users who have left the project because of it.

I’m not sympathetic with these folks; in fact, I’m in solid opposition. I think that Wikipedia’s huge amount of Web traffic is a resource that the Foundation is squandering. Traffic like Wikipedia’s is worth tens of millions if not hundreds of millions of dollars in ad revenue per year. That’s money that could go to disseminate free (libre and gratis) paperback pocket encyclopedias to millions of schools and millions of children, in their own language, around the world.

It’s irresponsible to abuse that opportunity.

I strongly agree and will repeat exactly what I said during last year’s Wikimedia fund drive:

Wikipedia chief considers taking ads (via Boing Boing) says that at current traffic levels, Wikipedia could generate hundreds of millions of dollars a year by running ads. There are strong objections to running ads from the community, but that is a staggering number for a tiny nonprofit, an annual amount that would be surpassed only by the wealthiest foundations. It could fund a staggering Wikimedia Foundation bureaucracy, or it could fund additional free knowledge projects. Wikipedia founder Jimmy Wales has asked what will be free. Would an annual hundred million dollar budget increase the odds of those predictions? One way to find out before actually trying.

In somewhat related news, Mozilla just reported 2005 financial information, showing 800% revenue growth:

In 2005 the Mozilla Foundation and Mozilla Corporation combined had revenue from all sources of $52.9M. $29.8M of this was associated with the Foundation (both before and after the creation of the Corporation). The bulk of this revenue was related to our search engine relationships, with the remainder coming from a combination of contributions, sales from the Mozilla store, interest income, and other sources. These figures compare with 2003 and 2004 revenues of $2.4M and $5.8M respectively, and reflect the tremendous growth in the popularity of Firefox after its launch in November 2004.

The combined expenses of the Mozilla Foundation and Corporation were approximately $8.2M in 2005, of which approximately $3M was associated with the Foundation. By far the biggest portion of these expenses went to support the large and growing group of people dedicated to creating and promoting Firefox, Thunderbird, and other Mozilla open source products and technologies. The rate of expenses increased over the year as new employees came on board. The unspent revenue provides a reserve fund that allows the Mozilla Foundation flexibility and long term stability.

An advertising-fueled Wikimedia Foundation could fund dozens of much needed Mozilla Firefox sized projects. And many Creative Commons (which just successfully completed its much more modest annual funding campaign) initiatives. :)

Update: Welcome Slashdot readers. The major objection to ads on Wikipedia takes two forms:

  • Advertising is profane.
  • Advertising would compromose Wikipedia’s neutrality.

A common response to the first is that those who don’t like ads can run an ad blocker. Easier still, those who don’t like ads can log in — there’s little reason to display ads to logged in users, who probably generate a tiny fraction of pageviews. But I don’t think either of these responses will satisfy this form of the objection, as it is basically emotional. Some people object to the knowledge that ads exist, even if not experienced personally. I suppose these people don’t use search engines. It’s a wonder they can stand to use the net at all. I discount them completely.

The second is completely unrealistic. How would third party text ads, e.g., via AdSense, compromise neutrality? There’s simply no vector for an advertiser to demand changes and zero reason for Wikipedians to comply. Wikipedia is not a small town newspaper beholden to the local department store, not even close. It isn’t even Slashdot, which as far as I can tell has not been compromised by years of running ads. To people with this objection: show me a community site that has gone astray due to advertiser influence.

Sponsors, “being managed by Wikipedia staff (like in newspaper ads, i.e. no uncontrolled 3rd party feeds)”, as suggested by Kuba Ober, are far more dangerous than third party ads, because then there is a vector between advertiser and someone with power at Wikipedia.

There may be an opportunity for Wikipedia to completely rethink and remake advertising, or merely compete in some fashion with what some are calling Google’s near monopoly, but now it would make tremendous sense to use AdSense or Yahoo! or both — and I suspect Wikipedia could manage to keep a greater share of revenue than a normal web publisher. Rick Yorgason mocked up what AdSense would look like in the place of the current fundraiser’s donation banner.

Slashdot commenter jklooserman summarizes objections from Wikiproject no ads:

  1. Wikipedia’s philosophy is non-commercial
  2. Ads put at risk Wikipedia’s principle of Neutral Point of View (NPOV)
  3. The information that constitutes Wikipedia is wealth for the community

I don’t see “non-commercial” in any form on the Wikimedia Foundation home page. I do see this, in large text:

Imagine a world in which every single human being can freely share in the sum of all knowledge. That’s our commitment.

The next line, all bold, asks for help in the form of donations.

Much more money, hundreds of millions, would speed the arrival of that world and fulfillment of that commitment.

As above, there is no realistic scenario for ads undermining neutrality on Wikipedia.

The third objection strikes me as a non-sequitur. In any case, the point of obtaining more resources would be to increase the wealth of the community — of all human beings.

jklooserman also pointed out that there’s a category of Wikipedians who think that the Wikimedia Foundation should use advertising. Add it to your user page if you agree.