<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Mike Linksvayer &#187; Microformats</title>
	<atom:link href="http://gondwanaland.com/mlog/category/microformats/feed/" rel="self" type="application/rss+xml" />
	<link>http://gondwanaland.com/mlog</link>
	<description>My opinions only. I do not represent any organization in this publication.</description>
	<lastBuildDate>Wed, 30 Jun 2010 16:46:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Semantic Web Web Web</title>
		<link>http://gondwanaland.com/mlog/2007/11/21/semantic-www/</link>
		<comments>http://gondwanaland.com/mlog/2007/11/21/semantic-www/#comments</comments>
		<pubDate>Wed, 21 Nov 2007 22:01:12 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Peeves]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2007/11/21/semantic-www/</guid>
		<description><![CDATA[The World Wide Web Consortium and particularly its Semantic Web efforts do great, valuable work. I have one massive complaint, particularly about the latter: they ignore the Web at their peril. Yes, it&#8217;s true, as far as I can tell (but mind that I&#8217;m one or two steps removed from actually working on the problems), [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://en.wikipedia.org/wiki/World_Wide_Web_Consortium" rel="tag">World Wide Web Consortium</a> and particularly its <a href="http://en.wikipedia.org/wiki/Semantic_Web" rel="tag">Semantic Web</a> efforts do great, valuable work. I have one massive complaint, particularly about the latter: they ignore the Web at their peril. Yes, it&#8217;s true, as far as I can tell (but mind that I&#8217;m <a href="http://benlog.com/">one</a> or <a href="http://yergler.net/blog/">two</a> steps removed from actually working on the problems), that the W3C and Semantic Web activities do not appreciate the importance of nor dedicate appropriate resources to the Web. Not just the theoretical Web of URIs, but the Web that billions of people use and see.</p>
<p>I&#8217;m reminded of this by Ian Davis&#8217; post <a href="http://iandavis.com/blog/2007/11/is-the-semantic-web-destined-to-be-a-shadow">Is the Semantic Web Destined to be a Shadow?</a>:</p>
<blockquote><p>My belief is that trust must be considered far earlier and that it largely comes from usage and the wisdom of the crowds, not from technology. Trust is a social problem and the best solution is one that involves people making informed judgements on the metadata they encounter. To make an effective evaluation they need to have the ability to view and explore metadata with as few barriers as possible. In practice this means that the web of data needs to be as accessible and visible as the web of documents is today and it needs to interweave transparently. A separate, dry, web of data is unlikely to attract meaningful attention, whereas one that is a full part of the visible and interactive web that the majority of the population enjoys is far more likely to undergo scrutiny and analysis. This means that HTML and RDF need to be much more connected than many people expect. In fact I think that the two should never be separate and it’s not enough that you can publish RDF documents, you need to publish visible, browseable and <em>engaging</em> RDF that is meaningful to people. Tabular views are a weak substitute for a rich, readable description.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2007/11/21/semantic-www/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SXSW: Growth of Microformats</title>
		<link>http://gondwanaland.com/mlog/2007/03/17/jillion-hcards/</link>
		<comments>http://gondwanaland.com/mlog/2007/03/17/jillion-hcards/#comments</comments>
		<pubDate>Sat, 17 Mar 2007 07:12:15 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Microformats]]></category>
		<category><![CDATA[SXSW]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2007/03/17/jillion-hcards/</guid>
		<description><![CDATA[Monday afternoon&#8217;s packed The Growth and Evolution of Microformats didn&#8217;t strike me as terribly different from last year&#8217;s Microformats: Evolving the Web. Last year&#8217;s highlight was a Flock demo, this year&#8217;s was an Operator demo. My capsule summary of the growth and (not much) evolution of Microformats over the past twelve months: a jillion names, [...]]]></description>
			<content:encoded><![CDATA[<p>Monday afternoon&#8217;s packed <a href="http://microformats.org/wiki/events/2007-03-12-sxsw-growth-evolution-of">The Growth and Evolution of Microformats</a> didn&#8217;t strike me as terribly different from last year&#8217;s <a href="http://microformats.org/wiki/events/2006-03-13-sxsw-microformats">Microformats: Evolving the Web</a>. Last year&#8217;s highlight was a <a href="http://en.wikipedia.org/wiki/Flock_%28web_browser%29">Flock</a> demo, this year&#8217;s was an <a href="https://addons.mozilla.org/firefox/4106/">Operator</a> demo.</p>
<p>My capsule summary of the growth and (not much) evolution of Microformats over the past twelve months: a jillion names, addresses, and events have been marked up with <a href="http://microformats.org/wiki/hcard">hCard</a> and <a href="http://microformats.org/wiki/hcalendar">hCalendar</a> formatting.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2007/03/17/jillion-hcards/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SXSW: Semantic Web 2.0 and Scientific Publishing</title>
		<link>http://gondwanaland.com/mlog/2007/03/10/semweb2-science/</link>
		<comments>http://gondwanaland.com/mlog/2007/03/10/semweb2-science/#comments</comments>
		<pubDate>Sun, 11 Mar 2007 05:54:01 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[SXSW]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2007/03/10/semweb2-science/</guid>
		<description><![CDATA[Web 2.0 and Semantic Web: The Impact on Scientific Publishing, probably the densest panel I attended today (and again expertly moderated by Science Commons&#8217; John Wilbanks), covered open access, new business models for scientific publishers, and how web technologies can help with these and data problems, but kept coming back to how officious Semantic Web [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://2007.sxsw.com/interactive/programming/panels/?action=show&#038;id=IAP060260">Web 2.0 and Semantic Web: The Impact on Scientific Publishing</a>, probably the densest panel I attended today (and <a href="http://gondwanaland.com/mlog/2006/03/18/sxswi-wrap/">again</a> expertly moderated by Science Commons&#8217; John Wilbanks), covered <a href="http://en.wikipedia.org/wiki/Open_access" rel="tag">open access</a>, new business models for scientific publishers, and how web technologies can help with these and data problems, but kept coming back to how officious Semantic Web technologies and controlled ontologies (which are <a href="http://gondwanaland.com/mlog/2005/03/13/semweb-not-by-committee/">not the same at all</a>, but are often lumped together) and microformats and tagging (also distinct) complement each other (all four of &#8216;em!), even within a single application. I agree.</p>
<p>Nearly on point, this comment elsewhere by Denny Vrandecic of the Semantic MediaWiki project:<br />
<blockquote>You are supposed to change the vocabulary in a wiki-way, just as well as the data itself. Need a new relation? Invent it. Figured out it&#8217;s wrong? Rename. Want a new category of things? Make it.</p></blockquote>
<p><a href="http://dannyayers.com/2007/03/10/clarification-from">Via Danny Ayers</a>, oringal posted to <a href="http://radar.oreilly.com/archives/2007/03/freebase_will_p_1.html">O&#8217;Reilly Radar</a>, which doesn&#8217;t offer permalinks for comments. This just needs a catchy name. Web 2.0 ontology engineering? Fonktology?</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2007/03/10/semweb2-science/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Perils of a too cool name</title>
		<link>http://gondwanaland.com/mlog/2007/02/14/xmp-microformat/</link>
		<comments>http://gondwanaland.com/mlog/2007/02/14/xmp-microformat/#comments</comments>
		<pubDate>Thu, 15 Feb 2007 01:29:03 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Peeves]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2007/02/14/xmp-microformat/</guid>
		<description><![CDATA[I&#8217;ve seen lots of confusion about microformats, but Jon Udell takes the cake in describing XMP: It’s a bit of a mish-mash, to say the least. There’s RDF (Resource Description Framework) syntax, Adobe-style metadata syntax, and Microsoft-style metadata syntax. But it works. And when I look at this it strikes me that here, finally, is [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve seen lots of confusion about microformat<strong><a href="http://gondwanaland.com/mlog/2006/10/22/microformats-worse/">s</a></strong>, but <a href="http://blog.jonudell.net/2007/02/14/truth-files-microformats-and-xmp/">Jon Udell takes the cake in describing XMP</a>:<br />
<blockquote>It’s a bit of a mish-mash, to say the least. There’s RDF (Resource Description Framework) syntax, Adobe-style metadata syntax, and Microsoft-style metadata syntax. But it works. And when I look at this it strikes me that here, finally, is a microformat that has a shot at reaching critical mass.</p></blockquote>
<p>How someone as massively clued-in as <a href="http://en.wikipedia.org/wiki/Jon_Udell">Jon Udell</a> could be so misled as to describe XMP as a microformat is beyond me.</p>
<p><a href="http://en.wikipedia.org/wiki/Extensible_Metadata_Platform" rel="tag">XMP</a>, which is basically a constrained RDF/XML serialization following super-ugly conventions that may be embedded in a number of file formats (most prominently PDF and JPEG, but potentially almost anything), is about as far from a <a href="http://en.wikipedia.org/wiki/Microformats" rel="tag">microformat</a> as one could possibly get. Off the top:</p>
<ul>
<li>XMP is RDF/XML and as such is arbitrarily extensible; each microformat covers a specific use case and goes through great lengths to favor interoperability among publishers of each microformat (sometime I will write about how microformat and RDF people mean completely different things by &#8220;interoperability&#8221;) at the expense of extensibility.</li>
<li>XMP is embedded in a binary file, completely opaque to nearly all users; microformats put a premium on (practically require) colocation of metadata with human-visible HTML.</li>
<li>XMP would be extremely painful to write by hand and there are very few tools that support publishing it; microformats, to a fault, put a premium on publisher ease&#8211;anyone with a passing familiarity with HTML could be writing microformats.</li>
</ul>
<p>I don&#8217;t agree with everything the microformats folk have done, but they do have a pretty self-consistent <a href="http://microformats.org/wiki/microformats">approach</a>, if one bothers to try to understand it.  XMP ain&#8217;t it.</p>
<p>XMP is <strong>by far</strong> the most promising embedded metadata format for &#8220;media&#8221; files &#8212; which is mostly a testament to how terribly useless to non-existent  the alternatives are (by some definitions there are none).</p>
<p><b>Addendum:</b> I&#8217;m really only picking on one word from Udell&#8217;s post, the remainder of which is recommended. It is <a href="http://wiki.creativecommons.org/XMP" rel="tag">useful to me</a> to learn that &#8220;There’s also good support in .NET Framework 3.0 for reading and writing XMP metadata.&#8221;</p>
<p><b>Update 20070215:</b> <a href="http://blog.jonudell.net/2007/02/15/xmp-and-microformats-revisited/">Udell explains</a>:</p>
<blockquote><p>Now there is, as Mike points out, a big philosophical difference between XMP, which aims for arbitrary extensibility, and fixed-function microformats that target specific things like calendar events. But in practice, from the programmer’s perspective, here’s what I observe.</p>
<p>Hand me an HTML document containing a microformat instance and I will cast about in search of tools to parse it, find a variety of ones that sort of work, and then wrestle with the details.</p>
<p>Hand me an image file containing an XMP fragment and, lo and behold, it’s the same story!</p></blockquote>
<p>Yes, for 99% of the .01% of the world that cares at all, microformats and XMP are the same: metadata, embedded data, or even just data. That&#8217;s what I was hinting at in the title of this post &#8212; in the minds of 99% of the .01%, microformats are becoming synonymous with metadata, i.e., genericized. This is either a marketing and naming coup or disaster, depending on one&#8217;s perspective (I don&#8217;t particularly care).</p>
<p>I considered this headline: If XMP is a microformat, <a href="http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/">RDFa</a> sure the heck is a microformat.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2007/02/14/xmp-microformat/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Microformats are worse</title>
		<link>http://gondwanaland.com/mlog/2006/10/22/microformats-worse/</link>
		<comments>http://gondwanaland.com/mlog/2006/10/22/microformats-worse/#comments</comments>
		<pubDate>Mon, 23 Oct 2006 02:58:59 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Peeves]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2006/10/22/microformats-worse/</guid>
		<description><![CDATA[I almost entirely agree with Mark Birbeck&#8217;s comparison of RDFa and microformats. The only thing to be said in defense of microformats is that a few of the problems Birbeck calls out are also features, from the microformats perspective. But worse is better. I will reveal what this means later. Another quip: My problem with [...]]]></description>
			<content:encoded><![CDATA[<p>I almost entirely agree with Mark Birbeck&#8217;s <a href="http://internet-apps.blogspot.com/2006/10/rdfa-and-microformats.html">comparison of RDFa and microformats</a>. The only thing to be said in defense of <a href="http://en.wikipedia.org/wiki/Microformats" rel="tag">microformats</a> is that a few of the problems Birbeck calls out are also features, from the microformats perspective.</p>
<p>But <a href="http://en.wikipedia.org/wiki/Worse_is_better" rel="tag">worse is better</a>.</p>
<p>I will reveal what this means later.</p>
<p>Another quip: My problem with microformats is the <strong>s</strong>.</p>
<p>Evan Prodromou provided a still-good <a href="http://evan.prodromou.name/Journal/11_Prairial_CCXIV">RDFa vs Microformats roundup</a> (better title: &#8220;RDFa and Microformats, please work together&#8221;) in May. I somehow missed it until now.</p>
<p>Ah, <a href="http://gondwanaland.com/mlog/2006/10/12/meta-those-who-cant/">metadata</a>.</p>
<p><b>Update 20061204:</b> I didn&#8217;t miss Prodromou&#8217;s roundup in May, I <a href="http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/">blogged about it</a>. And forgot.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2006/10/22/microformats-worse/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Long tail of metadata</title>
		<link>http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/</link>
		<comments>http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/#comments</comments>
		<pubDate>Tue, 30 May 2006 05:03:20 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/</guid>
		<description><![CDATA[Ben Adida notes that people are writing about RDFa, which is great, and envisioning conflict with microformats, which is not. As Ben says: Microformats are useful for expressing a few, common, well-defined vocabularies. RDFa is useful for letting publishers mix and match any vocabularies they choose. Both are useful. In other words RDFa is a [...]]]></description>
			<content:encoded><![CDATA[<p>Ben Adida notes that people are <a href="http://rdfa.info/2006/05/29/people-writing-about-rdfa/">writing about RDFa</a>, which is great, and envisioning conflict with microformats, which is not.  As Ben says:</p>
<blockquote><p>Microformats are useful for expressing a few, common, well-defined vocabularies. RDFa is useful for letting publishers mix and match any vocabularies they choose. Both are useful.</p></blockquote>
<p>In other words RDFa is a <a rel="tag" href="http://en.wikipedia.org/wiki/Long_tail">long tail</a> technology.</p>
<p><a href="http://evan.prodromou.name/RDFa_vs_microformats">Evan Prodromou</a> thinks the future is bleak without cooperation. I like his proposed way forward (strikeout added for obvious reasons):</p>
<blockquote>
<ol>
<li>RDFa gets acknowledged and embraced by microformats.org as the future of semantic-data-in-XHTML</li>
<li>The RDFa group makes an effort to encompass existing microformats with a minimum of changes</li>
<li>microformats.org leaders join in on the RDFa authorship process</li>
<li>microformats.org becomes a focus for developing real-world RDF<s>a</s> vocabularies</li>
</ol>
</blockquote>
<p>I see <a href="http://microformats.org/discuss/mail/microformats-discuss/2006-May/004144.html">little chance</a> of points one and three occuring. However, I don&#8217;t see this as a particularly bad thing. Point three will occur, almost by default: the simplest and most widely deployed microformats (e.g., <a rel="tag" href="http://microformats.org/wiki/reltag">reltag</a>, <a rel="tag" href="http://microformats.org/wiki/relnofollow">relnofollow</a> and <a href="http://microformats.org/wiki/rellicense">rellicense</a>) are also valid RDFa &#8212; the predicate (e.g., tag, nofollow, license) appearing in the default namespace to a RDFa application. More complex microformats may be handled by <a href="http://lists.w3.org/Archives/Public/public-rdf-in-xhtml-tf/2006Apr/0069.html">hGRDDL</a>, which is no big deal as a microformat-aware application needs to parse each microformat it cares about <em>anyway</em>. From an RDF perspective any well-crafted metadata is a plus (and the microformats group do very <a href="http://microformats.org/wiki/process">careful</a> work) as RDF&#8217;s killer app is <a href="http://gondwanaland.com/mlog/2006/03/11/semtech-wrap/">integrating heterogenous data sources</a>.</p>
<p>From a microformats perspecitve RDFa might well be ignored. While transformation of any microformat to RDF is relatively straightforward, transformation of RDF (which is a model, not a format) to microformats is nonsensical (well, I suppose the endpoint of such a transformation could be <a rel="tag" href="http://microformats.org/wiki/xoxo">xoxo</a>, though I&#8217;m not sure what the point would be). Microformats, probably wisely, is <a href="http://rbach.priv.at/Microformats-IRC/2006-05-25#T193311">not reinventing RDF</a> (as many do, usually badly).</p>
<p>So why would RDFa be of interest to developers? In a word, laziness. There is no <a href="http://microformats.org/wiki/process">process</a> to follow for developing an RDF vocabulary (<a href="http://gondwanaland.com/mlog/2005/03/13/semweb-not-by-committee/">ironic</a>), you can freely reuse existing vocabularies and tools, not write your own parsers, and trust that really smart people are figuring out the hard stuff for you (I believe the formal background of the Semantic Web is a long-term win). Or you might just want to, as Ben says &#8220;express metadata about other documents (embedded images)&#8221; which is trivial for RDF as images have URIs.</p>
<p><strong>Addendum 20060601:</strong> The &#8220;simplest&#8221; microformats mentioned above have a name: <a href="http://microformats.org/wiki/elemental-microformat">elemental microformats</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2006/05/29/longtail-metadata/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Semantic Technology Conference wrap</title>
		<link>http://gondwanaland.com/mlog/2006/03/11/semtech-wrap/</link>
		<comments>http://gondwanaland.com/mlog/2006/03/11/semtech-wrap/#comments</comments>
		<pubDate>Sat, 11 Mar 2006 08:24:58 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[SXSW]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2006/03/11/semtech-wrap/</guid>
		<description><![CDATA[The 2006 Semantic Technology Conference was more interesting than I expected. The crowd was older and much more formally dressed and there was far less emphasis on open source solutions than any conference I&#8217;ve attended in a long time but it wasn&#8217;t merely a vendor schmoozefest. James Hendler and Ora Lassila&#8217;s Semantic Web @5 keynote [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.semantic-conference.com">2006 Semantic Technology Conference</a> was more interesting than I expected. The crowd was older and much more formally dressed and there was far less emphasis on open source solutions than any conference I&#8217;ve attended in a long time but it wasn&#8217;t merely a vendor schmoozefest.</p>
<p>James Hendler and Ora Lassila&#8217;s <a href="http://www.semantic-conference.com/2up_BW/Lassila_Hendler-SemTech2006-keynote_bw.pdf">Semantic Web @5</a> keynote claimed that Semantic Web technologies have made great strides over the past five years. They pointed out that middle levels of the <a href="http://www.w3.org/2001/09/06-ecdl/slide17-0.html">Semantic Web layer cake</a> are mature and higher levels are subjects of funded research (in 2001 lower and middle levels were mature and research respectively). Near the end they made a strong call to &#8220;share; give it away!&#8221; &#8212; open source tools, datasets, and harvesters are needed to grow the Semantic Web.</p>
<p>My presentation on <a href="http://www.semantic-conference.com/2up_BW/Linksvayer-Mike-bw.pdf">Semantic Search on the Public Web with Creative Commons</a> went fairly well apart from some audio problems. I began with a hastily added segue (not in the slides) from the keynote, highlighting Science Commons&#8217; <a href="http://creativecommons.org/weblog/entry/5778">database licenseing FAQ and Uniprot</a>.  Questions were all over the map, befitting the topic.</p>
<p>I think Uche Ogbuji&#8217;s <a href="http://www.semantic-conference.com/2up_BW/Ogbuji-Uche-bw.pdf">Microformats: Partial Bridge from XML to the Semantic Web</a> is the first talk I&#8217;ve heard on <a rel="tag" href="http://en.wikipedia.org/wiki/Microformats">microformats</a> that I&#8217;ve heard from a non-cheerleader and was a pretty good introduction to the upsides and downsides of microformats and how <a rel="tag" href="http://en.wikipedia.org/wiki/GRDDL">GRDDL</a> can leverage microformats for officious Semantic Web purposes. My opinion is that the value in microformats hype is in encouraging people to take advantage of XHTML semantics in however a conventional in non-rigorous fashion they may. It is a pipe dream to think that most pages containing microformats will include the correct <code>profile</code> references to allow a spec-following crawler to extract much useful data via GRDDL. Add some convention-following heuristics a crawler may get lots of interesting data from microformatted pages. The big search engines are great at tolerating ambiguity and non-conformance, as they <a href="http://code.google.com/webstats/index.html">must</a>.</p>
<p>Ogbuji&#8217;s talk was the ideal lead in to Ben Adida&#8217;s <a href="http://ben.adida.net/presentations/semantic-2006-03-08.pdf">Interoperable Metadata for a Bottom-Up Semantic Web</a> which hammered home five principles of metadata interoperability: publisher independence, data reuse, self-containment, schema modularity, and schema evolvability. <a rel="tag" href="http://en.wikipedia.org/wiki/XML">XML</a>, <a rel="tag" href="http://en.wikipedia.org/wiki/RDF">RDF/XML</a>, Microformats, GRDDL, and <a href="http://www.w3.org/2001/sw/BestPractices/HTML/2006-01-24-rdfa-primer">RDF/A</a> were evaluated against the principles. It is no surprise that RDF/A came out looking best &#8212; Adida has been chairing the relevant <a href="http://www.w3.org/2001/sw/BestPractices/HTML/">W3C taskforce</a>. I think RDF/A has great promise &#8212; it feels like microformats minus annoyances, or microformats with a model &#8212; but <a rel="tag" href="http://en.wikipedia.org/wiki/Worse_is_better">worse is better</a> may say otherwise. The oddest response to the talk came from someone of the opinion that [X]HTML is irrelevant &#8212; everything should be custom XML rendered with custom XSLT when necessary.</p>
<p>I was somewhat surprised by the strong focus of most talks and vendors on RDF and friends rather than any other &#8220;semantic technologies.&#8221; <a rel="tag" href="http://en.wikipedia.org/wiki/Douglas_Lenat">Doug Lenat</a> was one exception. He apparently claimed last year that <a rel="tag" href="http://en.wikipedia.org/wiki/Cyc">Cyc</a> by this year would be growing primarily through machine learning rather than input by knowledge engineers. A questioner called Lenat on this prediction. Lenat claimed the prediction came true but did not offer any quantatative measure. It looked like from the slides (unavailable) that Cyc can have databases and similar described to it and may access same (e.g., via JDBC), giving it access to an arbitrary number of &#8220;facts.&#8221;</p>
<p>If there was a theme that flowed through the conference it was first integrating heterogenous data sources (I don&#8217;t recall who, but someone characterized semantic technologies as liberating enterprises from <a rel="tag" href="http://en.wikipedia.org/wiki/Enterprise_resource_planning">ERP</a> vendors) and second multiplying the value of that data through linking and inference.</p>
<p>Mills Davis&#8217; <a href="http://www.semantic-conference.com/2up_BW/Davis-Mills-Keynote-2up-bw.pdf">closing keynote</a> blew up these themes, claiming outrageous productivity improvements are coming very shortly due to semantic technologies, including a <a rel="tag" href="http://en.wikipedia.org/wiki/Raymond_Kurzweil">Ray Kurzweil</a> slide. The conference hotel fire alarm went off during the keynote, serving as a hype alert to any willing to hear.</p>
<p>SemTech06 reinforces my confidence in what I said in the <a href="http://gondwanaland.com/mlog/2005/03/18/semweb-ai-java-the-ontological-parallels/">SemWeb, AI, Java: The Ontological Parallels </a> mini-rant given at SXSW last year. Too bad they rejected my proposal for this year:</p>
<blockquote><p><strong>Semantic Web vs. Tags Deathmatch:</strong> Tags are hot, but are they a dead end? The Semantic Web is still a research project, but will it awaken in August, 2009? People in the trenches fight over the benefits and limits of tags, the viability of officious Semantic Web technologies, what the eventual semantic web will look like, and how to bridge the gap between the two.</p></blockquote>
<p>I&#8217;m off to SXSW tomorrow anyway. My <a href="http://creativecommons.org/weblog/entry/5814">schedule</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2006/03/11/semtech-wrap/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Search 2006</title>
		<link>http://gondwanaland.com/mlog/2006/01/14/search-2006/</link>
		<comments>http://gondwanaland.com/mlog/2006/01/14/search-2006/#comments</comments>
		<pubDate>Sun, 15 Jan 2006 03:44:31 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Computers]]></category>
		<category><![CDATA[Creative Commons]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Prediction Markets]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2006/01/14/search-2006/</guid>
		<description><![CDATA[I&#8217;m not going to make new predictions for search this year &#8212; it&#8217;s already underway, and my predictions for 2005 mostly did not come true. I predict that most of them will, in the fullness of time: Metadata-enhanced search. Yahoo! and Google opened Creative Commons windows on their web indices. Interest in semantic markup (e.g., [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m not going to make new predictions for search this year &#8212; it&#8217;s already underway, and my <a href="http://gondwanaland.com/mlog/2004/12/23/search-2005/">predictions for 2005</a> mostly did not come true. I predict that most of them will, in the fullness of time:</p>
<p><strong>Metadata-enhanced search.</strong> <a href="http://creativecommons.org/weblog/entry/5356">Yahoo!</a> and <a href="http://creativecommons.org/weblog/entry/5693">Google</a> opened Creative Commons windows on their web indices. Interest in semantic markup (e.g., <a href="http://microformats.org/blog/2006/01/04/2005-year-in-review/">microformats</a>) increased greatly, but search that really takes advantage of this is a future item. (NB I consider the services enabled by <a rel="tag" href="http://en.wikipedia.org/wiki/Tags">tags</a> more akin to browse than search and as far as I know they don&#8217;t allow combinging tag and keyword queries.)</p>
<p><strong>Proliferation of niche web scale search engines.</strong> Other than a few blog search services, which are very important, I don&#8217;t know of anything that could be called &#8220;web scale&#8221; &#8212; and I don&#8217;t know if blog search could really be called niche. One place to watch is <a href="http://wiki.apache.org/nutch/PublicServers">public search engines using Nutch</a>. Mozdex is attempting to <a href="http://www.libervis.com/modules/wordpress/?p=10">scale up</a>, but I don&#8217;t know that they really have a niche, unless &#8220;using open source software&#8221; is one. Another place is Wikipedia&#8217;s <a href="http://en.wikipedia.org/wiki/List_of_internet_search_engines">list of internet search engines</a>.</p>
<p>On the other hand, weblications (as <a href="http://gondwanaland.com/mlog/2005/12/09/web-20/">Web 2.0</a>) did take off.</p>
<p>I said lots of desktop search innovation was a near certainty, but if so, it wasn&#8217;t very visible. I predicted slow progress on making multimedia work with the web, and I guess there was very slow progress. If there was forward progress on usable security it was slow indeed. Open source did slog toward world domination (e.g., Firefox is <em>the</em> exciting platform for web development, but barely made a dent in <a href="http://www.websidestory.com/news-events/press-releases/view-release.html?id=309&#038;lid=Press+Release+-+309&#038;lpos=News">Internet Explorer&#8217;s market share</a>) with Apple&#8217;s success perhaps being a speed bump. Most things did get cheaper and more efficient, with the visible focus of the semiconductor industry swinging strongly in that direction (they knew about it before 2005).</p>
<p>Last year I riffed on John Battelle&#8217;s predictions. He has a <a href="http://battellemedia.com/archives/002149.php">new round for 2006</a>, one of which was <a href="http://creativecommons.org/weblog/entry/5755">worth noting at Creative Commons</a>.</p>
<p>Speaking of predictions, of course Google <a href="http://gondwanaland.com/mlog/2005/09/22/google-futures/">began using prediction markets internally</a>.  Yahoo!s <a href="http://gondwanaland.com/mlog/2005/03/16/collective-market-intelligence/">Tech Buzz Game</a> has some markets relevant to search but I don&#8217;t know how to interpret the game&#8217;s prices.</p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2006/01/14/search-2006/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Going overboard with Wikipedia tags</title>
		<link>http://gondwanaland.com/mlog/2006/01/12/wikipedia-tags/</link>
		<comments>http://gondwanaland.com/mlog/2006/01/12/wikipedia-tags/#comments</comments>
		<pubDate>Thu, 12 Jan 2006 08:00:59 +0000</pubDate>
		<dc:creator>Mike Linksvayer</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[Peeves]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://gondwanaland.com/mlog/2006/01/12/wikipedia-tags/</guid>
		<description><![CDATA[A frequent correspondent recently complained that my linking to Wikipedia articles about organizations rather than the home pages of organizations is detrimental to the usability of this site, probably spurred by my linking to a stub article about Webjay. I do so for roughly two reasons. First, I consider a Wikipedia link more usable than [...]]]></description>
			<content:encoded><![CDATA[<p>A <a href="http://www.chrisfmasse.com/">frequent correspondent</a> recently complained that my linking to <a rel="tag" href="http://en.wikipedia.org/wiki/Wikipedia">Wikipedia</a> articles about organizations rather than the home pages of organizations is detrimental to the <a rel="tag" href="http://en.wikipedia.org/wiki/Usability">usability</a> of this site, probably spurred by my <a href="http://gondwanaland.com/mlog/2006/01/09/yahoo-lightnet/">linking to a stub article about Webjay</a>.</p>
<p>I do so for roughly two reasons. First, I consider a Wikipedia link more usable than a link to an organization home page. An organization article will link directly to an organization home page, if the latter exists. The reverse is almost never true (though doing so is a great idea). An organization article at Wikipedia is more likely to be objective, succinct, and informational than an organizational home page (not to mention there is no chance of encountering <a rel="tag" href="http://en.wikipedia.org/wiki/Macromedia_Flash">Flash</a>, window resizing, or other annoying distractions &#8212; less charitably, attempts to control my browser &#8212; at Wikipedia). When I hear about something new these days, I nearly always check for a Wikipedia article before looking for an actual website. Finally, I have more confidence that the content of a Wikipedia article will be relevant to the content of my post many years from now.</p>
<p><a rel="tag" href="http://en.wikipedia.org/wiki/Webjay">Webjay</a> (link to <a href="http://webjay.org">webjay.org</a>) is actually a good example of these usability issues. Perhaps I have an unusually strong preference for words, but I think its still very brief Wikipedia article should allow one to understand exactly what Webjay is in under a minute.<sup>1</sup> If I were visiting the Webjay site for the first time, I&#8217;d need to click around awhile to figure the service out &#8212; and Webjay&#8217;s interface is very to the point, unlike many other sites. Years from now I&#8217;d expect webjay.org to be a yet another <a rel="tag" href="http://en.wikipedia.org/wiki/Link_farm">link farm</a> site &#8212; or since the Yahoo! acquisition, to redirect to some Yahoo! property &#8212; or the property of whatever entities own Yahoo! in the future. (Smart browser integration with the <a rel="tag" href="http://en.wikipedia.org/wiki/Internet_Archive">Internet Archive</a>&#8216;s <a href="http://www.archive.org/web/web.php">Wayback Machine</a> could mitigate this problem.)</p>
<p>Anyway, I predict that in the forseeable future your browser will be able to convert a Wikipedia article link into a home page link if that is your preference, aided by <a href="http://gondwanaland.com/mlog/2005/09/03/annotating-wikipedia/">Semantic Mediawiki</a> annotations or similar.</p>
<p>The second reason I link to Wikipedia preferentially<sup>2</sup> is that Wikipedia article URLs conveniently serve as &#8220;<a rel="tag" href="http://en.wikipedia.org/wiki/Tags">tags</a>, as specified by the <code><a rel="tag" href="http://microformats.org/wiki/reltag">rel="tag"</a></code> <a rel="tag" href="http://en.wikipedia.org/wiki/Microformat">microformat</a>. If <a href="http://gondwanaland.com/mlog/2005/09/02/blog-search-stinks/">Technorati</a> and its competitors happen to index this blog this month, it will show up in their tag-based searches, the names of the various Wikipedia articles I&#8217;ve linked to serving to name tags. I&#8217;ve never been enthusiastic about the overall <a href="http://gondwanaland.com/mlog/2005/01/20/not-following-tags/">utility of author applied tags</a>, but I figure linking to Wikipedia is not as bad as linking to a tagreggator.</p>
<p>Also, Wikipedia serves as a tag <a href="http://en.wikipedia.org/wiki/Wikipedia:Disambiguation">disambiguator</a>. Some tagging service is going to use Wikipedia data to disambiguate, cluster, merge, and otherwise enhance tags. I think this is pretty low hanging fruit &#8212; I&#8217;d work on it if I had concentration to spare.</p>
<p><strong>Update:</strong> Chris Masse <a href="http://www.chrisfmasse.com/2/2006/2006-01-11_website_&#038;_RSS_feed.html">responds</a> (see bottom of page). Approximate answer to his question: <a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fwww.tradesports.com">14,000 links to www.tradesports.com</a>, <a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fen.wikipedia.org/wiki/Tradesports">17 links to en.wikipedia.org/wiki/Tradesports</a> (guess where from). I&#8217;ll give Masse convention.</p>
<p>In the same post Masse claims that his own &#8220;following of Jakob Nielsen&#8217;s guidelines is responsible for the very high intergalactic popularity of my Internet presence.&#8221; How very humble of Masse to attribute the modest success of his site to mere guideline following rather than his own content and personality. Unfortunately I think there&#8217;s a missing counterfactual.</p>
<p><small><sup>1</sup> I would think that, having written most of the current Webjay article.</small></p>
<p><small><sup>2</sup> Actually my first link preference is for my past posts to this blog. I figure that if someone is bothering to read my ramblings, they may be interested in my past related ramblings &#8212; and I can use the memory aid.</small></p>
]]></content:encoded>
			<wfw:commentRss>http://gondwanaland.com/mlog/2006/01/12/wikipedia-tags/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
