Piggy-Bank is a new extension for the Mozilla Firefox browser that allows you to easily browse the semantic data linked to from regular web pages. I’ve seen some other projects along these lines, but they tend to be focused on a particular flavor of RDF data (such as Joel’s FOAFer extension, or Christopher Schmidt’s DOAP Viewer extension).
I’m still not quite sure how Piggy-Bank works, but at the least it’s scraping web pages for any embedded links that have well-recognized types in the Semantic Web, such as “application/rss+xml” (for RSS feeds) and “application/xml+rdf”. It then follows those links and parses out the “information tidbits” from those sources, and presents that information to you in a sidebar. Piggy-Bank attempts to categorize the tidbits into high-level categories, such as “News” for RSS Channel and Item resources, or “Contacts” for FOAF’s “Person” resources. You can save the tidbits of interest in a local database (“My Piggy Bank”) and search through them later; Piggy-Bank remembers the original source of the data and allows you to annotate them with comments as you desire.
In response to the question, “Why was [Piggy-Bank] built?” the developers offer the simple answer:
“Already many researchers and organizations have gotten excited about the promises by the Semantic Web. Through the deployment of this Piggy-Bank extension, we would like to get end-users excited about the Semantic Web, too.”
I’m not sure that Piggy-Bank is the tipping point that’s going to get end-users “excited” about the Semantic Web, but it certainly was easy to install and use.
There are still a few kinks in the system. For example, I found that a lot of pages that do in fact contain links to bunches of semantic data (such as Bob DuCharme’s rdfdata.org site) aren’t properly “scraped,” presumably since they don’t annotate the links using the expected link types. The developers might reasonably argue that this is by design, and that it would be too costly to follow all possible outgoing hyperlinks to try to determine whether they point to any useful RDF data. Nevertheless, it might prove useful to find ways to relax whatever rules that Piggy-Bank is currently using so that it’s able harvest even more semantic data from pages that aren’t necessarily playing by the rules. Another kink is that (at present) it only seems to handle RSS 1.0 feeds. Again, the developers might reasonably argue that since RSS 1.0 is the only version of RSS that parses as “legitimate” RDF, it’s OK for Piggy-Bank to stumble over the RSS 0.9x and 2.0 flavors. That was little solace to me as I had to wade through a number of my friends’ blogs’ RSS feeds before finding one that was actually available in RSS 1.0 format.
I’m anxious to see where this project goes over the coming year.