The question keeps rearing its confusing head: What for Pete's sake is the Semantic Web, and what will it do for the average user? Yet another attempt was last week's panel discussion held by Stanford's VLAB. The names of the panelists sounded promising, but after mulling over my notes I've come to the conclusion that I could have saved myself and the planet several pounds of CO2 by not going.
In the ring were futurist Paul Saffo, Radar Networks founder Nova Spivack, Alex Iskold from Adaptive Blue, VC-blogger Paul Kedrosky and Freebase cofounder Robert Cook. When they weren't entangled in all those geeky acronyms like RDF and OWL, they mostly talked about how search should and could be better and how they will do just that.
Spivack, whose company has been working on its service called Twine for several years now, gave another tantalizing preview of what feats Twine will perform to automate collaboration. But the beta is still limited to a few hundred people and reminds me more and more of the ongoing vaporware performance for alleged Google challenger Powerset than of a real launchable product. Let's finally play with Twine's automated Facebook-ish features -- that's what a recent demo looked like to me -- and see what new meaning of online life we get.
Spivack, whose experience with semantic web technologies and concepts goes back to SRI's and DARPA's CALO Project, predicted that "in 20 to 30 years, almost all human knowledge will be accessible and stored in a machine-understandable fashion." And that appears to be the key to all attempts at harnessing AI for a nice and useful human computer interface: let machines and agents peruse the web so they can haul in, structure, sort and analyze all the stuff for us, bring back the nuggets, perhaps even pre-emptively suggest things we like, care about or should be alerted to.
Things a good butler, assistant or tavel agent would do -- if the geeks and lean and mean management types hadn't killed them off! I tried to make that point in an Economist article several months ago, only to start a debate among Tim O'Reilly and Co. what the proper definition of Web3.0 and the semantic web was.
It's not just standards -- that much I know after last week's disjointed discussion at Stanford -- it's all about meaning and better, i.e. meaning-ful usability for machines and humans alike. I agree with Alex Iskold's superbly insightful posts and comments that it makes little sense to go for the all-out AI assault when there is so much low-hanging fruit out there. We can harvest them by leveraging human ingenuity, our playfulness, self-service motivation, or even vanity in conjunction with existing data sets. Layer the human factor on top or underneath the new technologies and the results can already be pretty amazing.
Adaptive Blue's Blue Organizer understands dozens of concepts and preferences without many bells and whistles or techno-jargon. It's not so much about search and the next Google, but about finally having an able virtual clerk who hovers nearby, who can remind me, remember things, file to-dos, make connections and pull up items when I ask for them. In that sense: Yes, services like Sandy in combination with Jott's transcription offering are definitely part of the rising semantic web, just as TripIt and its trip organizer on steroids is. To deny that is sticking to techie jargon.
Paul Kedrosky (of Infectious Greed fame) made a good point when he said that financial markets are already swirling about in an overlooked semantic web. On any given day, 17 percent of all business and finance stories on the Reuters newswire are machine-produced, with no human involved -- they are not even intended for human consumption. A whopping 55 percent of Reuters finance stories are only "read" by machines, according to the VC. "Wall Street is way out front," claimed Kedrosky, in the way autonomous software consumes, processes and extracts signals from information and turn it into actionable bits and pieces. That type of semantic web technology makes humans money today.
The AI hype has a sounder basis this time around because the data and the processing are decentralized and not just going on in a few big labs. Which raises an intricately connected problem.
How nice would it be if the data, my data, was actually open? If I could get my ratings on Netflix or any other site as a flow of personal insights, preferences and transactions that other services could reach into, as Dave Winer pointed out? Right now, chances are slim that such openness will really spread. It's valuable, proprietary stuff. And it lowers the barrier to entry and exit to let users or their software agents wander about with a cornucopia of transactions and a history of their engagement. Facebook's Beacon has it backwards, since it assumes opt-in to semi-open data and exploits the very users who entrusted their "social graph" to it. It's the latest twist on the Faustian bargain of "free Web services."
It's shame, because without such portability into which software and eventually their human masters can tap, neither the semantic web nor the so-called attention economy have a chance to go beyond baby steps and hyped-up announcements from start-ups.
In the meantime, the big Net players monetize our data by offering allegedly "free services" like beads to savages. We leave tons of functionality, meaning and money on the table. If we can harvest and control access to our own data and presence streams and give new apps access to it so they can wring meaning out of it (instead of Comcast & Co. selling our clickstreams behind our backs) -- then we can rightfully speak of the advent of the semantic web.
Or, to use Alex Iskold's closing words at the Stanford event: "My biggest fear is that AI will eat us." That evening at the VLAB event, it certainly didn't. It just yawned and showed some baby teeth.