It seems like it is a daily dose of semantic web on the tech blogs of late. Today it was Textwise’s Million Dollar Semantic Hacker Challenge and a few days ago it was Yahoo opening their search platform to support a wide variety of semantic web standards. This has lead to a good bit of proselytizing, mostly in the comments, that this heralds the arrival of the Semantic Web, or Web 3.0 or the Next Generation Web. All of which sounds like the circling of the marketing band wagons.

Unfortunately when the wagons circle everything starts picking up the label - in this case semantic. This is especially dangerous when you have a word like “semantics” that can be defined, so many different ways. Just look at the definition tree created by Wikipedia:

*Semantics is the study of meaning in communication.
*In computer science semantics reflects the meaning of programs or functions.
*The Semantic Web refers to the extension of the World Wide Web through the embedding of additional semantic metadata

More often I see folks labeling things semantic that are really syntax. “Syntax” being the rules to construct and define something like a sentence or line of code and “semantics” the meaning of those rules or definitions. Syntax is fairly easy and semantics are fairly hard, as most folks in artificial intelligence would argue. Even going so far as saying all programming languages other than LISP are syntax and not semantic.

This is a bit more clear with an example. Lets take the Textwise announcement - a technology that will parse plain text on a website or elsewhere and categorizes it to predefined topics. One example in the Techcrunch comments was the following:

input text:
Call us crazy, but we think there are some brilliant minds out there that can find some really amazing uses for this incredibly powerful and scalable technology. Think you’re up to the Challenge? We think you are!

categories (ranked from 0 (worst) to 100 (best)):
Shopping/Health/Alternative/Hypnotherapy/Audio_and_Video 43 Business/Telecommunications/Services/Wireless/Software 33 Arts/Music/Bands_and_Artists/311/Tablature 28
Computers/Internet/Consultants/Research 26 Shopping/Health/Alternative/Meditation/Audio_and_Video 25

The output is really not telling me anything about the meaning of the text just setting up rules to provide categorization. So I would definitely put this in the syntax and not semantic category. I would also say what Yahoo! is doing is really more syntax than semantics although there is the possibility of building truly semantic technologies on top of what they are enabling. They’ve created a set of rules based on rich standards to allow applications to be built. Remains to be seen what will come of it, but in rush of market buzz I think it is easy to miss that building truly semantic technologies is quite hard. Some folks in AI (the Chinese room) would argue machines are not even capable of semantic meaning or understanding.

From this perspective I think we’ll see a lot of people building applications based on syntax that reorganize and categorize content by giving the “page web” a bit of structure. Oddly its like we’ve gone full circle back to DMOZ. While these technologies may be clever and useful I do not think they will fundamentally change the Web. In the other category I think we’ll see a few companies pushing towards something more sophisticated (call it a semantic, implicit, computational web) where new data and services are mixed with existing web content to provide answers to users questions.

Popularity: 33% [?]

The Utility of Maps in Hazard Forecasting

The recent wildfires in Southern California remind of us of just how important hazard forecasting has become in helping to ensure the safety and welfare of the public and the role that mapping can play in the process. Short-term forecasts of fire direction and intensity were pivotal in containment and evacuation efforts; Mapping played a prominent role in generating forecasts and in disseminating and sharing information about potential risk.

The usefulness of maps in visualizing and and generating forecasts extends well beyond the California fire event. In the area of climate prediction, numerous sites provide regularly updated maps of long-term and short-term forecasts of a variety of conditions and in some cases, valuable watches and warnings to the public based on the forecasts.

Some Points for Discussion

While the information that is currently out there provides great utility, there are some limitations in the way that the information is is disseminated and formatted that are worth noting. The points are intended to be food for thought and to get us thinking about how we can increase the value of forecasting even further - particularly in an interactive, web-based mapping environment.

First, forecasts are scattered across multiple websites and even within websites, requiring some effort and time on the part of the consumer to find, extract and process information. The sites and links vary in terms of the information they provide. In terms of fire forecasting, some sites focus on drought conditions, others on smoke generation and yet others on combinations of factors to characteristic future fire potential. The forecasting horizons also vary considerably from site to site.

Second, much of the maps provided on the web are in a “hard copy” format and not in an interactive mode where the user can pan, zoom and perform other functions. Some sites do have map viewers however, they are currently limited in the amount and type of data that can be displayed.

Third, and related to the second point, is that the possibility for “layering” data to create custom maps with richer information relevant to the needs of the user is limited. For example, someone may be interested in seeing if an environmentally sensitive or protected area is in the path of a projected wildfire.

Fourth, there lacks a mechanism for consumers and providers of the forecasts to interact and share information. Interaction could be very useful in understanding forecasts but also in terms of improving current predictive models. In the book Making Climate Forecasts Better, Stern and Easterling write: “The utility of forecasts can be increased by systematic efforts to bring scientific output and users’ needs closer together. These efforts may include both analytic efforts to identify the climatic parameters to which particular sectors or groups are highly sensitive or vulnerable and social processes that foster continual interaction between the producers and the consumers of forecasts.”

Fifth, not all information is publicly available and perhaps it should be? In climate forecasting, having access to the “best” information is in the national interest: it can save lives. And in some cases, the private sector is the keeper of such information. A recent study by ForecastWatch, found that in terms of recent historical forecasting of next day rain and snow, government sites had a 21% greater error rate than some of the private companies that do similar projections.

What Could the Future Hold?

The new web is fertile for the development of a system by which forecasts can be provided to the public in a more usable, digestible and efficient manner. Sites like Geocommons could be a one-stop location for viewing forecasts, such as those related to hazards and climatic conditions. In such an environment, visitors could interact with each other or the producers of the forecasts, discuss the validity of the forecasts or provide additional information to augment the projections, all through a wiki or blog-style environment. They could also create custom forecast maps with overlays of additional information that is of most useful to them for solving a problem, understanding a situation or simply planning ahead.

Popularity: 21% [?]

A few weeks ago James Fee wrote a blog post about a debate he had with Steven Citron-Pousty concerning the usefulness of GeoCommons. From a high level the argument came down to the GIS vs. Neogeogrpahy debate. There were great quotes on both sides like “freaking sweet” in support and “pretty worthless” in the bashing category. Over all we were excited to see that GeoCommons had started a debate in the GIS community. The intent of GeoCommons had originally not been to provide a resource to the GIS community, but to provide access to GIS data and a few tools to the rest of the world. The GIS community always had access to the data and the tools, so I had figured GeoCommons would not even pop up on the radar. In hindsight I think we should be providing mutual resources to each other, so with that in mind here are few thoughts on the topic.

Read the rest of this entry »

Popularity: 9% [?]