Which Future Web?

The semantic Web will be a considerable part of the future Web. What is the difference between the semantic Web and artificial intelligence? And what about Web 2.0? Frank van Harmelen, computer scientist in the Netherlands and a specialist in the semantic Web, answers some questions.

The semantic Web initiative is often said to address the same issues that have already been approached for thirty years in Artificial Intelligence. What makes the semantic Web, with its focus on ontologies and reasoning so different?

There is indeed a widespread misconception that the semantic Web is ‘AI all over again’. Even though the two may have tools in common (eg ontologies, reasoning, logic), the goals of the two programmes are entirely different. In fact, the goals of the semantic Web are much more technical and modest. Rather than seeking to build a general-purpose all-encompassing global Internet-based intelligence, the goal is instead to achieve interoperability between data sets that are exposed to the Web, whether they consist of structured, unstructured or semi-structured data.

Tim Berners-Lee (W3C) devoted an entire presentation to the confusion between AI and semantic Web in July 2006 (see the link below). This presentation also does a very good job of busting some of the other myths surrounding the semantic Web, such as that the semantic Web is mainly concerned with hand-annotated text documents, or that the semantic Web requires a single universal ontology to be adopted by all.

Web 2.0 appears to be the new kid on the block - everybody’s darling, loved both by academia and industry. The semantic Web, on the other hand, has fallen from grace, owing to numerous unmet promises. How do you regard the coexistence of these two Webs and what role will Web 2.0 assume in the semantic Web’s story?

My feeling is that this question is based on a false premise, namely that “the semantic Web has fallen from grace, owing to numerous unmet promises”. The SemTech conference, an annual industry-oriented event organised in the past three years in San Jose, California, attracted 300 attendants in 2005, 500 attendants in 2006, and 700+ attendants in 2007. Its European counterpart, the European Semantic Technologies Conference, attracted 200+ attendants to its first event in Vienna in May 2007, of whom 75% were from companies. This would suggest that research and interest in the semantic Web are alive and well.

In fact, semantic technology is in the process of an industrial breakthrough. Here is a quote from a recent (May 2007) Gartner report, the industry watcher not known for its love of short-lived hypes:

“Key finding: During the next ten years, Web-based technologies will improve the ability to embed semantic structures in documents, and create structured vocabularies and ontologies to define terms, concepts and relationships. This will offer extraordinary advances in the visibility and exploitation of information - especially in the ability of systems to interpret documents and infer meaning without human intervention.” And: “The grand vision of the semantic Web will occur in multiple evolutionary steps, and small-scale initiatives are often the best starting points.”

Turning to the substance of your question: There is widespread agreement in the research world that Web 2.0 and semantic Web (or Web 3.0) are complementary rather than in competition. For example, a science panel at the WWW07 conference in May 2006 in Edinburgh came to the following consensus: Web 2.0 has a low threshold (it’s easy to start using it), but also has a low ceiling (folksonomies only get you so far), while Web 3.0 has a higher threshold (higher startup investments), but has a much higher ceiling (more is possible).

The aforementioned Gartner report has useful things to say here as well. It advises the combination of semantic Web with Web 2.0 techniques, and predicts a gradual growth path from the current Web via semantically lightweight but easy to use Web 2.0 techniques to higher-cost/higher-yield Web 3.0 techniques.

And what about automated means of learning ontologies, relationships between entities, and so forth - that is, resorting to natural language processing, text mining, and statistical means of knowledge extraction and inference. Do you regard these techniques as complementary to the manual composition of ontologies or rather inhibitory?

My attitude towards the acquisition of ontologies and the classification of data objects in these ontologies is: if it works, it’s fine. Clearly relying solely on the manual construction of ontologies puts a high cost and a low ceiling on the volume of knowledge that can be coded and classified. Hence, I expect that the techniques you mention will play an ever-bigger role in the range of semantic technologies. I see no reason why such techniques are ‘bound to fail’ ? instead I am rather optimistic about their increasingly valuable contribution.

All great technological inventions and milestones are marked by the advent of a killer application. What could/will be the semantic Web’s killer app? Will there be one at all?

I find the perennial question for the ‘killer app’ always a bit naive. For example: we can surely agree that the widespread adoption of XML was an important technical innovation. But what was XML’s ‘killer app’? Was there a single one? No. Instead there are many places where XML facilitates progress ‘under the hood’. Semantic Web technology is primarily infrastructure technology. And infrastructure technology is under the hood, or in other words, not directly visible to users. One simply notices a number of improvements. Web sites become more personalized, because under the hood semantic Web technology is allowing your personal interest profile to be interoperable with the data sources of the Web site. Search engines provide a better clustering of results, because under the hood they have classified search results in a meaningful ontology. Desktop search tools become able to link the author names on documents with email addresses in your address book, because under the hood, these data formats have been made to interoperate by exposing their semantics. However, none of these applications will have ’semantic Web technology’ written on their interface. Semantic Web technology is like Nikasil coating in the cylinders of a car: very few car drivers are aware of it, but they are aware of reduced fuel consumption, higher top speeds and the extended lifetime of the engine. Semantic Web technology is the Nikasil of the next generation of human-friendly computer applications that are being developed right now.

标签:, , ,


互联网上已有的本体库数量调查

研究语义网的朋友一定想知道,世界上到底已经存在多少本体库了?在这里您可以找到答案 阅读全文 »

标签:, , , ,


语义网的研究问题所在

在目前看来,语义网涉及的主要研究问题包含如下几个方面。

(1)语义网知识表示模型。语义网采用本体作为知识表示模型。本体形式化定义了领域内共同认可的知识,是语义网体系中的核心。因此,如何创建和管理本体是实现语义网上知识表示的基础。此外,本体的扩充、集成和演化等问题也常常需要考虑。

阅读全文 »

标签:, ,


语义网的诞生

为克服自身的不足,Web在其发展过程中也不断进行改进。针对HTML语言所暴露出来的各种局限性。W3C于1996年开始开发一种超越HTML能力的新语言,这就是XML。XMl是Web发展中的一个重要的里程碑,它的贡献主要在于解决了结构化和半结构化数据在语法层次上的互操作问题,为信息语义层的互操作打下了基础,从而大大促进了Web服务、电子商务的发展。但针对Web的改进并不局限于XML,因为虽然XML解决了语法层次的互操作问题,但Web应用仍然难以实现智能化。

阅读全文 »

标签:, ,


Web 3.0为何盖不过 Web 2.0 的风头

在Google搜索中输入“我想去夏威夷度假,但希望花销不超过3000美元,哪条线路和酒店更适合我?”敲回车,共搜索出3460条包含其词语的文档,但没有一条是理想中的答案。

不是说语义搜索能解决这些问题吗?不是到处疯炒Web 3.0的概念吗?语义网到底什么时候才能来到老百姓的身边? 阅读全文 »

标签:, , , , , , ,


The Semantic Web Vision:Where Are We?

这是一篇对语义网现状调查的文章,调查内容有如下:
• demographics (organization type, role in organization,and years of professional experience),
• tools and languages,
• ontology specifics (such as methodology used and the ontology’s purpose),
• ontology size (the number of concepts used in the smallest, largest, and typical ontologies), and
• production (the time frame for deploying developed ontologies and associated systems).

从调查中看出,工具的使用Protégé占68.2%,Swoop和OntoEdit各占13.6%和12.2%,使用语言OWL和RDF Schema占大比重,DAML+OIL只有12%了,其他还有关于语义网用在何处、语义网构建的方法学等等调查。

Download: The Semantic Web Vision Where Are We.pdf  The Semantic Web Vision Where Are We.pdf (224.1 KiB, 155 hits)

标签:, , , , , , ,


《计算机世界》某篇文章如是说——

在万维网日益普及的今天,人们充分体会到网络的巨大魅力。现在,我们可以与处于地球上遥远地方的人进行交流,浏览世界各地的信息,享受网上冲浪的乐趣。但是上过网的人都知道,现在所使用的万维网的功能并不尽如人意,如网页单调枯燥、搜索引擎智能化程度低等。但不管怎样,我们还是可以说,万维网是空前而不绝后的。那么,您是否想过万维网的未来又是什么样的呢? 阅读全文 »

标签:, , ,


忘掉web2.0,抓住web3.0

Web2.0在不到3年的时间迅速掀起了互联网新的狂潮,虽然2.0使得互联网具有更多丰富的内容,比如搜索、照片、音乐、视频、混搭式应用(Mash-Ups)、维基(Wiki)、网络日志(Blog)、社区等等,但是Web2.0本质上并没有创造出更新的盈利模式,仍是靠点击和流量换取广告。甚至,由于互联网创业者和投资者对于Web2.0的大力吹捧,导致越来越多的雷同的互联网公司前呼后拥的出现了,而这些Web2.0企业既缺乏盈利模式,又缺乏稳定真实的服务,在一阵“追星”的热浪之后,消费者开始视觉疲劳,投资者开始失望,互联网再次陷入对于泡沫的探讨与争论之中。

阅读全文 »

标签:, , , ,