Semantic Web: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Justin C. Klein Keane
(added wordpress links)
imported>Justin C. Klein Keane
(Added more data about what semantic web is.)
Line 3: Line 3:
{{TOC|right}}
{{TOC|right}}


The '''Semantic web''' is a concept, first named by [[Tim Berners-Lee]], for a "web of knowledge" in which [[world wide web]] documents' contents would be annotated and classified so that computers can parse the classifications and provide search results based on the semantic information (what the content means), rather than simply on matching of text strings.  There is also a W3C standards effort<ref name="SemanticWebFaq">
The '''Semantic web''' (often referred to as  Web 3.0<ref name="web3.0">{{cite web|url=http://www.nytimes.com/2006/11/12/business/12web.html|title=Entrepreneurs See a Web Guided by Common Sense|publisher=[http://www.nytimes.com/ New York Times]|year=2006|accessdat=2010-08-04}}</ref>) is a concept, first named by [[Tim Berners-Lee]], for a "web of knowledge" in which [[world wide web]] documents' contents would be annotated and classified so that computers can parse the classifications and provide search results based on the semantic information (what the content means), rather than simply on matching of text strings.  There is also a W3C standards effort<ref name="SemanticWebFaq">
{{cite web|
{{cite web|
url=http://www.w3.org/2001/sw/SW-FAQ|
url=http://www.w3.org/2001/sw/SW-FAQ|
title=W3C Semantic Web Frequently Asked Questions|
title=W3C Semantic Web Frequently Asked Questions|
publisher=[[http://www.w3.org/]]|
publisher=[http://www.w3.org/ W3C]|
year=2010|
year=2010|
accessdate=2010-07-11}}
accessdate=2010-07-11}}
</ref> related to this concept.
</ref> related to this concept.  Semantic web was developed to meet a specific deficiency in web based communications.  Although well defined in [[RFC]]'s, [[HTTP]] is architected to perform exchange of information that is delimited and optimized for presentation.  That is, the use of [HTML] is designed to communicate the appearance of documents within web browsers.  This is wonderful when attempting to create a document that will render in the same form across multiple platforms (or web browsers) but is problematic for transmitting meaning of data.  There are a few HTML specifications (notably META tags and other document head elements<ref name="HTML head">{{cite web|url=http://www.w3.org/TR/html401/struct/global.html#h-7.4.4|title=The global structure of an HTML document|publisher=[http://www.w3.org/ W3C]}}</ref>) that convey meaning, but these are precious few. 
 
In order to associate meaning with content, Semantic Web utilizes structures for categorization.  While a web page about soccer might specify how pictures and text should be arranged, what colors and font to use, and other presentation data, a similar Semantic Web document would convey the fact that the data pertained to the sport of soccer, perhaps a list of teams, scores of recent matches, and other data in categorization containers.  This presentation allows other consumers (mainly programs) of the data to parse and utilize the data in meaningful ways.  As opposed to modern web crawlers which must catalogue, index, and apply a certain amount of artificial intelligence to derive the meaning of documents on the web, semantic web allows data to be parsed easily for meaning - ultimately resulting in greater ability to share information.


What differentiates the Semantic Web from existing data structures is the use of [[Unique Resource Indicator|URIs]] to uniquely identify things, and relationships between things. The sort of problem scenario that Semantic Web technologies try to solve are those involving multiple disparate source of data - for instance, hooking together train timetables and class timetables, so a student can automatically plan their travel itinerary without having to manually match the data together.
What differentiates the Semantic Web from existing data structures is the use of [[Unique Resource Indicator|URIs]] to uniquely identify things, and relationships between things. The sort of problem scenario that Semantic Web technologies try to solve are those involving multiple disparate source of data - for instance, hooking together train timetables and class timetables, so a student can automatically plan their travel itinerary without having to manually match the data together.
Semantic web is closely tied to [[microformats]] with are an alternative way to embed meaning into HTML documents.  Microformats use standard HTML tags along with generally agreed upon conventions for attributes, in order to delineate certain data within documents.  For instance, microformats can be used to embed contact data or calendar data in web pages for easy integration with other programs.  This can allow users of popular calendaring or contact management software to simply click on elements within web pages and import calendar events, or contacts, directly into their calendaring or address book software.<ref name="microformats">{{cite web|url=http://microformats.org/wiki/hcalendar|title=Microformats hCal example|publisher=[http://microformats.org/ Microformats.org]|year=2010}}</ref>


The [[World Wide Web Consortium|W3C]] have put forward a variety of standards built on top of the [[Resource Description Framework]], a formal semantic model for representing things and the relationships between them.
The [[World Wide Web Consortium|W3C]] have put forward a variety of standards built on top of the [[Resource Description Framework]], a formal semantic model for representing things and the relationships between them.

Revision as of 09:13, 7 August 2010

All unapproved Citizendium articles may contain errors of fact, bias, grammar etc. A version of an article is unapproved unless it is marked as citable with a dedicated green template at the top of the page, as in this version of the 'Biology' article. Citable articles are intended to be of reasonably high quality. The participants in the Citizendium project make no representations about the reliability of Citizendium articles or, generally, their suitability for any purpose.

Nuvola apps kbounce green.png
Nuvola apps kbounce green.png
This article is currently being developed as part of an Eduzendium student project. The course homepage can be found at CZ:Special_Topics_2010.
To provide students with experience in collaboration, you are warmly invited to join in here, or to leave comments on the discussion page. The anticipated date of course completion is 13 August 2010. One month after that date at the latest, this notice shall be removed.
Besides, many other Citizendium articles welcome your collaboration!


This article is a stub and thus not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article is under development and subject to a disclaimer.

The Semantic web (often referred to as Web 3.0[1]) is a concept, first named by Tim Berners-Lee, for a "web of knowledge" in which world wide web documents' contents would be annotated and classified so that computers can parse the classifications and provide search results based on the semantic information (what the content means), rather than simply on matching of text strings. There is also a W3C standards effort[2] related to this concept. Semantic web was developed to meet a specific deficiency in web based communications. Although well defined in RFC's, HTTP is architected to perform exchange of information that is delimited and optimized for presentation. That is, the use of [HTML] is designed to communicate the appearance of documents within web browsers. This is wonderful when attempting to create a document that will render in the same form across multiple platforms (or web browsers) but is problematic for transmitting meaning of data. There are a few HTML specifications (notably META tags and other document head elements[3]) that convey meaning, but these are precious few.

In order to associate meaning with content, Semantic Web utilizes structures for categorization. While a web page about soccer might specify how pictures and text should be arranged, what colors and font to use, and other presentation data, a similar Semantic Web document would convey the fact that the data pertained to the sport of soccer, perhaps a list of teams, scores of recent matches, and other data in categorization containers. This presentation allows other consumers (mainly programs) of the data to parse and utilize the data in meaningful ways. As opposed to modern web crawlers which must catalogue, index, and apply a certain amount of artificial intelligence to derive the meaning of documents on the web, semantic web allows data to be parsed easily for meaning - ultimately resulting in greater ability to share information.

What differentiates the Semantic Web from existing data structures is the use of URIs to uniquely identify things, and relationships between things. The sort of problem scenario that Semantic Web technologies try to solve are those involving multiple disparate source of data - for instance, hooking together train timetables and class timetables, so a student can automatically plan their travel itinerary without having to manually match the data together.

Semantic web is closely tied to microformats with are an alternative way to embed meaning into HTML documents. Microformats use standard HTML tags along with generally agreed upon conventions for attributes, in order to delineate certain data within documents. For instance, microformats can be used to embed contact data or calendar data in web pages for easy integration with other programs. This can allow users of popular calendaring or contact management software to simply click on elements within web pages and import calendar events, or contacts, directly into their calendaring or address book software.[4]

The W3C have put forward a variety of standards built on top of the Resource Description Framework, a formal semantic model for representing things and the relationships between them.

Domain-specific semantic models

Medicine

Semantic models seem the major trend in expert support to medicine. As an example of how semantic methodologies are used, consider several isolated concepts, which could be considered "nouns":

One of the notations for relationships is the Unified Medical Language System® (UMLS®). Informally, some of the "verb" semantic relationships among the above could be:

  • beta-adrenergic antagonists TREAT hypertension and benign hand tremor
  • beta-adrenergic antagonists CAUSE bradycardia
  • beta-adrenergic antagonists TRIGGER asthma

"Hypertension" would have a number of other TREATS relations, from drug classes such as thiazide diuretics, angiotensin-II converting enzyme antagonists, calcium channel blockers, angiotensin-II receptor blockers, etc.

ULMS is now being extended with formal ontologies: [5]

Semantic Web in CMS

Content management systems (CMS) can benefit greatly from RDF features. RDF is an expressive means by which CMS can both publish and consume data. Because RDF makes data more easily machine readable it is perfect for systems that integrate data (such as CMS).

Drupal

The Drupal content management system is making a big push to include RDF and semantic web as part of the upcoming Drupal 7 release. There is a Drupal group devoted to semantic web as well as a code sprint devoted to the topic. With over significant and growing market share of CMS, Drupal's support of semantic web will mean a vast increase in implementation of RDF.

Wordpress

Wordpress has several third party plugins that implement RDF.

Other notable uses

Facebook recently announced support for open graph protocol which is an RDF implementation of semantic web.


References