World Wide Web: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Konstantin Tchernov
m (→‎Web Addresses: cite with RFC link)
mNo edit summary
 
(84 intermediate revisions by 16 users not shown)
Line 1: Line 1:
:''"The Web" and "WWW" redirect here. For other uses, see [[Web]] and [[WWW (disambiguation)]]. For the world's first browser, see [[WorldWideWeb]].''
{{subpages}}
 
:''"The Web" redirects here.''  
[[Image:WWWlogo.png|thumb|120px|right|WWW's historical logo designed by [[Robert Cailliau]]]]
[[Image:WWWlogo.png|thumb|120px|right|WWW's historical logo designed by [[Robert Cailliau]]]]
The '''World Wide Web''' ("'''WWW'''" or simply the  "'''Web'''") is a global collection of documents hosted on computers and available to the public.  These documents include text files, images, videos, sound files and many other types of information.  Individual piece of informations on the Web referred to as ''resources'', and each is identified by a short, unique, global identifier called [[Uniform Resource Identifier]] (URI) so that each can be found, accessed and cross referenced in the simplest possible way.
The '''World Wide Web''' ("'''WWW'''" or simply "'''the Web'''") is a global collection of information presented in the form of documents hosted on networked [[computer|computers]] and available to the public.  The information includes text files, images, videos, sound files and many other types of information.  Born in the early 1990's, it quickly became a way to share electronic documents publicly and freely.  The documents containing information are delivered to individuals over a global computer network called the [[Internet]] and using a communications protocol called [[HTTP]].  After connecting to the Internet, people can ''browse the Web'' by running a [[web browser]] on their local computer and using the web browser to ''surf'' (move around at will) through the vast array of available web documentsConnecting to the Internet also enables people to use many other services which are not necessarily considered to be part of the World Wide Web, such as [[email]], instant messaging or digital telephony.
 
The term is often mistakenly used as a synonym for the [[Internet]] itself, but the Web is actually only a ''collection of documents''.  While the Internet is the global ''[[network]]'' that connects a vast number of smaller networks.  The Web is accessed over the Internet, as are other services that are not part of the Web - like [[e-mail]], [[instant messaging]] and [[Voice over IP]].<ref>Halsal, p. 359, 568</ref>
 
==Basic terms==
The World Wide Web is the combination of 4 basic ideas:
*[[Hypertext]]: a format of information which allows one, in a computer environment, to move from one part of a document to another or from one document to another through internal connections among these documents (called "[[hyperlink]]s");
*[[Uniform Resource Identifier|Resource Identifiers]]: unique identifiers used to locate a particular resource (computer file, document or other resource) on the network;
*The [[Client-server]] model of computing: a system in which client software or a client computer makes requests of server software or a server computer that provides the client with resources or services, such as data or files; and
*[[Markup language]]: characters or codes embedded in text which indicate structure, [[Semantic Web|semantic meaning]], or advice on presentation.
 
Web pages are often arranged in collections of related material called "[[website]]s." The act of following hyperlinks from one website to another is referred to as ''"[[Web browser|browsing]]"'' or sometimes as ''"surfing"'' the Web. 
 
The phrase "surfing the Internet" was first popularized in print by [[Jean Armour Polly]], a [[librarian]], in an article called ''Surfing the INTERNET'', published in the [[University of Minnesota]] ''Wilson Library Bulletin'' in June, 1992. Although Polly may have developed the phrase independently, slightly earlier uses of similar terms have been found on the [[Usenet]] from 1991 and 1992, and some recollections claim it was also used verbally in the hacker community for a couple years before that. Polly is famous as "[[NetMom]]" in the history of the Internet.
 
Although the [[English (language)|English]] word ''worldwide'' is normally written as one word (without a space or hyphen), the proper name '''World Wide Web''' and abbreviation '''WWW''' are now well-established even in formal English.  The earliest references to the Web called it the [[WorldWideWeb]] (an example of computer programmers' fondness for [[CamelCase]]) or the ''World-Wide Web'' (with a hyphen, this version of the name is the closest to normal English usage).
 
Ironically, the abbreviation "WWW" contains two or three times as many [[syllables]] (depending on accent) as the full term "World Wide Web".  There are multiple [[Pronunciation of "www"|pronunciations of "www']].
 
==How the Web works ==
Web pages, and other files (such as images and videos), on the World Wide Web are stored on many different [[web server]]s located all around the world.  These web servers are all connected to the [[the Internet]].  The user accesses the Web through the Internet using a [[user agent]] program, such as a [[web browser]] (a type of user agent that [[rendering (computer graphics)|renders]] and displays the requested web page to the user).  The user can navigate to different pages on the web by either typing in the [[Uniform Resource Identifier]] (URI)<ref>For historical reasons, the [[URI]] (Uniform Resource Identifier) is often referred to as the [[URL]] (Uniform Resource Locator), however this is not always the correct term to use - see [[URL#A popular synonym for "URI"]] for more information.</ref>, also referred to as the "web address" (for example ''<nowiki>http://pilot.citizendium.org/wiki/World_Wide_Web</nowiki>'') into the web browser, or by following a [[hyperlink]] from another web page.  The user agent uses this URI to figure out which web server to ask for which resource and using which [[communication protocol]].  Unless a problem has occured, the server sends back the requested resource.
 
===Caching===
If the user returns to a page fairly soon, it is likely that the data will not be retrieved from the source web server, as above, again.  By default, browsers [[cache]] all web resources on the local hard drive.  An [[HTTP]] request will be sent by the browser that asks for the data ''only if it has been updated since the last download''. If it has not, the cached version will be reused in the rendering step. 
 
This is particularly valuable in reducing the amount of web traffic on the internet.  The decision about expiry is made independently for each resource (image, [[Cascading Style Sheets|stylesheet]], [[JavaScript]] file etc, as well as for the HTML itself).  Thus even on sites with highly dynamic content, many of the basic resources are only supplied once per session or less.  It is worth it for any web site designer to collect all the CSS and JavaScript into a few site-wide files so that they can be downloaded into users' caches and reduce page download times and demands on the server.
 
There are other components of the internet that can cache web contentThe most common in practice are often built into corporate and academic firewalls where they cache web resources requested by one user for the benefit of all.
 
Apart from the facilities built into web servers that can ascertain when physical files have been updated, it is possible for designers of dynamically generated web pages to control the HTTP headers sent back to requesting users, so that pages are not cached when they should not be — for example internet banking and news pages.
 
This helps with understanding the difference between the HTTP 'GET' and 'POST' verbs — data requested with a GET may be cached, if other conditions are met, whereas data obtained after POSTing information to the server usually will not.
 
==Web Addresses==
{{for2|a more technical description of URI syntax|[[URI scheme#Generic syntax]]}}
On the Internet, a site is commonly identified to users by a [[domain name]] (such as [http://www.google.com www.google.com]), while computers usually use [[Internet Protocol]] (IP) addresses (such as [http://209.85.135.104 209.85.135.104]).  To avoid forcing users to remember complex strings of numbers, browsers use [[Domain Name Service]] (DNS) routers, which act as phonebooks, matching the domain name with an IP address.
 
A web address consists of several parts.  An example URI might be:<ref>{{cite web|title=RFC 2616|publisher=[[The Internet Society]]|year=1999|url=http://www.rfc-editor.org/rfc/rfc2616.txt|work=Uniform Resource Identifer (URI) Schemes|accessdate=2007-01-17}}</ref>
<pre>http://www.w3.org:80/Consortium/activities.html#HTMLActivity</pre>


*'''<nowiki>http://</nowiki>''' tells the browser to use the [[HTTP]] [[protocol]].
==Web technical specifications==
*'''<nowiki>www.w3.org</nowiki>''' refers to the specific server. The browser would look this up with DNS.
The World Wide Web is implemented by software which adheres at least to the following three standards:
*'''<nowiki>:80</nowiki>''' refers to the [[port (networking) port]]. The default HTTP port is 80, and generally isn't included in an address.
*the ''Uniform Resource Identifier'' (URI)<ref name="URIprotocol">{{cite web|url=http://www.ietf.org/rfc/rfc3986.txt|title=Request for Comments: 3986, Uniform Resource Identifier (URI): Generic Syntax|publisher=IETF Network Working Group|date=January 2005|accessdate=2007-04-02}}</ref>, which is a universal system for referencing resources (i.e., pages or files) on the Web
*'''<nowiki>/Consortium/</nowiki>''' is the [[file path|path]] on the server.  Like on desktops, most websites are organized in a folder-based [[heirarchy]].
*the ''HyperText Transfer Protocol'' (HTTP)<ref name="HTTP10protocol">{{cite web|url=http://www.w3.org/Protocols/rfc1945/rfc1945|title=Request for Comments: 1945, Hypertext Transfer Protocol -- HTTP/1.0|publisher=IETF Network Working Group|date=May 1996|accessdate=2007-04-02}}</ref>, which specifies how the browser and server communicate with each other
*'''<nowiki>activities.html</nowiki>''' refers to the specific web page.
*the ''HyperText Markup Language'' (HTML)<ref name="HTML32spec">{{cite web|url=http://www.w3.org/TR/REC-html32.html|title=HTML 3.2 Reference Specification|publisher=World Wide Web Consortium (W3C)|date=14-Jan-1997|accessdate=2007-04-02}}</ref>, used to define the structure and content of hypertext documents.
*'''<nowiki>#HTMLActivity</nowiki>''' is called a "fragment".  It directs the browser to a section on the actual page.
 
Web addresses frequently start with "www".  The first part of the web address indicates the specific service you wish to access.  In the case of [http://news.google.com news.google.com], this means "Google.com's website, specifically the servers that handle <nowiki>'news'</nowiki>."  The reason that "www" is so common is that customarily, different services or protocols are handled under different hostnames.  For instance, public [[File transfer protocol|FTP]] was traditionally done from "ftp.name.org", while [[Gopher protocol|Gopher]] would have been handled by "gopher.name.org".  This convention predated the World Wide Web, and so organizations began calling web servers "www" servers.  Some sites still require the "www", while other sites don't.


==Origins==
==Origins==
{{main|History of the World Wide Web}}
By 1990, an entire economy had already grown around the demand for [[Internet]] access, but much functionality on the Internet was not available to non-technical users.
{{seealso|History of the Internet}}
The primary author, and the person who first implemented (or organized the implementation of) the above [[Request for Comments|RFC's]] was [[Tim Berners-Lee]] of [[CERN]], although several other people were also marginally involved. The Internet had existed in some form for perhaps twenty years before this--and it had grown--but the means of sharing information over the Internet were neither available to large numbers of people, nor friendly enough to accommodate non-technical users.  Berners-Lee's proposals allowed information to be shared easily to a non-technical audience.  
[[Image:FirstWebServer.jpg|thumb|This NeXTcube used by [[Tim Berners-Lee|Berners-Lee]] at CERN became the first Web server.]]
The underlying ideas of the Web can be traced as far back as [[1980]], when, at [[CERN]] in [[Switzerland]], [[Tim Berners-Lee]] built [[ENQUIRE]] (referring to ''[[Enquire Within Upon Everything]]'', a book he recalled from his youth). While it was rather different from the Web we use today, it contained many of the same core ideas (and even some of the ideas of Berners-Lee's next project after the WWW, the [[Semantic Web]]).  


In March 1989, Tim Berners-Lee wrote ''[http://www.w3.org/History/1989/proposal.html Information Management: A Proposal]'', which referenced ENQUIRE and described a more elaborate information management system. With help from [[Robert Cailliau]], he published a more formal [http://www.w3.org/Proposal proposal for the World Wide Web] on [[November 12]], [[1990]].  
Berners-Lee devised the first web browser to fetch and display documents containing [[hyperlink]]s, which when clicked brought down additional documents over the Internet. His version of [[hypertext]], which was an area of active research at many institutions during the 1980's, was combined with an elegant and simple protocol ([[HTTP]]) for using the existing Internet as transport, and the ideas took off very quickly.  The availability of HTML documents and free web browsers accelerated the growth of the Internet even more.


A [[NeXTcube]] was used by Berners-Lee as the world's first [[web server]] and also to write the first [[web browser]], [[WorldWideWeb]] in 1990.
Equally important with his technical expertise was Berners-Lee's vision that the Internet should remain a free service available to anyone.  The Web made its debut as a publicly available service on August 6, 1991.<ref name="BriefHistoryOfWeb">{{cite newsgroup|title=Re: Qualifiers on Hypertext links...|author=[[Tim Berners-Lee]]|date=1991-08-06|newsgroup=alt.hypertext|id=6484@cernvax.cern.ch|url = http://groups.google.com/group/alt.hypertext/msg/06dad279804cb3ba?dmode=source&hl=en| accessdate = 2007-01-19
}}
:A first release of the code followed [http://groups.google.com/group/comp.sys.next.announce/browse_thread/thread/6af5808c84a771fc/42c02b1b5992dd3 two weeks later].
</ref>  On April 30, 1993, [[CERN]] announced that the World Wide Web would be free to anyone.<ref>{{cite press release|title=Statement concerning CERN W3 software release into public domain|publisher=[[CERN]]|date=1993-04-30|url=http://intranet.cern.ch/Chronological/Announcements/CERNAnnouncements/2003/04-30TenYearsWWW/Welcome.html|accessdate=2007-01-19}}</ref> This came two months after the announcement that [[Gopher protocol#Decline|Gopher]], the older distributed document protocol, was no longer free to use.<ref>{{cite newsgroup|senddate=1993-03-11|email=http://www.nic.funet.fi/pub/vms/networking/gopher/gopher-software-licensing-policy.ancient|title=University of Minnesota Gopher software licensing policy|author=The Minnesota Gopher Team|accessdate=2007-01-19}}</ref>  In the intervening years, the World Wide Web has managed to survived in a low-cost form available to large numbers of people, but not without considerable social struggle to keep it so.  Multiple attempts have been made (and likely will be made again) to tax commerce performed through the Web, or to censor its content (in fact, some countries do attempt to censor information flow from the Web, with varied degrees of success).


By Christmas [[1990]], Berners-Lee had built all the tools necessary for a working Web [http://www.w3.org/People/Berners-Lee/WorldWideWeb]: the [[WorldWideWeb|first Web browser]] (which was a Web editor as well), the first Web server and the [http://www.w3.org/History/19921103-hypertext/hypertext/WWW/TheProject.html first Web pages] which described the project itself.
The World Wide Web began an astronomical growth in popularity after the 1993 release of the graphical [[Mosaic web browser]] by the [[National Center for Supercomputing Applications]] (NCSA) developed by [[Marc Andreessen]]. Prior to the release of Mosaic, graphics were not commonly mixed with text in web pages and its popularity was less than older protocols in use over the Internet, such as [[Gopher protocol]] and [[Wide Area Information Server]]. Mosaic's graphical user interface allowed the Web to become by far the most popular Internet application.


On [[August 6]], [[1991]], he posted a [http://groups.google.com/groups?selm=6487%40cernvax.cern.ch short summary of the World Wide Web project] on the <tt>alt.hypertext</tt> [[newsgroup]]. This date also marked the debut of the Web as a publicly available service on the Internet.
==A disruptive technology==
The crucial underlying concept of [[hypertext]] originated with older projects from the 1960s, such as [[Ted Nelson]]'s [[Project Xanadu]] and [[Douglas Engelbart]]'s [[NLS (computer system)|oN-Line System]] (NLS)Both Nelson and Engelbart were in turn inspired by [[Vannevar Bush]]'s [[microfilm]]-based "[[memex]]," which was described in the 1945 essay "[[As We May Think]]".
The fact that individuals outside [[mass media]] can easily publish on the Web has made the World Wide Web a ''disruptive''<ref>A ''disruptive technology'' is one which shakes up the status quo by invoking rapid, sometimes painful, change.</ref> influence on journalism. In order to "publish" a web page, one does not have to go through a [[publisher]] or other media institution, and potential readers could be found in all corners of the globe. The increased opportunity to publish materials is certainly observable in the countless personal pages, as well as pages by families, small shops, etc., facilitated by the emergence of free [[web hosting]] servicesIt's free to post some smaller web pages, and even larger sites are inexpensive in comparison to traditional media.


Berners-Lee's breakthrough was to marry hypertext to the Internet. In his book ''Weaving The Web,'' he explains that he had repeatedly suggested that a marriage between the two technologies was possible to members of ''both'' technical communities, but when no one took up his invitation, he finally tackled the project himself. In the process, he developed a system of globally unique identifiers for resources on the Web and elsewhere: the [[Uniform Resource Identifier]].
Unlike [[book]]s and documents, hypertext does not have a linear order from beginning to end. It is not broken down into the hierarchy of chapters, sections, subsections, etc. This allows readers to easily find more on a topic, move to other related topics, or skip sections they're uninterested in.


The World Wide Web had a number of differences from other hypertext systems that were then available:
Many different kinds of information are now available on the Web, and for those who wish to know other societies, their cultures and peoples, it has become easier. When traveling in a foreign country or a remote town, one might be able to find some information about the place on the Web, especially if the place is in one of the developed countries. Local newspapers, government publications, and other materials are easier to access, and therefore the variety of information obtainable with the same effort may be said to have increased, for the users of the Internet.  
*The WWW required only [[unidirectional]] links rather than [[bidirectional]] ones. This made it possible for someone to link to another resource without action by the owner of that resource. It also significantly reduced the difficulty of implementing Web servers and browsers (in comparison to earlier systems), but in turn presented the chronic problem of broken links.
*Unlike certain applications, such as [[HyperCard]], the World Wide Web was non-proprietary, making it possible to develop servers and clients independently and to add extensions without licensing restrictions.


On [[April 30]], [[1993]], [[CERN]] [http://intranet.cern.ch/Chronological/Announcements/CERNAnnouncements/2003/04-30TenYearsWWW/Welcome.html announced] that the World Wide Web would be free to anyone, with no fees due. Coming two months after the announcement that [[Gopher protocol#Decline|gopher]] was no longer free to use, this produced a rapid shift away from gopher and towards the Web.
Although some websites are available in multiple languages, many are in the local language only. Also, not all software supports all special characters, and [[Bi-directional text|RTL]] languages. These factors would challenge the notion that the World Wide Web will bring a unity to the world.


An early popular web browser was [[ViolaWWW]] which was based upon [[HyperCard]]. The World Wide Web, however, only gained critical mass with the 1993 release of the graphical [[Mosaic web browser]] by the [[National Center for Supercomputing Applications]] (NCSA) developed by [[Marc Andreessen]]. Prior to the release of Mosaic, graphics were not commonly mixed with text in Web pages and its popularity was less than older protocols in use over the Internet, such as [[Gopher protocol]] and [[Wide area information server]]. Mosaic's graphical user interface allowed the Web to become by far the most popular Internet protocol.
===Sociological implications===
 
The World Wide Web has allowed global interpersonal exchange on a scale unprecedented in human history.  People separated by vast distances, or even large amounts of time, can use the Web to exchange &mdash; or even mutually develop &mdash; their most intimate and extensive thoughts, or alternately their most casual attitudes and spirits.  Emotional experiences, political ideas, cultural customs, musical idioms,  business advice, artwork, photographs, literature, can all be shared and disseminated digitally with less individual investment than ever before in human history.  Although the existence and use of the Web relies upon material technology, which comes with its own disadvantages, its information does not use physical resources in the way that [[library|libraries]] or the [[printing press]] have.  Therefore, propagation of information via the Web (via the Internet, in turn) is not constrained by movement of physical volumes, or by manual or material copying of information.  And by virtue of being [[digital]], the information of the Web can be searched more easily and efficiently than any library or physical volume, and vastly more quickly than a person could retrieve information about the world by way of physical travel or by way of [[mail]], [[telephone]], [[telegraphy|telegraph]], or any other communicative medium.
==Web standards==
At its core, the Web is made up of three standards:
*the ''Uniform Resource Identifier'' ([[Uniform Resource Identifier|URI]]), which is a universal system for referencing resources on the Web, such as Web pages;
*the ''HyperText Transfer Protocol'' ([[HTTP]]), which specifies how the browser and server communicate with each other; and
*the ''HyperText Markup Language'' ([[HTML]]), used to define the structure and content of [[hypertext]] documents.
 
Berners-Lee now heads the [[World Wide Web Consortium]] (W3C), which develops and maintains these and other standards that enable computers on the Web to effectively store and communicate different forms of information.
 
==Java and JavaScript==
Another significant advance in the technology was [[Sun Microsystems|Sun Microsystems']] [[Java Platform|Java platform]]. It initially enabled Web servers to embed small programs (called [[applet]]s) directly into the information being served, and these applets would run on the end-user's computer, allowing faster and richer user interaction. Eventually, it came to be more widely used as a tool for generating complex [[server-side]] content as it is requested. Java never gained as much acceptance as Sun had hoped as a platform for client-side applets for a variety of reasons, including lack of integration with other content (applets were confined to small boxes within the rendered page) and poor performance (particularly start up delays) of Java [[virtual machine|VM]]s on PC hardware of that time.
 
[[JavaScript]], however, is a [[Scripting programming language|scripting language]] that was developed for Web pages. The standardised version is [[ECMAScript]]. While its name is similar to Java, it was developed by [[Netscape Communications Corporation|Netscape]] and not Sun Microsystems, and it has almost nothing to do with Java, with the only exception being that like Java its syntax is derived from the [[C programming language]]. Like Java, JavaScript is also [[Object-oriented programming|object oriented]], but like [[C++]] and unlike Java, it allows mixed code — both object-oriented as well as [[procedural programming|procedural]]. In conjunction with the [[Document Object Model]], JavaScript has become a much more powerful language than its creators originally envisioned. Sometimes its usage is expressed under the term [[Dynamic HTML]] (DHTML), to emphasise a shift away from ''static'' HTML pages.
 
[[Ajax (programming)|Ajax]] (Asynchronous JavaScript And XML) is a JavaScript-based technology that may have a significant effect on the development of the World Wide Web. By providing a method where only part of a page need be updated when required, rather than the whole, Ajax allows such updates to be much faster and more efficient. Ajax is seen as an important aspect of [[Web 2.0]]. Examples of Ajax techniques currently in use can be seen in [[Gmail]], [[Google Maps]] etc.
 
==Sociological implications==
The Web, as it stands today, has allowed global interpersonal exchange on a scale unprecedented in human history.  People separated by vast distances, or even large amounts of time, can use the Web to exchange &mdash; or even mutually develop &mdash; their most intimate and extensive thoughts, or alternately their most casual attitudes and spirits.  Emotional experiences, political ideas, cultural customs, musical idioms,  business advice, artwork, photographs, literature, can all be shared and disseminated digitally with less individual investment than ever before in human history.  Although the existence and use of the Web relies upon material technology, which comes with its own disadvantages, its information does not use physical resources in the way that [[library|libraries]] or the [[printing press]] have.  Therefore, propagation of information via the Web (via the Internet, in turn) is not constrained by movement of physical volumes, or by manual or material copying of information.  And by virtue of being [[digital]], the information of the Web can be searched more easily and efficiently than any library or physical volume, and vastly more quickly than a person could retrieve information about the world by way of physical travel or by way of [[mail]], [[telephone]], [[telegraphy|telegraph]], or any other communicative medium.


The Web is the most far-reaching and extensive medium of personal exchange to appear on [[Earth]].  It has probably allowed many of its users to interact with many more groups of people, dispersed around the planet in time and space, than is possible when limited by physical contact or even when limited by every other existing medium of communication combined.
The Web is the most far-reaching and extensive medium of personal exchange to appear on [[Earth]].  It has probably allowed many of its users to interact with many more groups of people, dispersed around the planet in time and space, than is possible when limited by physical contact or even when limited by every other existing medium of communication combined.
Line 97: Line 40:
Because the Web is global in scale, some have suggested that it will nurture mutual understanding on a global scale.  By definition or by necessity, the Web has such a massive potential for social exchange, it has the potential to nurture empathy and symbiosis, but it also has the potential to incite belligerence on a global scale, or even to empower [[demagogy|demagogues]] and repressive regimes in ways that were historically impossible to achieve.
Because the Web is global in scale, some have suggested that it will nurture mutual understanding on a global scale.  By definition or by necessity, the Web has such a massive potential for social exchange, it has the potential to nurture empathy and symbiosis, but it also has the potential to incite belligerence on a global scale, or even to empower [[demagogy|demagogues]] and repressive regimes in ways that were historically impossible to achieve.


==Content==
==Web 2.0==
The Web is available to individuals outside [[mass media]]. In order to "publish" a web page, one does not have to go through a [[publisher]] or other media institution, and potential readers could be found in all corners of the globeThe increased opportunity to publish materials is certainly observable in the countless personal pages, as well as pages by families, small shops, etc., facilitated by the emergence of free [[web hosting]] servicesIt's free to post some smaller webpages, and even larger sites are inexpensive in comparison to traditional media.
Since around 2000, all major web browsers have moved quickly to support additional standards, including [[CSS]], [[XHTML]] (and its Document Object Model, or DOM), and [[JavaScript]]Use of these standards in devising web pages has led to a better usability experience for users, in which parts of web pages are updated by background programs that talk to the web server without the user having to browse further to an additional web pageThese new technologies are often called [[Ajax]] as a comglomerate.
 
Unlike [[book]]s and documents, hypertext does not have a linear order from beginning to end. It is not broken down into the hierarchy of chapters, sections, subsections, etc.  This allows readers to easily find more on a topic, move to other related topics, or skip sections they're uninterested in.


Many different kinds of information are now available on the Web, and for those who wish to know other societies, their cultures and peoples, it has become easier. When travelling in a foreign country or a remote town, one might be able to find some information about the place on the Web, especially if the place is in one of the developed countries. Local newspapers, government publications, and other materials are easier to access, and therefore the variety of information obtainable with the same effort may be said to have increased, for the users of the Internet.  
Further advances in website design include user-generated content, social networking aspects such as user ratings or page customization, so-called "mashups" which allow data to be visualized more easily, and new ways of generating revenue such as use of a small-face, discreet, text-only advertizing column a la Google. Such trends are frequently referred to under the all-encompassing term "Web 2.0", meaning the second version of how web pages can be created for a richer user experience. For developers, creating "Web 2.0" pages is more complex, and the technology is still costly enough in terms of human effort that many websites cannot yet afford the implementation overhead.  But tools support for Web 2.0 design approaches continues to evolve rapidly as of 2008, and this has resulted in higher expectations for web page friendliness among users.


Although some websites are available in multiple languages, many are in the local language only. Also, not all software supports all special characters, and [[Bi-directional text|RTL]] languages. These factors would challenge the notion that the World Wide Web will bring a unity to the world.
Websites which heavily employ so-called "Web 2.0" technology are also sometimes referred to as Rich Internet Applications, or simply RIA's.


==Statistics==
==References==
According to a 2001 study [http://www.brightplanet.com/technology/deepweb.asp], there were more than 550 billion documents on the Web, mostly in the "[[Deep web|invisible Web]]".  A 2002 survey of 2,024 million web pages [http://www.netz-tipp.de/languages.html] determined that by far the most Web content was in English: 56.4%; next were pages in German (7.7%), French (5.6%) and Japanese (4.9%). A more recent study [http://www.cs.uiowa.edu/~asignori/web-size/] which used web searches in 75 different languages to sample the Web determined that there were over 11.5 billion web pages in the [[Surface web|publicly-indexable Web]] as of January 2005.
 
==Speed issues==
Frustration over [[congestion]] issues in the [[Internet]] infrastructure and the high [[Latency (engineering)|latency]] that results in slow browsing has led to an alternative name for the World Wide Web: the ''World Wide Wait''. Speeding up the Internet is an ongoing discussion over the use of [[peering]] and [[Quality of service|QoS]] technologies. Other suggested solutions to reduce the World Wide Wait can be found from [http://www.w3.org/Protocols/NL-PerfNote.html the W3C].
 
Standard [[guideline]]s for ideal web response times are (Nielsen 1999, page 42):
* 0.1 second (one tenth of a second). Ideal response time. The user doesn't sense any interruption.
* 1 second. Highest acceptable response time. Download times above 1 second interrupt the user experience.
* 10 seconds. Unacceptable response time. The user experience is interrupted and the user is likely to leave the site or system.
These numbers are useful for planning server capacity.
 
==Link rot==
[[Link rot]] is when web links become [[dead link|broken]] due to resources moving or ceasing to exist.  The ephemeral nature of the Web has prompted many efforts to archive the Web.  The [[Internet Archive]] is one of the most well-known efforts; they have been archiving the Web since 1996.
 
==Academic conferences==
The major academic event covering the WWW is the World Wide Web series of conferences, promoted by [http://www.iw3c2.org IW3C2]. There is a [http://www.iw3c2.org/conferences/ list] with links to all conferences in the series.
 
==Standards==
The following is a cursory list of the documents that define the World Wide Web's three core standards:
 
*'''Uniform Resource Locators (URL)'''
**RFC 1738, Uniform Resource Locators (URL) (December 1994)
**RFC 3986, Uniform Resource Identifier (URI): Generic Syntax (January 2005)
*'''HyperText Transfer Protocol (HTTP)'''
**RFC 1945, HTTP/1.0 specification (May 1996)
**RFC 2616, HTTP/1.1 specification (June 1999)
**RFC 2617, HTTP Authentication
**[http://purl.org/NET/http-errata HTTP/1.1 specification errata]
*'''Hypertext Markup Language (HTML)'''
**[http://www.w3.org/MarkUp/draft-ietf-iiir-html-01.txt Internet Draft, HTML version 1]
**RFC 1866, HTML version 2.0
**[http://www.w3.org/TR/REC-html32 HTML 3.2 Reference Specification]
**[http://www.w3.org/TR/html4/ HTML 4.01 Specification]
**[http://www.w3.org/TR/html/ Extensible HTML (XHTML) Specification]
 
==See also==
*[[Deep web]]
*[[First image on the Web]]
*[[Search engine]]
*[[Streaming media]]
*[[Web directory]]
 
==Notes==
<references/>
<references/>
==References==
<div class="references-small">
*{{cite paper | author=Berners-Lee, Tim; Bray, Tim; Connolly, Dan; Cotton, Paul; Fielding, Roy; Jeckle, Mario; Lilley, Chris; Mendelsohn, Noah; Orchard, David; Walsh, Norman; Williams, Stuart | title=Architecture of the World Wide Web, Volume One | publisher=W3C | date=December 15, 2004 | version=Version 20041215 | url=http://www.w3.org/TR/webarch/}}
*{{cite book | last=Halsall | first = Fred | title=Computer Networking and the Internet | edition = fifth edition | year = 2005 | origyear = 1985 | isbn = 0-321-26358-8 | publisher = Pearson Education }}
*{{cite paper | author=Fielding, R.; Gettys, J.; Mogul, J.; Frystyk, H.; Masinter, L.; Leach, P.; Berners-Lee, T. | title=Hypertext Transfer Protocol — HTTP/1.1 | publisher=Information Sciences Institute | date=June 1999 | version=Request For Comments 2616 | url=ftp://ftp.isi.edu/in-notes/rfc2616.txt}}
*{{cite web | author=Polo, Luciano  | title=World Wide Web Technology Architecture: A Conceptual Analysis | publisher= | year=2003 | work=New Devices | url=http://newdevices.com/publicaciones/www/ | accessdate=July 31 | accessyear=2005 }}
</div>
==External links==
*[http://dmoz.org/Computers/Internet/Web_Design_and_Development/ Open Directory — Computers: Internet: Web Design and Development]
*[http://vlib.iue.it/history/internet/ WWW-Virtual Library: History of the Internet & W3]
*[http://www.w3.org/History/19921103-hypertext/hypertext/WWW/ Early archive of the first web site]
*[http://www.mit.edu/people/mkgray/net/ Internet Statistics: Growth and Usage of the Web and the Internet]
*[http://hixie.ch/commentary/web/history The History of the Web]
*[http://www.webology.ir/ Webology]
*[http://www.vlib.us/websitetools.html The World Wide Web Virtual Library: Web Site Tools] from the [[World Wide Web Virtual Library]]
*[http://www.livinginternet.com/w/w.htm A comprehensive history with people, concepts and many interesting quotations]
*[http://www.articlehorizon.com Article Directory]Some articles about World  Wide Web
{{step|[[ENQUIRE]]|}}
[[Category:World Wide Web|World Wide Web]]
[[Category:English inventions]]
[[Category:Internet history]]
[[Category:Computers Workgroup]]
[[Category:CZ_Live]]

Latest revision as of 08:48, 22 April 2024

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
Video [?]
 
This editable Main Article is under development and subject to a disclaimer.
"The Web" redirects here.
WWW's historical logo designed by Robert Cailliau

The World Wide Web ("WWW" or simply "the Web") is a global collection of information presented in the form of documents hosted on networked computers and available to the public. The information includes text files, images, videos, sound files and many other types of information. Born in the early 1990's, it quickly became a way to share electronic documents publicly and freely. The documents containing information are delivered to individuals over a global computer network called the Internet and using a communications protocol called HTTP. After connecting to the Internet, people can browse the Web by running a web browser on their local computer and using the web browser to surf (move around at will) through the vast array of available web documents. Connecting to the Internet also enables people to use many other services which are not necessarily considered to be part of the World Wide Web, such as email, instant messaging or digital telephony.

Web technical specifications

The World Wide Web is implemented by software which adheres at least to the following three standards:

  • the Uniform Resource Identifier (URI)[1], which is a universal system for referencing resources (i.e., pages or files) on the Web
  • the HyperText Transfer Protocol (HTTP)[2], which specifies how the browser and server communicate with each other
  • the HyperText Markup Language (HTML)[3], used to define the structure and content of hypertext documents.

Origins

By 1990, an entire economy had already grown around the demand for Internet access, but much functionality on the Internet was not available to non-technical users. The primary author, and the person who first implemented (or organized the implementation of) the above RFC's was Tim Berners-Lee of CERN, although several other people were also marginally involved. The Internet had existed in some form for perhaps twenty years before this--and it had grown--but the means of sharing information over the Internet were neither available to large numbers of people, nor friendly enough to accommodate non-technical users. Berners-Lee's proposals allowed information to be shared easily to a non-technical audience.

Berners-Lee devised the first web browser to fetch and display documents containing hyperlinks, which when clicked brought down additional documents over the Internet. His version of hypertext, which was an area of active research at many institutions during the 1980's, was combined with an elegant and simple protocol (HTTP) for using the existing Internet as transport, and the ideas took off very quickly. The availability of HTML documents and free web browsers accelerated the growth of the Internet even more.

Equally important with his technical expertise was Berners-Lee's vision that the Internet should remain a free service available to anyone. The Web made its debut as a publicly available service on August 6, 1991.[4] On April 30, 1993, CERN announced that the World Wide Web would be free to anyone.[5] This came two months after the announcement that Gopher, the older distributed document protocol, was no longer free to use.[6] In the intervening years, the World Wide Web has managed to survived in a low-cost form available to large numbers of people, but not without considerable social struggle to keep it so. Multiple attempts have been made (and likely will be made again) to tax commerce performed through the Web, or to censor its content (in fact, some countries do attempt to censor information flow from the Web, with varied degrees of success).

The World Wide Web began an astronomical growth in popularity after the 1993 release of the graphical Mosaic web browser by the National Center for Supercomputing Applications (NCSA) developed by Marc Andreessen. Prior to the release of Mosaic, graphics were not commonly mixed with text in web pages and its popularity was less than older protocols in use over the Internet, such as Gopher protocol and Wide Area Information Server. Mosaic's graphical user interface allowed the Web to become by far the most popular Internet application.

A disruptive technology

The fact that individuals outside mass media can easily publish on the Web has made the World Wide Web a disruptive[7] influence on journalism. In order to "publish" a web page, one does not have to go through a publisher or other media institution, and potential readers could be found in all corners of the globe. The increased opportunity to publish materials is certainly observable in the countless personal pages, as well as pages by families, small shops, etc., facilitated by the emergence of free web hosting services. It's free to post some smaller web pages, and even larger sites are inexpensive in comparison to traditional media.

Unlike books and documents, hypertext does not have a linear order from beginning to end. It is not broken down into the hierarchy of chapters, sections, subsections, etc. This allows readers to easily find more on a topic, move to other related topics, or skip sections they're uninterested in.

Many different kinds of information are now available on the Web, and for those who wish to know other societies, their cultures and peoples, it has become easier. When traveling in a foreign country or a remote town, one might be able to find some information about the place on the Web, especially if the place is in one of the developed countries. Local newspapers, government publications, and other materials are easier to access, and therefore the variety of information obtainable with the same effort may be said to have increased, for the users of the Internet.

Although some websites are available in multiple languages, many are in the local language only. Also, not all software supports all special characters, and RTL languages. These factors would challenge the notion that the World Wide Web will bring a unity to the world.

Sociological implications

The World Wide Web has allowed global interpersonal exchange on a scale unprecedented in human history. People separated by vast distances, or even large amounts of time, can use the Web to exchange — or even mutually develop — their most intimate and extensive thoughts, or alternately their most casual attitudes and spirits. Emotional experiences, political ideas, cultural customs, musical idioms, business advice, artwork, photographs, literature, can all be shared and disseminated digitally with less individual investment than ever before in human history. Although the existence and use of the Web relies upon material technology, which comes with its own disadvantages, its information does not use physical resources in the way that libraries or the printing press have. Therefore, propagation of information via the Web (via the Internet, in turn) is not constrained by movement of physical volumes, or by manual or material copying of information. And by virtue of being digital, the information of the Web can be searched more easily and efficiently than any library or physical volume, and vastly more quickly than a person could retrieve information about the world by way of physical travel or by way of mail, telephone, telegraph, or any other communicative medium.

The Web is the most far-reaching and extensive medium of personal exchange to appear on Earth. It has probably allowed many of its users to interact with many more groups of people, dispersed around the planet in time and space, than is possible when limited by physical contact or even when limited by every other existing medium of communication combined.

Because the Web is global in scale, some have suggested that it will nurture mutual understanding on a global scale. By definition or by necessity, the Web has such a massive potential for social exchange, it has the potential to nurture empathy and symbiosis, but it also has the potential to incite belligerence on a global scale, or even to empower demagogues and repressive regimes in ways that were historically impossible to achieve.

Web 2.0

Since around 2000, all major web browsers have moved quickly to support additional standards, including CSS, XHTML (and its Document Object Model, or DOM), and JavaScript. Use of these standards in devising web pages has led to a better usability experience for users, in which parts of web pages are updated by background programs that talk to the web server without the user having to browse further to an additional web page. These new technologies are often called Ajax as a comglomerate.

Further advances in website design include user-generated content, social networking aspects such as user ratings or page customization, so-called "mashups" which allow data to be visualized more easily, and new ways of generating revenue such as use of a small-face, discreet, text-only advertizing column a la Google. Such trends are frequently referred to under the all-encompassing term "Web 2.0", meaning the second version of how web pages can be created for a richer user experience. For developers, creating "Web 2.0" pages is more complex, and the technology is still costly enough in terms of human effort that many websites cannot yet afford the implementation overhead. But tools support for Web 2.0 design approaches continues to evolve rapidly as of 2008, and this has resulted in higher expectations for web page friendliness among users.

Websites which heavily employ so-called "Web 2.0" technology are also sometimes referred to as Rich Internet Applications, or simply RIA's.

References

  1. Request for Comments: 3986, Uniform Resource Identifier (URI): Generic Syntax. IETF Network Working Group (January 2005). Retrieved on 2007-04-02.
  2. Request for Comments: 1945, Hypertext Transfer Protocol -- HTTP/1.0. IETF Network Working Group (May 1996). Retrieved on 2007-04-02.
  3. HTML 3.2 Reference Specification. World Wide Web Consortium (W3C) (14-Jan-1997). Retrieved on 2007-04-02.
  4. Template:Cite newsgroup
    A first release of the code followed two weeks later.
  5. CERN (1993-04-30). Statement concerning CERN W3 software release into public domain. Press release. Retrieved on 2007-01-19.
  6. Template:Cite newsgroup
  7. A disruptive technology is one which shakes up the status quo by invoking rapid, sometimes painful, change.