| Computer Education | |||||||
|
|
|
How Internet addresses (URLs) are constructed and regulated
and the relation to numerical Internet protocol addresses (IPs) is discussed
.
Internet Protocol (IP) AddressesHuman memory being what it is, some form of mnemonic aid is necessary for us but computers need no such help and use numbers called IP addresses. Ultimately, computers work with 32-bit binary representations but there are several equivalent formats for IP (Internet protocol) addresses. The one generally used is the so-called "dotted-decimal" form (also known as "dotted quad"). A dotted-decimal IP address has 4 numeric segments, each separated by a period. The numbers must range from 0 to 255 (eight bits). In this representation, Microsoft's real Web address is 207.46.230.229. (At least it was when I wrote this. Try it in your browser.) There are also other formats that you may encounter. For example, spammers may try to obfuscate an address with octal or hex formats. A good discussion of the multitude of ways to make addresses obscure is at http://www.pc-help.org/obscure.htm. If you encounter a long address with a number of % signs, you have likely run into one of these other formats. One place to go to get these types of addresses deciphered is http://www.samspade.org/. Coordinating Internet AddressesObviously, there must be some consistent system to the form that both IPs and URLs have. Also every computer on the Internet has to have a unique address assigned to it. Keeping order and coordinating addresses on the Internet was originally done under the auspices of the US Federal government through organizations like the Internet Assigned Numbers Authority (IANA) and the Inter-Networking Information Center (InterNic). The coordination has been privatized and is now administered by The Internet Corporation for Assigned Names and Numbers (ICANN). The Structure of an URLThe formats that are used for URLs come from standards proposed and agreed to in the beginnings of the Internet. Those with a historical and technical bent can read the paper where Tim Berners-Lee and others set forth the form for URLs. In schematic form, the prescription for an URL is The “user:password@“ section is rarely encountered by the average user but allows you to enter your user name and password for sites where that is required. It can also be used to obscure an URL. For example, http://myname:mypassword@www.microsoft.com works the same as http://www.microsoft.com. Any fake name and password you want can be put in. Try it. We almost always encounter the “host:port” segment without the port specification. The average user need never think about “ports” in this context and discussing exactly what they are would lead us too far afield. Suffice it to say that certain ports are assigned by default to each protocol when none is specifically designated. For example, HTTP is assigned port number 80. More information on ports is on another page. The “host” is the computer where the information that we want resides. As is the case for the example www.microsoft.com there are often three parts to the host name separated by periods or “dots”. The first part refers to a name for a specific computer. It is often called “www” for World Wide Web but other names are common. Also, the practice is growing to omit this part entirely. The second part (“microsoft”) is the local network where the computer resides, and the third (“com”) is one of the so-called “generic top-level domains.” Top-level domainsFor some time, these domains were limited to com, edu, org, net, mil, gov, and int. The com, net, and org domains are available to persons and businesses worldwide (after proper registration). Originally, they were supposed to be descriptive with com, net, and org meaning “commercial”, “network”, and “(non-profit) organization”, respectively. However, these meanings have become blurred. Anyone can apply for a domain containing org, for example. The edu (education) domain is for properly registered four-year institutions of higher learning, i.e. colleges and universities. The mil domain is reserved exclusively for the United States Military and gov is reserved exclusively for the US Government. Certain organizations established by international treaty use int (for example, the UN). In recent years ICANN has added new domains. Check at http://www.icann.org/tlds/ or at http://www.iana.net/domain-names.htm for more discussion of top-level domains. Two of the more common of the new domains are .biz and.info In addition to the generic top-level domains, there are domains assigned to a specific country (some are very small countries) to use as they may see fit. France is fr, Germany is de, and so forth. A list of country domains is at http://www.iana.net/cctld/cctld-whois.htm. (Incidentally, the little Pacific island nation of Tuvalu decided to cash in on its domain tv and sold the rights to it for $50 million.) Recently, a domain .eu for Europe was approved by ICANN. URL-pathFinally we come to the last part of our address “/url-path”.
This is the path on the host computer to the particular page or file that
we wish to download. This part will often have the name of a directory, then
some sub-directories and then perhaps an html file or one of the
server generated file-types such as asp. The naming will be similar
to paths on your own computer with the big exception that forward slashes
are used instead of back slashes. For example in the URL Name or DNS ServersWhen your browser sends off a request to your Internet service provider (ISP) asking to connect you to www.microsoft.com, the computer at AOL or Comcast or wherever has to go look up what those letters mean. The ISP will make use of a special computer (or probably computers) called a "name server" or sometimes a "DNS server." DNS stands for "Domain Naming System". These computers have databases that allow them to translate the human-friendly form into something that computers understand. These translation processes takes a certain amount of time and if the name servers used by your ISP happen to be busy, the delay may be noticeable. If the DNS computer can't find the address you send it, then you will get an error message. An error message could also mean that the computer at the site you are addressing is busy, that the ISP server is busy, or come from a variety of other causes. If you are reasonably sure you have entered a correct URL, try again. Find Your Computer's Internet Address (IP)Note that your own computer must have an IP address when it is connected to the Internet. Dial-up ISPs are assigned a block of these addresses and they temporarily provide your computer with one of them each time you sign on. Thus the address will vary each time you go on-line (but always from the ISP's assigned block). However, cable and DSL connections often have a more permanently assigned IP address. The address may stay the same for some days. If you are curious about which address you have when you are on the Internet, you can make use of the command-line accessory. ipconfig. Its use is described here. Note that, if you are on a local network, your computer will have a local IP address and the Internet IP address will refer to the network as a whole. Your browser broadcasts your IP to any Web site that you go to. (It has to. How else could you get information back?) There are many sites that will tell you what IP you have. An easy one to remember is http://whatismyip.com |
| << Home page | ©2002-2010 Victor Laurie | Home page >> |