| Suntek White Papers |
Metadata is specified with the <meta> HTML tag within the header of a web page. Therefore, this information will not be displayed on the browser but you can examine the HTML source of the page to find out the metatags defined in the web page.
There are several common metatags used in web pages. The most common ones are title, keywords, and description tags. The less common but useful tags are author and date tags. The following is an example showing the use of metatags. Notice that most web pages use the <title></title> tag in place of the title metatag.
<head>
<meta name="title"
content="Best Chinese Search Engine">
<meta name="author"
content="Suntek Computer Systems Ltd.">
<meta name="description"
content="The best Chinese search engine in the world was launched ...">
</head>
You may reference the
standard element set defined in the
Dublin Core Metadata Initiative (commonly known as the Dublin Core),
which is an open forum for the development of metadata standards
for a broad range of applications.
The Dublin Core recommends the use of the description metatag to contain the table of contents or an abstract of the web page. However, most web pages do not follow this recommendation. Perhaps it is because the metatags are not displayed by the browser anyway, there does not seem to be any point to put in an abstract which is not going to be displayed. However, this point is invalid since metatags are there to help applications, not human users, to manipulate the pages in a meaningful way. For example, you have written a number of articles and you want to display a list of the articles and a summary for each article on the list. Since there is no automatic way till now to produce a good summary for an article, you have to type it in by yourself and put it in the description metatag. Then, you can easily write a CGI script to extract the summary from each article and display them in the list. It certainly requires some typing, but the convenience to the users resulting from the summaries far outweights the costs.
No matter how great search engines are, there is no magical way (yet!) for search engines to determine the content (or semantics) of web pages and use them for judging the relevance of the web pages. Likewise, there is no easy way to produces precise summaries for web pages automatically. Many search engines just extract the first few lines from the beginning of a web page as the summary. Some search engines such as Google simply report the lines containing the search keywords. Neither of them are satisfactory methods for summarizing the content.
It is too easy to put the blame on search engines and proclaim how stupid they are. Granted that search engines make mistakes, but, as content developers, you can help the search engines, and yourselves by the way, to write your web pages in such a way that the search engines can easily capture the important content of your web pages. In fact, the whole industry, call search engine optimization, is built to help users to write their web pages in the right way and include metatags that would make their web pages rank high on search engines.
You can find out more about the importance of metatags in articles from search engine positioning companies, search engine registration companies, and companies building affiliate programs.
Another example can be found at the City University's CityUToday publication. Try a few queries and you will notice that the results contain the titles, dates, summaries of each matched article. Examining the HTML source of a matched article reveals that metatags are used to define these attributes and that all metatags are indexed and searchable.
You may observe from this life example that summaries are very useful in helping the user to determine if he or she wants to go into the page and browse it. It also makes the result page looks a lot nicer. Without the description tag, most search engine, including Suntek, would simply extract the words from the beginning of a page to serve as the summary. While better than nothing, the first few words don't always represent the summary of an article. Suntek's search engine allows the search engine administrator to specify that a metatag (in this case the description metatag) be used to produce the summary of a web page. For the HTML segment above, the following will be displayed in the result:
Best Chinese Search Engine The best Chinese search engine in the world was launched ...Of course, Suntek allows you to include in the result page other useful information, such as last modification date, URL of the page, size of the page, the category that the page belongs to, etc. With a template-driven approach, you can format the result page in any way you want. You can try out the various web sites built on Suntek search engine and note the diversified designs employed.
<meta name="date" content="CCYY-MM-DDThh:mm:ss+mm:ss">
For example, the following date corresponds to the Hong Kong/Taiwan time 01:02:30 of June 1, 2000.
<meta name="date" content="2000-06-01T01:02:30+08:00">
Why do we need to input a date metatag? If you want to search or display the date of the web page (whether it is the creation date or the last edit date), we recommend you to use the date metatag to specify it. This is because not all web servers return the last modification date of a page (not to mention that the last modification date is not necessarily the same as the date when the document was created or modified).
Last update: May, 2001. Copyright (C) 2001 SUNTEK Computer Systems Ltd. All rights reserved.