|
How to Find Information on the Web
Content
-
Introduction
-
By Subject Classification (or Directories)
-
By Keyword/Key Phrase
-
Searching Hints
-
Lists of Search Engines
-
Search Engines that Accept English
-
What Can Be Searched
-
Five Examples
-
Search Engines that Accept Chinese
-
Other Search Engines that Accept Chinese
-
Meta Search Engines: One Search By Numerous Search Engines At One Time
-
By Natural English Language
-
Subject-Specialized Search Engines
-
References
Free Internet resources are scattered, diffused and dispersed, but sometimes may be very useful. These information and resources can be found by search engines on the World Wide Web.
This document is not a study on or comparison of various search engines on the World Wide Web. Rather, it highlights how relevant information and resources can be found by using them. Several search engines are used as examples to illustrates the searches.
While different search engines have different search capabilities, each search engine has its search guide available on its website. The search guide can be accessed by clicking on the hyperlink "Help", "Search Tips", "Tips", etc. Those who want to know more may go to the search guide.
By Subject Classification (or Directories)
Search engines listed here group Internet resources into broad subject categories. Desired information could be found by following the appropriate category.
-
Excite
Internet resources are grouped into over ten broad subject categories, each of which has sub-categories.
-
WWW Virtual Library
List out more than ten broad subject categories, each of which has sub-categories. Sub-categories point to webpages which in turn collect webpages with resources of the topics concerned.
-
Yahoo
Internet resources are grouped into more than ten broad subject categories, each of which has sub-categories.
-
中文雅虎
Provide similar subject categories to that of Yahoo. Resources collected are primarily on China, Hong Kong, and Taiwan, though some are on United States. Resources are basically in Chinese.
By Keyword/Key Phrase
Two Search Approaches
Input as many relevant keywords as possible, view only the first few pages of the result list.
The reasoning behind is search engines will return webpages that match all or some or one of the keywords inputted. The webpages with the most keywords would be listed first. The more keywords inputted, the more refined the search results that appear top of the result list would be. Those appear lower or at the bottom of the result list would be irrelevant, and may be ignored.
Get a small, highly relevant result set by
-
using phrase searching
-
specifying word/phrase that must be present or must be absent
-
using searching tag
Lists of Search Engines
-
Search Engines by Countries and Regions
-
Search Engines Worldwide
-
Search Engines in Alphabetical Order
While search engines may cover Internet resources of any kind, some of the search engines in these two lists only cover resources of particular topics.
-
From AltaVista
-
From Yahoo
-
From Search Engine Watch
Search Engines that Accept English
-
What Can Be Searched
-
title of the webpage
-
text in the webpage
-
the url address of the webpage
-
host url of the webpage
-
the text in the hyperlink
-
the url address behind the hyperlink
-
domain of the webpage (domain is the last part of the host url of the webpage)
-
the name of the java applet
-
the name of the image
-
Five Examples
Search engines that accept English may not support all of the above searches. Some support more, some support fewer. Of the five examples here, only AltaVista exhibits all the above searching capabilities.
-
AltaVista
-
Indexes the full text in all files in its database
-
Truncation * allows searcher to stem search terms, e.g. quot*, colo*r
-
In simple search, + immediately before a word or phrase requires it to be present, - used in simple search, immediately before a word or phrase excludes documents with its presence
-
Search phrases by quotation marks, e.g. "handover ceremony"
-
Search domain, e.g. domain:de, domain:org
-
Search host, e.g. host:altavista.digital.com
-
Search title, e.g. title:"handover ceremony"
-
Search url, e.g. url:nobel to find all pages on all servers that have the
world nobel in the host name, path or filename
-
Search applet, e.g. applet:comet to find pages using applets called comet.
-
Search image, e.g. image:"jackie chan" to find pages with the filename of the image being
"jackie chan".
-
Search pages that contain the specified text in any part of the page other
than an image tag, link, or URL, e.g. text:comet
-
Search in the text of hyperlinks, e.g. anchor:"handover ceremony"
-
Search pages with a link to a page with the specified url text, e.g. link:www.cuhk.edu.hk
to find all pages linking to CUHK homepage
Find websites or webpages similar to the specified url, e.g. like:www.patents.com will retrieve websites relating to patents
-
In advance search, four operators: AND, OR, NOT, NEAR (find documents containing
both specified words or phrases within 10 words of each other)
- Northern Light
-
Indexes the full text of all files in the database -
Search phrases by quotation marks, e.g. "nobel prize" -
Search pages that contain the specified text within the text of the document or website e.g. text:comet -
Search title, e.g. title:"handover ceremony" to find pages of which the title has the phrase "handover ceremony" -
Search url, e.g. url:nobel to find all pages on all servers that have the word "nobel" in the host name, path or filename -
Truncation * allows searcher to replace multiple characters e.g. chemi* will find pages containing "chemical", "chemistry", etc.; or "psych*ist" will find pages containing "psychologist", "psychiatrist", etc. -
Truncation % allows searcher to replace single character e.g. gene%logy will find pages containing "genealogy" and "geneology" -
+ immediately before a word or phrase requires it to be present, - immediately before a word or phrase excludes documents with its presence -
supports full Boolean capability (AND, OR, NOT), including parenthetical expressions
-
Google
- Returns web pages that contain all the keywords entered but ignores common words and characters
- + to specify a keyword that must be present, - to exclude a keyword
- Put quotation marks around a phrase to do phrase searching
- No truncation search
- Supports OR operator
- site: to restrict the search to a particular website, e.g. "business administration" site:www.cuhk.edu.hk
- link: to find webpages with hyperlinks pointing to the specified URL, but a link: search cannot be combined with a regular keyword search, e.g. link:hkinchip.lib.cuhk.edu.hk
-
Excite
-
Indexes the full text of all files in the database
-
Search phrases by quotation marks, e.g. "nobel prize"
-
+ immediately before a word or phrase requires it to be present, -
immediately before a word or phrase excludes documents with its presence
-
Search title in Power Search
-
Three operators must be capitalized: AND, OR, AND NOT, e.g. "nobel prize"
AND NOT peace
-
Those between parentheses are performed first
-
Yahoo
-
Automatic right truncation
-
Search phrases by quotation marks, e.g. "handover ceremony"
-
Search in the document title, e.g. t:"handover ceremony"
-
Search in the document url, e.g. u:cuhk.edu.hk
-
+ immediately before a word or phrase requires it to be present, -
immediately before a word or phrase excludes documents with its presence
-
GAIS (Global Area Information Servers)
- Accepts traditional Chinese character (Big5) only
- & to stand for AND, | stand for OR, ! stand for NOT
e.g. 香港 & 公務員 & ( 薪酬 | 薪金)
- + to specify the phrase that must be present, -" to specify the phrase that must not be present
e.g. +茶道 -日本
Other Search Engines that Accept Chinese
-
中文雅虎
Accepts traditional (Big5) and simplified (GB) Chinese character
-
搜孤
Accepts simplified (GB) Chinese character
-
AltaVista in Chinese
Can select to accept either traditional (Big5) or simplified (GB) Chinese character -
TOM 站
Can select Hong Kong, Beijing or Shanghai, and Guangzhou stations that accept traditional (Big5) or simplified (GB) Chinese character
- phrase searching, but a long phrase might be broken down to short phrases
- A space to stand for "AND", e.g. 程式設計 薪金 上海 to find webpages talking about the salary of programmer in Shanghai
- | to stand for OR, - to stand for NOT
- . to limit the search to a specific website, e.g. 網上出版 .www.tsinghua.edu.cn
- link: to find webpages that have hyperlinks point to the specified website, e,g. link:www.lib.cuhk.edu.hk
Meta search engines send the keyword/key phrase input to a number of search engines, and return a compilation of results from each of them to the user.
Meta Search Engines
Some search engines accept natural English language. Users do not need to know any searching techniques, but simply enter the kind of English spoken or written in daily life when searching Internet resources via this kind of search engines. However, the search may be less accurate.
-
WebCrawler
e.g. Who are the nobel prize laureates in 2000 ?
-
Ask Jeeves
e.g. How the toilet flushes
All Academic : The Guide to Free Academic Resources On-Line All Academic is an all-discipline academic index to free publications on the Internet. It hosts or provides links to academic e-journals, working papers, convention proceedings, as well as other scholarly works. It supports searching by subject, author, journal or article title; and allows browsing by journal titles and convention proceedings. Edinburgh Engineering Virtual Library EEVL, or Edinburgh Engineering Virtual Library, searches the full text of over one hundred engineering e-journals, which are listed in the EEVL catalogue of engineering resources. In order to be selected, e-journals must be free, full text (or offer most of their content as full text) and available without registration. Arts & Humanities Business Education Government & Politics Health & Medicine Legal Music Science & Technology - Scirus : An index to science, medicine, technology, as well as economics, business and management information
Software & Computer
-
Searching
the Internet: Recommended Sites and Search Techniques
-
Understanding and Comparing Web Search Tools
-
Search Engine Watch -
Search Engine Showdown
Copyright: Reference/YCL/27052002
|