modified 05.10.96
Searching and Wandering the Web
"How do I find X?"
Turn to friendly spiders, robots, wanderers, and other web creepy crawlers...
The Internet is huge, vast, and some might say, in horrific disarry.
How do you find things in all of this? There is no one single index or tool to find everything and anything. However, there are some resources out there to make it a little easier. We've tried to organize the various search engines into similar categories for the ways in which they search and the scope of what they search.
If you do not find a search type here, try the ever-growing list
maintained at Yahoo
or WebCrawler's List of Robots
Or, if you like to be shown the ropes, take a walk with our webhound...
No matter what, you will end up having to do some exploring....
One Page-One Click Meta-Searches
A "Meta-Search" will take your query and search for it by sending your request
to other search engines, all in parallel. The "one-click"
approach means doing all of this work from a single interface.
- Savvy Search
- has a single field for you to enter keywords and sends your request to
more than 18 other major web search sites. SavvySearch has a built in mechanism
to rank your search and send it to the most appropriate sites.
- MetaCrawler
- is a "Multi-Threaded Web Search Service" designed to take your keyword and submit it
to several other search angines, returning a ranked list of sites
in one form. MetaCrawler can also authenticate the returned links, which may take longer
than other search engines, but offers more reliable results. The MetaCrawler is free and differs from other
services in that it doesn't maintain any internal database. Rather, it relies on the databases of eight different services: Open Text,
Lycos, WebCrawler, InfoSeek, Excite, Inktomi, Yahoo, and Galaxy.
- IBM InfoMarket
- can conduct parallel keyword searches by sending the request to Disclose SEC database, McKinley Internet Direct, OpenText Internet Search, USENET Newsgroups, and Yahoo. You have to register for the service to use it, but it is free.
- W3 Catalog
- The W3 catalog is a huge searchable collection of W3 resources. It is created
daily by concatenating a list of resource databases collected by other net providers
including the NCSA What's New Archives
One Page-Several Click Meta-Searches
These are pages that have have set up interfaces to search many other sites. Typically you
will have to enter your keywords in several places and submit them individually. However, this
is much easier than connecting to each search site seperately.
- c|net search.com
- Boasting more than 250 search engines, c|net has neatly organized an interface to the many other search sites on the web.
- Find-It
- offers an extensive array of search sites, nicely organized into tables.
- All-In-One Internet Search
- from its description, "This page is a compilation of various forms-based search tools found on
the Internet. They have been combined here to form a consistent
interface and convenient ALL-IN-ONE search point." The searches are categorized (i.e. Wordl Wide
Web, Software, Publications/Literature, News) and is very comprehensive.
- The Inquirer
- provides a one page access to many of the major web search sites, grouped into
the categories of Current Events, Software Files, Reference Materials,
and People and Computers
- The Internet Sleuth
- provides a subject-oriented directory of hundreds of other searchable indexes and databases.
- CUSI (Configurable Unified Search Engine)
- is a "one-page-does-all"-- a configurable search interface for many searcheable WWW
resources. It allows you to quickly check related resources, without having to navigate an re-type the
keywords.
Robots, Spiders, and Crawlers
All of these creatures are built up by sending out "agents" on the world wide web that
follow every link possible, returning their results to a large, searchable database. For
more information, take a look at Robots, Wanderers, and Spiders
- Alta Vista: The Largest Web Index
- The competition is on as far as what "largest" means, but this server is very efficient and flexible.
It offers access to some 8 billion words found in over 16 million Web pages plus
a full-text index of over 13,000 news groups updated in real-time.
- Inktomi
- also claims to be the "biggest" web search tool. Inktomi uses parallel computing technology to build a scalable web server using
workstations; it is part of the Network of Workstations
(NOW) project at the University of California at Berkeley.
- Excite
- Another claim as the biggest! "The Intelligent way to naviagte the net" includes NetSearch, an exhaustive net wide search of over 1.5 million web pages; NetReviews a searchable and category index of sites that have been reviewed; and Bulletin -- constantly updated news articles and editorials.
- Open Text Index
- is a fast, powerful search engine to a massive index of the Web. The results include
not only links to sites that match the search criteria, but also a short description,
keywords taken from the headers of documents, and an ability to conduct a search
of similar sites. OpenText also offers a "Power Search" page where you can
construct more complex queries.
- Lycos
- created at Carnegie Mellon University, is the catalog of the Internet.
The Lycos web explorer searches the World Wide Web every day
(including Gopher and FTP space), building a database of all the web pages it finds.
The index is updated weekly. The search engine provides retrieval from this catalog,
taking a user's query and returning a sorted list of hits, sorted by match score.
- WebCrawler
- is one of the most popular,
fastest and easy-to-use Internet search tool available in the market today.
- InfoSeek
- "the most powerful and popular way to
search the Web" offers full searches for commercial cleints, but anyone can use its search engine
for a free 10 item search. The results are ranked and include a short description
- World Wide Web Worm
- allows you to locate almost any WWW hypertext or URL. WWWW provides four types of search
databases: citation hypertext, citation addresses (URL), HTML titles and HTML addresses.
- ALIWEB
- searches the web and returns a ranked list with descriptions.
- RBSE Spider
- This index is a collection of url references built up and indexed with a hacked version of WAIS. The index is constructed
by a spider that walks the web, building a graph in an Oracle database, and WAIS indexing the full text of the document.
Open Submission Sites- Yellow Pages
These sites allow public announcing of their web sites and then in turn
, you can search their collections.
- New Rider's Yellow Pages
- can be searched by keyword or subject category
- Virtual Yellow Pages
- is a comprehensive and easy to use directory of Web sites and
information. The VYP® uses a revolutionary patented search engine that gives you the
power of natural language concept searching as well as keyword search.
- What's New Too
- one of the most up-to-date and fastest growing resources on the
net, What's New Too! posts an average of over 300 announcements
daily, all within 36 hours of submission. Search the collection based uopn
desired interest area, length of description, and date of
announcement
Search Big, Major Sites
Many of the large volume/high traffic web sites are worth searching, too.
- Yahoo
- Yahoo is a hierarchical subject-oriented catalogue for the World Wide Web and Internet.
- TradeWave Galaxy
- searches the extensive collections of the TradeWave Galaxy
(formerly known as EINet). It references a large number of Web documents from around the world, including the home
pages of most of the world's Web servers.
W3 InfoPage
Maricopa Center for Learning and Instruction (MCLI)
Maricopa County Community College District
The Internet Connection at MCLI is
Alan Levine --}
Comments to levine@maricopa.edu
URL: http://www.mcli.dist.maricopa.edu/w3info/