WEB SEARCHING Part One - Community informatics

Download Report

Transcript WEB SEARCHING Part One - Community informatics

What to Know:
9 Essential Things to Know
About Web Searching
Janet Eke
Graduate School of Library and Information Science
University of Illinois at Champaign-Urbana
2003
Topics
• 3 Essential Conceptual Things to
Know About the Web.
• 3 Essential Practical Things to Know
About General-purpose Search Tools.
• 3 Essential Useful Tips to Know About
Search Strategy.
2
3 Essential Conceptual
Things to Know About the
Web
What to know: Web concepts
1. Understand basic context and
structure.
Know what the World Wide Web is.
4
Key Terms Defined
• The Internet is a global network of
computers.
• The World Wide Web is a service
running on the Internet.
– it is the name given to a collection of
documents stored on computers connected to
the Internet
– these documents are written in a markup
language (usually HTML) and characterized
by ‘hypertext links’
5
Key Terms Defined
• A web browser is a piece of software.
– its purpose is to read and display web pages
6
Key Terms Defined
• To search the web, we use search tools
accessed via web pages.
– Search tools may be as simple as a list of
links, or as complicated as a large database
of information gathered from web pages.
7
HTML tags
define
document
structure
Web browser
software
interprets
HTML and
displays page
What to know: Web concepts
2. Know the basic types of generalpurpose search tools and how they
work.
– search engines versus subject directories
9
Basic Web Search Tools
• Both Subject Directories and Search
Engines offer access via a web page to
a database of information about web
sites.
• The information in their databases,
however, and the way this information
is gathered, organized and maintained,
is very different.
10
Subject Directories
• A subject directory searches a humancompiled database of web sites,
organized into subject categories.
• The database includes the name and
URL of the web site, plus a brief
description.
• The database does NOT include
individual web pages within the site.
11
Search Engines
• A search engine searches a computercompiled database of information about
individual web pages. There are no
subject categories. No human examines
the web sites.
• The database includes detailed
information from the web site -- in some
cases every word on every page is
indexed; in others only selected portions
are indexed.
12
Directories VS Engines
UI LIS Current Clips: http://www.lis.uiuc.edu/clips/2002_12.html
13
What to know: Web concepts
3. Bear in mind the implications of the
structure of the Web environment and
its search tools.
–
–
–
–
Web sources must be carefully evaluated.
Everything is NOT on the Web.
There is no such thing as a ‘live’ Web search.
There is no such thing as a fully
comprehensive Web search.
14
3 Essential Practical Things
to Know About Generalpurpose Search Tools
General-purpose Search Tools
1. Know when directory results may be
more useful than search engine results,
and vice versa.
16
Directories VS Engines
UI LIS Current Clips: http://www.lis.uiuc.edu/clips/2002_12.html 17
E.g., Yahoo!
Search
keywords here
Or browse subject
categories here
18
Search Results: 5 types
Types of results
Yahoo Directory category
matches
Blends site results from
Google and Yahoo
Directory
19
Search Results: Directory only
20
Yahoo Directory Site
21
Google Directory
22
Example
• Find major earthquake engineering
research centres.
23
24
25
Category view
Annotated
directory entries
26
27
General-purpose Search Tools
2. Know advanced search features and
syntax, such as ‘search engine math.’
28
Basic ‘Search Syntax’
Searching Phrases
“” searches enclosed terms as a phrase
• Example:
Find source and completion of quotation
beginning: “went down to the station to look
for her there”
29
Without Quotation Marks
First results NOT relevant.
Terms are scattered in
documents.
30
Search terms as a phrase
Enclose terms in
quotes to search
as a phrase.
31
Full album is in one web page
32
Use browser to search page
Use browser to search page
Use browser to search page
Basic Search Syntax
+
*
-- include term
-- exclude term
-- truncation symbol (AltaVista)
– allows for 5 characters at end of word
– use to search plurals / alternate endings
– e.g., computer* retrieves ‘computers,’
‘computerized,’ etc.
36
Advanced Search Syntax
• Field searching
• Format:
fieldname:TERM
37
Advanced Search Syntax
• Field Searching Examples
title:drucker searches for ‘drucker’ only in title
of web page (Google: intitle)
url:hoovers searches for ‘hoovers’ anywhere in
the web address (Google: inurl)
domain:ca searches for ‘ca’ (canada) only in the
domain portion of the web address
link:www.hoovers.com searches for pages that
link to www.hoovers.com
lyrics site:j-tull.com searches for the word
‘lyrics’ within the site www.j-tull.com (Google)
38
Example
• Search for Canadian federal election
results.
“federal election” domain:ca
– retrieves documents containing BOTH the
phrase “federal election” in the text, and the
domain “ca” in the web address
39
Example
• Search for a good list of international
phone directories.
– we already know a good Canadian online
phone directory
– strategy: see what other sites link to it
link:canada411.sympatico.ca
40
Google Search
41
Google Search
42
Google Search
43
Listing of Web Phone Dirs.
44
Google Advanced Search
45
General-purpose Search Tools
3. Know where to go to learn more
quickly.
–
–
–
–
what search tools are out there?
what are their advanced features?
where do they get their results?
is there a more specialised search tool for my
topic?
46
Search Engine Watch
www.searchenginewatch.com
47
Search Engine Watch
www.searchenginewatch.com
48
Search Engine Watch
Search Features Charts
http://www.searchenginewatch.com/facts/ataglance.html
49
Search Engine Showdown
www.searchengineshowdown.com
50
Search Engine Showdown
Search Tool In-depth Reviews
51
3 Essential Useful Tips to
Know About Search
Strategy
Search Strategy
1. For some topics, consider using
general-purpose search tools to search
for sources, not for directly for content.
53
Example
• What is the provenance of the
Leonardo da Vinci painting “Virgin of
the Rocks” in the National Gallery
(UK)?
54
Direct search in Google
55
Direct search in Google
Results may be useful. Need to
examine and evaluate individually.
No source stands out as useful for
future provenance searches.
56
Search for general topic, to find
source
57
Search for general topic, to find
source
58
59
60
61
62
63
64
Search Strategy
2. Find expert sources by asking
yourself, Who cares about this topic?
65
The ‘Who Cares?’ strategy
• Ask Who cares about this?
– Rather than searching for the info needed,
find out if someone has already gathered it
together for you
• identify a likely organisation and its web site
• use specialised tools to find ‘guru pages’ and
subject guides
66
Strategy: Who cares?
• Is there an organisation or person
interested in this problem?
– Is there a government agency responsible for
collecting or disseminating this info?
– Would a trade association be interested on
behalf of its members?
– Has a university department or independent
scholar or hobbyist created a subject guide?
67
Example
• Where can I find coal production
statistics for the US?
68
Example
• Where can I find coal production
statistics for the US?
• Who cares?
– United States Geologic Survey (USGS)
– often government agencies are responsible
for compiling statistics
69
Government Agency
US Geologic Survey
publishes the
Minerals Yearbook.
Sample: Coal Product
statistics
70
71
Search Strategy
3. Build a core collection of specialised
search tools beyond general-purpose
subject directories and search engines.
– collect sites useful for searching your subject
area
– develop a workable way to organise and
access them
72
73
http://www.census.gov
74
75
76
77
78
Powermarks bookmark utility -- creates
searchable database of bookmarked
sites; easy to organise, weed, and search.
79
Powermarks bookmark utility -- creates
searchable database of bookmarked
sites; easy to organise, weed, and search.
80
Summary
• 3 Essential Conceptual Things to
Know about the Web:
– Know what it is; know basic definitions and
components involved.
– Know that there are two basic types of
general-purpose search tools, and how they
work.
– Bear in mind implications of structure for
how searches work, and limitations of the
Web.
81
Summary
• 3 Essential Practical Things to Know
about General Purpose Search Tools:
– Know when directory results may be more
useful than search engine results, and vice
versa.
– Know advanced search features and syntax,
such as ‘search engine math.’
– Know where to go to quickly learn more.
82
Summary
• 3 Essential Useful Tips to Know About
Search Strategy:
– Consider using general-purpose search tools
to search for sources, not for directly for
content.
– Find expert sources by asking yourself, Who
cares about this topic?
– Build a core collection of specialised search
tools beyond general-purpose subject
directories and search engines.
83