Database/Record Structure

Download Report

Transcript Database/Record Structure

Database & Record Structure
Dania Bilal
IS 530
Fall 2005
Definition

A database is a collection of information
organized in a way that a computer
program can quickly retrieve desired
pieces of data.
Database Components



Fields
Records
Files
Database Fields

Pieces of information a user can access






Author
Title
Journal name
Abstract
Descriptors
Other
Database Fields

Fields may have attributes associated
with them:

Numeric


(e.g., accession number)
Textual

(e.g., author name)
Database Records and Files

Record


A collection of fields which constitutes a
complete set of information
File

A collection of records
Hypertext Database


Hypertext was invented by Ted Nelson
in the 1960s.
In a hypertext database, objects (text,
pictures, music, and other media) are
linked to each other.
Data Structure


A scheme for organizing related pieces
of information.
Basic types of data structures

Files, records, trees, tables
Trees

Data is organized in a hierarchical
structure



Each element is attached to one or more
elements that is directly beneath it.
Connections between elements ->branches
Elements at bottom of a tree with no
elements below them -> leaves
Tables

Data is organized in rows and columns


Example: Excel spreadsheet
Relational database management
systems store data in the form of
related tables

Aleph system is based on a relational database management
system (Oracle)
Dialog Database


Documents or surrogates are stored in
a linear file
Linear file is transformed into an
inverted file
Dialog Database Structure

Linear file


Composed of document surrogates stored
in the IR system in their full, native form.
Inverted file

Composed of all words included in
document surrogates excluding stop
words.
Linear File


Documents have to be searched in their
entirety to locate specific information
needed.
Slow and inefficient
Inverted File


Words in all documents can be searched
instead of the whole text of the
documents themselves
Faster and more efficient
Creation of Inverted File



A list of words in document surrogates
is made.
Each word is numbered, including
phrases and excluding stop words.
Words that are numbered are
alphabetized (numbers precede letters)
Creation of Inverted File

Alphabetized entries are followed by the
document number, field (e.g., AB, DE),
and the number of the entry in that
field (e.g., entry in abstract as first
word)
Linear File: Example
101
The origins of Don Giovanni.
Discusses the history and sources Mozart used in
his opera Don Giovanni.
DE: Mozart, Opera, Historical Analysis.
Inverted File
Origins
Don
Giovanni
101
101
101
Ti
Ti
Ti
2
4
5
Discusses
History
Sources
Mozart
Used
…
101
101
101
101
101
Ab
Ab
Ab
Ab
Ab
1
3
5
6
7
Inverted File
Mozart
101
DE
1
Opera
101
DE
2
Historical
101
DE
3
Analysis
101
DE
4
Historical Analysis
101
DE
3,4
Indexing

Words (keywords)


Every important word in document is
indexed
Information systems, for example, is
indexed as 2 separate words and as a
phrase



Information
Systems
Information systems
Record Structure



Dialog record structure shows every
field followed by the information for
that field.
Fields and structure varies among
databases
Demo of a Dialog Record structure
Internet Protocols



Http
Telnet
File Transfer Protocol (FTP)


Secure FTP (SSH)
Web-based FTP (Volspace)
E-mail Protocols

SMTP (Simple Mail Transfer Protocol)




Used to send e-mail between servers and between
a server and client’s machine over the Internet
Email is retrieved by using a POP, IMAP, or
text-based email client
POP (Post Office Protocol)
IMAP (Internet Message Access Protocol)
Introduction to Dialog






Begin or B
Select or S
Select Steps or SS
Display Sets or DS
Logoff hold
Logoff