Using Freeware Computer Programmes for English Language

Download Report

Transcript Using Freeware Computer Programmes for English Language

Using Freeware Computer Programmes
for English Language Teaching and Learning
Deny A. Kwary
www.kwary.net
10 May 2010
1
Presentation Outline
1.
2.
3.
4.
5.
Introduction
Using the American Corpus
Using the BAWE Corpus
Building a Corpus
Creating Term Banks using Range, AntConc,
and TermoStat
6. Creating Vocabulary Exercises, a Pop-Up
Glossary, and a Personal Dictionary
7. What else can We do with a Corpus?
2
1. Introduction
 Three different disciplines:
 Computer programming
 Computational linguistics
 English language teaching
 Do teachers need to master all of those three
disciplines?
 Computer programmers create programs 
Computational linguists modify the programs 
English language teachers use the programs.
3
“Knowledge is of two kinds. We know
a subject ourselves, or we know where
we can find information upon it.”
(Samuel Johnson)
4
2. Using the American Corpus (1)
http://www.americancorpus.org/
5
2. Using the American Corpus (2)
Which one occurs
more frequently in
academic texts:
1. ‘small’ or ‘little’?
2. ‘small difference’ or
‘little difference’?
6
2. Using the American Corpus (3)
7
2. Using the American Corpus (4)
8
3. Using the BAWE Corpus (1)
http://www2.warwick.ac.uk/fac/soc/al/research/collect/bawe/
9
3. Using the BAWE Corpus (2)
http://ca.sketchengine.co.uk/open/
10
11
12
13
14
4. Building a Corpus
http://www.lextutor.ca/tools/corpus_builder2/
15
5. 1. Creating Term Banks using Range
 Four kinds of vocabulary in a text: High frequency words,
Academic words, Technical words, and Low-frequency
words (Nation 2001: 11-13)
 Word Lists used as Stop Lists in the RANGE software:
Word List The first 1000 words from the
One
General Service List (West 1953)
High
Frequency
Words
Word List The second 1000 words from the
High
Two
General Service List (West 1953)
Frequency
Words
Word List 570 headwords from the Academic Academic
Three
Word List (Coxhead, 2000)
Words
16
Results from RANGE Software (1)
The sample text is taken from the CFA (Chartered Financial
Analyst) textbook, Study Session 8, Book 3, Level 1.
WORD LIST
TOKENS/%
TYPES/%
FAMILIES
one
15990/67.41
837/46.09
493
two
1404/ 5.92
170/ 9.36
104
three
2846/12.00
378/20.81
217
not in the lists
3479/14.67
431/23.73
1816
?????
Total
17
23719
814
Results from RANGE Software (2)
18
NO.
BASE ONE FAMILIES
TYFREQ
FAFREQ
F1
1.
THE
1539
1539
1539
2.
OF
862
862
862
3.
AND
587
587
587
12
OR
190
190
190
13
EXPENSE
128
197
197
14
COST
121
165
165
15
NOT
121
124
124
16
STOCK
121
123
123
 Nation actually realized that some technical
vocabulary actually also occurs in the high
frequency words. Therefore, he suggested
comparing the frequency of words in a
specialized text with their frequency in a
general corpus (Nation, 2001: 18) .
19
5. 2. Creating Term Banks using AntConc
 Target corpus: CFA textbook, Study Session 8, Book 3,
Level 1 (23,719 words).
 Reference corpus: the British Academic Written English
(BAWE) corpus (6,506,995 words).
 If the occurrence of a word is outstandingly frequent in a
target corpus than in a reference corpus, it will be
considered a positive key word. It means that a key
word does not always mean a word with high frequency.
 Key word is “a word which occurs with unusual
frequency in a given text” (Scott 1997: 236).
20
21
22
Criticizing the results of the Key Word Analysis
NO.
1
FREQUENCY
534
KEYNESS
4445.302
WORD
cash
2
3
4
336
268
174
2191.156
1459.710
1244.679
income
flow
net
Terms: Only single-word terms or also multi-word terms?
Mostly single-word terms or mostly multi-word terms?
Single-word terms, i.e. cash, income, flow, and net
Multi-word terms, i.e. cash flow and net income.
23
5. 3. Creating Term Banks using TermoStat
http://olst.ling.umontreal.ca/~drouinp/termostat_web/
24
25
Using the Collocates in AntConc
26
Using the N-gram (Word Clusters) in AntConc
27
6. Creating Vocabulary Exercises (1)
 Hot Potatoes (http://hotpot.uvic.ca/)
28
6. Creating a Pop-Up Glossary (2)
1. Open Microsoft Frontpage
2. Click File  New  Blank Page
3. In the Code view, Copy-Paste the following after the
code <body>
<INPUT onclick="alert('The receipts and payments made
by a business')" type=button value="cash flow" </td>
4. See the result in the Design view. See how it works in
the Preview.
5. Back to the Split View. Copy-Paste the button, and edit
as necessary.
29
30
6. Creating a Personal Dictionary (3)
1. Extract and Install the file ‘My Personal Dictionary’
2. Start the Program
31
Adding an Entry
To type the
entry word
To type the
definition
To Add
an
Entry
32
To
clear
the text
boxes
Editing an Entry
Double-click
the entry
Revise the
definition
Update
the
entry
33
Deleting an Entry
Double-click
the Entry
Delete
the
Entry
34
What else can we do with a corpus?
 Many Eyes
(http://manyeyes.alphaworks.ibm.com/manyeyes/)
The heart of the site is a collection of data visualizations.
 Example: Biomonitoring Corpus
http://manyeyes.alphaworks.ibm.com/manyeyes/datasets
/biomonitoring-corpus-from-pubmed-8/versions/1
35
the word tree: ‘examine’
36
Phrase net: ‘* or *’
37
THANK YOU
Deny A. Kwary
www.kwary.net
10 May 2010
38