5.0724 Available: CONC; ICAME on CD; English Lexicon (3/150)

Elaine Brennan & Allen Renear (EDITORS@BROWNVM.BITNET)
Thu, 27 Feb 1992 23:17:02 EST

Humanist Discussion Group, Vol. 5, No. 0724. Thursday, 27 Feb 1992.


(1) Date: Tue, 25 Feb 92 9:35:52 CST (50 lines)
From: txsil!evan@utafll.uta.edu (Evan Antworth)
Subject: concordance program for Macintosh

(2) Date: Wed, 26 Feb 1992 19:52:26 +0100 (53 lines)
From: Knut Hofland <knut@x400.hd.uib.no>
Subject: ICAME text corpora available on CD-ROM

(3) Date: Tue, 25 Feb 92 10:08:59 CST (47 lines)
From: txsil!evan@utafll.uta.edu (Evan Antworth)
Subject: English lexicon available

(1) --------------------------------------------------------------------
Date: Tue, 25 Feb 92 9:35:52 CST
From: txsil!evan@utafll.uta.edu (Evan Antworth)
Subject: concordance program for Macintosh

Conc is a program for the Macintosh that produces keyword in context
concordances. It can handle both ordinary flat text and multiple-line
interlinear text. In the case of interlinear text, it can concord
correspondences between two annotation lines. It can also do letter
concordances to facilitate phonological analysis. Conc permits the
user to limit the concordance to just those words that match a specified
pattern (GREP expression).

Concordances can be saved to disk, printed, and exported to a plain text
file. As for performance, producing a concordance of Moby Dick (1,177KB)
on a Mac IIci takes about 13 minutes and requires about 2,500KB of memory.
Documentation is included on-line in a Microsoft Word file.
Conc was written by John Thomson of SIL.

Conc version 1.70 is a beta test version offered as 'freeware'. If you use
it, we only ask that you send us your comments, complaints, and wishlist.
You can affect the shape of the final product!

Conc is available in either of two way:

1. Conc can be downloaded by anonymous FTP from the Consortium for Lexical
Research at clr.nmsu.edu [128.123.1.11]. In the directory
pub/tools/concordances/conc you will find the file conc170.hqx, which is a
binhexed, Stuffed archive. Send e-mail inquiries to lexical@nmsu.edu.
(While you are connected, I recommend downloading the file catalog-short
from the top directory.)

2. Conc can be ordered on disk from:

International Academic Bookstore
7500 W. Camp Wisdom Road
Dallas, TX 75236
U.S.A.
phone: (214)709-2404

Cost for media and shipping is $4 to North America and $6 overseas.
(Checks *must* be drawn on a U.S. bank. They do not accept credit cards,
but will bill by invoice.)


Evan Antworth | Internet: evan@sil.org
Academic Computing Department | UUCP: ...!uunet!convex!txsil!evan
Summer Institute of Linguistics | phone: 214/709-2418
7500 W. Camp Wisdom Road | fax: 214/709-3387
Dallas, TX 75236 |

(2) --------------------------------------------------------------74----
Date: Wed, 26 Feb 1992 19:52:26 +0100
From: Knut Hofland <knut@x400.hd.uib.no>
Subject: ICAME text corpora available on CD-ROM

The ICAME Collection of English Language Corpora on CD-ROM is now available.

The CD-ROM is ISO 9660 formatted and have directories for MS-DOS,
Macintosh and Unix.

The CD-ROM contains the following text corpora in the original formats:

Brown Corpus, untagged version, 1 million running words
LOB Corpus, tagged and untagged versions, 1 million running words
London-Lund Corpus, 0.5 million words (spoken)
Helsinki Corpus, diacronic part, 1.5 million running words
Kolhapur Corpus, 1 million running words (Indian English)

All the corpara are also indexed with WordCruncher 4.4 for MS-DOS. The
retrieval part of WordCruncher, WCView, is included.

All the corpora, except Kolhapur, are also indexed with TACT for MS-DOS.

Brown, LOB and London-Lund corpora are indexed with "Free Text Browser"
for Macintosh.

The CD-ROM also has information about network resources like discussion
lists, FTP sites, Netnews lists, text projects and archives, on-line
services and contain some linguistic freeware/shareware programs.

The CD-ROM is available to bona fide researchers for non-commercial research,
the buyer has to state this on the order form.

The price of the disc is 3000 NOK (about 470$).

It is possible to see the disc at the ALLC-ACH conference in Oxford. Since
there are no general sessions for demonstrations, this will be more or less
informal, either on our own equipment or on available equipment at the
conference site. Contact Knut Hofland, either before or under the conference.

More information about the CD-ROM can be fetched from our file servers,
either by mail to the automatic mail responder FILESERV@HD.UIB.NO with
the following line in the BODY:

send icame info.cd

or by anonymous FTP to NORA.HD.UIB.NO (129.177.24.42), and retrieving the file
info.cd in the directory pub/icame.

Knut Hofland
E-mail. knut@hd.uib.no / fafkh@nobergen.bitnet / knut@x400.hd.uib.no
Norwegian Computing Centre for the Humanities,
P.O. Box 53 Universitetet, N-5027 Bergen, Norway
Tel. +47 5 212954, Fax. +47 5 322656
(3) --------------------------------------------------------------58----
Date: Tue, 25 Feb 92 10:08:59 CST
From: txsil!evan@utafll.uta.edu (Evan Antworth)
Subject: English lexicon available

Englex is a morphological parsing lexicon of English intended for use with
PC-KIMMO and/or KTEXT. It's 20,000 entries consist of affixes, roots, and
indivisible stems. Both inflectional and derivational morphology are
analyzed. Englex will run under Unix, Macintosh, or MS-DOS (the files are
plain ascii and are identical for all three versions). Because of memory
requirements, to run Englex under MS-DOS you will need a 386 cpu and
the new 386 versions of PC-KIMMO and KTEXT. These 386 versions will use all
available extended/expanded memory and virtual memory. They support
VCPI-compliant memory managers such as DOS 5.0's EMM386 and Quarterdeck's
QEMM. They do not support (or need) Windows.

All of this software can by downloaded by anonymous FTP from the Consortium
for Lexical Research at clr.nmsu.edu [128.123.1.11]. Send e-mail inquiries
to lexical@nmsu.edu. (For a listing of their holdings, get the file
catalog-short in the top directory.) Here are the subdirectories and
file names:

Directory: pub/tools/ling-analysis/englex_pckimmo
englex10.zip Zipped MS-DOS file of englex10
englex10.tar.Z Compressed UNIX tar file of englex10
englex10.hqx Stuffed, binhexed Mac file of englex10

Directory: pub/tools/ling-analysis/morphology/pc-kimmo
pckim108.zip Zipped MS-DOS file of pc-kimmo108 (inc. 386 version)
pckim108.tar.Z Compressed UNIX tar file of pc-kimmo108 sources
pckimmo108.hqx Stuffed, binhexed Mac file of pc-kimmo108

Directory: pub/tools/ling-analysis/morphology/ktext
ktext103.zip Zipped MS-DOS fiel of ktext103 (inc. 386 version
ktext103.tar.Z Compressed UNIX tar file of ktext103 sources
ktext103.hqx Stuffed, binhexed Mac file of ktext103

Englex, PC-KIMMO, and KTEXT are offered as 'freeware' to the academic
community; your feedback is welcomed.

Evan Antworth | Internet: evan@sil.org
Academic Computing Department | UUCP: ...!uunet!convex!txsil!evan
Summer Institute of Linguistics | phone: 214/709-2418
7500 W. Camp Wisdom Road | fax: 214/709-3387
Dallas, TX 75236 |
U.S.A. |