6.0088 Yugoslav Text Corpus Available (1/33)
Elaine Brennan & Allen Renear (EDITORS@BROWNVM.BITNET)
Thu, 18 Jun 1992 18:14:46 EDT
Humanist Discussion Group, Vol. 6, No. 0088. Thursday, 18 Jun 1992.
Date: Thu, 18 Jun 92 13:29:56 +0200
From: Henning M|rk <slavhenn@aau.dk>
Subject: YU-CORPUS
Dear colleagues, Aarhus, Denmark, June 1992
This message is to announce the first part of my YU-CORPUS (Yugoslav text
corpus) consisting of (mainly) contemporary fiction (prose) in Serbo-Croatian
with the main areas represented: Serbia, Croatia, Montenegro, and Bosnia-
Hercegovina.
The corpus consists of 15 files containing together approximately 700 000
words.
These files are available by
ftp at aau.dk (129.142.17.240) in the directory /home/ftp/pub/slav
First get the text files yu-corp.txt, which among other things tells
about the chosen ASCII standard, and yu-index.txt, which identifies the
available texts by author(s) and size.
The corpus files are zipped and must thus be transferred in binary mode.
All comments are welcome
Henning Moerk
Slavisk Institut
Aarhus Universitet
Ny Munkegade 116
8000 Aarhus C
Denmark
tel: +45 86 13 65 55
fax: +45 86 19 21 55
e-mail: slavhenn@aau.dk