12.0266 new on WWW

Humanist Discussion Group (humanist@kcl.ac.uk)
Fri, 23 Oct 1998 09:31:40 +0100 (BST)

Humanist Discussion Group, Vol. 12, No. 266.
Centre for Computing in the Humanities, King's College London
<http://www.princeton.edu/~mccarty/humanist/>
<http://www.kcl.ac.uk/humanities/cch/humanist/>

[1] From: "Lissa Lord <lissa-lord@uiowa.edu>" (22)
<llord@blue.weeg.uiowa.edu>
Subject: Exhibit On Preservation

[2] From: Ian Butterworth <i.butterworth@ic.ac.uk> (25)
Subject: Conference, 'Electronic Communication and Research in
Europe'

[3] From: "David L. Gants" <dgants@english.uga.edu> (57)
Subject: Prague Dependency Treebank

--[1]------------------------------------------------------------------
Date: Sat, 17 Oct 1998 09:52:43 -0500 (CDT)
From: "Lissa Lord <lissa-lord@uiowa.edu>" <llord@blue.weeg.uiowa.edu>
Subject: Exhibit On Preservation

http://www.lib.uiowa.edu/ref/exhibit

The University of Iowa Libraries announces a virtual exhibition which
focuses on the preservation of scholarly resources in various formats.

This Web Page is the virtual version of the Libraries' exhibition,
"Keeping our Word: Preserving Information Across the Ages." Most
digitized images are from the exhibition as shown in the North
Exhibition Hall of the Main Library on the University of Iowa campus
from October 1998 to January 1999.

The presence of this exhibition on the Internet extends the space
and resources available to the on-site visitor. The space grows
beyond library walls. The resources expand into major preservation
research centers as well as into small, specialized collections the
world over. The virtual exhibit forms a continuing research page
for the study of library preservation.

* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
Lissa Lord lissa-lord@uiowa.edu
Team Leader, Research Services
Information, Research, & Instructional Services (IRIS)
University of Iowa Libraries
100 Main Library
Iowa City, IA 52242 319/335-5403

It's like Deja Vu all over again...Y.Berra
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *

--[2]------------------------------------------------------------------
Date: Sun, 18 Oct 1998 14:57:59 +0100 (BST)
From: Ian Butterworth <i.butterworth@ic.ac.uk>
Subject: Conference, 'Electronic Communication and Research in
Europe'

Colleagues

The Proceedings of the Conference:"Electronic Communication and
Research in Europe",which was organised by the Academia Europaea and took
place in Darmstadt/Seeheim on the 15th to 17th of April 1998 within the
framework of the European Commission's European Science and Technology Forum
has now been published on the Web by the Forum with the help of
the German National Research Centre for IT-Integrated Publications and
Information Systems Institute (GMD-IPSI).

The Proceedings were edited by Jack Meadows and can be found at:
http://academia.darmstadt.gmd.de/seeheim/thebook/index.html

The printed version will be available from the Forum soon.

Regards

Ian Butterworth
------------------------------------------------------------------------
Professor Ian Butterworth CBE FRS MAE

Vice-President, Academia Europaea
The Blackett Laboratory
Imperial College
Prince Consort Road
London SW7 2AZ
UK

Tel: +44 (0)171 594 7851

Fax: +44 (0)171 823 8830

E-Mail: i.butterworth@ic.ac.uk

--[3]------------------------------------------------------------------
Date: Tue, 20 Oct 1998 13:08:30 -0400 (EDT)
From: "David L. Gants" <dgants@english.uga.edu>
Subject: Prague Dependency Treebank

>> From: Daniel Zeman <zeman@ufal.ms.mff.cuni.cz>

The Institute of Formal and Applied Linguistics (UFAL) at the Charles
University, Prague, proudly announces that the first version of the
PRAGUE DEPENDENCY TREEBANK has been made available to the research
community.

The Prague Dependency Treebank (PDT) is a morphologically and
syntactically annotated corpus of Czech as a representative of
inflectionally rich free-word-order languages. (E.g., all the Slavic
languages such as Russian, Polish, Serbo-Croatian and many others
spoken together by more than 350 million people have similar typological
properties as Czech in both morphology and syntax.) The current version
of PDT (0.5) contains 456705 tokens (words+punctuation) in 26610
sentences and 576 files. For keeping results of NLP applications
comparable the data has been divided into a training set (19126
sentences), a development test set (3697 sentences) and a
(cross-)evaluation test data set (3787 sentences).

The Prague Dependency Treebank is - to a certain extent - modelled after
the Penn Treebank but it uses the dependency syntax representation of
sentences. It has three layers:

1.morphological (uses word forms, tags, lemmas)
2.analytical, or surface syntax (uses dependencies and analytical
functions of dependencies)
3.tectogrammatical, which captures linguistic meaning (contains
tectogrammatical functions such as Actor, Patient, Addressee, etc.)

The Prague Dependency Treebank is a long-term project which should end
in the year 2000. At the moment (October 1998) we have at our disposal
roughly half the material (at levels 1 and 2) while the level 3 is still
in the specification phase and rules of transition between the
representations on level 2 and level 3 are being formulated. The current
version is thus preliminary and identified as "PDT version 0.5"
(reflecting
mostly the amount of material currently available).

The text material contains samples from the following sources:

1.Lidove noviny (daily newspapers), 1991, 1994, 1995
2.Mlada fronta Dnes (daily newspapers), 1992
3.Ceskomoravsky Profit (business weekly), 1994
4.Vesmir (scientific magazine), Academia Publishers, 1992, 1993

The electronic source has been provided by the Institute of the Czech
National Corpus, in a format jointly developed by the ICNK and UFAL.

The Treebank has been supported by the following grants and projects:

Grant Agency of the Czech Republic No. 405/96/0198
(Treebank Definition and Procedures Specification)
Grant Agency of the Czech Republic No. 405/96/K214
(Tools and Level 1 Annotation)
Ministry of Education of the Czech Republic Project No. VS96151
(Tools and Structural Annotation on the Level 2)
National Science Foundation grant No. #IIS-9732388
(Version 0.5 Preparation for the Workshop 98)

The documentation of PDT is linked from its main page at UFAL. Go to the
UFAL home page, http://ufal.ms.mff.cuni.cz/, then click on "Projects"
and "Treebank".

The PDT Version 0.5 is freely available for research purposes providing
you fill in and submit a licence agreement. The appropriate form is also
linked from the PDT web page.

-- 
Daniel Zeman, UFAL MFF UK, Praha
zeman@ufal.mff.cuni.cz
http://www.ms.mff.cuni.cz/~zeman/

------------------------------------------------------------------------- Humanist Discussion Group Information at <http://www.kcl.ac.uk/humanities/cch/humanist/> <http://www.princeton.edu/~mccarty/humanist/> =========================================================================