20.089 Workshop on Historical Text Mining

From: Humanist Discussion Group (by way of Willard McCarty willard.mccarty_at_kcl.ac.uk>
Date: Sun, 2 Jul 2006 08:51:57 +0100

                Humanist Discussion Group, Vol. 20, No. 89.
       Centre for Computing in the Humanities, King's College London
                     Submit to: humanist_at_princeton.edu

         Date: Sun, 02 Jul 2006 08:30:19 +0100
         From: "Rayson, Paul" <rayson_at_exchange.lancs.ac.uk>
         Subject: Call for participation: Workshop on Historical Text Mining

Workshop on Historical Text Mining

Thursday 20th and Friday 21st July 2006, Lancaster University, UK.


Organisers: Paul Rayson (Lancaster University) and Dawn Archer
(University of Central Lancashire)

This is one of a series of workshops sponsored by the AHRC ICT Methods

We want to develop a network of scholars interested in 'Historical Text
Mining' via a workshop for experts from the various fields: text mining
and E-Science, corpus development and annotation, historical
linguistics, dialectology and computational linguistics. We believe that
a discussion relating to the effective text mining of historical data is
particularly overdue and much needed, because of the growth in
(historical) digital resources (e.g. Open Content Alliance, Google
Print, Early English Books Online). We particularly want to better
define the relationship between the text mining/E-Science community, who
are often involved in applying basic techniques to large scale datasets,
and the corpus linguistic community, who tend to apply data-driven
linguistic analysis and annotation techniques to relatively small

The 'Historical Text Mining' workshop will seek:

* to raise awareness of the various techniques utilised and/or tools
developed by researchers working within the various fields.
* to make scholars who work with historical data aware of existing text
mining techniques that are applicable to their research needs,
* to familiarise such scholars with the use of these techniques and
tools, by means of a series of tutorial sessions (e.g. GATE, WordSmith,
VARD, VIEW, Wmatrix),
* to investigate the problems of applying some "modern" large-scale
corpus annotation and analysis techniques to historical data, and
* to encourage/enable a roundtable discussion, with the ultimate aim of
determining what needs to be done to improve historical text mining and
(importantly) identify possible future workshops and collaborative

Participation is free but, since places are limited, we request that
potential participants apply in advance, and explain why they wish to
attend and what they expect from the workshop. For further details on
the application procedure and the workshop programme, please see the


Dr Paul Rayson
Computing Department, Infolab21, South Drive, Lancaster University,
Lancaster, LA1 4WA, UK.
Tel: +44 (0)1524 510357
Fax: +44 (0)1524 510492
Email: paul_at_comp.lancs.ac.uk

Dr Dawn Archer
Lecturer in English Language and Linguistics, Department of Humanities,
University of Central Lancashire, Preston, Lancashire, PR1 2HE
Tel: +44 (0)1772 893032
Email: dearcher_at_uclan.ac.uk
Received on Sun Jul 02 2006 - 04:22:50 EDT

This archive was generated by hypermail 2.2.0 : Sun Jul 02 2006 - 04:22:50 EDT