15.121 SOX; HumComp seminar; ichim01; new book; digitizing newspapers

From: Humanist Discussion Group (willard@lists.village.Virginia.EDU)
Date: Sat Jul 07 2001 - 17:40:36 EDT

Next message: Humanist Discussion Group: "Digest of announce from remdigest"

Previous message: Humanist Discussion Group: "15.124 manuscript catalogues and on-line databases (fwd)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

              Humanist Discussion Group, Vol. 15, No. 121.
      Centre for Computing in the Humanities, King's College London
              <http://www.princeton.edu/~mccarty/humanist/>
             <http://www.kcl.ac.uk/humanities/cch/humanist/>

[1] From: "David L. Gants" <dgants@english.uga.edu> (100)
Subject: Secret Oxford Photographt Project

[2] From: "David L. Gants" <dgants@english.uga.edu> (34)
Subject: seminars on humanities computing

[3] From: "David L. Gants" <dgants@english.uga.edu> (74)
Subject: International Cultural Heritage Inforamtics

[4] From: "David L. Gants" <dgants@english.uga.edu> (147)
Subject: New book on text summarisation by Inderjeet Mani (fwd)

[5] From: "David L. Gants" <dgants@english.uga.edu> (35)
Subject: Workshop: Digitising newspapers

--[1]------------------------------------------------------------------
        Date: Sat, 7 Jul 2001 16:10:21 -0500 (EST)
        From: "David L. Gants" <dgants@english.uga.edu>
        Subject: Secret Oxford Photographt Project

From: Michael Fraser <mike.fraser@computing-services.oxford.ac.uk>

SOX: The Secret Oxford Photography Project

Over 5,400 digital images of the parts of Oxford you rarely see!

http://www.etrc.ox.ac.uk/sox/index.htm

-------------------------------------------------------------------------

What is SOX?
'SOX' stands for Secret Oxford and it is an archive of over 5,400
pictures of Oxford all taken on the same day. In short, it is a snap-shot
of everyday life in the City, and provides a unique record of 14 hours in
Oxford (7am to 8pm) on 7th June 2001 - election day.

Why call it SOX?
The idea was to take pictures of people and places that do not normally
get chosen for photographs, i.e. the hidden parts of Oxford that never
appear on Inspector Morse or postcards. Rather then showing another 100
pictures of the Radcliffe Camera, SOX has images of people at work and
rest, contents of rubbish bins, posters, shops, and cars filled with
cement (yes, there is one).

What was the point of the project?
In part it was to create a record of Oxford as noted above, but it was
also an experiment just to see how many photographs it was possible to
take in such a short space of time, covering such a distance. SOX was born
digital. All the images were taken using digital cameras and then
downloaded (on the day) to computers and uploaded to the Web.

How did it work?
A team of about 25 volunteers were assembled, all giving up an hour or two
of their time during the day. Each was given a digital camera, a grid
reference of the City, and approximately one hour to take 100 photos of
secret Oxford. They then returned to the base and handed in their
camera. The images were then downloaded from the cameras onto some
computers, deleted from the camera itself, and then a new volunteer was
sent out to take another 100 images in an entirely different area.

How many photographs did you take?
Over 5,400. That averages out at 385 every hour, or over 6 a minute.

Whose idea was it?
The man behind the project was Peter Robinson, multimedia specialist at
Oxford's Humanities Computing Unit. He thought up the idea,
assembled the volunteers and cameras, and took many of the photographs
himself. He also processed the images and built the web site.

Who can use SOX?
Anyone with an Internet connection can use SOX as long as they comply with
the copyright restrictions.

How can I use SOX
Browse the images by area or by time of day. A page of thumbnail images
links to larger images which, in turn, includes information about the time
the photograph was taken, the photographer, image size, camera etc.
Found interesting ways of using SOX? Let us know!

What are the copyright restrictions?

Users are allowed to access this site for educational purposes. Without
further permission the images or documents may be:

* viewed
* printed on paper
* saved to disk
* used by students or staff registered or employed by an educational
  institution for any non-commercial purpose associated with teaching,
  learning or research. Credit must be given to the copyright
  holder and the URL of the SOX Web site should be stated in any
  materials incorporating images or documents from the SOX Project.

All other use requires prior permission. In particular, images or
documents may NOT be offered for sale or hire or otherwise disposed of on
a commercial basis, or stored in a database or compilation without prior
written permission.

All queries concerning copyright should be addressed to Peter Robinson
at the address below.

Who was involved ?
All the photographers were amateur volunteers who in the majority of cases
had never used a digital camera before. The youngest photographer was aged
six.

How was the website created?
All the photographs were taken as JPEG pictures via digital cameras. They
were transferred into folders sorted by geographic area and batch
processed to add catalogue information into the jpeg using a standard
called IPTC with the program Qpict. They were then renamed to include the
sector name and author initials and the original sequential number. A
program called iView then generated the web page of thumbnail pics and the
final photo pages including all the catalogue information, IPTC
information and the electronic EXIF information.

Is there a picture of the shark in the roof ?
Yes, it's in the Headington section and a montage picture is here
http://www.etrc.ox.ac.uk/sox/images/shark5.jpg

Peter Robinson
Humanities Computing Unit
University of Oxford
13 Banbury Road
Oxford OX2 6NN
United Kingdom
Email: peter.robinson@oucs.ox.ac.uk

27th June 2001
http://www.etrc.ox.ac.uk/sox/index.htm

--[2]------------------------------------------------------------------
        Date: Sat, 7 Jul 2001 16:15:40 -0500 (EST)
        From: "David L. Gants" <dgants@english.uga.edu>
        Subject: seminars on humanities computing

>> From: Frances Condron
>> <frances.condron@computing-services.oxford.ac.uk>

Summer Seminars at Oxford's Humanities Computing Unit
23rd - 27th July 2001
Humanities Computing Unit, University of Oxford
http://www.hcu.ox.ac.uk/summer/

Booking deadline: 6th July 2001

Spaces are still available for some of the seminars on humanities
computing, offered by Oxford University's Humanities Computing Unit:

23rd July
Introduction to Humanities Computing

24th July
Putting your database on the Web (only a few spaces available, but please=
=20
note that this seminar is repeated on the 27th July)
or
Advanced use of the Internet

25th July
Creating and using multimedia resources

26th July
Creating and using digital video

27th July
Putting your database on the Web=20
or
Records to go: cataloguing and using humanities online resources in the
Humbul Humanities Hub

How Much Will It Cost?
Each seminar costs =A365 (=A335 for students). You can book for any
combination of individual seminars.

Interested?
Booking information and further details are available online, at
http://www.hcu.ox.ac.uk/summer/

or contact Jenny Newman, Humanities Computing Unit, OUCS, 13 Banbury
Road, Oxford OX2 6NN. Tel: +44 (0)1865 273221; fax: +44 (0)1865 273275;
email: Jenny.Newman@oucs.ox.ac.uk

--[3]------------------------------------------------------------------
        Date: Sat, 7 Jul 2001 16:17:51 -0500 (EST)
        From: "David L. Gants" <dgants@english.uga.edu>
        Subject: International Cultural Heritage Inforamtics

>> From: "J. Trant" <jtrant@archimuse.com>

Join us for ichim01 !

The International Cultural Heritage Informatics Meeting 2001
"Cultural Heritage and Technologies in the Third Millennium"

Politecnico di Milano, Milan, Italy
3-7 September, 2001

http://www.ichim01.polimi.it/ (Italy) or
http://www.archimuse.com/ichim2001/ (US)

About ichim01
-------------

The International Cultural Heritage Informatics Meeting (ICHIM) is,
traditionally, the best international forum in which to examine the
relationship between technology and cultural heritage. ICHIM has been
held every two years, alternating between North America and Europe.
Following successful meetings at Le Louvre in Paris (97), and
Washington D.C. (99) our host for ichim01 is the Politecnico di
Milano. We expect at least 500 specialists, from museums, cultural
organizations, universities, research institutes, technology
companies and organizations. Please join us!

The Program
-----------
http://www.archimuse.com/ichim2001/overview.html

The ichim01 Program includes a full range of papers, presentations,
panel discussions and tutorials, including:

* keynote addresses by outstanding experts:
        - Maxwell L. Anderson (Director of the New York Whitney
Museum of American Art, USA)
        - Sarah Tyacke (Keeper of the Public Records Office, United Kingdom)
        - Peter Walsh (Chair of the Commonwealth of Massachusetts Art
Commission, USA)

* over 100 papers, by professionals and researchers from 24 countries

* panels featuring leading experts from the US and Europe debating
issues of culture and technology and offering new visions

* over 30 demonstrations of new technologies and applications

* 26 pre-conference tutorials (20 in English and 6 in Italian)
covering a broad spectrum of approaches and state-of- the-art
technologies

* engaging social events, held in the most charming places of Milan
- the City of Art, Fashion, and Design.

Registration
------------
http://www.archimuse.com/ichim2001/registration.html

You can register online, or print out a registration form to return
by mail or fax.

Organization
------------
http://www.archimuse.com/ichim2001/organization.html

ichim01 is organized by Archives & Museum Informatics and the
Politecnico de Milano, with our thanks to the Honorary Committee and
a Program Committee of more than 60 respected professionals from
throughout the world.

Conference Co-Chairs
Paolo Paolini, Politecnico di Milano (Italy)
Jennifer Trant, Archives & Museum Informatics (USA)

Program Co-Chairs
David Bearman, Archives & Museum Informatics (USA)
Franca Garzotto, Politecnico di Milano (Italy)

Sponsors
--------
ichim01 is held under the aegis of the Cultural Heritage Ministry of
Italy, the European Commission (IST Programme), Fondazione CARIPLO,
Camera di Commercio di Milano, Municipality of Milano, the counties
of Genova, Milano, Perugia, Roma, Torino, Venezia, the regions
Abruzzo, Basilicata, Calabria, Emilia Romagna, Lazio, Liguria,
Lombardia, Puglia, Sicilia, Umbria, Val d'Aosta, Veneto, and the
following Milanese museums: Museo alla Scala, Museo Archeologico,
Museo Bagatti Valsecchi, Musei Civici Milanesi, Museo della Scienza e
della Tecnica, Pinacoteca Ambrosiana, Pinacoteca di Brera, Poldi
Pezzoli, Triennale.

-- ________ ichim2001 Milano, Italy Archives & Museum Informatics September 3-7, 2001 2008 Murray Ave, Suite D http://www.archimuse.com/ichim2001/ Pittsburgh, PA 15217 ichim2001@archimuse.com

--[4]------------------------------------------------------------------ Date: Sat, 7 Jul 2001 16:24:38 -0500 (EST) From: "David L. Gants" <dgants@english.uga.edu> Subject: New book on text summarisation by Inderjeet Mani (fwd)

From: Ruslan Mitkov <R.Mitkov@wlv.ac.uk>

****************************************************** BOOK SERIES IN NATURAL LANGUAGE PROCESSING ******************************************************

John Benjamins=92 NLP series (NLP-3) http://www.wlv.ac.uk/~le1825/JB/series.htm

Book series editor Ruslan Mitkov

000000000000000000OOOOOOOOOOOooooooooooooooooooooooOOOOOOOOOOO00000000000= 000 0000

AUTOMATIC SUMMARIZATION

Inderjeet Mani

John Benjamins Pub Co; ISBN: 1588110591 (hardcover), 1588110605 (pape= rback)

=20 000000000000000000OOOOOOOOOOOooooooooooooooooooooooOOOOOOOOOOO00000000000= 000 0000

With the explosion in the quantity of on-line text and multimedia=20 information in recent years, there has been a renewed interest in=20 automatic summarization. This book provides a systematic introduction to=20 the field, explaining basic definitions, the strategies used by human=20 summarizers, and automatic methods that leverage linguistic and=20 statistical knowledge to produce extracts and abstracts. Drawing from a=20 wealth of research in artificial intelligence, natural language=20 processing, and information retrieval, the book also includes detailed=20 assessments of evaluation methods and new topics such as multi-document=20 and multimedia summarization.

Previous automatic summarization books have been either collections of=20 specialized papers, or else authored books with only a chapter or two=20 devoted to the field as a whole. This is the first textbook on the=20 subject, based on teaching materials used in two one-semester=20 courses. To further help the student reader, the book includes detailed=20 case studies, accompanied by end-of-chapter reviews and an extensive=20 glossary.

The book is intended for students and researchers, as well as=20 information technology managers, librarians, and anyone else interested=20 in the subject.

TABLE OF CONTENTS

PREFACE=20

I. PRELIMINARIES 1. Introduction=20 2. Basic Notions for Summarization=20 3. Abstract Architecture for Summarization=20 4. Summarization Approaches=20 5. Current Applications=20 6. Conclusion=20 7. Review=20

II. PROFESSIONAL SUMMARIZING=20 1. Introduction=20 2. The stages of abstracting=20 3. Abstracting Strategies=20 4. Reading for Abstracting=20 5. Revision=20 6. Psychological Experiments=20 7. Structure of Empirical Abstracts=20 8. Conclusion=20 9. Review=20

III. EXTRACTION=20 1. Introduction=20 2. The Edmundsonian Paradigm=20 3. Corpus Based Sentence Extraction=20 3.1 General Considerations=20 3.2 Aspects of Learning Approaches=20 4. Coherence of Extracts=20 5. Conclusion=20 6. Review=20

IV. REVISION=20 1. Introduction=20 2. Shallow Coherence Smoothing=20 3. Full Revision to Improve Informativeness=20 3.1 Case Study: Full Revision=20 3.2 Related Work=20 3.3 Implications=20 4. Text Compaction=20 5. Conclusion=20 6. Review=20

V. DISCOURSE-LEVEL INFORMATION=20 1. Introduction=20 2. Text Cohesion=20 2.1 Introduction=20 2.2 Cohesion Graph Topology=20 2.3 Topic Characterization=20 3. Text Coherence=20 3.1 Introduction=20 3.2 Coherence Relations=20 3.3 Rhetorical Structure Theory (RST)=20 3.4 Rhetorical Structure and Cue Phrases=20 3.5 The Document Scheme, Revisited=20 4. Conclusion=20 5. Review=20

VI. ABSTRACTION=20 1. Introduction=20 2. Abstraction from Templates=20 2.1 Introduction=20 2.2 Case Study: Sketchy Scripts=20 2.3 Modern Information Extraction=20 3. Abstraction by Term Rewriting=20 4. Abstraction using Event Relations=20 5. Abstraction using a Concept Hierarchy=20 5.1. Domain Knowledge Base Activation=20 5.2. Generic Thesaurus Activation=20 6. Synthesis for Abstraction=20 6.1. Pretty printing=20 6.2. Graphical Output=20 6.3. Extraction=20 6.4. Generation for Synthesis=20 7. Conclusion=20 8. Review=20

VII. MULTI-DOCUMENT SUMMARIZATION=20 1. Introduction=20 2. Types of relationships across documents=20 3. MDS methods=20 3.1 Overview=20 3.2 Specific Approaches=20 4. Case Study: Biographical Summarization=20 4.1 Introduction=20 4.2 Example Architecture=20 4.3 Algorithm Steps=20 4.4 Bio Summarizer Components=20 4.5 Assessment=20 5. Conclusion=20 6. Review=20

VIII. MULTIMEDIA SUMMARIZATION=20 1. Introduction=20 2. Dialog Summarization=20 3. Summarization of Video=20 4. Summarization of Diagrams=20 5. Automatic Multimedia Briefing Generation=20 6. Conclusion=20 7. Review=20

IX. EVALUATION=20 1. Introduction=20 2. Intrinsic Methods=20 2.1 Assessing Agreement Between Subjects=20 2.2 Quality=20 2.3 Informativeness=20 2.4 Component-level tests=20 3. Extrinsic Methods=20 3.1 Relevance Assessment=20 3.2 Reading Comprehension=20 3.3 Presentation Strategies=20 3.4 Mature System Evaluation=20 4. Conclusion=20 5. Review=20

X. POSTSCRIPT=20

REFERENCES=20 INDEX

--[5]------------------------------------------------------------------ Date: Sat, 7 Jul 2001 16:26:11 -0500 (EST) From: "David L. Gants" <dgants@english.uga.edu> Subject: Workshop: Digitising newspapers

From: Michael Fraser <mikef@ermine.ox.ac.uk>

ALL THE NEWS THAT'S FIT TO SCAN: INCREASING ACCESS TO HISTORIC NEWSPAPERS

A one-day workshop to be held at Senate House, University of London, 12 July 2001 10.00am-4.00pm, with a wine reception 5.00-7.00pm

The workshop is sponsored by the British Library; the Institute for English Studies; Olive Software; the Office for Humanities Communication, King's College, London; the Refugee Studies Centre, University of Oxford; the Joint Information Systems Committee (JISC). Its aim is to present the results of a unique pilot project in the digitization and delivery of historic newspapers to a wider audience, and to discuss the implications of this remarkable system for the preservation and presentation of similar materials in UK HE, archives and public libraries.

Since January 2001, the British Library Newspaper Library, the Refugee Studies Centre at Oxford University, OCLC, the Malibu Hybrid Library Project at King's College London, and Olive Software have been working together to produce a prototype system for the digitization, indexing, and presentation of historic newspapers from the British Library Newspaper Library collection. 18 reels of microfilm have been scanned, and some 500,000 newspaper articles indexed. Speakers will demonstrate the pilot project and discuss the practical implications of the various technologies.

The workshop is free to all participants, who are also invited to a wine reception afterwards.

See the programme at http://www.uk.olivesoftware.com/conference

Bookings should be made through:

Ms Joanne Nixon Institute of English Studies Room 308 School of Advanced Study University of London Senate House Malet Street London WC1E 7HU +44 (0) 207 862 8675 ies@sas.ac.uk

Next message: Humanist Discussion Group: "Digest of announce from remdigest"
Previous message: Humanist Discussion Group: "15.124 manuscript catalogues and on-line databases (fwd)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b30 : Sat Jul 07 2001 - 17:42:29 EDT