19.568 new publication: Document Analysis Systems

From: Humanist Discussion Group (by way of Willard McCarty willard.mccarty_at_kcl.ac.uk>
Date: Mon, 23 Jan 2006 06:23:41 +0000

               Humanist Discussion Group, Vol. 19, No. 568.
       Centre for Computing in the Humanities, King's College London
                   www.kcl.ac.uk/humanities/cch/humanist/
                        www.princeton.edu/humanist/
                     Submit to: humanist_at_princeton.edu

         Date: Mon, 23 Jan 2006 06:12:52 +0000
         From: Willard McCarty <willard.mccarty_at_kcl.ac.uk>
         Subject: Document Analysis Systems VII

Volume 3872/2006 (Document Analysis Systems VII)
of Lecture Notes in Computer Science is now
available on the springerlink.metapress.com web
site at http://springerlink.metapress.com.

This issue contains:

Retrieval from Document Image Collections p. 1
A. Balasubramanian, Million Meshesha, C.V. Jawahar
DOI: 10.1007/11669487_1

A Semi-automatic Adaptive OCR for Digital Libraries p. 13
Sachin Rawat, K.S. Sesh Kumar, Million Meshesha,
Indraneel Deb Sikdar, A. Balasubramanian, C.V. Jawahar
DOI: 10.1007/11669487_2

Contribution to the Discrimination of the
Medieval Manuscript Texts: Application in the Palaeography p. 25
Ikram Moalla, Frank LeBourgeois, Hubert Emptoz, Adel M. Alimi
DOI: 10.1007/11669487_3

Restoring Ink Bleed-Through Degraded Document
Images Using a Recursive Unsupervised Classification Technique p. 38
Drira Fadoua, Frank Le Bourgeois, Hubert Emptoz
DOI: 10.1007/11669487_4

Networked Document Imaging with Normalization and Optimization p. 50
Hirobumi Nishida
DOI: 10.1007/11669487_5

Gray-Scale Thinning Algorithm Using Local Min/Max Operations p. 62
Kyoung Min Kim, Buhm Lee, Nam Sup Choi, Gwan Hee
Kang, Joong Jo Park, Ching Y. Suen
DOI: 10.1007/11669487_6

Automated Scoring of Handwritten Essays Based on
Latent Semantic Analysis p. 71
Sargur Srihari, Jim Collins, Rohini Srihari, Pavithra Babu, Harish Srinivasan
DOI: 10.1007/11669487_7

Aligning Transcripts to Automatically Segmented Handwritten Manuscripts p. 84
Jamie Rothfeder, R. Manmatha, Toni M. Rath
DOI: 10.1007/11669487_8

Virtual Example Synthesis Based on PCA for
Off-Line Handwritten Character Recognition p. 96
Hidetoshi Miyao, Minoru Maruyama
DOI: 10.1007/11669487_9

Extraction of Handwritten Text from Carbon Copy Medical Form Images p. 106
Robert Milewski, Venu Govindaraju
DOI: 10.1007/11669487_10

Document Logical Structure Analysis Based on Perceptive Cycles p. 117
Yves Rangoni, Abdel Belai¨d
DOI: 10.1007/11669487_11

A System for Converting PDF Documents into Structured XML Format p. 129
Hervé Déjean, Jean-Luc Meunier
DOI: 10.1007/11669487_12

XCDF: A Canonical and Structured Document Format p. 141
Jean-Luc Bloechle, Maurizio Rigamonti, Karim
Hadjar, Denis Lalanne, Rolf Ingold
DOI: 10.1007/11669487_13

Structural Analysis of Mathematical Formulae with
Verification Based on Formula Description Grammar p. 153
Seiichi Toyota, Seiichi Uchida, Masakazu Suzuki
DOI: 10.1007/11669487_14

Notes on Contemporary Table Recognition p. 164
David W. Embley, Daniel Lopresti, George Nagy
DOI: 10.1007/11669487_15

Handwritten Artefact Identification Method for
Table Interpretation with Little Use of Previous Knowledge p. 176
Luiz Antônio Pereira Neves, João Marques de
Carvalho, Jacques Facon, Flávio Bortolozzi, Sérgio Aparecido Ignácio
DOI: 10.1007/11669487_16

Writer Identification for Smart Meeting Room Systems p. 186
Marcus Liwicki, Andreas Schlapbach, Horst Bunke,
Samy Bengio, Johnny Mariéthoz, Jonas Richiardi
DOI: 10.1007/11669487_17

Extraction and Analysis of Document Examiner
Features from Vector Skeletons of Grapheme ‘th’ p. 196
Vladimir Pervouchine, Graham Leedham
DOI: 10.1007/11669487_18

Segmentation of On-Line Handwritten Japanese Text
Using SVM for Improving Text Recognition p. 208
Bilan Zhu, Junko Tokuno, Masaki Nakagawa
DOI: 10.1007/11669487_19

Application of Bi-gram Driven Chinese Handwritten
Character Segmentation for an Address Reading System p. 220
Yan Jiang, Xiaoqing Ding, Qiang Fu, Zheng Ren
DOI: 10.1007/11669487_20

Language Identification in Degraded and Distorted Document Images p. 232
Shijian Lu, Chew Lim Tan, Weihua Huang
DOI: 10.1007/11669487_21

Bangla/English Script Identification Based on
Analysis of Connected Component Profiles p. 243
Lijun Zhou, Yue Lu, Chew Lim Tan
DOI: 10.1007/11669487_22

Script Identification from Indian Documents p. 255
Gopal Datt Joshi, Saurabh Garg, Jayanthi Sivaswamy
DOI: 10.1007/11669487_23

Finding the Best-Fit Bounding-Boxes p. 268
Bo Yuan, Leong Keong Kwoh, Chew Lim Tan
DOI: 10.1007/11669487_24

Towards Versatile Document Analysis Systems p. 280
Henry S. Baird, Matthew R. Casey
DOI: 10.1007/11669487_25

Exploratory Analysis System for Semi-structured Engineering Logs p. 291
Michael Flaster, Bruce Hillyer, Tin Kam Ho
DOI: 10.1007/11669487_26

Ground Truth for Layout Analysis Performance Evaluation p. 302
A. Antonacopoulos, D. Karatzas, D. Bridson
DOI: 10.1007/11669487_27

On Benchmarking of Invoice Analysis Systems p. 312
Bertin Klein, Stefan Agne, Andreas Dengel
DOI: 10.1007/11669487_28

Semi-automatic Ground Truth Generation for Chart Image Recognition p. 324
Li Yang, Weihua Huang, Chew Lim Tan
DOI: 10.1007/11669487_29

Efficient Word Retrieval by Means of SOM Clustering and PCA p. 336
Simone Marinai, Stefano Faini, Emanuele Marino, Giovanni Soda
DOI: 10.1007/11669487_30

The Effects of OCR Error on the Extraction of Private Information p. 348
Kazem Taghva, Russell Beckley, Jeffrey Coombs
DOI: 10.1007/11669487_31

Combining Multiple Classifiers for Faster Optical
Character Recognition p. 358
Kumar Chellapilla, Michael Shilman, Patrice Simard
DOI: 10.1007/11669487_32

Performance Comparison of Six Algorithms for Page Segmentation p. 368
Faisal Shafait, Daniel Keysers, Thomas M. Breuel
DOI: 10.1007/11669487_33

HVS Inspired System for Script Identification in
Indian Multi-script Documents p. 380
Peeta Basa Pati, A.G. Ramakrishnan
DOI: 10.1007/11669487_34

A Shared Fragments Analysis System for Large Collections of Web Pages p. 390
Junchang Ma, Zhimin Gu
DOI: 10.1007/11669487_35

Offline Handwritten Arabic Character Segmentation
with Probabilistic Model p. 402
Pingping Xiu, Liangrui Peng, Xiaoqing Ding, Hua Wang
DOI: 10.1007/11669487_36

Automatic Keyword Extraction from Historical Document Images p. 413
Kengo Terasawa, Takeshi Nagasaki, Toshio Kawashima
DOI: 10.1007/11669487_37

Digitizing a Million Books: Challenges for Document Analysis p. 425
K. Pramod Sankar, Vamshi Ambati, Lakshmi Pratha, C.V. Jawahar
DOI: 10.1007/11669487_38

Toward File Consolidation by Document Categorization p. 437
Abdel Belaïd, André Alusse
DOI: 10.1007/11669487_39

Finding Hidden Semantics of Text Tables p. 449
Saleh A. Alrashed
DOI: 10.1007/11669487_40

Reconstruction of Orthogonal Polygonal Lines p. 462
Alexander Gribov, Eugene Bodansky
DOI: 10.1007/11669487_41

A Multiclass Classification Framework for Document Categorization p. 474
Qi Qiang, Qinming He
DOI: 10.1007/11669487_42

The Restoration of Camera Documents Through Image Segmentation p. 484
Shijian Lu, Chew Lim Tan
DOI: 10.1007/11669487_43

Cut Digits Classification with k-NN Multi-specialist p. 496
Fernando Boto, Andoni Cortés, Clemente Rodríguez
DOI: 10.1007/11669487_44

The Impact of OCR Accuracy and Feature
Transformation on Automatic Text Classification p. 506
Mayo Murata, Lazaro S.P. Busagala, Wataru Ohyama,
Tetsushi Wakabayashi, Fumitaka Kimura
DOI: 10.1007/11669487_45

A Method for Symbol Spotting in Graphical Documents p. 518
Daniel Zuwala, Salvatore Tabbone
DOI: 10.1007/11669487_46

Groove Extraction of Phonographic Records p. 529
Sylvain Stotzer, Ottar Johnsen, Frédéric Bapst, Rolf Ingold
DOI: 10.1007/11669487_47

Use of Affine Invariants in Locally Likely
Arrangement Hashing for Camera-Based Document Image Retrieval p. 541
Tomohiro Nakai, Koichi Kise, Masakazu Iwamura
DOI: 10.1007/11669487_48

Robust Chinese Character Recognition by Selection
of Binary-Based and Grayscale-Based Classifier p. 553
Yoshinobu Hotta, Jun Sun, Yutaka Katsuyama, Satoshi Naoi
DOI: 10.1007/11669487_49

Segmentation-Driven Recognition Applied to
Numerical Field Extraction from Handwritten Incoming Mail Documents p. 564
Clément Chatelain, Laurent Heutte, Thierry Paquet
DOI: 10.1007/11669487_50

Performance Evaluation of Text Detection and Tracking in Video p. 576
Vasant Manohar, Padmanabhan Soundararajan,
Matthew Boonstra, Harish Raju, Dmitry Goldgof,
Rangachar Kasturi, John Garofolo
DOI: 10.1007/11669487_51

Document Analysis System for Automating Workflows p. 588
Steven J. Simske, Jordi Arnabat
DOI: 10.1007/11669487_52

Automatic Assembling of Cadastral Maps Based on
Generalized Hough Transformation p. 593
Fei Liu, Wataru Ohyama, Tetsushi Wakabayashi, Fumitaka Kimura
DOI: 10.1007/11669487_53

A Few Steps Towards On-the-Fly Symbol Recognition
with Relevance Feedback p. 604
Jan Rendek, Bart Lamiroy, Karl Tombre
DOI: 10.1007/11669487_54

The Fuzzy-Spatial Descriptor for the Online
Graphic Recognition: Overlapping Matrix Algorithm p. 616
Noorazrin Zakaria, Jean-Marc Ogier, Josep Llados
DOI: 10.1007/11669487_55

Dr Willard McCarty | Reader in Humanities
Computing | Centre for Computing in the
Humanities | King's College London | Kay House, 7
Arundel Street | London WC2R 3DX | U.K. | +44
(0)20 7848-2784 fax: -2980 ||
willard.mccarty_at_kcl.ac.uk www.kcl.ac.uk/humanities/cch/wlm/
Received on Mon Jan 23 2006 - 01:59:00 EST

This archive was generated by hypermail 2.2.0 : Mon Jan 23 2006 - 01:59:01 EST