Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data

MOCR_AND '11: Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data

September 2011

2011 Proceeding

Conference Chairs:
Lipika Dey
India
,
Venu Govindaraju
USA
,
Daniel Lopresti
USA
,
Prem Natarajan
USA
,
Christoph Ringlstetter
Germany
,
Shourya Roy
India

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

MOCR/AND '11: The Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data Beijing China 17 September 2011

ISBN:

978-1-4503-0685-0

Published:

17 September 2011

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Cited By

Contributors

Lipika Dey
Tata Consultancy Services India
- Publication Years1999 - 2023
- Publication counts67
- Citation count389
- Available for Download33
- Downloads (cumulative)9,905
- Downloads (12 months)422
- Downloads (6 weeks)54
- Average Downloads per Article300
- Average Citation per Article6
View Full Profile
Venu Govindaraju
University at Buffalo, The State University of New York
- Publication Years1989 - 2023
- Publication counts218
- Citation count1,117
- Available for Download20
- Downloads (cumulative)6,804
- Downloads (12 months)390
- Downloads (6 weeks)58
- Average Downloads per Article340
- Average Citation per Article5
View Full Profile
Daniel Lopresti
Lehigh University
- Publication Years1987 - 2023
- Publication counts93
- Citation count398
- Available for Download13
- Downloads (cumulative)4,304
- Downloads (12 months)126
- Downloads (6 weeks)15
- Average Downloads per Article331
- Average Citation per Article4
View Full Profile
Prem Natarajan
Amazon.com, Inc.
- Publication Years2003 - 2023
- Publication counts53
- Citation count321
- Available for Download18
- Downloads (cumulative)5,971
- Downloads (12 months)835
- Downloads (6 weeks)91
- Average Downloads per Article332
- Average Citation per Article6
View Full Profile
Christoph Ringlstetter
Ludwig-Maximilians-University Munich
- Publication Years2003 - 2015
- Publication counts21
- Citation count90
- Available for Download10
- Downloads (cumulative)2,840
- Downloads (12 months)32
- Downloads (6 weeks)9
- Average Downloads per Article284
- Average Citation per Article4
View Full Profile
Shourya Roy
Xerox Research Center India
- Publication Years2007 - 2020
- Publication counts30
- Citation count276
- Available for Download16
- Downloads (cumulative)7,359
- Downloads (12 months)274
- Downloads (6 weeks)26
- Average Downloads per Article460
- Average Citation per Article9
View Full Profile

Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
1. Applied computing

Recommendations

Comments

MOCR-AND

Sections

Proceeding Downloads

New method for the selection of binarization parameters based on noise features of historical documents

A real-world noisy unstructured handwritten notebook corpus for document image analysis research

Acquiring competitive intelligence from social media

Experiments with artificially generated noise for cleansing noisy text

Adapting a WSJ trained part-of-speech tagger to noisy text: preliminary results

Tackling content spamming with a term weighting scheme

Segmenting eBay item descriptions into coherent sections

Recognizing garbage in OCR output on historical documents

Experiences of integration and performance testing of multilingual OCR for printed Indian scripts

Topological features for recognizing printed and handwritten Bangla characters

Script based text identification: a multi-level architecture

Recognition of Tibetan wood block prints with generalized hidden Markov and kernelized modified quadratic distance function

Lampung - a new handwritten character benchmark: database, labeling and recognition

MAST: multi-script annotation toolkit for scenic text

Text level performance evaluation of Indic OCR using split & merge

Unconstrained Bangla online handwriting recognition based on MLP and SVM

Automatic localization of page segmentation errors

Sparsity-based super-resolution for offline handwriting recognition

Cited By

AND '10: Proceedings of the fourth workshop on Analytics for noisy unstructured text data

AND '09: Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data

AND '08: Proceedings of the second workshop on Analytics for noisy unstructured text data

Save to Binder

Sections

Proceeding Downloads

Cited By

Save to Binder

Recommendations

AND '10: Proceedings of the fourth workshop on Analytics for noisy unstructured text data

AND '09: Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data

AND '08: Proceedings of the second workshop on Analytics for noisy unstructured text data