MayoBMI at ImageCLEF 2016 handwritten document retrieval task

Sijia Liu, Yanshan Wang, Saeed Mehrabi, Dingcheng Li, Hongfang Liu

Research output: Contribution to journalConference articlepeer-review


In this working note, we introduce our participation at the ImageCLEF 2016 Handwritten Document Retrieval Task. We mainly focused on hyphenation detection using line images and information retrieval using n-best results. The hyphenation detection step utilizes extracted image features from beginning and end of a line and a binary classifier to determine if a line contains hyphenation. Then the spell correction step is used to eliminate spelling errors from the concatenation of a broken word from the end of a line and the beginning of the next line. The final text retrieval step employs a sufix stripping algorithm to normalize the word tense and form and TF-IDF scheme to rank the retrieved relevant segment results of our submission.

Original languageEnglish (US)
Pages (from-to)347-355
Number of pages9
JournalCEUR Workshop Proceedings
StatePublished - 2016
Event2016 Working Notes of Conference and Labs of the Evaluation Forum, CLEF 2016 - Evora, Portugal
Duration: Sep 5 2016Sep 8 2016


  • Handwriting recognition
  • Hyphenation detection
  • Text retrieval

ASJC Scopus subject areas

  • Computer Science(all)


Dive into the research topics of 'MayoBMI at ImageCLEF 2016 handwritten document retrieval task'. Together they form a unique fingerprint.

Cite this