A generic tool to recognize the logical structure of documents from a OCR stream
Everyone who has to deal with electronic document encoding of from the
original source material and needs to consider the hierarchical
structure represented in the digitized document.
The system recognizes the logical structure of documents from a OCR
stream in accordance with the descriptions of a model (DTD, XML Schema). The result is a hierarchically structured flow. The model involves both knowledge of the macro-structure of the documents and the micro-structure of their content.
Any Posix compliant system