Documentare: auxiliary intelligence for digital content analysis
Documentare is a software library written in Java including unsupervised clustering tools applicable to : content stored in directories, pictures issued from a text detection and a character segmentation process in a digitized document, which can be applied for building OCR reference bases. Technological core of this library is the distance measurement of similarities between […]