Documentare: auxiliary intelligence for digital content analysis

Documentare is a software library written in Java including unsupervised clustering tools applicable to :
  • content stored in directories,
  • pictures issued from a text detection and a character segmentation process in a digitized document, which can be applied for building OCR reference bases.
Technological core of this library is the distance measurement of similarities between two sequences of bytes, regardless the coded information. This is a universal distance which can be applied to a large variety of content, given a relevant alignment of the code. Associated tools consist of a statistical geometry-based method of detection and segmentation of text in a digitized document and the clustering tools themselves, the descriptions of which can be found in README.md on GitLab. The code is published under GNU General Public License v2.0.

Recent posts / Page 10

  1. Documentare is a software library written in Java including unsupervised clustering tools applicable to : content stored in directories, pictures issued from a text detection and a character segmentation process in a digitized document, which can be applied for building OCR reference bases. Technological core of this library is the distance measurement of similarities between […]

  2. Orange Applications for Business presents pcap2c. This library will convert captured network traffic as pcap files (with Wireshark or tcpdump tool, for example) into C structures to be embedded directly into a C/C++ source code. This small tool depends on the libpcap component. pcap2c is now available on Orange-OpenSource GitHub under the BSD 3-Clause license.

  3. Video Call allows you to make video calls with each of your contacts without the need to download the application or to create an account. Video Call. Source: www.primezone.orange-labs.com You no longer need to ask your contact to install a new application, the video call function works with all the recent smartphones. The application offers two […]

  4. We continue to present the applications taking part in this year’s Prime Zone Cup (page in French). This internal challenge, organized since 2012, each year brings to the public new apps designed and created by employees and partners of the Orange Group. This is the last episode of this year’s series. We finish it with […]

  5. We continue to present the applications taking part in this year’s Prime Zone Cup (page in French). This internal challenge, organized since 2012, each year brings to the public new apps designed and created by employees and partners of the Orange Group. Today we want to introduce you to seven mobile apps related to culture […]

  6. Orange presents a KeePass plugin to synchronize passwords with HashiCorp Vault. The KeePass Vault Sync plugin allows a user to get, in a local KeePass file, the secrets he has access to in an HashiCorp Vault. This plugin allows (for now) readonly access. KeePass Vault Sync was developped at Orange Applications for Business under LGPL-2.1. […]

  7. We continue to present the applications taking part in this year’s Prime Zone Cup (page in French). This internal challenge, organized since 2012, each year brings to the public new apps designed and created by employees and partners of the Orange Group. Today we want to introduce you to four mobile games that will make […]

  8. The time has come to reap the fruits of this year’s Prime Zone Cup (page in French). This internal challenge, organized since 2012, each year brings to the public new applications designed and created by employees and partners of the Orange Group. Today we present six apps that can help you with everyday life tasks […]

  9. pyDCOP is a library implementing many Distributed Constraints OPtimization (DCOP) algorithms. Its goal is to foster academic research on DCOP by providing an easy to use library to help researcher studying and benchmarking DCOP algorithms and building new ones. pyDCOP is use-case agnostic : it can be embedded in other application to implement distributed coordination […]

  10. “Aidevig-Bodyguard” app – screenshot. Orange Labs Prime Zone presents “Aidevig Bodyguard” – an app that allows you to alert who you want with a real phone call! You can also say that all is fine thanks to the ‘green alerts’ or simply indicate a problem with an ‘orange alert’. Women, sportsmen, seniors, travellers, workers, technicians […]