Software

CarottAge

The CarottAge system is based on Hidden Markov Models of second order (HMM2) and provides a non supervised temporal clustering algorithm for data mining and a synthetic representation of temporal and spatial data. CarottAge is well adapted to mine the temporal changes in territories.

ARPEnTAge (Analyse de Régularités dans les Paysages : Environnement, Territoires, Agronomie) is a software also based on stochastic models (HMM2 and Markov Field) for analyzing spatio-temporal data-bases. ARPEnTAge is built on top of the CarottAge system to fully take into account the spatial dimension of input sequences. It performs a Time-Space clustering of a landscape based on its time dynamic Land Uses.

CORON

The Coron platform is a KDD toolkit organized around three main components: (1) Coron-base, (2) AssRuleX, and (3) pre- and post-processing modules. The Coron-base component includes a complete collection of data mining algorithms for extracting itemsets (APriori, Close, Pascal, Eclat, Charm, ZART, Snow, Touch, and Talky-G). AssRuleX generates sets of association rules and rule bases. The Coron system supports the whole lifecycle of a data mining task and proposes modules for data cleaning and size reduction.

LatViz: Visualization of Concept Lattices

LatViz is a tool allowing the construction, the display and the exploration of concept lattices. LatViz introduces various functionalities focusing on interaction with experts, such as visualization of pattern structures for dealing with complex non-binary data, AOC-poset which is composed of the core elements of the lattice, concept annotations, filtering based on various criteria and a visualization of implications…

OrphaMine: Data Mining Platform for Orphan Diseases

The OrphaMine platform enables visualization, data integration and in-depth analytics in the domain of “orphan diseases”, where data is extracted from the OrphaData ontology (http://www.orpha.net/consor/cgi-bin/index.php).
At present, we aim at building a true collaborative portal allowing a general visualization of OrphaData, and the integration of data mining algorithms for improving the general knowledge about rare diseases.

Siren: Interactive and Visual Redescription Mining (Esther Galbrun)

Siren is a tool for interactive mining and visualization of redescriptions. Redescription mining aims to find distinct common characterizations of the same objects and, vice versa, to identify sets of objects that admit multiple shared descriptions. The goal is to provide domain experts with a tool allowing them to tackle their research questions using redescription mining.

FixOut: FaIrness through eXplanations and feature dropOut

This system is based on FI explanations and ensemble approaches to mitigate unintended bias in ML models without compromising their performance.

ANNa: Detecting and solving morphological analogies

The objective of the ANNa project is to provide an online platform to detect, solve, and reason with analogies in various domains, for instance, in NLP, biomedical sciences, as well as in industry. Here, illustrated on morphological tasks.