Software

Classification

  • LCIF: Combining Instance and Feature neighbors for Efficient Multi-label Classification (by Len Feremans)

Databases / Query languages

  • Blixem: A LiXQuery engine (by Jeroen Avonts, Pieter Wellens, Wim Le Page)
  • Conqueror: Conjunctive Query Generator (by Wim Le Page)

Data Quality Rules

  • CFD Discovery Algorithms: Implementations for discovering frequent, approximate Conditional Functional Dependencies from csv data (by Joeri Rammelaere)
  • XPlode: The XPlode algorithm discovers a Conditional Functional Dependency based on a given partial repair of a dataset. The returned CFD provides the best explanation for the observed repair (by Joeri Rammelaere)
  • FBIMiner: Forbidden Itemsets are itemsets with a low lift, aiming to capture anomalous co-occurences in data, which in practice are often erroneous. The program further attempts to repair the data, in order to remove all forbidden itemsets (by Joeri Rammelaere)
  • CTane and CFDMiner: Implementations of the CTane and CFDMiner algorithms for discovering Conditional Functional Dependencies.

Frequent Pattern Mining

Interactive Pattern Mining / Efficient Pattern Mining

Pattern Sets / Summarization

Pattern Mining / Sequential Data /Interestingness measures

  • FCI seq: Efficient Discovery of Sets of Co-occurring Items in Event Sequences (by Len Feremans)
  • Mining Closed Strict Episodes and Marbles (by Nikolaj Tatti)
  • SCII: Sequence Classification based on Interesting Itemsets (by Cheng Zhou)
  • SQS: The Long and the Short of It: Summarizing Event Sequences with Serial Episodes (by Nikolaj Tatti, Jilles Vreeken)
  • QCSP: Mining Top-k Quantile-based Cohesive Sequential Patterns (by Len Feremans)