Data Mining, Machine Intelligence and Inductive Programming


Bella-Sanjuán,Antonio email:
Castillo-Andreu, Hèctor email:
Contreras Ochando, Lidia email:
Estruch-Gregori, Vicent email:
Ferri-Ramírez, Cèsar email:
Hernández-Orallo, José email:
Insa-Cabrera, Javier email:
Martínez-Usó, Adolfo email:
Martínez-Plumed, Fernando email:
Ramírez-Quintana, M.José email:

a subgroup of  Extensions of Logic Programming Group.







The DMIP team began in 1997, and consolidated as a group in 2000, with the initial goal of extending ILP to other declarative languages. Since then, the group has been acquiring a broader view of the field, exploring different techniques and applications, focussing on the evaluation of machine learning and machine intelligence systems, so spanning in many areas of machine learning, data mining and artificial intelligence.

Currently, the research areas are:
  • Machine Learning and Data Mining
  • Knowledge Discovery
  • Multi-paradigm Inductive Programming
  • ROC analysis, cost-sensitive learning and model evaluation for decision support
  • Agreement Technologies.
  • Agent Intelligence Evaluation.
  • MML induction and Solomonoff prediction.
  • Probabilistic (inductive) programming.
  • Inductive Debugging.
  • Universal Psychometrics.
Nonetheless, the primary interest is still the learning of comprehensible or declarative models from data and the understanding of system performance.

For a more comprehensive account of the team's activities and projects, you can take a look a this presentation (as for 2009) .

Learning Systems and other software:

We have developed three learning systems:
  • The FLIP system (1998-2001, click here for downloading the software and for more information): implements a framework for the Induction of Functional Logic Programs (IFLP) from facts. This can be seen as an extension to the now consolidated field of Inductive Logic Programming (ILP). Inspired in the inverse resolution operator of ILP, the system is based on the reversal of narrowing, the more usual operational mechanism for Functional Logic Programming. The main advantages of the FLIP system over the most used ILP systems are a natural handling of functions, without the use of mode or determinism declarations, and its power for inducing short recursive programs. Its applications are mainly program synthesis, program debugging and data mining of small highly structured documents.
  • The SMILES system (2001-2002, click here for downloading the software and for more information): a machine learning system that integrates many different features from other machine learning techniques and paradigms and, more importantly, it presents several innovations in almost all of these features. In particular, SMILES extends classical decision tree learners in many ways (new splitting criteria, non-greedy search, new partitions, extraction of several and different solutions), it has an anytime handling of resources, and has a sophisticated and quite effective handling of costs. In this way, SMILES combines and improves the recent interest in hypotheses combination (e.g. boosting) and cost-sensitive learning (a priori and a posteriori class assignments, ROC analysis) outperforming previous systems in many situations. Its applications are basically data-mining and any other machine learning task where decision trees could be useful.
  • The DBDT system (2004-2010, click here for downloading the software and for more information): is a machine learning algorithm that integrates decision tree learning and center splitting. Roughly speaking, the inferred classifer can be viewed as a tree of attribute prototypes (The value distribution of an attribute is represented by a set of prototypes.). An instance is linked to one prototype or other depending on its proximity.
  • A machine intelligence testing framework system (2010-2012), click here for downloading the software. For more information, see the project Anytime Universal Intelligence .
  • A RLGGP system (2011-2012), a system that integrates reinforcement learning and General Game Playing, click here for more information.
  • A MML-LPP-Cost system (2011-2012), a system for coding logic programs with probabilities using MML, click here for more information.
  • Newton Trees, stochastic distance-based decision trees (2010-2012), click here for more information.
  • gErl, a general learning system (2012-), click here for more information.
  • AirVLC: An application for producing real-time urban air pollution forecasts for the city of Valencia in Spain AirVLC (2015-).
  • Multidimensional: With this software, multidimensional data is systematically analysed at multiple granularities by applying aggregate and disaggregate operators (e.g., by the use of OLAP tools). We show how these strategies behave when the resolution context changes, using several machine learning techniques in four application domains (link)(2015)
  • Coverage Graphs : Knowledge acquisition with forgetting: an incremental and developmental rule-based setting (Expermiments & Code in GitHub). (2014-2016)
  • IRT in machine learning : Making sense of IRT in machine learning (Expermiments & Code in GitHub). (2016)

Some projects:

The DMIP group participates in many of the projects of the Extensions of Logic Programming Group.

The most important active project of the group is REFRAME: Rethinking the Essence, Flexibility and Reusability of Advanced Model Exploitation.

In addition, it has participated in some other important projects: Agreement Technologies, Anytime Universal Intelligence.





© 1999-2016 José Hernández Orallo, Cèsar Ferri Ramírez.