Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-14T00:13:15.159Z Has data issue: false hasContentIssue false

Using automated planning for improving data mining processes

Published online by Cambridge University Press:  07 February 2013

Susana Fernández
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
Tomás de la Rosa
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
Fernando Fernández
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
Rubén Suárez
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
Javier Ortiz
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
Daniel Borrajo
Affiliation:
Departamento de Informática, Universidad Carlos III de Madrid, Avd. de la Universidad 30, 28911Leganés (Madrid), Spain; e-mail: susana.fernandez@uc3m.es
David Manzano
Affiliation:
Ericsson España, S.A.U, Madrid R&D Center, Technology & Innovation, C/ Vía de los Poblados 13, 28013Madrid, Spain; e-mail: david.manzano.macho@ericsson.com

Abstract

This paper presents a distributed architecture for automating data mining (DM) processes using standard languages. DM is a difficult task that relies on an exploratory and analytic process of processing large quantities of data in order to discover meaningful patterns. The increasing heterogeneity and complexity of available data requires some expert knowledge on how to combine the multiple and alternative DM tasks to process the data. Here, we describe DM tasks in terms of Automated Planning, which allows us to automate the DM knowledge flow construction. The work is based on the use of standards that have been defined in both DM and automated-planning communities. Thus, we use PMML (Predictive Model Markup Language) to describe DM tasks. From the PMML, a problem description in PDDL (Planning Domain Definition Language) can be generated, so any current planning system can be used to generate a plan. This plan is, again, translated to a DM workflow description, Knowledge Flow for Machine Learning format (Knowledge Flow file for the WEKA (Waikato Environment for Knowledge Analysis) tool), so the plan or DM workflow can be executed in WEKA.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2013 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable