Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-08T06:08:32.690Z Has data issue: false hasContentIssue false

Toward an integrated knowledge discovery and data mining process model

Published online by Cambridge University Press:  01 March 2010

Sumana Sharma*
Affiliation:
Department of Information Systems, the Information Systems Research Institute, Virginia Commonwealth University, Richmond, VA 23284, USA
Kweku-Muata Osei-Bryson*
Affiliation:
Department of Information Systems, the Information Systems Research Institute, Virginia Commonwealth University, Richmond, VA 23284, USA

Abstract

The knowledge discovery and data mining (KDDM) process models describe the various phases (e.g. business understanding, data understanding, data preparation, modeling, evaluation and deployment) of the KDDM process. They act as a roadmap for implementation of the KDDM process by presenting a list of tasks for executing the various phases. The checklist approach of describing the tasks is not adequately supported by appropriate tools, which specify ‘how’ the particular task can be implemented. This may result in tasks not being implemented. Another disadvantage is that the long checklist does not capture or leverage the dependencies that exist among the various tasks of the same and different phases. This not only makes the process cumbersome to implement, but also hinders possibilities for semi-automation of certain tasks. Given that each task in the process model serves an important goal and even affects the execution of related tasks due to the dependencies, these limitations are likely to negatively affect the efficiency and effectiveness of KDDM projects. This paper proposes an improved KDDM process model that overcomes these shortcomings by prescribing tools for supporting each task as well as identifying and leveraging dependencies among tasks for semi-automation of tasks, wherever possible.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2010

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable