Hostname: page-component-77f85d65b8-v2srd Total loading time: 0 Render date: 2026-04-20T05:47:49.021Z Has data issue: false hasContentIssue false

Large-scale data analysis on aviation accident database using different data mining techniques

Published online by Cambridge University Press:  22 November 2016

A.B. Arockia Christopher*
Affiliation:
Department of Information Technology, VSB Engineering College, Karur, Tamil Nadu, India
V. Shunmughavel Vivekanandam*
Affiliation:
Department of Computer Science and Engineering, VSB Engineering College, Karur, Tamil Nadu, India
A.B. Antony Anderson*
Affiliation:
Department of Computer Science and Engineering, PVP College of Technology and Engineering for Women, Dindigul, Tamil Nadu, India
S. Markkandeyan*
Affiliation:
Department of IT, RVS College of Engineering, Dindigul, Tamil Nadu, India
V. Sivakumar*
Affiliation:
Department of Electrical and Electronics Engineering, VSB Engineering College, Karur, Tamil Nadu, India

Abstract

Data mining is an iterative process in which progress is defined by discovery through either automatic or manual methods. A data cleaning procedure is proposed to improve the quality of classification tasks in the knowledge discovery process by taking into account both redundant and conflicting data. The redundancy check is performed on the original dataset and the resultant dataset is preserved. This resultant dataset is then checked for conflicting data and, if any are found, they are corrected and updated on the original aircraft dataset. This updated dataset is then classified using a variety of classifiers such as Bayes, functions, lazy, MISC, rules and decision trees. The performance of the updated datasets on these classifiers is examine, and the result shows a significant improvement in the classification accuracy after redundancy and conflicts are removed. The conflicts after correction are updated in the original dataset, and when the performance of the classifier is evaluated, great improvement is observed. This paper aims to address how data mining techniques can be used to understand complex system accidents in the aviation domain. Decision trees are considered to be the one of the most powerful and popular approaches in knowledge discovery and data mining. The objective is to develop a classification model for aviation risk investigation and reduction using a decision tree induction method that enhances the ability to form decision trees and thereby proves that the classification accuracy of decision trees is greater. Different feature selectors are used in this study in order to reduce the number of initial attributes.

Information

Type
Research Article
Copyright
Copyright © Royal Aeronautical Society 2016 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable