A Preliminary Study on Learning Challenges in Machine Learning-based Flight Delay Prediction

Authors

  • Ismail Babajide Mustapha UTM Big Data Centre, Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia
  • Siti Mariyam Shamsuddin UTM Big Data Centre, Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia
  • Shafaatunnur Hasan UTM Big Data Centre, Universiti Teknologi Malaysia 81310 UTM Johor Bahru, Johor, Malaysia

DOI:

https://doi.org/10.11113/ijic.v9n1.204

Keywords:

Flight delay prediction, class imbalance, machine learning, dimensionality reduction, class overlapping

Abstract

Machine learning based flight delay prediction is one of the numerous real-life application domains where the problem of imbalance in class distribution is reported to affect the performance of learning algorithms. However, the fact that learning algorithms have been reported to perform well on some class imbalance problems posits the possibility of other contributing factors. In this study, we visually explore air traffic data after dimensionality reduction with t-Distributed Stochastic Neighbour Embedding. Our initial findings suggest a high degree of overlapping between the delayed and on-time class instances which can be a greater problem for learning algorithms than class imbalance.

Downloads

Published

2019-05-31

How to Cite

Mustapha, I. B., Shamsuddin, S. M., & Hasan, S. (2019). A Preliminary Study on Learning Challenges in Machine Learning-based Flight Delay Prediction. International Journal of Innovative Computing, 9(1). https://doi.org/10.11113/ijic.v9n1.204

Issue

Section

Computer Science