TY - JOUR
T1 - Driver Distraction Classification Using Deep Convolutional Autoencoder and Ensemble Learning
AU - Muthuswamy, Anirudh
AU - Dewan, M. Ali Akber
AU - Murshed, Mahbub
AU - Parmar, Dharamjit
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - The study of real-time classification for driver distraction provides new insights into the understanding of behavioral and cognitive reasons behind it. Among various approaches, deep learning models show better performance and can be utilized for a real-time classification system. However, deep learning models suffer from generalization performance. Ensemble learning, on the other hand, can combine numerous classification models to improve generalization. Hence, a deep ensemble learning model would be advantageous to improve the overall performance of driver distraction classification. Approaches using ensemble learning are relatively scarce in this research field. This paper proposes an ensemble framework where a novel dynamic ensemble learning is used to classify driver distraction based on autoencoders and a set of popular convolutional neural network (CNN) architectures. The framework has also been tested on two other ensemble techniques, namely average-weighted-ensemble and grid-search-ensemble, for driver distraction classification. Our ensemble framework uses VGG-19, ResNet-50, and DenseNet-121 CNN models with the pre-trained weights from the ImageNet dataset for the ensemble network. Hyperparameter tuning on the classification heads of the baseline models is performed to get the most optimum performance. We used two open-source datasets, the State Farm Driver Distraction Dataset (SF3D) and the Multimodal Multiview and Multispectral Driver Action Dataset (3MDAD), with a combined size of over 80,000 images consisting of 10 categories of driver distractions that allowed increased reliability of the ensemble model in different situations and lighting conditions. The models were fine-tuned using transfer learning techniques. The experimental results showed that the accuracy of the average weighted ensemble, grid search ensemble, and the dynamic ensemble were 88.91%, 89.04%, and 89.13%, respectively, which is higher than the individual baseline models.
AB - The study of real-time classification for driver distraction provides new insights into the understanding of behavioral and cognitive reasons behind it. Among various approaches, deep learning models show better performance and can be utilized for a real-time classification system. However, deep learning models suffer from generalization performance. Ensemble learning, on the other hand, can combine numerous classification models to improve generalization. Hence, a deep ensemble learning model would be advantageous to improve the overall performance of driver distraction classification. Approaches using ensemble learning are relatively scarce in this research field. This paper proposes an ensemble framework where a novel dynamic ensemble learning is used to classify driver distraction based on autoencoders and a set of popular convolutional neural network (CNN) architectures. The framework has also been tested on two other ensemble techniques, namely average-weighted-ensemble and grid-search-ensemble, for driver distraction classification. Our ensemble framework uses VGG-19, ResNet-50, and DenseNet-121 CNN models with the pre-trained weights from the ImageNet dataset for the ensemble network. Hyperparameter tuning on the classification heads of the baseline models is performed to get the most optimum performance. We used two open-source datasets, the State Farm Driver Distraction Dataset (SF3D) and the Multimodal Multiview and Multispectral Driver Action Dataset (3MDAD), with a combined size of over 80,000 images consisting of 10 categories of driver distractions that allowed increased reliability of the ensemble model in different situations and lighting conditions. The models were fine-tuned using transfer learning techniques. The experimental results showed that the accuracy of the average weighted ensemble, grid search ensemble, and the dynamic ensemble were 88.91%, 89.04%, and 89.13%, respectively, which is higher than the individual baseline models.
KW - autoencoders
KW - Behavioral sciences
KW - CNNs
KW - Computational modeling
KW - Deep learning
KW - driver distraction
KW - dynamic ensemble classification
KW - Feature extraction
KW - Roads
KW - Vehicle dynamics
KW - Vehicles
UR - http://www.scopus.com/inward/record.url?scp=85164402153&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2023.3293110
DO - 10.1109/ACCESS.2023.3293110
M3 - Journal Article
AN - SCOPUS:85164402153
VL - 11
SP - 71435
EP - 71448
JO - IEEE Access
JF - IEEE Access
ER -