TY - GEN
T1 - Deep Learning-Driven Automated Data Generation for Enhanced Anomaly Detection in Cybersecurity
AU - Gough, Michael
AU - Ali Akber Dewan, M.
AU - Wang, Harris
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In anomaly detection mechanisms for cybersecurity, researchers heavily relay on accurate and comprehensive datasets. However, researchers face persistent challenges, including incomplete data, outdated datasets, and the absence of datasets tailored to specific, contemporary scenarios. These limitations impede the development of robust anomaly detection models and exacerbate the resource-intensive nature of data preparation and analysis. Widely used datasets, such as KDD Cup, are no longer sufficient due to their age and limited applicability to modern cybersecurity environments. To address these challenges, this paper proposes a novel hybrid approach combining automation and deep learning to automate synthetic data generation. Our methodology uses contemporary testing frameworks to ensure data validity and aligns with enterprise-specific requirements. By combining automated workflows with advanced deep learning models, this framework facilitates the creation of synthetic datasets that offer enhanced domain coverage and accuracy. Furthermore, it significantly reduces computational overhead and preparation time. This research contributes to the modernization of security data generation, offering a scalable and efficient solution to enhance anomaly detection mechanisms within the domain of cybersecurity.
AB - In anomaly detection mechanisms for cybersecurity, researchers heavily relay on accurate and comprehensive datasets. However, researchers face persistent challenges, including incomplete data, outdated datasets, and the absence of datasets tailored to specific, contemporary scenarios. These limitations impede the development of robust anomaly detection models and exacerbate the resource-intensive nature of data preparation and analysis. Widely used datasets, such as KDD Cup, are no longer sufficient due to their age and limited applicability to modern cybersecurity environments. To address these challenges, this paper proposes a novel hybrid approach combining automation and deep learning to automate synthetic data generation. Our methodology uses contemporary testing frameworks to ensure data validity and aligns with enterprise-specific requirements. By combining automated workflows with advanced deep learning models, this framework facilitates the creation of synthetic datasets that offer enhanced domain coverage and accuracy. Furthermore, it significantly reduces computational overhead and preparation time. This research contributes to the modernization of security data generation, offering a scalable and efficient solution to enhance anomaly detection mechanisms within the domain of cybersecurity.
KW - Anomaly detection
KW - application security
KW - deep learning
KW - software
KW - synthetic data generation
KW - systems security
UR - https://www.scopus.com/pages/publications/105016708176
U2 - 10.1109/ICSC65596.2025.11140556
DO - 10.1109/ICSC65596.2025.11140556
M3 - Published Conference contribution
AN - SCOPUS:105016708176
T3 - 2025 5th Intelligent Cybersecurity Conference, ICSC 2025
SP - 137
EP - 146
BT - 2025 5th Intelligent Cybersecurity Conference, ICSC 2025
A2 - Alsmirat, Mohammad
A2 - Alkhabbas, Fahed
A2 - Al-Abdullah, Muhammad
A2 - Jararweh, Yaser
T2 - 5th Intelligent Cybersecurity Conference, ICSC 2025
Y2 - 19 May 2025 through 22 May 2025
ER -