TY - JOUR
T1 - Hack me if you can
T2 - Aggregating autoencoders for countering persistent access threats within highly imbalanced data
AU - Benabderrahmane, Sidahmed
AU - Hoang, Ngoc
AU - Valtchev, Petko
AU - Cheney, James
AU - Rahwan, Talal
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2024/11
Y1 - 2024/11
N2 - Advanced Persistent Threats (APTs) are sophisticated, targeted cyberattacks designed to gain unauthorized access to systems and remain undetected for extended periods. To evade detection, APT cyberattacks deceive defense layers with breaches and exploits, thereby complicating exposure by traditional anomaly detection-based security methods. The challenge of detecting APTs with machine learning is compounded by the rarity of relevant datasets and the significant imbalance in the data, which makes the detection process highly burdensome. We present AE-APT, a deep learning-based tool for APT detection that features a family of AutoEncoder methods ranging from a basic one to a Transformer-based one. We evaluated our tool on a suite of provenance trace databases produced by the DARPA Transparent Computing program, where APT-like attacks constitute as little as 0.004% of the data. The datasets span multiple operating systems, including Android, Linux, BSD, and Windows, and cover two attack scenarios. The outcomes showed that AE-APT has significantly higher detection rates compared to its competitors, indicating superior performance in detecting and ranking anomalies. Data and code: https://github.com/ae-apt/AE-APT.
AB - Advanced Persistent Threats (APTs) are sophisticated, targeted cyberattacks designed to gain unauthorized access to systems and remain undetected for extended periods. To evade detection, APT cyberattacks deceive defense layers with breaches and exploits, thereby complicating exposure by traditional anomaly detection-based security methods. The challenge of detecting APTs with machine learning is compounded by the rarity of relevant datasets and the significant imbalance in the data, which makes the detection process highly burdensome. We present AE-APT, a deep learning-based tool for APT detection that features a family of AutoEncoder methods ranging from a basic one to a Transformer-based one. We evaluated our tool on a suite of provenance trace databases produced by the DARPA Transparent Computing program, where APT-like attacks constitute as little as 0.004% of the data. The datasets span multiple operating systems, including Android, Linux, BSD, and Windows, and cover two attack scenarios. The outcomes showed that AE-APT has significantly higher detection rates compared to its competitors, indicating superior performance in detecting and ranking anomalies. Data and code: https://github.com/ae-apt/AE-APT.
KW - Advanced persistent threats
KW - Anomaly detection
KW - Attention mechanism
KW - Cyber-security
KW - Deep learning
KW - Transformers
UR - http://www.scopus.com/inward/record.url?scp=85197781615&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85197781615&partnerID=8YFLogxK
U2 - 10.1016/j.future.2024.06.050
DO - 10.1016/j.future.2024.06.050
M3 - Article
AN - SCOPUS:85197781615
SN - 0167-739X
VL - 160
SP - 926
EP - 941
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -