Search CORE

1,721,370 research outputs found

Evading botnet detectors based on flows and random forest with adversarial samples

Author: Apruzzese G.
Giovanni Apruzzese
Colajanni M.
Michele Colajanni
Publication venue
Publication date: 01/01/2018
Field of study

Machine learning is increasingly adopted for a wide array of applications, due to its promising results and autonomous capabilities. However, recent research efforts have shown that, especially within the image processing field, these novel techniques are susceptible to adversarial perturbations. In this paper, we present an analysis that highlights and evaluates experimentally the fragility of network intrusion detection systems based on machine learning algorithms against adversarial attacks. In particular, our study involves a random forest classifier that utilizes network flows to distinguish between botnet and benign samples. Our results, derived from experiments performed on a public real dataset of labelled network flows, show that attackers can easily evade such defensive mechanisms by applying slight and targeted modifications to the network activity generated by their controlled bots. These findings pave the way for future techniques that aim to strengthen the performance of machine learning-based network intrusion detection systems

Crossref

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

The Cross-evaluation of Machine Learning-based Network Intrusion Detection Systems

Author: Conti M.
Pajola L.
Apruzzese G.
Publication venue
Publication date: 01/01/2022
Field of study

Archivio istituzionale della ricerca - Università di Padova

Density fluctuations induced by MARFE on FTU

Author: Apruzzese G.
Pucella G.
Zeng L.
Publication venue
Publication date: 01/01/2013
Field of study

High density operations on FTU (Frascati Tokamak Upgade) are affected by strong density oscillations due to the so-called MARFE (Multifaceted Asymmetric Radiation From the Edge). An analysis of these oscillations is reported here. The MARFE on FTU occurs at the value of p = 0.48 (p is ratio between line-averaged density with averaged current density in the plasma); at the MARFE onset, the volume averaged density versus central density is almost constant, corresponding to the value of 0.8. The MARFE is observed to move up and down along the poloidal circumference on FTU. The MARFE movement causes continuous density fluctuations. Amplitude of density fluctuations decreases with edge safety factor and with electron density. The mechanism to cause MARFE movement is discussed. If amplitude exceeds a certain threshold, it may cause poloidal heat flux asymmetry between the edges of the MARFE, which will make the MARFE moving along the poloidal circumference. © 2013 Elsevier B.V. All rights reserved

ENEA-IRIS Open Archive

Beyond the west: Revealing and bridging the gap between Western and Chinese phishing website detection

Author: Conti M.
Yuan Y.
Apruzzese G.
Publication venue
Publication date: 01/01/2025
Field of study

Phishing attacks are on the rise, and phishing websites are everywhere, denoting the brittleness of security mechanisms reliant on blocklists. To cope with this threat, many works proposed to enhance Phishing Website Detectors (PWD) with data-driven techniques powered by Machine Learning (ML). Despite achieving promising results both in research and practice, existing solutions mostly focus "on the West", e.g., they consider websites in English, German, or Italian. In contrast, phishing websites targeting "Eastern"countries, such as China, have been mostly neglected-despite phishing being rampant also in this side of the world. In this paper, we scrutinize whether current PWD can simultaneously work against Western and Chinese phishing websites. First, after highlighting the difficulties of practically testing PWD on Chinese phishing websites, we create ChiPhish-a dataset which enables assessment of PWD on Chinese websites. Then, we evaluate 72 PWD developed by industry practitioners and 10 ML-based PWD proposed in recent research on Western and Chinese websites: our results highlight that existing solutions, despite achieving low false positive rates, exhibit unacceptably low detection rates (sometimes inferior to 1%) on phishing websites of different regions. Next, to bridge the gap we brought to light, we elucidate the differences between Western and Chinese websites, and devise an enhanced feature set that accounts for the unique characteristics of Chinese websites. We empirically demonstrate the effectiveness of our proposed feature set by replicating (and testing) state-ofthe-art ML-PWD: our results show a small but statistically significant improvement over the baselines. Finally, we review all our previous contributions and combine them to develop practical PWD that simultaneously work on Chinese and Western websites, achieving over 0.98 detection rate while maintaining only 0.01 false positive rate in a cross-regional setting. We openly release all our tools, disclose all our benchmark results, and also perform proof-of-concept experiments revealing that the problem tackled by our paper extends to other "Eastern"countries that have been overlooked by prior research on PWD

Archivio istituzionale della ricerca - Università di Padova

Hardening Random Forest Cyber Detectors Against Adversarial Attacks

Author: Andreolini M.
Apruzzese G.
Marchetti M.
Colajanni M.
Publication venue
Publication date: 01/01/2020
Field of study

Machine learning algorithms are effective in several applications, but they are not as much successful when applied to intrusion detection in cyber security. Due to the high sensitivity to their training data, cyber detectors based on machine learning are vulnerable to targeted adversarial attacks that involve the perturbation of initial samples. Existing defenses assume unrealistic scenarios; their results are underwhelming in non-adversarial settings; or they can be applied only to machine learning algorithms that perform poorly for cyber security. We present an original methodology for countering adversarial perturbations targeting intrusion detection systems based on random forests. As a practical application, we integrate the proposed defense method in a cyber detector analyzing network traffic. The experimental results on millions of labelled network flows show that the new detector has a twofold value: it outperforms state-of-the-art detectors that are subject to adversarial attacks; it exhibits robust results both in adversarial and non-adversarial scenarios

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems

Author: Andreolini M.
Apruzzese G.
Venturi A.
Marchetti M.
Colajanni M.
Publication venue
Publication date: 01/01/2021
Field of study

We present the first dataset that aims to serve as a benchmark to validate the resilience of botnet detectors against adversarial attacks. This dataset includes realistic adversarial samples that are generated by leveraging two widely used Deep Reinforcement Learning (DRL) techniques. These adversarial samples are proved to evade state of the art detectors based on Machine- and Deep-Learning algorithms. The initial corpus of malicious samples consists of network flows belonging to different botnet families presented in three public datasets containing real enterprise network traffic. We use these datasets to devise detectors capable of achieving state-of-the-art performance. We then train two DRL agents, based on Double Deep Q-Network and Deep Sarsa, to generate realistic adversarial samples: the goal is achieving misclassifications by performing small modifications to the initial malicious samples. These alterations involve the features that can be more realistically altered by an expert attacker, and do not compromise the underlying malicious logic of the original samples. Our dataset represents an important contribution to the cybersecurity research community as it is the first including thousands of automatically generated adversarial samples that are able to thwart state of the art classifiers with a high evasion rate. The adversarial samples are grouped by malware variant and provided in a CSV file format. Researchers can validate their defensive proposals by testing their detectors against the adversarial samples of the proposed dataset. Moreover, the analysis of these samples can pave the way to a deeper comprehension of adversarial attacks and to some sort of explainability of machine learning defensive algorithms. They can also support the definition of novel effective defensive techniques

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

On the effectiveness of machine and deep learning for cyber security

Author: Ferretti L.
Apruzzese G.
Guido A.
Marchetti M.
Colajanni M.
Publication venue
Publication date: 01/01/2018
Field of study

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Deep Reinforcement Adversarial Learning against Botnet Evasion Attacks

Author: Andreolini M.
Apruzzese G.
Venturi A.
Marchetti M.
Colajanni M.
Publication venue
Publication date: 01/01/2020
Field of study

As cybersecurity detectors increasingly rely on machine learning mechanisms, attacks to these defenses escalate as well. Supervised classifiers are prone to adversarial evasion, and existing countermeasures suffer from many limitations. Most solutions degrade performance in the absence of adversarial perturbations; they are unable to face novel attack variants; they are applicable only to specific machine learning algorithms. We propose the first framework that can protect botnet detectors from adversarial attacks through deep reinforcement learning mechanisms. It automatically generates realistic attack samples that can evade detection, and it uses these samples to produce an augmented training set for producing hardened detectors. In such a way, we obtain more resilient detectors that can work even against unforeseen evasion attacks with the great merit of not penalizing their performance in the absence of specific attacks. We validate our proposal through an extensive experimental campaign that considers multiple machine learning algorithms and public datasets. The results highlight the improvements of the proposed solution over the state-of-the-art. Our method paves the way to novel and more robust cybersecurity detectors based on machine learning applied to network traffic analytics

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Modeling of lithium vapour cloud observed on FTU

Author: IAFRATI MATTEO
Apruzzese G.
Mazzitelli G.
Apicella M. L.
Publication venue
Publication date: 01/01/2016
Field of study

Archivio istituzionale della ricerca - Università di Padova

Attribute Inference Attacks in Online Multiplayer Video Games: A Case Study on DOTA2

Author: Conti M.
Tricomi P. P.
Apruzzese G.
Facciolo L.
Publication venue
Publication date: 01/01/2023
Field of study

Did you know that over 70 million of Dota2 players have their in-game data freely accessible? What if such data is used in malicious ways? This paper is the first to investigate such a problem. Motivated by the widespread popularity of video games, we propose the first threat model for Attribute Inference Attacks (AIA) in the Dota2 context. We explain how (and why) attackers can exploit the abundant public data in the Dota2 ecosystem to infer private information about its players. Due to lack of concrete evidence on the efficacy of our AIA, we empirically prove and assess their impact in reality. By conducting an extensive survey on 500 Dota2 players spanning over 26k matches, we verify whether a correlation exists between a player's Dota2 activity and their real-life. Then, after finding such a link (p < 0.01 and ρ > 0.3), we ethically perform diverse AIA. We leverage the capabilities of machine learning to infer real-life attributes of the respondents of our survey by using their publicly available in-game data. Our results show that, by applyingdomain expertise, some AIA can reach up to 98\% precision and over 90\% accuracy. This paper hence raises the alarm on a subtle, but concrete threat that can potentially affect the entire competitive gaming landscape. We alerted the developers of Dota2

Archivio della ricerca- Università di Roma La Sapienza

Archivio istituzionale della ricerca - Università di Padova