Intelligent IoT Network Malware Classification using Realtime Heterogenous Data
dc.contributor.author | Ahmed, Aqeel | el |
dc.date.accessioned | 2023-06-07T08:40:08Z | |
dc.date.available | 2023-06-07T08:40:08Z | |
dc.date.issued | 2023-06-07 | |
dc.identifier.uri | https://repository.ihu.edu.gr//xmlui/handle/11544/30277 | |
dc.rights | Default License | |
dc.subject | IoT malware | en |
dc.subject | Heterogeneity | el |
dc.subject | IDS | el |
dc.subject | Classification | el |
dc.subject | BotNet Attacks | el |
dc.title | Intelligent IoT Network Malware Classification using Realtime Heterogenous Data | el |
heal.abstract | Due to its wide range of applications, the Internet of Things (IoT) technology is evolving rapidly. One can witness IoT systems in smart cities, smart homes, smart healthcare, smart industry, and smart agriculture. IoT systems usually use low-powered and low-memory devices to sense the data from the environment and transmit it to the destination through wired or wireless communication channels. Although IoT technology is gaining massive attention in every sector of life, the security of these devices is one of the biggest challenges. Due to resource constraints, these devices are often vulnerable to malicious actors. In this work, a machine learning-based intelligent classification of the IoT network attacks using real-time heterogenous data is carried out. Two IoT network malware datasets (Ton-IoT & IoT-23) that include the real-time IoT Botnet attacks are used for the experiments. The data is pre-processed before performing the experimentation. In addition, a information gain based feature selection method is also applied to select the most important features in the dataset. Several classification methods include Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbors (KNN), Naïve Bayes (NB), and eXtreme Gradient Boosting (XGB) are implemented. These models were evaluated using classification metrics; accuracy, precision, recall, and f1-score. It is concluded that the Naïve Bayes and Logistic Regression are not the best methods to perform classification on these datasets. On the other hand, DT, RF, KNN, and XGB provided an accuracy of 99% for binary labels and 98% for multiclass labels for the Ton-IoT dataset. Using the IoT-23 dataset, these models provided accuracy above 90%. It is found that LR and NB are not the best choices for classification using either dataset. In addition, not all the features in these datasets are essential; hence some can be dropped to reduce the complexity of the model and improve the computational capacity. It is further concluded that heterogeneity in the dataset does not necessarily affect the performance of classification algorithms. | el |
heal.academicPublisher | IHU | el |
heal.academicPublisherID | ihu | en_US |
heal.access | free | en_US |
heal.advisorName | Tjortjis, Christos | el |
heal.committeeMemberName | Lim, Theo | el |
heal.dateAvailable | 2023-05-17 | |
heal.language | en | en_US |
heal.license | http://creativecommons.org/licenses/by-nc/4.0 | en_US |
heal.publicationDate | 2023-05-17 | |
heal.recordProvider | School of Science and Technology, MSc in Information & Communication Technology Systems | en_US |
heal.sponsor | Erasmus Mundus Joint Masters in SMACCs by European Union | el |
heal.type | masterThesis | en_US |
Αρχεία
Πρωτότυπος φάκελος/πακέτο
1 - 1 από 1
Δεν υπάρχει διαθέσιμη μικρογραφία
- Ονομα:
- Intelligent IoT Network Malware Classification using Real time Heterogenous Data.pdf
- Μέγεθος:
- 2.27 MB
- Μορφότυπο:
- Adobe Portable Document Format
- Περιγραφή:
Φάκελος/Πακέτο αδειών
1 - 1 από 1
Δεν υπάρχει διαθέσιμη μικρογραφία
- Ονομα:
- license.txt
- Μέγεθος:
- 2.58 KB
- Μορφότυπο:
- Item-specific license agreed upon to submission
- Περιγραφή: