Detecting Data Exfiltration Anomalies in Academic Networks Using the Isolation Forest Algorithm

dc.contributor.authorArusei, Mike K.
dc.contributor.authorDr. Njenga, Stephen
dc.date.accessioned2025-11-26T05:20:44Z
dc.date.issued2025
dc.description.abstractAcademic networks face increased risks of data exfiltration due to sensitive personal information and research data. Traditional supervised detection models rely on labeled datasets which are often unavailable in resource constrained institutions. This study investigates the applicability of the unsupervised Isolation Forest algorithm for detecting anomalous network traffic indicative of data exfiltration. The research utilized the CICIDS2017 dataset focusing on the Thursday-WorkingHours-Afternoon-Infiltration subset. Key features including Flow Duration, Total Fwd Packets, Flow Bytes/s, Flow IAT Mean, and Destination Port were preprocessed and normalized for modeling. The model achieved a precision of 1.00, recall of 0.99 and F1-score of 1.00 for anomalous traffic detection successfully identifying approximately 4.8% of flows as anomalous. Comparative analysis with previous methods, including supervised Random Forest and SVM demonstrated that Isolation Forest offers competitive accuracy with lower computational overhead and does not require labeled data. The findings highlight the algorithm’s suitability for academic network monitoring, providing an effective early warning mechanism while emphasizing the importance of threshold tuning to reduce false positives.
dc.identifier.urihttp://192.168.8.146:4000/handle/123456789/1009
dc.language.isoen
dc.publisherKCA University
dc.subjectAnomaly Detection
dc.subjectData Exfiltration Machine Learning
dc.subjectIsolation Forest
dc.subjectAcademic Networks
dc.titleDetecting Data Exfiltration Anomalies in Academic Networks Using the Isolation Forest Algorithm
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
Arusei, Njenga- Detecting Data Exfiltration Anomalies in Academic Networks Using the Isolation Forest Algorithm.pdf
Size:
448.62 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: