From Comments to Threats: Leveraging NLP to Detect Malicious Browser Extensions

Abstract

With the increasing popularity of browser extensions, there is a growing concern about malicious actors exploiting this software to distribute malware. Existing solutions rely on manual review processes or traditional methods like static, dynamic or hybrid analysis, which are insufficient in detecting complex and evolving threats. In this study, we investigate the potential of Natural Language Processing (NLP) for automatically classifying users' comments in the Chrome Web Store, the public repository where extensions are stored and distributed, to identify malicious extensions. We propose a novel framework called CoTH that leverages NLP techniques to analyse the textual feedback provided by users and detect patterns indicative of malicious activity. We evaluate the accuracy of our model using a dataset of user reviews and demonstrate its effectiveness in identifying malicious extensions. Our findings suggest that NLP-based comment analysis can be a valuable addition to existing security measures, providing an opportunity for more accurate and efficient detection of malware in the Chrome Web Store.

Publication
Journal of Computer Virology and Hacking Techniques