CBDC-Net: Recurrent Bidirectional LSTM Neural Networks Based Cyberbullying Detection with Synonym-Level N-Gram and TSR-SCSOFeatures

Authors

  • P. Padma Research Scholar
  • G. Siva Nageswara Rao

DOI:

https://doi.org/10.22399/ijcesen.623

Keywords:

Cyber bullying detection, Sand Cat Swarm Optimization, N-gram feature selection, Recurrent Neural Network, Text Mining, Machine Learning

Abstract

Social networks Cyber bullying has become another common problem in online social networks (OSNs) which exposes individuals to high risks of their mental health and interacting with others. Previous work in cyber bullying detection is often confronted with limitations in accurately detecting abusive behavior because of the intricacies in cyber space and evolution of cyber bullying practices. A new approach of Cyber bullying detection and classification network (CBDC- Net) for improving the effectiveness of detection of cyber bullying in OSNs based on natural language processing features, feature selection techniques, and deep learning algorithms is also presented in this study. CBDC-Net can overcome these challenges to existing detection methods of cyber bullying using innovative Natural Language Processing (NLP) and Deep Learning approaches. In the data preprocessing step, CBDC-Net filter and normalize the text data that is openly collected from OSNs. After that, CBDC-Net extracts features using a Synonym Level N-Gram (SLNG) approach and it incorporates both the word and character-based information to make the synonyms of text much better than the other method. After that, CSI of CBDC-Net applied Textual Similarity Resilient Sand Cat Swarm Optimization (TSR-SCSO) for feature selection to give an iterative value of their features’ importance level to detect cyber bullying. Last, in CBDC-Net, a Recurrent Bidirectional Long Short-Term Memory (LSTM)Neural Network for classification (RBLNN) is used as classification approach is applied, which recognizes the sequential nature of textual data enabling proper distinction between cyber bullying cases. Last but not the least, the CBDC Net provides a promising solution for solving the mentioned problems of cyber bullying detection in OSNs.

References

Mahmud, Tanjim, Michal Ptaszynski, Juuso Eronen, and Fumito Masui. (2023). Cyberbullying detection for low-resource languages and dialects: Review of the state of the art. Information Processing & Management. 60(5);103454. https://doi.org/10.1016/j.ipm.2023.103454

Alduailaj, Alanoud Mohammed, and Aymen Belghith. (2023). Detecting arabic cyberbullying tweets using machine learning. Machine Learning and Knowledge Extraction. 5(1);29-42. https://doi.org/10.3390/make5010003

Pericherla, Subbaraju, and E. Ilavarasan. (2024). Transformer network-based word embeddings approach for autonomous cyberbullying detection. International Journal of Intelligent Unmanned Systems.12(1);154-166.

Dewani, Amirita, Mohsin Ali Memon, Sania Bhatti, Adel Sulaiman, Mohammed Hamdi, Hani Alshahrani, Abdullah Alghamdi, and Asadullah Shaikh. (2023). Detection of Cyberbullying Patterns in Low Resource Colloquial Roman Urdu Microtext using Natural Language Processing, Machine Learning, and Ensemble Techniques. Applied Sciences. 13(4);2062.

Teng, Teoh Hwai, Kasturi Dewi Varathan, and Fabio Crestani. (2023). A Comprehensive Review of Cyberbullying-related Content Classification In Online Social Media. Expert Systems with Applications. 122644.

Alqahtani, Abdulkarim Faraj, and Mohammad Ilyas. (2024). An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying. Machine Learning and Knowledge Extraction. 6(1);156-170.

Ptaszynski, Michal, Agata Pieciukiewicz, Pawel Dybala, Pawel Skrzek, Kamil Soliwoda, Marcin Fortuna, Gniewosz Leliwa, and Michal Wroczynski. (2023). Expert-annotated dataset to study cyberbullying in Polish language. Data. 9(1);1.

Hasan, Md Tarek, Md Al Emran Hossain, Md Saddam Hossain Mukta, Arifa Akter, Mohiuddin Ahmed, and Salekul Islam. (2023). A Review on Deep-Learning-Based Cyberbullying Detection. Future Internet. 15(5);179.

Gattulli, Vincenzo, Donato Impedovo, Giuseppe Pirlo, and Lucia Sarcinella. (2023). Human Activity Recognition for the Identification of Bullying and Cyberbullying Using Smartphone Sensors. Electronics. 12(2);261.

López-Vizcaíno, Manuel, Francisco J. Nóvoa, Thierry Artieres, and Fidel Cacheda. (2023). Site Agnostic Approach to Early Detection of Cyberbullying on Social Media Networks. Sensors. 23(10);4788.

Fati, Suliman Mohamed, Amgad Muneer, Ayed Alwadain, and Abdullateef O. Balogun. (2023). Cyberbullying Detection on Twitter Using Deep Learning-Based Attention Mechanisms and Continuous Bag of Words Feature Extraction. Mathematics. 11(16);3567.

Alqahtani, Abdulkarim Faraj, and Mohammad Ilyas. (2024). An Ensemble-Based Multi-Classification Machine Learning Classifiers Approach to Detect Multiple Classes of Cyberbullying. Machine Learning and Knowledge Extraction. 6(1);156-170.

Muneer, Amgad, Ayed Alwadain, Mohammed Gamal Ragab, and Alawi Alqushaibi. (2023). Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT. Information. 14(8);467.

Gattulli, Vincenzo, Donato Impedovo, Giuseppe Pirlo, and Lucia Sarcinella. (2023). Human Activity Recognition for the Identification of Bullying and Cyberbullying Using Smartphone Sensors. Electronics. 12(2);261.

Dewani, Amirita, Mohsin Ali Memon, Sania Bhatti, Adel Sulaiman, Mohammed Hamdi, Hani Alshahrani, Abdullah Alghamdi, and Asadullah Shaikh. (2023). Detection of Cyberbullying Patterns in Low Resource Colloquial Roman Urdu Microtext using Natural Language Processing, Machine Learning, and Ensemble Techniques. Applied Sciences. 13(4);2062.

López-Vizcaíno, Manuel, Francisco J. Nóvoa, Thierry Artieres, and Fidel Cacheda. (2023). Site Agnostic Approach to Early Detection of Cyberbullying on Social Media Networks. Sensors. 23(10);4788.

N. A. Samee, U. Khan, S. Khan, M. M. Jamjoom, M. Sharif and D. H. Kim. (2023). Safeguarding Online Spaces: A Powerful Fusion of Federated Learning, Word Embeddings, and Emotional Features for Cyberbullying Detection. in IEEE Access. 11; 124524-24541,doi:10.1109/ACCESS.2023.3329347.

Chen, Shifeng, Jialin Wang, and Ketai He. (2024). Chinese Cyberbullying Detection Using XLNet and Deep Bi-LSTM Hybrid Model. Information. 15(2);93.

Hasan, Md. Tarek, Md. Al Emran Hossain, Md. Saddam Hossain Mukta, Arifa Akter, Mohiuddin Ahmed, and Salekul Islam. (2023). A Review on Deep-Learning-Based Cyberbullying Detection. Future Internet. 15(5);179.

Teoh Hwai Teng, Kasturi Dewi Varathan, Fabio Crestani, (2024). A comprehensive review of cyberbullying-related content classification in online social media, Expert Systems with Applications, 244; 122644

Čepulionytė, Aušra, JevgenijusToldinas, and Borisas Lozinskis. (2023). A Multilayered Preprocessing Approach for Recognition and Classification of Malicious Social Network Messages. Electronics. 12(18);3785.

Samee, Nagwan Abdel, Umair Khan, Salabat Khan, Mona M. Jamjoom, Muhammad Sharif, and Do Hyuen Kim. (2023). Safeguarding Online Spaces: A Powerful Fusion of Federated Learning, Word Embeddings, and Emotional Features for Cyberbullying Detection. IEEE Access. 11,124524-124541. DOI:10.1109/ACCESS.2023.3329347

Miran, Ara Zozan, and Adnan Mohsin Abdulazeez. (2023). Detection of Hate-Speech Tweets Based on Deep Learning: A Review. JISA (JurnalInformatika dan Sains). 6(2);174-188.

Huang, Jingxiu, Ruofei Ding, Yunxiang Zheng, Xiaomin Wu, Shumin Chen, and Xiunan Jin. (2023). Does Part of Speech Have an Influence on Cyberbullying Detection? Analytics. 3(1);1-13.

Dar, Momna, Faiza Iqbal, Rabia Latif, Ayesha Altaf, and Nor Shahida Mohd Jamail. (2023). Policy-Based Spam Detection of Tweets Dataset. Electronics. 12(12);2662.

Khairy, Marwa, Tarek M. Mahmoud, and Tarek Abd-El-Hafeez. (2024). The effect of rebalancing techniques on the classification performance in cyberbullying datasets. Neural Computing and Applications. 36(3);1049-1065.

Gamal, Donia, Marco Alfonse, Salud María Jiménez-Zafra, and Mostafa Aref. (2023). Intelligent Multi-Lingual Cyber-Hate Detection in Online Social Networks: Taxonomy, Approaches, Datasets, and Open Challenges. Big Data and Cognitive Computing. 7(2);58.

Anwar, Aamir, Ikram Ur Rehman, Moustafa M. Nasralla, Sohaib Bin Altaf Khattak, and Nasrullah Khilji. (2023). Emotions Matter: A Systematic Review and Meta-Analysis of the Detection and Classification of Students’ Emotions in STEM during Online Learning. Education Sciences. 13(9);914.

Allouch, Merav, Noa Mansbach, Amos Azaria, and Rina Azoulay. (2023). Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text. Applied Sciences. 13(6);3927.

R. Deepa, V. Jayalakshmi, K. Karpagalakshmi, S. Manikanda Prabhu, & P.Thilakavathy. (2024). Survey on Resume Parsing Models for JOBCONNECT+: Enhancing Recruitment Efficiency using Natural language processing and Machine Learning. International Journal of Computational and Experimental Science and Engineering, 10(4);1394-1403. https://doi.org/10.22399/ijcesen.660

ÖZNACAR, T., & ERGENE, N. (2024). A Machine Learning Approach to Early Detection and Malignancy Prediction in Breast Cancer. International Journal of Computational and Experimental Science and Engineering, 10(4);911-917. https://doi.org/10.22399/ijcesen.516

Nuthakki, praveena, & Pavankumar T. (2024). Comparative Assessment of Machine Learning Algorithms for Effective Diabetes Prediction and Care. International Journal of Computational and Experimental Science and Engineering, 10(4);1337-1343. https://doi.org/10.22399/ijcesen.606

Rama Lakshmi BOYAPATI, & Radhika YALAVARTHI. (2024). RESNET-53 for Extraction of Alzheimer’s Features Using Enhanced Learning Models. International Journal of Computational and Experimental Science and Engineering, 10(4);879-889. https://doi.org/10.22399/ijcesen.519

S.P. Lalitha, & A. Murugan. (2024). Performance Analysis of Priority Generation System for Multimedia Video using ANFIS Classifier. International Journal of Computational and Experimental Science and Engineering, 10(4);1320-1328. https://doi.org/10.22399/ijcesen.707

P., A. M., & R. GUNASUNDARI. (2024). An Interpretable PyCaret Approach for Alzheimer’s Disease Prediction. International Journal of Computational and Experimental Science and Engineering, 10(4);1223-1232. https://doi.org/10.22399/ijcesen.655

ÇOŞGUN, A. (2024). Estimation Of Turkey’s Carbon Dioxide Emission with Machine Learning. International Journal of Computational and Experimental Science and Engineering, 10(1);95-101. https://doi.org/10.22399/ijcesen.302

Türkmen, G., Sezen, A., & Şengül, G. (2024). Comparative Analysis of Programming Languages Utilized in Artificial Intelligence Applications: Features, Performance, and Suitability. International Journal of Computational and Experimental Science and Engineering, 10(3)461-469. https://doi.org/10.22399/ijcesen.342

N. Sriram, Jayalakshmi V., P. Preethi, B. Shoba, & K. Shenbagavalli. (2024). Navigating the Future with YOLOv9 for Advanced Traffic Sign Recognition in Autonomous Vehicles. International Journal of Computational and Experimental Science and Engineering, 10(4)1424-1436. https://doi.org/10.22399/ijcesen.658

Agnihotri, A., & Kohli, N. (2024). A novel lightweight deep learning model based on SqueezeNet architecture for viral lung disease classification in X-ray and CT images. International Journal of Computational and Experimental Science and Engineering, 10(4);592-613. https://doi.org/10.22399/ijcesen.425

M, P., B, J., B, B., G, S., & S, P. (2024). Energy-efficient and location-aware IoT and WSN-based precision agricultural frameworks. International Journal of Computational and Experimental Science and Engineering, 10(4);585-591. https://doi.org/10.22399/ijcesen.480

Guven, M. (2024). A Comprehensive Review of Large Language Models in Cyber Security. International Journal of Computational and Experimental Science and Engineering, 10(3);507-516. https://doi.org/10.22399/ijcesen.469

S. Praseetha, & S. Sasipriya. (2024). Adaptive Dual-Layer Resource Allocation for Maximizing Spectral Efficiency in 5G Using Hybrid NOMA-RSMA Techniques. International Journal of Computational and Experimental Science and Engineering, 10(4);1130-1139. https://doi.org/10.22399/ijcesen.665

Downloads

Published

2024-12-20

How to Cite

P. Padma, & G. Siva Nageswara Rao. (2024). CBDC-Net: Recurrent Bidirectional LSTM Neural Networks Based Cyberbullying Detection with Synonym-Level N-Gram and TSR-SCSOFeatures. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.623

Issue

Section

Research Article