An Efficient Hybrid Improved Feature Vector Manifold Clustering with Neighbour Search Optimization

Authors

  • L. Dhanapriya Research Scholar, Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women, Coimbatore, Tamil Nadu, India
  • S. Preetha Associate Professor, Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women, Coimbatore, Tamil Nadu, India

DOI:

https://doi.org/10.22399/ijcesen.1671

Keywords:

Data Mining, Clustering, Feature Selection, Manifold, Neighbour Search, Outlier

Abstract

In this paper, the IFMCNSO algorithm a novel hybrid Improved Feature Vector Manifold clustering with Neighbour search optimization clustering algorithm —is presented. Many methods for linear or nonlinear manifold clustering have been developed recently. While in many cases they have proven to perform better than classic clustering algorithms, the majority of these approaches have a high complexity. In order to overcome the clustering problem, particularly for high-dimensional datasets, this work provides an effective hybrid method called IFMCNSO. By using this strategy, the domain in which feature vector manifold learning and Neighbor search optimization techniques can be used is greatly expanded, enabling parameterization in real-world data sets. A good or nearly optimal solution is found using the IFMCNSO algorithm in an acceptable amount of time. A comprehensive comparison of the proposed IFMCNSO algorithm with state-of-the-art clustering algorithms, namely DCNaN, RDMN, HFMST, and HFMST-PSO, reveals that IFMCNSO achieves higher Rand Index (RI) and Adjusted Rand Index (ARI) scores, underscoring its exceptional clustering performance and accuracy

References

[1] Edelstein, H. A. (1999). Introduction to data mining and knowledge discovery (3rd ed). Potomac, MD: Two Crows Corp.

[2] Mirkin, B. Clustering: A Data Recovery Approach, Second Edition. Chapman & Hall/CRC Computer Science & Data Analysis.

[3] Alpaydin, E. (2004). Introduction to Machine Learning. Cambridge, the MIT Press.

[4] Tan, P. N., Steinbach, M., & Kumar, V. (2005). Introduction to data mining. Boston, Addison-Wesley.

[5] Webb, A. (2002). Statistical pattern recognition. New Jersey, John Wiley & Sons.

[6] Hathway, R. J., & Bezdek, J. C. (1995). Optimization of clustering criteria by reformulation. IEEE transactions on Fuzzy Systems. 3(2), 241–245. https://doi.org/10.1109/91.388178

[7] Rajendra, P., Jatindra, K. D., & Sukumar, N. (2010). Distance based Fast Outlier Detection Method. Annual IEEE, India Conference (INDICON). 1–4. https://doi.org/10.1109/indcon.2010.5712706

[8] Zhang, J., Yang, L., Zhang, Y., Tang, D., & Liu, T. (2022). Non-parameter clustering algorithm based on saturated neighborhood graph. Applied Soft Computing. 130, 109647. https://doi.org/10.1016/j.asoc.2022.109647

[9] Guan, C., Yuen, K. K. F., & Coenen, F. (2019). Particle swarm Optimized Density-based Clustering and Classification: Supervised and unsupervised learning approaches. Swarm and Evolutionary Computation. 44;876–896. https://doi.org/10.1016/j.swevo.2018.09.008

[10] Ma, Y., Zhao, X., Zhang, C., Zhang, J., & Qin, X. (2021). Outlier detection from multiple data sources. Information Sciences. 580, 819–837. https://doi.org/10.1016/j.ins.2021.09.053

[11] Liu, H., Li, J., Wu, Y., & Fu, Y. (2021). Clustering with outlier removal. IEEE Trans. Knowl. Data Eng. 33(6);2369-2379. https://doi.org/ 10.1109/TKDE.2019.2954317

[12] Yuan, Z., Chen, H., Li, T., Zhang, X., & Sang, B. (2022). Mult granulation Relative Entropy-Based Mixed Attribute Outlier Detection in Neighborhood Systems. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 52(8); 5175–5187. https://doi.org/10.1109/tsmc.2021.3119119

[13] Rameshkumar, et al. (2023). Cloud computing and machine learning for analysis of health care data based on neuro fuzzy logistic regression. Journal of Intelligent & Fuzzy Systems. 44(6);9955-9964. https://doi.org/10.3233/jifs-223280

[14] Ammal, S. G., et al. (2024). Advanced Cloud-Based Prediction Models for Cardiovascular Disease: Integrating Machine Learning and Feature Selection Techniques. SN Computer Science. 5(5);572. https://doi.org/10.1007/s42979-024-02927-w

[15] Ramkumar, M., et al. (2024). Identifying cancer risks using spectral subset feature selection based on multi-layer perception neural network for premature treatment. Computer Methods in Biomechanics and Biomedical Engineering. 27(13);1804-1816. https://doi.org/10.1080/10255842.2023.2262662

[16] Dhanapriya, L., & Preetha, S. (2023). HFMST-PSO: An Efficient Hybrid Fuzzy Based Minimum Spanning Tree (HFMST) With Particle Swarm Optimization Clustering Algorithm. Metszet journal. 528-533. https://doi.org/10.27896/METSZET8.11/46

[17] Pasi, F. et al. (2015). Clustering datasets. Available: http://cs.uef.fi/sipu/datasets/

[18] Yeh, C. C., & Yang, M. S. (2017). Evaluation measures for cluster ensembles based on a fuzzy generalized Rand index. Applied Soft Computing. 57, 225–234. https://doi.org/10.1016/j.asoc.2017.03.030

[19] Xie, J., Xiong, Z.-Y., Zhang, Y.-F., Feng, Y., & Ma, J. (2018). Density corebased clustering algorithm with dynamic scanning radius. Knowledge-Based Systems. 142;58–70. https://doi.org/10.1016/j.knosys.2017.11.025

[20] Mishra, G., & Mohanty, S. K. (2022). RDMN: A Relative Density Measure Based on MST Neighborhood for Clustering Multi-Scale Datasets. IEEE Transactions on Knowledge and Data Engineering. 34(1). https://doi.org/10.1109/tkde.2020.2982400

[21] Halkidi, M., Batistakis, Y., & Vazirgiannis, M. (2001). On clustering validation techniques. Journal of Intelligent Information Systems. 17(2–3);107–145. https://web.itu.edu.tr/sgunduz/courses/verimaden/paper/validity_survey.pdf

[22] ARSLANKAYA, S., & Saltan YAŞLI, G. (2024). Predicting Diabetes Disease Using Data Mining Classification Algorithms and Comparison of Algorithm Performances. International Journal of Computational and Experimental Science and Engineering, 10(2). https://doi.org/10.22399/ijcesen.233

[23] M. Revathy Meenal, & S. Mary Vennila. (2025). Renyi Entropy Predictive Data Mining And Weighted Xavier Deep Neural Classifier For Heart Disease Prediction. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.1000

[24] García, R., Carlos Garzon, & Juan Estrella. (2025). Generative Artificial Intelligence to Optimize Lifting Lugs: Weight Reduction and Sustainability in AISI 304 Steel. International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.22

[25] Arya P. Pillai, & N.V. Chinnasamy. (2025). Prediction of Postpartum Depression With Dataset Using Hybrid Data Mining Classification Technique. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.750

[26] Hafez, I. Y., & El-Mageed, A. A. A. (2025). Enhancing Digital Finance Security: AI-Based Approaches for Credit Card and Cryptocurrency Fraud Detection. International Journal of Applied Sciences and Radiation Research, 2(1). https://doi.org/10.22399/ijasrar.21

[27] Fowowe, O. O., & Agboluaje, R. (2025). Leveraging Predictive Analytics for Customer Churn: A Cross-Industry Approach in the US Market. International Journal of Applied Sciences and Radiation Research, 2(1). https://doi.org/10.22399/ijasrar.20

[28] S. Menaka, & V. Selvam. (2025). Bibliometric Analysis of Artificial Intelligence on Consumer Purchase Intention in E-Retailing. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.1007

[29] Ibeh, C. V., & Adegbola, A. (2025). AI and Machine Learning for Sustainable Energy: Predictive Modelling, Optimization and Socioeconomic Impact In The USA. International Journal of Applied Sciences and Radiation Research, 2(1). https://doi.org/10.22399/ijasrar.19

[30] Taşkaya, S., Wu, D., Kurt, M., Liao, Y., Xu, J., & Liao, W. (2024). Exploring the Application of Building Information Modeling (BIM) in Town Planning: Key Roles in the Relationship Between Buildings and Parcels. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.459

[31] Olola, T. M., & Olatunde, T. I. (2025). Artificial Intelligence in Financial and Supply Chain Optimization: Predictive Analytics for Business Growth and Market Stability in The USA. International Journal of Applied Sciences and Radiation Research, 2(1). https://doi.org/10.22399/ijasrar.18

Downloads

Published

2025-04-29

How to Cite

L. Dhanapriya, & S. Preetha. (2025). An Efficient Hybrid Improved Feature Vector Manifold Clustering with Neighbour Search Optimization. International Journal of Computational and Experimental Science and Engineering, 11(2). https://doi.org/10.22399/ijcesen.1671

Issue

Section

Research Article