Understanding and Analysing Causal Relations through Modelling using Causal Machine Learning

Authors

  • D. Naga Jyothi Ms
  • Uma N. Dulhare

DOI:

https://doi.org/10.22399/ijcesen.1018

Keywords:

Causal Inference, Machine Learning, Directed Acyclic Graphs, DoWhy Library, Causal Discovery Algorithms, Student Placement

Abstract

The study of causal inference has gained significant attention in artificial intelligence (AI) and machine learning (ML), particularly in areas such as explainability, automated diagnostics, reinforcement learning, and transfer learning.. This research applies causal inference techniques to analyze student placement data, aiming to establish cause-and-effect relationships rather than mere correlations. Using the DoWhy Python library, the study follows a structured four-step approach—Modeling, Identification, Estimation, and Refutation—and introduces a novel 3D framework (Data Correlation, Causal Discovery, and Domain Knowledge) to enhance causal modeling reliability. Causal discovery algorithms, including Peter Clark (PC), Greedy Equivalence Search (GES), and Linear Non-Gaussian Acyclic Model (LiNGAM), are applied to construct and validate a robust causal model. Results indicate that internships (0.155) and academic branch selection (0.148) are the most influential factors in student placements, while CGPA (0.042), projects (0.035), and employability skills (0.016) have moderate effects, and extracurricular activities (0.004) and MOOCs courses (0.012) exhibit minimal impact. This research underscores the significance of causal reasoning in higher education analytics and highlights the effectiveness of causal ML techniques in real-world decision-making. Future work may explore larger datasets, integrate additional educational variables, and extend this approach to other academic disciplines for broader applicability.

References

Jyothi, D. N., & Dulhare, U. N. (2023). Inferring causal relationships in student placements' performance using causal machine learning. International Journal of Intelligent Systems and Applications in Engineering, 11(2), 123-130.

Kaur, P., Polyzou, A., & Karypis, G. (2019). Causal inference in higher education: Building better curriculums. arXiv preprint arXiv:1906.04698.

Akshaya Kumar Mandal, Pedro Machado, & Eneko Osaba. (2025). Applying Coral Reef Restoration Algorithm for Quantum Computing in Genomic Data Analysis. International Journal of Computer Engineering in Research Trends, 12(1), 20–28

Qin Yang, Claudio Castelnovo, & Florian Mölein. (2025). Utilizing Leaf Venation Network Model for Ethical AI Decision-Making in Financial Technologies. International Journal of Computer Engineering in Research Trends, 12(1), 39–48.

Naresh Kumar Bhagavatham, Bandi Rambabu, Jaibir Singh, Dileep P, T. Aditya Sai Srinivas, M. Bhavsingh, & P. Hussain Basha. (2024). Autonomic Resilience in Cybersecurity: Designing the Self-Healing Network Protocol for Next-Generation Software-Defined Networking. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.640 DOI: https://doi.org/10.22399/ijcesen.640

Akshaya Kumar Mandal, Pedro Machado, & Eneko Osaba. (2025). Applying Coral Reef Restoration Algorithm for Quantum Computing in Genomic Data Analysis. International Journal of Computer Engineering in Research Trends, 12(1), 20–28.

SumanPrakash, P., Ramana, K. S., CosmePecho, R. D., Janardhan, M., Churampi Arellano, M. T., Mahalakshmi, J., Bhavsingh, M., & Samunnisa, K. (2024). Learning-driven continuous diagnostics and mitigation program for secure edge management through zero-trust architecture. Computer Communications, 94–107. https://doi.org/10.1016/j.comcom.2024.04.007 DOI: https://doi.org/10.1016/j.comcom.2024.04.007

Krishna, U. V., Rao, G. S., Addepalli, L., Bhavsingh, M., S. D., V. S., & Jaime, L. M. (2024). Enhancing airway assessment with a secure hybrid network-blockchain system for CT & CBCT image evaluation. International Research Journal of Multidisciplinary Technovation, 6(2), 45–60. https://doi.org/10.54392/irjmt2425 DOI: https://doi.org/10.54392/irjmt2425

Onesimus John Waino, & Steven David. (2025). An Artificial Intelligence Model for Predicting Flooding and Drought in Bali Local Government Area of Taraba State, Nigeria. International Journal of Computer Engineering in Research Trends, 12(1), 1–19.

Divyansh Awasthi, Zeinab Elngar, & Jeyarani Selvarajan. (2025). Implementing Bioluminescent Swarm Optimization to Enhance Blockchain Security in IoT Healthcare Systems. International Journal of Computer Engineering in Research Trends, 12(1), 29–38.

Saranya, V. S., Subbarao, G., Balakotaiah, D., & Bhavsingh, M. (2024). Real-time traffic flow optimization using adaptive IoT and data analytics: A novel DeepStreamNet model. International Journal of Advanced Research in Computer Science, 15(10), 45–52

Prakash, P. S., Janardhan, M., Sreenivasulu, K., Saheb, S. I., Neeha, S., & Bhavsingh, M. (2022). Mixed Linear Programming for Charging Vehicle Scheduling in Large-Scale Rechargeable WSNs. Journal of Sensors, 2022, 1–13. https://doi.org/10.1155/2022/8373343 DOI: https://doi.org/10.1155/2022/8373343

Lakshmi, M. S., Ramana, K. S., Pasha, M. J., Lakshmi, K., Parashuram, N., & Bhavsingh, M. (2022). Minimizing the Localization Error in Wireless Sensor Networks Using Multi-Objective Optimization Techniques. International Journal on Recent and Innovation Trends in Computing and Communication, 10(2s), 306–312. https://doi.org/10.17762/ijritcc.v10i2s.5948 DOI: https://doi.org/10.17762/ijritcc.v10i2s.5948

Qin Yang, Claudio Castelnovo, and Florian Mölein, (2025). Utilizing Leaf Venation Network Model for Ethical AI Decision-Making in Financial Technologies, Int. J. Comput. Eng. Res. Trends, 12(1);39–48.

Burnett, J. W., & Blackwell, C. (2023). Graphical causal modelling: An application to identify and estimate cause-and-effect relationships. Applied Economics, 55(1), 1–15.

Kaur, P., Polyzou, A., & Karypis, G. (2019). Causal inference in higher education: Building better curriculums. In Proceedings of the 6th ACM Conference on Learning at Scale (pp. 1–4).

Ouaadi, I., & Ibourk, A. (2023). Causal discovery and features importance analysis: What can be inferred about at-risk students? In Proceedings of the International Conference on Business Intelligence (pp. 134–145). Springer.

Sharma, A., & Kiciman, E. (2020). DoWhy: An end-to-end library for causal inference. arXiv Preprint, arXiv:2011.04216.

Shohei, S., Shimizu, S., & Bollen, K. (2011). DirectLiNGAM: A direct method for learning a linear non-Gaussian structural equation model. Journal of Machine Learning Research, 12, 1225–1248.

Aragam, B. (2024). Greedy equivalence search for nonparametric graphical models. arXiv Preprint, arXiv:2406.17228.

Forney, A., & Mueller, S. (2022). Causal inference in AI education: A primer. Journal of Causal Inference, 10(1), 141–173.

Tadayon, M., & Pottie, G. (2021). Causal inference in educational systems: A graphical modeling approach. arXiv Preprint, arXiv:2108.00654.

de Carvalho, W. F., & Zarate, L. E. (2019). Causality relationship among attributes applied in an educational data set. In Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing (pp. 1271–1277).

Sharma, A., & Kiciman, E. (2020). DoWhy: An end-to-end library for causal inference. arXiv preprint arXiv:2011.04216.

Burnett, J. W., & Blackwell, C. (2023). Graphical causal modelling: An application to identify and estimate cause-and-effect relationships. Applied Economics, 55(1), 1–15.

Shohei, S., Shimizu, S., & Bollen, K. (2011). DirectLiNGAM: A direct method for learning a linear non-Gaussian structural equation model. Journal of Machine Learning Research, 12, 1225.

Kaur, P., Polyzou, A., & Karypis, G. (2019). Causal inference in higher education: Building better curriculums. In Proceedings of the 6th ACM Conference on Learning at Scale (pp. 1–4). DOI: https://doi.org/10.1145/3330430.3333663

Tadayon, M., & Pottie, G. (2021). Causal inference in educational systems: A graphical modeling approach. arXiv preprint arXiv:2108.00654.

Aragam, B. (2024). Greedy equivalence search for nonparametric graphical models. arXiv preprint arXiv:2406.17228.

Ouaadi, I., & Ibourk, A. (2023). Causal discovery and features importance analysis: What can be inferred about at-risk students? In Proceedings of the International Conference on Business Intelligence (pp. 134–145). Springer. DOI: https://doi.org/10.1007/978-3-031-37872-0_10

Spirtes, P., Glymour, C., & Scheines, R. (2000). Causation, prediction, and search (2nd ed.). MIT Press. DOI: https://doi.org/10.7551/mitpress/1754.001.0001

Heckerman, D. (1998). A tutorial on learning with Bayesian networks. In M. I. Jordan (Ed.), Learning in Graphical Models (pp. 301–354). Springer. DOI: https://doi.org/10.1007/978-94-011-5014-9_11

Shimizu, S., Hoyer, P. O., Hyvärinen, A., & Kerminen, A. J. (2006). A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7, 2003–2030.

de Carvalho, W. F., & Zarate, L. E. (2019). Causality relationship among attributes applied in an educational data set. In Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing (pp. 1271–1277). DOI: https://doi.org/10.1145/3297280.3297406

Forney, A., & Mueller, S. (2022). Causal inference in AI education: A primer. Journal of Causal Inference, 10(1), 141–173. DOI: https://doi.org/10.1515/jci-2021-0048

Dulhare, U. N., Jyothi, D. N., Balimidi, B., & Kesaraju, R. R. (2023). Classification models in education domain using PSO, ABC, and A2BC metaheuristic algorithm-based feature selection and optimization. In M. A. Jabbar, P. K. Reddy, & B. B. Chaudhuri (Eds.), Machine Learning and Metaheuristics: Methods and Analysis (pp. 255–270). Springer. DOI: https://doi.org/10.1007/978-981-99-6645-5_12

Pearl, J. (2009). Causality: Models, reasoning, and inference (2nd ed.). Cambridge University Press. DOI: https://doi.org/10.1017/CBO9780511803161

Kohavi, R., & Provost, F. (1998). Glossary of terms for AI & data mining. Machine Learning, 30(2–3), 271–274. DOI: https://doi.org/10.1023/A:1017181826899

Weidlich, W., Hicks, B., & Drachsler, H. (2023). Causal reasoning with causal graphs in educational technology research. Educational Technology Research and Development, 71, 1–19.

Imbens, G., & Rubin, D. (2015). Causal inference for statistics, social, and biomedical sciences: An introduction. Cambridge University Press. DOI: https://doi.org/10.1017/CBO9781139025751

Soo, H. N., Rahman, M. S., & Majumder, A. (2022). Machine learning-based student performance prediction: A systematic review. In Proceedings of the IEEE International Conference on AI & Big Data (AIBD) (pp. 1–6).

Guo, R., Cheng, L., Li, J., Hahn, P., & Liu, H. (2020). A survey of learning causality with data: Problems and methods. ACM Computing Surveys, 53(4), 1–37. DOI: https://doi.org/10.1145/3397269

A. F. Wise, (2018). Learning Analytics: Using Data-Informed Decision-Making to Improve Teaching and Learning, Contemporary Technologies in Education, pp. 119–143, doi: 10.1007/978-3-319-89680-9_7. DOI: https://doi.org/10.1007/978-3-319-89680-9_7

Dietterich, T. G. (2002). Machine learning for sequential data: A review. In Proceedings of the International Conference on AI & Statistics (pp. 15–23). DOI: https://doi.org/10.1007/3-540-70659-3_2

Z. Zhang, (2019). Distinguishing between mediators and confounders is important for the causal inference in observational studies, AME Medical Journal, 4;35–35, doi: 10.21037/amj.2019.09.03. DOI: https://doi.org/10.21037/amj.2019.09.03

Y. Fang, (2024). Randomized Controlled Clinical Trials, Causal Inference in Pharmaceutical Statistics, pp. 19–39, doi: 10.1201/9781003433378-2. DOI: https://doi.org/10.1201/9781003433378-2

S. Shimizu, (2022). Correction to: Statistical Causal Discovery: LiNGAM Approach, pp. C1–C1, 2022, doi: 10.1007/978-4-431-55784-5_7. DOI: https://doi.org/10.1007/978-4-431-55784-5_7

ZHANG, J. (2025). Artificial intelligence contributes to the creative transformation and innovative development of traditional Chinese culture. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.860 DOI: https://doi.org/10.22399/ijcesen.860

M. Shanthalakshmi, & R.S. Ponmagal. (2025). An Intelligent Intrusion Detection System for VANETs Using Adaptive Fusion Models. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.935 DOI: https://doi.org/10.22399/ijcesen.935

K.S. Praveenkumar, & R. Gunasundari. (2025). Optimizing Type II Diabetes Prediction Through Hybrid Big Data Analytics and H-SMOTE Tree Methodology. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.727 DOI: https://doi.org/10.22399/ijcesen.727

Vijayadeep GUMMADI, & Naga Malleswara Rao NALLAMOTHU. (2025). Optimizing 3D Brain Tumor Detection with Hybrid Mean Clustering and Ensemble Classifiers. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.719 DOI: https://doi.org/10.22399/ijcesen.719

Amjan Shaik, Bhuvan Unhelkar, & Prasun Chakrabarti. (2025). Exploring Artificial Intelligence and Data Science-Based Security and its Scope in IoT Use Cases. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.869 DOI: https://doi.org/10.22399/ijcesen.869

S. Shankar, N. Padmashri, N. Shanmugapriya, S. Ramasamy, & P.S. Sruthi. (2025). IntelliFuzz: An Advanced Fuzzy Logic Framework for Dynamic Evaluation of Student Performance in Open-Ended Learning Tasks. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.911 DOI: https://doi.org/10.22399/ijcesen.911

Sagiraju, S., Mohanty, J. R., & Naik, A. (2025). Hyperparameter Tuning of Random Forest using Social Group Optimization Algorithm for Credit Card Fraud Detection in Banking Data. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.777 DOI: https://doi.org/10.22399/ijcesen.777

Badugu Sobhanbabu, & K.F. Bharati. (2025). Towards Precision Medicine with Genomics using Big Data Analytics. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.906 DOI: https://doi.org/10.22399/ijcesen.906

I. Prathibha, & D. Leela Rani. (2025). Rainfall Forecasting in India Using Combined Machine Learning Approach and Soft Computing Techniques : A HYBRID MODEL. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.785 DOI: https://doi.org/10.22399/ijcesen.785

Abu-Shaikha, M., & Nasereddin, S. (2025). Predicting Media Impact: A Machine Learning Framework for Optimizing Corporate Communication Strategies in Architectural Practices. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.1032 DOI: https://doi.org/10.22399/ijcesen.1032

Downloads

Published

2025-02-11

How to Cite

D. Naga Jyothi, & Uma N. Dulhare. (2025). Understanding and Analysing Causal Relations through Modelling using Causal Machine Learning. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.1018

Issue

Section

Research Article