Prediction and Classification of Low Birth Weight Data Using Machine Learning Techniques
Keywords:
Machine learning, Binary logistic regression, Random forest, Low birth weightAbstract
Machine learning (ML) is a subject that focuses on the data analysis using various statistical tools and learning processes in order to gain more knowledge from the data. The objective of this research was to apply one of the ML techniques on the low birth weight (LBW) data in Indonesia. This research conducts two ML tasks; including prediction and classification. The binary logistic regression model was firstly employed on the train and the test data. Then; the random approach was also applied to the data set. The results showed that the binary logistic regression had a good performance for prediction; but it was a poor approach for classification. On the other hand; random forest approach has a very good performance for both prediction and classification of the LBW data set.
Downloads
References
Alpaydin, E. (2010). Introduction to machine learning (2nd ed.). The MIT Press.
Austin, M. P. (2002). Spatial prediction of species distribution: an interface between ecological
theory and statistical modelling. Ecological modelling, 157(2-3), 101-118.
Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32.
Chen, R., and Herskovits, E.H. (2010). Machine-learning techniques for building a diagnostic
model for very mild dementia. Euroimage, 52(1), 234–244.
Dahlui, M., Azahar, N., Oche, O. C., and Aziz, N. A. (2016). Risk factors for low birth weight in
nigeria: evidence from the 2013 Nigeria demographic and health survey. Global Health
Action, 9, 28822.
Firdaus, C., Wahyudin, W., & Nugroho, E. P. (2017). Monitoring System with Two Central
Facilities Protocol. Indonesian Journal of Science and Technology, 2(1), 8-25.
Gunawan, A. A. S., Falah, A.N., Faruk, A., Lutero, D.S., Ruchjana, B.N., and Abdullah, A. S. (2016).
Spatial data mining for predicting of unobserved zinc pollutant using ordinary point
kriging. Proceedings of International Workshop on Big Data and Information Security
(IWBIS), 2016, 83-88.
Kleinbaum, D.G, and Klein, M. (2010) Logistic regresion a self-learning text (3rd ed). Springer.
Last, M., Kandel, A., and Bunke, H. (2004). Data mining in time series databases. World
Scientific Publishing Co. Pte. Ltd.
Liaw, A., and Wiener, M. (2002). Classification and regression by random forest. R News, 2(3),
-22.
Makhabel, B. (2015). Learning data mining with R. Packt Publishing Ltd.
Riza, L. S., Nasrulloh, I. F., Junaeti, E., Zain, R., and Nandiyanto, A. B. D. (2016). gradDescentR:
An R package implementing gradient descent and its variants for regression tasks.
Proceedings of Information Technology, Information Systems and Electrical Engineering
(ICITISEE), 2016, 125-129.
Tampah-Naah, A.M., Anzagra, L., and Yendaw, E. (2016). Factors correlated with low birth
weight in Ghana. British Journal of Medicine and Medical Research, 16(4), 1-8.
Downloads
Published
Issue
Section
License
Copyright (c) 2018 Universitas Pendidikan Indonesia

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.