Analyzing Factors Affecting Cholesterol Levels: A Data-Driven Study Using Statistical Models and Machine Learning

  • Zahraa Tariq Mohammed Taher Medicine College- Family and Community Medicine/ Ninevah University- Mosul-Iraq
Keywords: Cholesterol, Predictive Models, Feature Importance, Classification, Risk Factors, Data Analysis, Framingham Heart Study

Abstract

High cholesterol levels are associated with various health complications, particularly cardiovascular diseases. Predicting an individual's cholesterol levels can be crucial in healthcare analytics to prevent and manage these conditions, leading to long-term health benefits and potential economic savings in healthcare systems. This paper focuses on analyzing the factors influencing blood cholesterol levels and developing predictive models for cholesterol levels using statistical and machine learning techniques. The study conducted an extensive analysis of determinants governing cholesterol levels using a feature importance ratio and leveraged the Framingham Heart Study (FHS) dataset with machine learning techniques to predict cholesterol levels. The study found that age, BMI, and glucose levels consistently influenced cholesterol levels, whether classified into three levels or two levels. Machine learning models exhibited varying performance, with models like Random Forest and Gradient Boosting excelling in precision, recall, and F1-score in specific cholesterol categories. The results emphasize the importance of addressing age, BMI, and glucose levels in healthcare strategies for cholesterol management. They also highlight the need for continuous model refinement and fine-tuning to improve predictive accuracy in different cholesterol classification scenarios.

References

Huff, Trevor & Brandon Boyd & Ishwarlal Jialal. (2023). Physiology, Cholesterol. In StatPearls. StatPearls Publishing. https://www.ncbi.nlm.nih.gov/books/NBK470561/

Duan Y, Gong K, Xu S, Zhang F, Meng X, Han J. Regulation of cholesterol homeostasis in health and diseases: from mechanisms to targeted therapeutics. Signal Transduct Target Ther. 2022 Aug 2;7(1):265. doi: 10.1038/s41392-022-01125-5. PMID: 35918332; PMCID: PMC9344793.

Dybiec, Jill & Baran, Wiktoria & Dąbek, Bartłomiej & Fularski, Piotr & Młynarska, Ewelina & Radzioch, Ewa & Rysz, Jacek & Franczyk, Beata. (2023). Advances in Treatment of Dyslipidemia. International Journal of Molecular Sciences. 24. 13288. 10.3390/ijms241713288.

Sureshbabu, Jayanthi. (2023). Importance and Need of Medical Entomology and Medical Entomologist in Public Health. International Journal of Medical Sciences and Nursing Research. 3. 1-2. 10.55349/ijmsnr.20233312.

Hidayat, Anas. (2023). MANAGEMENT OF INCREASING PUBLIC KNOWLEDGE ABOUT THE IMPORTANCE OF MEDICAL RECORDS IN HEALTH CARE FACILITIES. Jurnal Pengabdian Masyarakat Permata Indonesia. 3. 7-11. 10.59737/jpmpi.v3i1.218.

Dickens, Brian & Sassanpour, Mana & Bischoff, Evan. (2023). The Effect of Chia Seeds on High-Density Lipoprotein (HDL) Cholesterol. Cureus. 15. 10.7759/cureus.40360.

Zhu, Chen & Wu, Juan & Wu, Yixian & Guo, Wen & Lu, Jing & Zhu, Wenfang & Li, Xiaona & Xu, Nianzhen & Zhang, Qun. (2022). Triglyceride to high-density lipoprotein cholesterol ratio and total cholesterol to high-density lipoprotein cholesterol ratio and risk of benign prostatic hyperplasia in Chinese male subjects. Frontiers in Nutrition. 9. 10.3389/fnut.2022.999995.

Katahira, Masahito & Imai, Shu & Ono, Satoko & Moriura, Shigeaki. (2023). Estimating Triglyceride Levels Using Total Cholesterol, Low-Density Lipoprotein Cholesterol, and High-Density Lipoprotein Cholesterol Levels: A Cross-Sectional Study. Metabolic syndrome and related disorders. 21. 10.1089/met.2023.0045.

Wu, Shouling & Su, Xin & Zuo, Yingting & Chen, Shuohua & Tian, Xue & Xu, Qin & Zhang, Yijun & Zhang, Xiaoli & Wang, Penglian & He, Yan & Wang, Anxin. (2023). Discordance between remnant cholesterol and low-density lipoprotein cholesterol predicts arterial stiffness progression. Hellenic Journal of Cardiology. 10.1016/j.hjc.2023.05.008.

Siddharth, Saurav & Farooq, Bilkisa & Kumar, Nirnay & Burhan, Mirza. (2023). Effect of Lifestyle in Female Infertility: A Review Based Study. International Journal for Research in Applied Science and Engineering Technology. 11. 1777-1783. 10.22214/ijraset.2023.56307.

Hernández-Arango, Alejandro & Arias, María & Pérez, Viviana & Chavarría, Luis & Jaimes, Fabian & Mater, Alma. (2023). Prediction of the risk of adverse clinical outcomes with machine learning techniques in patients with chronic no communicable diseases.

Lukyanenko, Roman & Maass, Wolfgang & Storey, Veda. (2022). Trust in artificial intelligence: From a Foundational Trust Framework to emerging research opportunities. Electronic Markets. 32. 3. 10.1007/s12525-022-00605-4.

Ahuja, Abhimanyu. (2019). The impact of artificial intelligence in medicine on the future role of the physician. PeerJ. 7. e7702. 10.7717/peerj.7702.

Lokpo, Sylvester & Laryea, Roger & Osei-Yeboah, James & Owiredu, William & Ephraim, Richard & Adejumo, Esther & Ametepe, Samuel & Appiah, Michael & Nogo, Peter & Affrim, Patrick & Precious Kwablah, Kwadzokpui & Abeka, Ohene Kweku. (2022). The pattern of dyslipidaemia and factors associated with elevated levels of non-HDL-cholesterol among patients with type 2 diabetes mellitus in the Ho municipality: A cross sectional study. Heliyon. 8. e10279. 10.1016/j.heliyon.2022.e10279.

Verbeek, Rutger & Hoogeveen, Renate & Langsted, Anne & Stiekema, Lotte & Verweij, Simone & Hovingh, G. Kees & Wareham, Nicholas & Khaw, Kay-Tee & Boekholdt, S & Nordestgaard, Børge & Stroes, Erik. (2018). Cardiovascular disease risk associated with elevated lipoprotein(a) attenuates at low low-density lipoprotein cholesterol levels in a primary prevention setting. European heart journal. 39. 10.1093/eurheartj/ehy334.

Shao, Zeguo & Xiang, Yuhong & Zhu, Yingchao & Fan, Aiqin & Zhang, Peng. (2020). Influences of Daily Life Habits on Risk Factors of Stroke Based on Decision Tree and Correlation Matrix. Computational and Mathematical Methods in Medicine. 2020. 1-12. 10.1155/2020/3217356.

Schmidt, Gilda & Schneider, Christina & Gerlinger, Christoph & Endrikat, Jan & Gabriel, Lena & Ströder, Russalina & Müller, Carolin & Juhasz-Böss, Ingolf & Solomayer, Erich-Franz. (2020). Impact of body mass index, smoking habit, alcohol consumption, physical activity and parity on disease course of women with triple-negative breast cancer. Archives of Gynecology and Obstetrics. 301. 10.1007/s00404-019-05413-4.

Chua, Shiao & Yovich, Steven & Hinchliffe, Peter & Yovich, John. (2023). Male Clinical Parameters (Age, Stature, Weight, Body Mass Index, Smoking History, Alcohol Consumption) Bear Minimal Relationship to the Level of Sperm DNA Fragmentation. Journal of Personalized Medicine. 13. 759. 10.3390/jpm13050759.

Kuan, Valerie & Warwick, Alasdair & Hingorani, Aroon & Tufail, Adnan & Cipriani, Valentina & Burgess, Stephen & Sofat, Reecha & Fritsche, Lars & Igl, Wilmar & Cooke Bailey, Jessica & Grassmann, Felix & Sengupta, Sebanti & Bragg-Gresham, Jennifer & Burdon, Kathryn & Hebbring, Scott & Wen, Cindy & Gorski, Mathias & Kim, Ivana & Cho, David & Heid, Iris. (2021). Association of Smoking, Alcohol Consumption, Blood Pressure, Body Mass Index, and Glycemic Risk Factors With Age-Related Macular Degeneration: A Mendelian Randomization Study. JAMA Ophthalmology. 139. 10.1001/jamaophthalmol.2021.4601.

Oh, Gyu & Ko, Taehoon & Kim, Jin-Hyu & Lee, Min & Choi, Sae & Bae, Ye & Kim, Kyung & Lee, Hae-Young. (2022). Estimation of low-density lipoprotein cholesterol levels using machine learning. International Journal of Cardiology. 352. 10.1016/j.ijcard.2022.01.029.

Garcia-D'Urso, Nahuel & Climent i Pérez, Pau & Sanchez, Miriam & Martí, Ana & Guilló, Andrés & Azorin-Lopez, Jorge. (2022). A Non-Invasive Approach for Total Cholesterol Level Prediction Using Machine Learning. 10.1109/ACCESS.2022.3178419.

Liao, Pen-Chih & Chen, Ming-Shu & Jhou, Mao-Jhen & Chen, Tsan-Chi & Yang, Chih-Te & Lu, Chi-Jie. (2022). Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol. Diagnostics. 12. 1965. 10.3390/diagnostics12081965.

Krentz, Andrew & Haddon-Hill, Gabe & Zou, Xiaoyan & Pankova, Natalie & Jaun, André. (2023). Machine Learning Applied to Cholesterol-Lowering Pharmacotherapy: Proof-of-Concept in High-Risk Patients Treated in Primary Care. Metabolic syndrome and related disorders
Published
2024-07-29
How to Cite
Zahraa Tariq Mohammed Taher. (2024). Analyzing Factors Affecting Cholesterol Levels: A Data-Driven Study Using Statistical Models and Machine Learning. CENTRAL ASIAN JOURNAL OF MATHEMATICAL THEORY AND COMPUTER SCIENCES, 5(3), 220 - 230. Retrieved from https://cajmtcs.centralasianstudies.org/index.php/CAJMTCS/article/view/648
Section
Articles