<?xml version="1.0" encoding="UTF-8"?>
<doi_batch xmlns="http://www.crossref.org/schema/4.4.2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:jats="http://www.ncbi.nlm.nih.gov/JATS1" version="4.4.2" xsi:schemaLocation="http://www.crossref.org/schema/4.4.2 http://www.crossref.org/schema/deposit/crossref4.4.2.xsd">
  <head>
    <doi_batch_id>16-LQP-ARJCS</doi_batch_id>
    <timestamp>20251231000000</timestamp>
    <depositor>
      <depositor_name>Lumina Quest Publishing</depositor_name>
      <email_address>m.arslansohail@gmail.com</email_address>
    </depositor>
    <registrant>Lumina Quest Publishing</registrant>
  </head>
  <body>
    <journal>
      <journal_metadata>
        <full_title>Advanced Research Journal of Computer Science</full_title>
        <abbrev_title>Adv. Res. J. Comput. Sci.</abbrev_title>
        <issn media_type="electronic">3134-884X</issn>
        <doi_data>
          <doi>10.XXXXX/arjcs</doi>
          <resource>https://lquestpub.com/archives.php?journal=advanced-research-journal-of-computer-science</resource>
        </doi_data>
      </journal_metadata>
      <journal_issue>
        <publication_date media_type="print">
          <month>12</month>
          <day>31</day>
          <year>2025</year>
        </publication_date>
        <publication_date media_type="online">
          <month>12</month>
          <day>31</day>
          <year>2025</year>
        </publication_date>
        <journal_volume>
          <volume>2</volume>
        </journal_volume>
        <issue>2</issue>
        <doi_data>
          <doi>10.XXXXX/arjcs20250102</doi>
          <resource>https://lquestpub.com/articles-list.php?journal=advanced-research-journal-of-computer-science&amp;volume=2&amp;issue=2</resource>
        </doi_data>
      </journal_issue>
      <journal_article publication_type="full_text">
        <titles>
          <title>Predictive Modeling for Diabetes Risk Assessment Using the Healthcare Diabetes Dataset: A Machine Learning Approach</title>
          <original_language_title>Predictive Modeling for Diabetes Risk Assessment Using the Healthcare Diabetes Dataset: A Machine Learning Approach</original_language_title>
        </titles>
        <contributors>
          <person_name sequence="first" contributor_role="author">
            <given_name>Maham</given_name>
            <surname>Nasir</surname>
          </person_name>
          <person_name sequence="additional" contributor_role="author">
            <given_name>Safina</given_name>
            <surname>Shahzadi</surname>
          </person_name>
          <person_name sequence="additional" contributor_role="author">
            <given_name>Muhammad</given_name>
            <surname>Usman</surname>
          </person_name>
          <person_name sequence="additional" contributor_role="author">
            <given_name>Muhammad Tehseen</given_name>
            <surname>Qureshi</surname>
          </person_name>
        </contributors>
        <jats:abstract xml:lang="en">
          <jats:p>Diabetes mellitus remains a major global health challenge, with early prediction critical for intervention. This study leverages the Healthcare Diabetes Dataset [1] - comprising 768 records of female patients of Pima Indian heritage - to develop a binary classification model for diabetes diagnosis (Outcome: 0 = No, 1 = Yes). Key attributes include Pregnancies, Glucose, Blood Pressure, Skin Thickness, Insulin, BMI, Diabetes Pedigree Function and Age. After addressing disguised missing values (zeros in clinically impossible fields), Exploratory Data Analysis (EDA), feature scaling and an 80/20 train-test split, a Logistic Regression model was trained and evaluated. The model achieved 77.26% accuracy, with strong performance on the majority class (non-diabetic: Precision 0.79, recall 0.90) but moderate recall on the diabetic class (0.52), reflecting typical challenges with class imbalance. Results align with recent literature (70&amp;ndash;80% range for Logistic Regression on this dataset). This paper provides a complete, reproducible pipeline, discusses ethical considerations, compares with advanced methods and suggests improvements. It serves as an educational benchmark for healthcare predictive analytics.</jats:p>
        </jats:abstract>
        <publication_date media_type="online">
          <month>12</month>
          <day>31</day>
          <year>2025</year>
        </publication_date>
        <publication_date media_type="print">
          <month>12</month>
          <day>31</day>
          <year>2025</year>
        </publication_date>
        <pages>
          <first_page>35</first_page>
          <last_page>38</last_page>
        </pages>
        <doi_data>
          <doi>10.XXXXX/arjcs2025010201</doi>
          <resource>https://lquestpub.com/article/10.XXXXX/arjcs2025010201</resource>
        </doi_data>
      </journal_article>
    </journal>
  </body>
</doi_batch>
