Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world's diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus.
In this study, we used Pima Indians Dataset to predict the diabetes. And we get a better result. This web page shows the Pima Indian Dataset, which one of the datasets which we used in our study. |
  Description |
        There are 768 samples with 9 variables. The variables are: |
         1. pregnant, number of times pregnant
|
         2. glucose, plasma glucose concentration (glucose tolerance test) |
         3. pressure, diastolic blood pressure (mm/Hg) |
         4. triceps,triceps skin fold thickness (mm) |
         5. insulin, 2-Hour serum insulin (mu U/ml) |
         6. BMI, body mass index (weight in kg/(height in m)\^2) |
         7. pedigree, diabetes pedigree function |
         8. age, age (years) |
         9. diabetes, class variable (test for diabetes)
|
  Source |
        Original owners: National Institute of Diabetes and Digestive and Kidney Diseases |
        Donor of database: Vincent Sigillito (vgs@aplcen.apl.jhu.edu) |
  Result |
         |
  Download |
        Pima Indians Data |