Multivariate Logistic Regression for predicting probability of Heart Attacks #813
j-craggy
announced in
2024 Plotnine Contest
Replies: 3 comments
-
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Ahoy @j-craggy , |
Beta Was this translation helpful? Give feedback.
0 replies
-
@j-craggy, The winning submission has been announced here. Thank you again for taking part in the contest. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Authors
Jeff Craggy
Links
https://www.jeffcraggy.com
Full description
DATA:
Data is found in Data Mining for the Masses, 3rd Edition
We have access to the company’s medical claims database. With this access, she is able to
generate two data sets for us. This first is a list of people who have suffered heart attacks, with an
attribute indicating whether or not they have had more than one; and the second is a list of those
who have had a first heart attack, but not a second. The former data set, comprised of 138
observations, will serve as our training data; while the latter, comprised of 690 peoples’ data, will be
for scoring. This data is used to help this latter group of people avoid becoming second heart attack
victims. In compiling the two data sets we have defined the following attributes:
Exploratory Data Analysis
Looking at the training data set, we can see that some statistically significant predictors of a heart attack are:
A second run narrowed by these three indicators show an even greater significance.
References
North, A. Matthew (2012). Data Mining for the Masses. Global Text Project Book. Creative Commons Attribution 3.0 Licence
Code repository
Github: https://github.com/JBC-Inc/jbcPlotnine2
Plot: https://kraggle.shinyapps.io/jbcplotnine221/
Beta Was this translation helpful? Give feedback.
All reactions