Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the twentytwentyone domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home1/moderna7/public_html/wp-includes/functions.php on line 6131

Warning: Cannot modify header information - headers already sent by (output started at /home1/moderna7/public_html/wp-includes/functions.php:6131) in /home1/moderna7/public_html/wp-includes/feed-rss2-comments.php on line 8
Comments on: Data Science Practice – Classifying Heart Disease https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/ Documenting my path from "SQL Data Analyst pursuing an Engineering Master's Degree" to "Data Scientist" Mon, 08 May 2017 17:56:33 +0000 hourly 1 https://wordpress.org/?v=6.9.4 By: Kyle Hooks https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-3273 Mon, 08 May 2017 17:56:33 +0000 https://www.becomingadatascientist.com/?p=385#comment-3273 I found if you prune some less useful features you can get up to 78%. I haven’t tried testing various initialization values for a classifier yet, but viewing it as regression got me to a stdev of 0.22 after testing many parameter initialization values. I’m not sure how to test the “accuracy” here without some threshold value, but I’d say it’s 90% accurate by eyeballing it (if you round to nearest integer values).

]]>
By: dibya https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-3235 Tue, 11 Apr 2017 06:19:00 +0000 https://www.becomingadatascientist.com/?p=385#comment-3235 hello very nice project for beginners. can you guide me how to proceed further? how to improve the accuracy??

]]>
By: AND https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-2764 Sat, 29 Oct 2016 19:40:40 +0000 https://www.becomingadatascientist.com/?p=385#comment-2764 heart_test_results[‘correct’] = heart_test_results[‘predict’] == goal_test_df[‘actual’]
File “/home/admin-pc/anaconda3/lib/python3.5/site-packages/pandas/core/ops.py”, line 735, in wrapper
raise ValueError(‘Series lengths must match to compare’)

I am unable to figure this error :/

]]>
By: Renee https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-2699 Mon, 22 Aug 2016 05:31:38 +0000 https://www.becomingadatascientist.com/?p=385#comment-2699 In reply to aziz.

It’s not my dataset. If you’re having trouble accessing it via the UCI respository, you should probably let them know. Sorry, but I don’t have time to re-download and email datasets to individuals when they’re available online.

]]>
By: aziz https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-2698 Sun, 21 Aug 2016 04:53:22 +0000 https://www.becomingadatascientist.com/?p=385#comment-2698 Hello,
I couldn’t download the datase. can you mail me? [admin removed email]

]]>
By: Renee https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-2682 Mon, 08 Aug 2016 14:24:30 +0000 https://www.becomingadatascientist.com/?p=385#comment-2682 In reply to Manasi.

Hm, I used the one that is here called processed.Cleveland.data, I think
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/

]]>
By: Manasi https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-2678 Mon, 01 Aug 2016 09:13:49 +0000 https://www.becomingadatascientist.com/?p=385#comment-2678 Hello,
I couldn’t download the dataset. I couldnt find the one with 76 attributes. Could you please pass over that particular dataset?

Manasi

]]>
By: Renee https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-1733 Mon, 16 Mar 2015 04:07:28 +0000 https://www.becomingadatascientist.com/?p=385#comment-1733 In reply to Felipe Bormann.

Hi Felipe,

Great, I’m glad to have given you courage to post about your data science adventures! Post a link here when you get that started! You can also follow me on twitter at @becomingdatasci.

Renee

]]>
By: Felipe Bormann https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-1730 Thu, 12 Mar 2015 22:18:32 +0000 https://www.becomingadatascientist.com/?p=385#comment-1730 Congratulations, I’m starting to learn data science as well, like, I’ve just started computer science but I have a feeling it is the right field for me, I’ll start posting my adventures really soon, you have gave me the courage, please, keep up the good work, is there somewhere else I can follow you?. Have a great day.

Oh, I’m from Recife, Brazil o/!

]]>
By: Renee https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-1712 Mon, 23 Feb 2015 02:44:24 +0000 https://www.becomingadatascientist.com/?p=385#comment-1712 In reply to Natalie.

Hi, thanks Natalie! I haven’t used dplyr (my R experience is very limited), and I was comparing to the “manual” (without a package to help) data importing I had done in past projects when I first learned Python, where I had to manipulate the columns using indexes to get what I wanted into the final set, and where the importing was less straightforward. You can see some of that here: https://www.becomingadatascientist.com/2014/04/10/ml-project-2-post-2/
I enjoyed being able to work with the columns by referencing their names. I know I just touched on the surface of what Pandas can do, and yes I am planning to do more practice projects like this soon!

]]>
By: Natalie https://www.becomingadatascientist.com/2015/01/19/data-science-practice-classifying-heart-disease/#comment-1709 Wed, 18 Feb 2015 08:19:39 +0000 https://www.becomingadatascientist.com/?p=385#comment-1709 Very interesting! Did you feel pandas was better with import and manipulation? Compared with, say, using the dplyr package for R? Or if you don’t use that, what are you comparing with? If you’ve got more exploratory posts like this in the pipeline I hope you post them soon, because I enjoyed reading this :)

]]>