{"id":3190,"date":"2025-01-20T12:24:01","date_gmt":"2025-01-20T12:24:01","guid":{"rendered":"https:\/\/muhammadsoliman.com\/?p=3190"},"modified":"2025-01-20T12:24:02","modified_gmt":"2025-01-20T12:24:02","slug":"beginner-friendly-project-that-builds-on-your-foundational-data-science-skills-on-kaggle","status":"publish","type":"post","link":"https:\/\/muhammadsoliman.com\/index.php\/2025\/01\/20\/beginner-friendly-project-that-builds-on-your-foundational-data-science-skills-on-kaggle\/","title":{"rendered":"beginner-friendly project that builds on your foundational data science skills on Kaggle"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">After completing the <strong>Titanic Kaggle project<\/strong>, it&#8217;s a good idea to choose another beginner-friendly project that builds on your foundational data science skills. Here are some great second projects to consider on Kaggle:<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">1. <strong>House Prices: Advanced Regression Techniques<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Introduces regression techniques.<\/li>\n\n\n\n<li>Helps you practice feature engineering, handling missing data, and data transformations.<\/li>\n\n\n\n<li>Explores advanced topics like log transformation and interaction terms.<\/li>\n\n\n\n<li>Excellent for learning how to predict continuous variables.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Regression modeling.<\/li>\n\n\n\n<li>Feature selection.<\/li>\n\n\n\n<li>Handling skewed data and outliers.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/c\/house-prices-advanced-regression-techniques\" target=\"_blank\" rel=\"noopener\">House Prices Competition<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">2. <strong>Zillow&#8217;s Home Value Prediction<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Continuation of working with real estate data.<\/li>\n\n\n\n<li>Focuses on handling larger datasets and understanding real-world prediction challenges.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Advanced regression.<\/li>\n\n\n\n<li>Working with time-series-like data.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">3. <strong>Digit Recognizer (MNIST Data)<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Introduces image recognition and machine learning techniques.<\/li>\n\n\n\n<li>Explores classification problems with clear goals.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Working with image datasets.<\/li>\n\n\n\n<li>Neural networks and deep learning basics (if you use TensorFlow\/Keras).<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/c\/digit-recognizer\" target=\"_blank\" rel=\"noopener\">Digit Recognizer<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">4. <strong>Predicting Loan Default (Loan Prediction Dataset)<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Focuses on classification and business analytics.<\/li>\n\n\n\n<li>Helps you learn to interpret results in business contexts.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Logistic regression, decision trees, and ensemble techniques.<\/li>\n\n\n\n<li>Understanding categorical variables and their impact.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/altruistdelhite04\/loan-prediction-problem-dataset\" target=\"_blank\" rel=\"noopener\">Loan Prediction<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">5. <strong>Heart Disease Prediction<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Provides medical data to analyze, which is becoming increasingly popular in data science.<\/li>\n\n\n\n<li>Encourages exploratory data analysis (EDA) and feature importance.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Classification problems.<\/li>\n\n\n\n<li>Feature selection and understanding correlations.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/ronitf\/heart-disease-uci\" target=\"_blank\" rel=\"noopener\">Heart Disease Dataset<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">6. <strong>Pima Indians Diabetes Dataset<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Small dataset focused on binary classification.<\/li>\n\n\n\n<li>Ideal for practicing machine learning pipelines.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Binary classification using logistic regression or decision trees.<\/li>\n\n\n\n<li>Model evaluation with precision, recall, and F1-score.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/uciml\/pima-indians-diabetes-database\" target=\"_blank\" rel=\"noopener\">Pima Indians Diabetes Dataset<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">7. <strong>Mall Customers Segmentation Dataset<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Why it\u2019s good for you<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Introduces unsupervised learning concepts like clustering.<\/li>\n\n\n\n<li>Helps you practice visualizations and interpret group behavior.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Skills you&#8217;ll develop<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>K-means clustering.<\/li>\n\n\n\n<li>Data visualization and grouping analysis.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Link<\/strong>: <a href=\"https:\/\/www.kaggle.com\/vjchoudhary7\/customer-segmentation-tutorial-in-python\" target=\"_blank\" rel=\"noopener\">Mall Customer Segmentation<\/a><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><\/h3>\n","protected":false},"excerpt":{"rendered":"<p>After completing the Titanic Kaggle project, it&#8217;s a good idea to choose another beginner-friendly project that&#8230;<\/p>\n","protected":false},"author":1,"featured_media":3191,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kad_blocks_custom_css":"","_kad_blocks_head_custom_js":"","_kad_blocks_body_custom_js":"","_kad_blocks_footer_custom_js":"","_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"_kad_post_classname":"","footnotes":""},"categories":[22,23],"tags":[],"class_list":["post-3190","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-science","category-machine-learning"],"taxonomy_info":{"category":[{"value":22,"label":"Data Science"},{"value":23,"label":"Machine learning"}]},"featured_image_src_large":["https:\/\/muhammadsoliman.com\/wp-content\/uploads\/2025\/01\/DALL\u00b7E-2025-01-20-06.23.25-A-futuristic-scene-of-a-Kaggle-competition-event-where-diverse-students-are-actively-engaged-in-learning-artificial-intelligence-machine-learning-an-1024x585.webp",1024,585,true],"author_info":{"display_name":"Muhammad Soliman","author_link":"https:\/\/muhammadsoliman.com\/author\/muhmmad-soliman\/"},"comment_info":0,"category_info":[{"term_id":22,"name":"Data Science","slug":"data-science","term_group":0,"term_taxonomy_id":22,"taxonomy":"category","description":"","parent":0,"count":3,"filter":"raw","cat_ID":22,"category_count":3,"category_description":"","cat_name":"Data Science","category_nicename":"data-science","category_parent":0},{"term_id":23,"name":"Machine learning","slug":"machine-learning","term_group":0,"term_taxonomy_id":23,"taxonomy":"category","description":"","parent":0,"count":3,"filter":"raw","cat_ID":23,"category_count":3,"category_description":"","cat_name":"Machine learning","category_nicename":"machine-learning","category_parent":0}],"tag_info":false,"_links":{"self":[{"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/posts\/3190","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/comments?post=3190"}],"version-history":[{"count":0,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/posts\/3190\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/media\/3191"}],"wp:attachment":[{"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/media?parent=3190"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/categories?post=3190"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/muhammadsoliman.com\/index.php\/wp-json\/wp\/v2\/tags?post=3190"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}