Skip to content
#

sklearn-library

Here are 21 public repositories matching this topic...

This is a tutorial in an IPython Notebook for the Kaggle competition, Titanic Machine Learning From Disaster. The goal of this repository is to provide an example of a competitive analysis for those interested in getting into the field of data analytics or using python for Kaggle's Data Science competitions .

  • Updated Jul 1, 2022
  • Jupyter Notebook

This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.

  • Updated Oct 15, 2024
  • Jupyter Notebook

This notebook explores and analyzes the Heart Disease UCI dataset using Python libraries such as NumPy, Pandas, Matplotlib, Seaborn, and scikit-learn. It includes data visualization, feature engineering, model building using Random Forest Classifier, and evaluation of the model's performance in predicting the presence or absence of heart disease.

  • Updated Apr 9, 2024
  • Jupyter Notebook

This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science

  • Updated Apr 29, 2024
  • Jupyter Notebook

Titanic challenge part 1 In this notebook, we will be covering all of the steps required to wrangle the Titanic data set into a format that is suitable for machine learning. We will do each of the following: impute missing values create new features (feature engineering) Part 2 of this challenge involves fitting and tuning a random forest to mak…

  • Updated Apr 6, 2021
  • Jupyter Notebook

This notebook is a study of the application of sklearn Logistic Regression model and analysis of metric quality with a focus on the impact of imbalanced data. The problem presented is the analysis of sales of newspapers of a local stand in order to classify the probability of the newspaper being Sold Out or Not, given a set of features.

  • Updated Nov 2, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the sklearn-library topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sklearn-library topic, visit your repo's landing page and select "manage topics."

Learn more