train-test-split
Here are 196 public repositories matching this topic...
GraphPart, a data partitioning method for ML on biological sequences
-
Updated
Oct 26, 2023 - Jupyter Notebook
This library allows reading and converting bounding box annotations in many popular formats
-
Updated
Jun 9, 2023 - Python
🩺 Machine Learning diabetes prediction model using Support Vector Machine (SVM) classifier. Analyzes 8 medical features (glucose, BMI, age, etc.) from Pima Indian dataset to predict diabetes risk with 75-80% accuracy. Built with Python, scikit-learn, pandas. Includes data preprocessing, model training, and prediction system for diabetes..
-
Updated
Jul 25, 2025 - Jupyter Notebook
🍾 A comprehensive machine learning project using Random Forest algorithm to predict wine quality based on physicochemical properties. Features EDA, model training, hyperparameter tuning, feature importance analysis, and detailed documentation.
-
Updated
Jul 26, 2025 - Jupyter Notebook
🫀 A machine learning project using logistic regression to predict heart disease risk from clinical data. Built with Python, scikit-learn, and Jupyter notebooks. Achieves 85%+ accuracy on 303-patient dataset with 13 medical features. Complete ML pipeline from data exploration to model evaluation.
-
Updated
Jul 25, 2025 - Jupyter Notebook
🪨 Machine learning project using logistic regression to classify sonar signals as either rocks or mines. Uses scikit-learn to train a binary classifier on sonar dataset with 60 numerical features for accurate underwater object detection.
-
Updated
Jul 25, 2025 - Jupyter Notebook
This is an algorithm for evenly partitioning.
-
Updated
Aug 28, 2025 - Python
A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.
-
Updated
Mar 29, 2025 - Jupyter Notebook
📁 Repo for python_splitter Python package. This package can split Images into Train, Test, Validation folders automatically by shuffling media/images for machine learning.
-
Updated
Jan 1, 2024 - Python
protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.
-
Updated
Oct 8, 2025 - Python
This repository contains the code and resources for our participation (team of 4 members) in the IIMS Hackathon 2024, where our team developed an award-winning image segmentation model for autonomous vehicles. Our model was recognized for its accuracy and innovation, earning us the Best Model award.
-
Updated
Aug 12, 2025 - Jupyter Notebook
To create a Decision Tree classifier and visualize it graphically, the purpose is if we feed any new data to this classifier, it would be able to predict the right class accordingly.
-
Updated
Sep 26, 2022 - Jupyter Notebook
This repository contains introductory notebook for logistic regression
-
Updated
Nov 17, 2022 - Jupyter Notebook
Trained and evaluated two supervised machine learning models using original and resampled data to identify 'healthy loan' and 'high risk loan' applicants from financial disclosures.
-
Updated
Jun 7, 2023 - Jupyter Notebook
This project is designed to extract sales data from a PostgreSQL database, process it, and use a Random Forest model to predict sales quantities. It also visualizes real and predicted sales for better understanding.
-
Updated
Jan 6, 2025 - Python
This project applies a Random Forest Classifier to predict whether a student will Pass or Fail based on their features
-
Updated
Aug 20, 2025 - Jupyter Notebook
Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).
-
Updated
Dec 14, 2024 - Python
Supervised-ML-Decision-Tree-C5.0-Entropy-Iris-Flower-Using Entropy Criteria - Classification Model. Import Libraries and data set, EDA, Apply Label Encoding, Model Building - Building/Training Decision Tree Classifier (C5.0) using Entropy Criteria. Validation and Testing Decision Tree Classifier (C5.0) Model
-
Updated
Nov 9, 2021 - Jupyter Notebook
A Python module for time series cross-validation using Combinatorial Purged Cross-Validation (CPCV) with embargo to prevent data leakage.
-
Updated
Sep 4, 2025 - Python
This creates an AWS Chatbot to give users their investment portfolio based on their risk tolerance level i.e. conservative, moderate, or aggressive. With the use of machine learning, the tool will be created to different portfolios based off that.
-
Updated
Nov 5, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the train-test-split topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the train-test-split topic, visit your repo's landing page and select "manage topics."