Skip to content

This notebook implements a structured machine learning pipeline to classify cosmic data using the CatBoost Classifier, known for its efficiency with categorical features and minimal preprocessing requirements.

Notifications You must be signed in to change notification settings

ayushsaksena30/Cosmic-Classifier

Repository files navigation

Cosmic Classifier

This project is a machine learning-based classifier designed to analyze and categorize cosmic data. It leverages the CatBoost classifier along with various preprocessing techniques to ensure robust and accurate classification.

Team Members

  1. Ayush Saksena (cogni2047191)

  2. Prince Raj (cogni2047190)

  3. Tanishka Nibariya (cogni2047075)

  4. Ratan Jyoti Jaiswal (cogni2047342)

Features

  • Data preprocessing using pandas, numpy, sklearn

  • Robust scaling and imputation techniques

  • CatBoost for efficient and high-performance classification

  • Performance evaluation using accuracy metrics

Installation

All necessary libraries are already mentioned in the Python Notebook. No Need to install additional dependencies.

Usage

  1. Open Google Colab and upload the Python Notebook.

    Goto File  ->  Upload Notebook  ->  Upload.
    
  2. Upload Train and Test dataset from the Github Repository to the Session Storage manually.

    Goto File on sidebar  ->  Upload  ->  Upload Train and Test dataset files from Github Repository.
    
  3. Run through the cells to train and evaluate the model.

Dataset

The dataset files must be uploaded manually each session via the file upload dialog.

Ensure the files are named exactly as in the github repository before loading them into pandas.

Results

The model's performance is evaluated based on accuracy and other metrics, with visualizations powered by matplotlib and seaborn.

The Google Collab Notebook will prompt to download Submissions.csv file towards the end of the execution. The same csv file is also provived in the repository.

Contributions

Feel free to fork the repository and submit pull requests to improve the classifier.

About

This notebook implements a structured machine learning pipeline to classify cosmic data using the CatBoost Classifier, known for its efficiency with categorical features and minimal preprocessing requirements.

Topics

Resources

Stars

Watchers

Forks