Skip to content
View ved1beta's full-sized avatar
πŸŒ‹
back to work
πŸŒ‹
back to work

Block or report ved1beta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ved1beta/README.md

image

Things I Do: )

  • Triton: making custom triton kernels for better optimizations, working on some big kernel projects
  • Cuda: cuda architecture for better understanding of kernels and triton
  • Deep Learning: comp vision, NLP etc. : )

Technical Skills πŸ› οΈ

  • Languages: Python, CUDA, C++
  • Frameworks & Libraries: Pytorch, Pandas, Matplotlib, triton, Mpi4py
  • Tools & Platforms: GitHub, Docker, Vercel, Neovim, Vscode, Jupyter Notebook, Aws
  • Machine Learning Specialist: Proficient in statistical analysis, predictive modeling (Regression, Decision Trees, Random Forest), and advanced algorithms (CatBoost, SGD) with strong focus on optimization and accuracy.

Key Projects πŸ“š

CUDA

  • GPU Sanghathan: Small scale distributed training of sequential deep learning models, built on Numpy and MPI.
  • Cuda writer: writing cuda kernels from scratch vec_add to flash_attention and model implementation from scratch.
  • Flash attention: Implementation of flash attention in tritonutilization

Machine learning

  • Paligemma-Google: Implemented paligemma vision language model by google from scratch paper

  • Transformer: Implemented Transformer language model by Google from scratch paper

  • Mixture of Experts: Mixture of Experts (MoE) model with a focus on efficient routing and expert

  • Triton/CUDA kernels in my free time : )

Connect with Me πŸ“¬

  • 🐦 Twitter
  • πŸ“« Email
  • πŸ”— LinkedIn I'm looking forward to collaborating on projects that are at the intersection of technology and social good. Let's connect! 🌍

Pinned Loading

  1. axolotl-ai-cloud/axolotl axolotl-ai-cloud/axolotl Public

    Go ahead and axolotl questions

    Python 10.7k 1.2k

  2. bitsandbytes-foundation/bitsandbytes bitsandbytes-foundation/bitsandbytes Public

    Accessible large language models via k-bit quantization for PyTorch.

    Python 7.7k 791

  3. tinygrad/tinygrad tinygrad/tinygrad Public

    You like pytorch? You like micrograd? You love tinygrad! ❀️

    Python 30.4k 3.7k

  4. huggingface/transformers huggingface/transformers Public

    πŸ€— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 152k 30.9k

  5. GPU-sanghathan GPU-sanghathan Public

    Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

    Python 3

  6. Quanta Quanta Public

    "Efficient and scalable solutions for PyTorch, enabling large language model quantization with k-bit precision for enhanced accessibility.

    Python 1 2