Skip to content
View ved1beta's full-sized avatar
🤭
🤭

Organizations

@axolotl-ai-cloud @dylo-oss @sanshins

Block or report ved1beta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight inference engine supporting speculative speculative decoding (SSD).

Python 962 74 Updated May 10, 2026

ARC Relay — WebSocket relay server for agent remote control by Axolotl AI

Python 47 6 Updated Mar 30, 2026

Visual Causal Flow

Python 3,082 271 Updated Feb 3, 2026

High-performance Rust extensions for Axolotl (no OOM for large datasets) - drop-in acceleration for existing installations.

Python 3 Updated Jun 5, 2026

Nano vLLM

Python 14,243 2,265 Updated Apr 26, 2026
Python 1 Updated Oct 26, 2025

Disposable Temp mail service

Go 43 Updated Jun 17, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,578 316 Updated Jul 17, 2025

A safetensors extension to efficiently store sparse quantized tensors on disk

Python 1 Updated Jul 28, 2025

inference engine for LLMs

Python 2 1 Updated Nov 21, 2025

The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers

Python 1 Updated Dec 3, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 1 Updated May 31, 2026

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 1 Updated Jun 7, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 1 Updated Dec 15, 2025

Utils for Unsloth

Python 1 Updated Aug 16, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Updated Dec 2, 2025

Efficient implementations of state-of-the-art sequence modeling architectures—using PyTorch and Triton.

Python 2 Updated Sep 22, 2025

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 3 Updated Apr 25, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,875 1,094 Updated Jun 30, 2026

PyTorch native quantization and sparsity for training and inference

Python 1 Updated Feb 1, 2026
Python 1 1 Updated Jun 6, 2025

A community curated list of Rust Language streamers

746 40 Updated Jan 7, 2024

Extremely fast Query Engine for DataFrames, written in Rust

Rust 38,892 2,923 Updated Jun 30, 2026

Node.js dependency tracing utility

JavaScript 1,650 176 Updated Jun 22, 2026

[Hackintosh] Configuration for Lenovo Thinkpad P50.

ASL 49 12 Updated Sep 14, 2023

Autonomous coding agent as an SDK, IDE extension, or CLI assistant.

TypeScript 64,101 6,800 Updated Jun 30, 2026

HelixDB is an OLTP graph-vector database built in Rust.

Rust 5,529 306 Updated Jun 25, 2026

Rust full node implementation of the Fuel v2 protocol.

Rust 56,984 2,861 Updated Jun 28, 2026

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 86,212 9,323 Updated Jun 30, 2026
Next