dgx-spark

Star

Here are 175 public repositories matching this topic...

Avarok-Cybersecurity / atlas

Star

Pure Rust Inference Engine

rust cuda transformers ssm mamba dgx openai-api llm-inference speculative-decoding gb10 nvfp4 dgx-spark

Updated Jun 30, 2026
Rust

AEON-7 / Qwen3.6-27B-AEON-Ultimate-Uncensored-DFlash

Star

Fully uncensored, capability-enhanced abliteration of Qwen3.6-27B. NVFP4 + z-lab DFlash speculative decoding (n=12) on the unified ghcr.io/aeon-7/aeon-vllm-ultimate:latest container, tuned for long-context draft acceptance on DGX Spark. 6 HF variants (BF16/NVFP4/MTP/MTP-XS), docker-compose, and QuickStart.

quantization uncensored blackwell llm vllm qwen speculative-decoding abliteration qwen3 nvfp4 dgx-spark dflash

Updated Jun 28, 2026
Python

spark-arena / sparkrun

Star

sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems

inference llama-cpp vllm sglang dgx-spark

Updated Jun 29, 2026
Python

albond / DGX_Spark_Qwen3.5-122B-A10B-AR-INT4

Star

Qwen3.5-122B-A10B on DGX Spark: 28.3 → 51 tok/s (+80%)

cuda lossless mtp speedup performance-optimization vllm autoround dgx-spark qwen3-5 sm121 qwen3-5-122b-a10b

Updated Jun 2, 2026
Python

eelbaz / dgx-spark-vllm-setup

Star

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

machine-learning ai deep-learning gpu cuda pytorch nvidia arm64 blackwell llm vllm llm-inference gb10 dgx-spark

Updated Oct 28, 2025
Shell

eelbaz / dgx-spark-headless-sunshine

Star

Headless remote desktop setup for NVIDIA DGX SPARK using Sunshine streaming

remote-desktop dgx-spark

Updated Oct 25, 2025
Shell

joeynyc / spark-doctor

Star

Local diagnostic CLI for NVIDIA DGX Spark (GB10). Detects power caps, unified memory pressure, thermal risk, Docker/runtime issues, and validates vLLM/Ollama/llama.cpp/SGLang recipes.

cli nvidia diagnostics dgx llama-cpp vllm local-llm ollama sglang gb10 dgx-spark grace-blackwell nvidia-dgx-spark

Updated May 15, 2026
Python

AEON-7 / comfyui-aeon-spark

Star

Bleeding-edge ComfyUI for NVIDIA DGX Spark (GB10/Blackwell/sm_121a). CUDA 13 + SageAttention v3 (sm_121a) + NVFP4 + 14 custom-node packs + Flux 2 Dev / LTX 2.3 22B / ACE-Step v1.5 XL Turbo pre-bundled with abliterated text-encoder paths.

docker flux blackwell comfyui sageattention ltx-video ace-step nvfp4 dgx-spark sm-121a

Updated Jun 28, 2026
Shell

AEON-7 / vllm-dflash

Star

DFlash vLLM for DGX Spark — Plug & Play Block-Diffusion Speculative Decoding

docker inference nvidia blackwell llm vllm qwen speculative-decoding block-diffusion nvfp4 dgx-spark dflash

Updated Jun 28, 2026
Python

bjk110 / spark_vllm_docker

Star

DGX Spark / GB10 vLLM Docker stack for large-model serving, presets, patches, and validation notes.

docker docker-compose cuda llm-serving vllm qwen deepseek gb10 dgx-spark

Updated Jun 29, 2026
Python

jdaln / dgx-spark-inference-stack

Star

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace Blackwell AI supercomputer on your desk. Mostly vLLM based for now and single-spark. For the not-so-rich buddies. If you want latest/in-testing, look at the branches

docker docker-compose cuda inference self-hosted llama model-serving mlops dgx generative-ai local-llm gb10 dgx-spark

Updated Jun 23, 2026
Shell

seanGSISG / dgx-spark-sunshine-setup

Star

Headless 4K remote desktop for the NVIDIA DGX Spark (GB10): one-command installer for Sunshine + Moonlight low-latency game streaming with NVENC hardware encoding, a software virtual display (no HDMI dummy plug), GDM autologin, and optional Tailscale.

Updated Jun 3, 2026
Shell

bjk110 / SPARK_Qwen3.5-122B-A10B-NVFP4

Star

vLLM + Qwen3.5-122B-A10B-NVFP4 on NVIDIA DGX Spark (GB10/SM121) — single-GPU NVFP4 W4A4 with MTP speculative decoding, self-contained Docker build

docker-compose dgx-spark vllm-server

Updated Mar 12, 2026
Python

DanTup / spark-evals

Sponsor

Star

Some benchmark results of small models and quants that fit on DGX Spark

ai benchmarks llms dgx-spark

Updated Jun 21, 2026
Python

botAGI / AGmind

Star

Private LLM/RAG platform in one command for NVIDIA DGX Spark / GB10 (arm64). Validated on real hardware.

docker ai docker-compose gpu self-hosted nvidia arm64 ai-agents dify rag ai-tools vllm open-webui rag-pipeline ragflow ai-stack dgx-spark agmind

Updated Jun 29, 2026
Shell

Entrpi / ds4-on-spark

Star

antirez/ds4 (DwarfStar 4) on NVIDIA DGX Spark — install, benchmarks, and roofline analysis. Steady-state decode at ~95% of bandwidth ceiling; MTP and concurrency analyzed.

benchmark cuda inference moe llm gguf gb10 dgx-spark deepseek-v4-flash

Updated Jun 4, 2026
Shell

theshiphq / claw-spark

Star

One-click AI agent setup for NVIDIA DGX Spark, Jetson, and RTX hardware. OpenClaw + Ollama, fully local.

amd gpu nvidia hetzner claw dgx-spark clawdbot moltbot openclaw nemoclaw

Updated Apr 16, 2026
Shell

Mekopa / whisperx-blackwell

Star

GPU-accelerated WhisperX on NVIDIA Blackwell (SM_121) - DGX Spark compatible

audio docker machine-learning deep-learning gpu cuda pytorch nvidia speech-recognition transcription asr speaker-diarization dgx blackwell pyannote whisperx dgx-spark sm-121

Updated Apr 23, 2026
Python

calico88x / DGX-Model-Manager

Star

Single-file web UI for NVIDIA DGX Spark — pull Ollama models, browse and download from HuggingFace, manage LiteLLM routing, and control SGLang, vLLM, llama.cpp, LocalAI, and ComfyUI. All from one browser tab.

web ai nvidia model-deployment fastapi ai-tools llm llm-tools gb10 dgx-spark dgxspark

Updated May 19, 2026
Python

CoconutMacaroon / blender-arm64

Star

Blender for ARM64 Linux with CUDA/OptiX/Vulkan support

blender nvidia dgx-spark

Updated Mar 26, 2026
Shell

Improve this page

Add a description, image, and links to the dgx-spark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dgx-spark topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dgx-spark

Here are 175 public repositories matching this topic...

Avarok-Cybersecurity / atlas

AEON-7 / Qwen3.6-27B-AEON-Ultimate-Uncensored-DFlash

spark-arena / sparkrun

albond / DGX_Spark_Qwen3.5-122B-A10B-AR-INT4

eelbaz / dgx-spark-vllm-setup

eelbaz / dgx-spark-headless-sunshine

joeynyc / spark-doctor

AEON-7 / comfyui-aeon-spark

AEON-7 / vllm-dflash

bjk110 / spark_vllm_docker

jdaln / dgx-spark-inference-stack

seanGSISG / dgx-spark-sunshine-setup

bjk110 / SPARK_Qwen3.5-122B-A10B-NVFP4

DanTup / spark-evals

botAGI / AGmind

Entrpi / ds4-on-spark

theshiphq / claw-spark

Mekopa / whisperx-blackwell

calico88x / DGX-Model-Manager

CoconutMacaroon / blender-arm64

Improve this page

Add this topic to your repo