Skip to content
#

moe

Here are 8 public repositories matching this topic...

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

  • Updated Jun 29, 2026
  • Go

Single-kernel Codex runtime that distills skills, agents, plugins, and workflows into atomic capabilities and routes them through one main chain. / 面向 Codex 的单核运行时:将技能、Agent、插件与工作流蒸馏为原子能力,并由唯一主链统一调度。

  • Updated Jun 20, 2026
  • Go

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."

Learn more