TokenSpeed-Kernel: Portable APIs and High-Performance Kernels for Multi-Silicon LLM Inference
TokenSpeed-kernel is a standalone open-source subsystem for LLM inference kernels, using layered APIs and registry-based dispatch to separate runtime logic from hardware-specific backend implementations.