- Paris
Lists (1)
Sort Name ascending (A-Z)
- All languages
- ActionScript
- Ada
- Agda
- AngelScript
- Assembly
- BASIC
- Batchfile
- Bikeshed
- BlitzBasic
- C
- C#
- C++
- CMake
- COBOL
- CSS
- Cairo
- Chapel
- Circom
- Clojure
- Common Lisp
- Coq
- Crystal
- Cuda
- D
- Dafny
- Dart
- Dhall
- Dockerfile
- EJS
- Elixir
- Elm
- Erlang
- F#
- F*
- Fortran
- Futhark
- GLSL
- Go
- HCL
- HLSL
- HTML
- Hack
- Haskell
- Haxe
- Isabelle
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Just
- KCL
- Koka
- Kotlin
- LLVM
- Lean
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Max
- Metal
- Mojo
- Move
- Nim
- Nix
- Noir
- Nushell
- OCaml
- Objective-C
- Odin
- OpenEdge ABL
- PHP
- Pascal
- Perl
- Pony
- PowerShell
- PureBasic
- Python
- Q#
- QML
- R
- Racket
- Reason
- Rocq Prover
- Roff
- Ruby
- Rust
- SCSS
- SMT
- SRecode Template
- Sage
- Sail
- Sass
- Scala
- Scheme
- Scilab
- Shell
- Solidity
- SourcePawn
- Stan
- Standard ML
- Svelte
- Swift
- SystemVerilog
- TLA
- TeX
- Twig
- TypeScript
- VHDL
- Verilog
- Vim Script
- Vue
- WGSL
- WebAssembly
- Wikitext
- Wren
- YAML
- YARA
- Yacc
- Zig
- q
Starred repositories
Free open-source community edition of DevCleaner - a fast development cache cleaner for macOS
Tile-Based Runtime for Ultra-Low-Latency LLM Inference
Systematic benchmark study of DeepSeek-V4-Flash inference on 4× NVIDIA RTX PRO 6000 Blackwell (TP=4, FP8 KV, MTP=2, 1M context). Sustained decode matrix + Estonia long-context profile.
Reverse engineering notes. Personal reference only. Everything here is a best-guess reconstruction.
A collection of tricks and tools to speed up transformer models
local-inference-lab / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Nim CPS runtime with http1.1, http2, http3, ws, sse, webtransport, irc, dns and a React-like DSL, http server DSL, and wasm compilation
(at least a useful portion of) Temporal Logic of Actions, a.k.a. TLA in Lean 4
Dynamic Memory Management for Serving LLMs without PagedAttention
A SillyTavern fork with Bun as the backend, along with UI/UX improvements
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
A pure-Python implementation of the Nvidia CuTe layout algebra intended to be approachable and easy to learn.
High-performance safetensors model loader
Hub for ongoing Qwen inference benchmarks on NVIDIA Blackwell. Indexes all studies, hosts the rolling SOTA leaderboard, points to the toolchain.
Docker images for LLM inference (SGLang + vLLM) on NVIDIA Blackwell GPUs (SM120, CUDA 13.2)
Remove large amounts of unwanted applications quickly.
A modern system cleaner built in Go with a TUI and CLI.
A high performance Python graph library implemented in Rust.