Skip to content
View Murgio's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Murgio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,590 153 Updated Jun 17, 2026

Stable Looped Models and their Scaling Laws

Python 167 11 Updated May 17, 2026

An alignment auditing agent capable of quickly exploring alignment hypothesis

Python 1,244 201 Updated Jun 26, 2026

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

C++ 5,743 606 Updated Jun 30, 2026

Fully automatic censorship removal for language models

Python 25,624 2,765 Updated Jun 27, 2026

Research code base for Automatic Textbook Formalization

Python 155 11 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 7,156 1,014 Updated Jun 30, 2026

An interface library for RL post training with environments.

Python 2,373 403 Updated Jun 30, 2026

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.

Jupyter Notebook 94 10 Updated Jan 13, 2026

Super basic implementation (gist-like) of RLMs with REPL environments.

Python 812 134 Updated Jan 7, 2026

Like catnip, a highly addictive agentic coding tool

Go 489 41 Updated Jun 30, 2026

Code for tuning Smart Tab Grouping models for Firefox

Jupyter Notebook 21 4 Updated Dec 4, 2025

Clean, reusable paper implementations for trending papers on alphaXiv

Python 188 19 Updated Mar 17, 2026
Python 85 17 Updated Feb 18, 2026

OpenTelemetry Instrumentation for AI Observability

Python 1,061 264 Updated Jun 30, 2026

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,464 496 Updated Jun 29, 2026

OpenAI Guardrails - Python

Python 216 35 Updated Mar 28, 2026

Pretraining data reconstruction scripts for Apertus

Python 143 14 Updated Oct 27, 2025

Tech Report of the Apertus LLM

137 5 Updated Mar 9, 2026

Response format to be used with apertus

Python 14 1 Updated Dec 3, 2025

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Python 5,046 354 Updated May 25, 2026

Opensource benchmark evaluating web operators/agents performance

Python 48 7 Updated Apr 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,199 2,101 Updated Jun 9, 2026

A library for making RepE control vectors

Jupyter Notebook 738 64 Updated Sep 24, 2025

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 148 7 Updated Sep 20, 2024

Democratizing Reinforcement Learning for LLMs

Python 5,667 581 Updated Jun 30, 2026

Temporal Python SDK

Python 1,118 201 Updated Jun 29, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python 10,213 912 Updated Jun 29, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 2,045 369 Updated Jun 30, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 5,038 450 Updated Nov 13, 2025
Next