The Github repo for our survey paper: "Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models"

102 9 Updated Mar 28, 2026

Computer-use-agents / dart-gui

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Python 82 4 Updated Feb 26, 2026

SWE-Lego / SWE-Lego

SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving

Python 60 1 Updated Feb 28, 2026

LLM-MI-Research / Actionable-MI

14 Updated Jan 20, 2026

terminal-agent / reptile

💻 Terminal-Agent with Human-in-the-Loop Learning

Python 39 2 Updated Jan 16, 2026

NVlabs / COAT

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 262 25 Updated Aug 9, 2025

Aider-AI / polyglot-benchmark

Coding problems used in aider's polyglot benchmark

C++ 208 27 Updated Dec 22, 2024

Heng-xiu / smol_training_playbook_zh_tw

HTML 7 4 Updated Nov 9, 2025

inclusionAI / PromptCoT

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 134 14 Updated Jan 31, 2026

bytedance / Repo2Run

Repo2Run is an LLM-based agent that automates environment configuration by generating error-free Dockerfiles for Python repositories.

Python 169 28 Updated Nov 18, 2025

mmsearch-plus / MMSearch-Plus

[ICLR 2026] MMSearch-Plus: Benchmarking Provenance-Aware Search For Multimodal Browsing Agents

Python 7 Updated Feb 6, 2026

peng-weihan / SWE-QA-Bench

Python 47 8 Updated Jan 21, 2026

LongEmotion / LongEmotion

Python 11 1 Updated Jan 17, 2026

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,556 1,431 Updated Feb 27, 2026

Hambaobao / SWE-Flow

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Python 37 5 Updated Jun 29, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 3,566 491 Updated Mar 24, 2026

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 49,726 8,637 Updated Mar 28, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,300 530 Updated Mar 29, 2026

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 70,102 8,783 Updated Mar 29, 2026

JierunChen / SFT-RL-SynergyDilemma

15 Updated Jan 14, 2026

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 606 115 Updated Mar 23, 2026

ganler / code-r1

Reproducing R1 for Code with Reliable Rewards

Python 301 18 Updated May 5, 2025

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,875 2,040 Updated Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chaofan Tao ChaofanTao

Achievements

Achievements

Block or report ChaofanTao

Stars

bytedance / trae-agent

harbor-framework / harbor

abundant-ai / SWE-gen

GAIR-NLP / daVinci-Dev

menik1126 / Swing-Bench

BIT-DataLab / Edit-Banana

DeepSoftwareAnalytics / Awesome-Issue-Resolution

rattlesnakey / Awesome-Actionable-MI-Survey