Skip to content
Change the repository type filter

All

    Repositories list

    • slime

      Public
      slime is an LLM post-training framework for RL Scaling.
      Python
      Apache License 2.0
      7315.4k184100Updated Apr 18, 2026Apr 18, 2026
    • CaRR

      Public
      This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric…
      Python
      MIT License
      76100Updated Apr 8, 2026Apr 8, 2026
    • IndexCache

      Public
      IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
      Other
      88531Updated Mar 14, 2026Mar 14, 2026
    • AgentBench

      Public
      A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
      Python
      Apache License 2.0
      2463.3k6110Updated Feb 8, 2026Feb 8, 2026
    • DataSciBench: An LLM Agent Benchmark for Data Science
      Python
      85500Updated Jan 21, 2026Jan 21, 2026
    • AgentRL

      Public
      Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
      Python
      MIT License
      2027280Updated Jan 17, 2026Jan 17, 2026
    • MobileRL

      Public
      Python
      MIT License
      87930Updated Dec 23, 2025Dec 23, 2025
    • Python
      Apache License 2.0
      62640Updated Nov 7, 2025Nov 7, 2025
    • PETra

      Public
      Python
      0200Updated Nov 5, 2025Nov 5, 2025
    • AlignBench

      Public
      大模型多维度中文对齐评测基准 (ACL 2024)
      Python
      29425150Updated Oct 25, 2025Oct 25, 2025
    • Python
      21220Updated Oct 15, 2025Oct 15, 2025
    • DeepDive

      Public
      DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
      Python
      2929720Updated Oct 2, 2025Oct 2, 2025
    • TDRM

      Public
      Python
      Apache License 2.0
      1900Updated Sep 25, 2025Sep 25, 2025
    • ReST-RL

      Public
      Reinforcing LLM Reasoning through Self-Training and Value-Guided Decoding
      Python
      MIT License
      01400Updated Sep 18, 2025Sep 18, 2025
    • INFTY

      Public
      INFTY Engine: An Optimization Toolkit to Support Continual AI
      Python
      MIT License
      956800Updated Sep 13, 2025Sep 13, 2025
    • Python
      MIT License
      2231220Updated Aug 18, 2025Aug 18, 2025
    • SWE-Dev

      Public
      [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
      Python
      MIT License
      05910Updated Jul 21, 2025Jul 21, 2025
    • Typescript SDK for Z.ai - Not yet released.
      TypeScript
      MIT License
      2810Updated Jul 17, 2025Jul 17, 2025
    • BiPro

      Public
      code and data for Paper: BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework(ACL 2025 main)
      Python
      0600Updated Jun 28, 2025Jun 28, 2025
    • [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
      Python
      Apache License 2.0
      1851.9k282Updated Jun 24, 2025Jun 24, 2025
    • TreeRL

      Public
      TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
      Python
      Apache License 2.0
      89040Updated Jun 16, 2025Jun 16, 2025
    • WebRL

      Public
      Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
      Python
      3751700Updated Jun 6, 2025Jun 6, 2025
    • Python
      Apache License 2.0
      11310Updated May 29, 2025May 29, 2025
    • code, data and model for Paper: AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models (ACL'25 main)
      Python
      2510Updated May 20, 2025May 20, 2025
    • CogKit

      Public
      Finetuning and inference tools for the CogView4 and CogVideoX model series.
      Python
      Apache License 2.0
      17124181Updated May 14, 2025May 14, 2025
    • Towards Large Multimodal Models as Visual Foundation Agents
      Python
      Apache License 2.0
      11263160Updated Apr 24, 2025Apr 24, 2025
    • Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
      Python
      23900Updated Apr 2, 2025Apr 2, 2025
    • Parameter-Efficient Fine-Tuning for Foundation Models
      411300Updated Mar 31, 2025Mar 31, 2025
    • WebGLM

      Public
      WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
      Python
      Apache License 2.0
      1351.6k511Updated Mar 25, 2025Mar 25, 2025
    • WhoIsWho

      Public
      KDD'23 Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit
      Python
      175060Updated Mar 19, 2025Mar 19, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.