Skip to content
View PKUFlyingPig's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report PKUFlyingPig

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 22,779 2,046 Updated Mar 14, 2025
Python 3,949 316 Updated Mar 12, 2025

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 33 2 Updated Jan 8, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,680 494 Updated Mar 7, 2025

Let your Claude able to think

TypeScript 14,708 1,709 Updated Mar 10, 2025

Annotated version of the Mamba paper

Jupyter Notebook 474 18 Updated Feb 27, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,719 129 Updated Jan 17, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 19,266 2,055 Updated Mar 11, 2025

O1 Replication Journey

1,972 65 Updated Jan 14, 2025

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

C++ 2,870 256 Updated Mar 11, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 12,857 1,854 Updated Mar 1, 2025

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,705 280 Updated Aug 10, 2024

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 237 15 Updated Jan 13, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 492 52 Updated Aug 19, 2024

Kernel-Bypass LibOS Architecture

Rust 1,100 126 Updated Mar 14, 2025

This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.

Python 14 Updated May 2, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,250 557 Updated Oct 19, 2024

A PyTorch Native LLM Training Framework

Python 752 40 Updated Dec 27, 2024

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 453 31 Updated Mar 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,482 899 Updated Jul 1, 2024

Official inference library for Mistral models

Jupyter Notebook 10,086 902 Updated Nov 12, 2024

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 674 72 Updated Oct 30, 2024

Automatic resource configuration for serverless workflows.

Python 20 2 Updated Mar 24, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 11,910 1,241 Updated Mar 14, 2025

A resilient distributed training framework

Python 89 5 Updated Apr 11, 2024

Learning material for CMU10-714: Deep Learning System

Jupyter Notebook 239 37 Updated May 12, 2024

A simple bash script for switching between installed versions of CUDA.

Shell 621 142 Updated Dec 19, 2018

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 173,367 45,328 Updated Mar 14, 2025
Next