YoctoHan

Follow

❤️

YoctoHan YoctoHan

❤️

Follow

4 followers · 5 following

Beijing
03:00 (UTC +08:00)

Achievements

Achievements

Popular repositories Loading

FasterTransformer FasterTransformer Public

Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++
lmdeploy lmdeploy Public

Forked from InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

C++
aix_infer_trt aix_infer_trt Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++