Skip to content
Change the repository type filter

All

    Repositories list

    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Python
      MIT License
      5985.4k121Updated May 20, 2025May 20, 2025
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      7697.7k630Updated May 19, 2025May 19, 2025
    • Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      2777.8k00Updated May 15, 2025May 15, 2025
    • Integrate the DeepSeek API into popular softwares
      Creative Commons Zero v1.0 Universal
      3.6k32k8242Updated May 13, 2025May 13, 2025
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      8858.9k8620Updated May 7, 2025May 7, 2025
    • Other
      761.1k92Updated Apr 30, 2025Apr 30, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient MLA decoding kernels
      Cuda
      MIT License
      83312k410Updated Apr 29, 2025Apr 29, 2025
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3492.9k340Updated Apr 22, 2025Apr 22, 2025
    • MIT License
      12k89k16225Updated Apr 9, 2025Apr 9, 2025
    • Python
      MIT License
      16k97k4932Updated Apr 9, 2025Apr 9, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      1901.2k51Updated Mar 24, 2025Mar 24, 2025
    • Analyze computation-communication overlap in V3/R1.
      1421k100Updated Mar 21, 2025Mar 21, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      MIT License
      2952.8k40Updated Mar 10, 2025Mar 10, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      MIT License
      4144.6k226Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.7k4.8k9316Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k17k15125Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      5174.9k773Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      8795.8k503Updated Sep 24, 2024Sep 24, 2024
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      249611100Updated Sep 22, 2024Sep 22, 2024
    • Python
      MIT License
      23452080Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.5k22k10120Updated May 21, 2024May 21, 2024
    • DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      MIT License
      5663.8k402Updated Apr 24, 2024Apr 24, 2024
    • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      MIT License
      5122.7k321Updated Apr 15, 2024Apr 15, 2024
    • A curated list of open-source projects related to DeepSeek Coder
      20069700Updated Apr 3, 2024Apr 3, 2024
    • DeepSeek LLM: Let there be answers
      Makefile
      MIT License
      9946.4k262Updated Feb 4, 2024Feb 4, 2024
    • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      MIT License
      2791.7k163Updated Jan 16, 2024Jan 16, 2024