- 👋 Hi, I’m @DerrickYLJ
- 👀 I’m interested in ...
- 🌱 I’m currently learning ...
- 💞️ I’m looking to collaborate on ...
- 📫 How to reach me ...
Pinned Loading
-
TidalDecode
TidalDecode PublicTidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
-
flexflow/flexflow-train
flexflow/flexflow-train PublicAutomatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
-
Blocking_Waived_Estimation
Blocking_Waived_Estimation PublicThis repo aims to solve worst case delay of relatively complicated network architecture with [1] Trajectory Approach; [2] Network Calculus; [3] Compositional Performance Analysis (CPA); and [4] Flo…
Python 2
-
-
mit-han-lab/TinyChatEngine
mit-han-lab/TinyChatEngine PublicTinyChatEngine: On-Device LLM Inference Library
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.