Intro Use C++ CUDA and Python programming, for: LLM inference system Web Server building (software system) I am currently interning in the Paddle R&D team at Baidu. Contact E-mail: d31409163@163.com