Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: tiktoken integration #60

Merged
merged 38 commits into from
Jul 24, 2024
Merged

Conversation

nguyenhoangthuan99
Copy link

@nguyenhoangthuan99 nguyenhoangthuan99 commented Jul 16, 2024

Issue: #49

@vansangpfiev vansangpfiev changed the title Feat tiktoken integration feat: tiktoken integration Jul 24, 2024
@vansangpfiev vansangpfiev marked this pull request as ready for review July 24, 2024 23:59
@vansangpfiev vansangpfiev merged commit 221c653 into sync-0.10.0 Jul 24, 2024
3 checks passed
vansangpfiev added a commit that referenced this pull request Jul 25, 2024
* TensorRT-LLM v0.10 update

* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>

* Update Makefile and CICD for rel 0.10.0

* Update breaking change for 0.10.0

* fix: use tensorrt 10.1.0

* fix: revert tensorrt version windows

* fix: synk report fixup (#56)

Co-authored-by: vansangpfiev <sang@jan.ai>

* Fix error read timeout minio

* feat: tiktoken integration (#60)

* cpp-tiktoken integration init

* remove cpp-tiktoken folder

* tiktoken integration init

* move fmt to third-party dependency

* remove unnecessary option in cortex CMakeLists

* remove unnecessary submodule

* make pcre2 as third-party lib

* fix CI bug build in window

* fix CI bug in linux

* link directory pcre2

* Fix CI bug in linux 'pcre2.h' not found

* Fix bug build static third party lib

* Integate Llama3 successfully

* refactor with model type

* remove unnecessary comment

* format and code convention

* fix logic of mistral when preparing input

* Add remove end_of_text and end_of_turn token feature for llama3

* fix: patch pcre2 CMakeLists.txt file

* fix: do not add fPIC to cpp-tiktoken cmake file

* fix: suppress warnings

* fix: add CMAKE_C_FLAGS

* fix: CI bug build for window in cpp-tiktoken

* fix: CI bug for window fmt build fail without utf-8 flag

* fix: CI bug for window stop using fmt

* fix: build bugs for window - using latest pcre2 third party

* fix: build bugs for window - pcre2 static runtime on

* fix CI bug for window build pcre2

* rename link lib to pcre2-8-static in windows

* fix: use pcre2 dynamic lib for windows

* fix: pack tensorrt_llm_nvrtc_wrapper.dll

* test: only windows

* fix: disable setLoggerFinder for msvc

* fix: cleanup cmake files

* fix: remove fmt

* fix: pack libtensorrt_llm_nvrtc_wrapper

* fix: patches windows

* fix: rm fmt

---------

Co-authored-by: sangjanai <sang@jan.ai>

---------

Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Hien To <tominhhien97@gmail.com>
Co-authored-by: sangjanai <sang@jan.ai>
Co-authored-by: vansangpfiev <vansangpfiev@gmail.com>
Co-authored-by: nguyenhoangthuan99 <35255081+nguyenhoangthuan99@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants