chat_template parameter #679

prasiyer · 2024-06-12T23:18:02Z

prasiyer
Jun 12, 2024

Can you please explain when to use each of the following chat_template parameters?
tokenizer, chatml, zephyr, None

I am working to prepare the dataset for finetuning Llama-3-8B-Instruct

splanker · 2024-06-18T03:29:10Z

splanker
Jun 18, 2024

Yes, this would be helpful. Autotrain seems appealing but the lack of documentation, lack of how to videos, lack of basic guides blows my mind.

0 replies

abhishekkrthakur · 2024-06-18T10:18:31Z

abhishekkrthakur
Jun 18, 2024
Maintainer

does this help: https://huggingface.co/blog/abhishek/phi3-finetune-macbook?

0 replies

splanker · 2024-06-19T16:58:28Z

splanker
Jun 19, 2024

Yeah, not really. Same old same old. Even at the bottom where it says "extensive" documentation can be found here (link to huggingface documentation) it really is just a couple of pages explaining what AuoTrain is and how much it costs. I wouldn't exactly call that "extensive".

A colab dataset validator with some code to detect and possibly offer solutions for correcting the issue would be fantastic. I've looked, but can't find anything.

So, as it stands the process is:

make adjustments to the dataset
upload
run
fail
search for explanation of error
find none
repeat
It's kind of a time consuming approach for something that has the word "auto" in it.

0 replies

abhishekkrthakur · 2024-06-20T08:48:30Z

abhishekkrthakur
Jun 20, 2024
Maintainer

@splanker i read your rant :D thanks! that means more things need to be improved.
may i ask you to post a few lines from the dataset that is giving you the error? so i know a bit more

0 replies

prasiyer · 2024-06-20T22:24:14Z

prasiyer
Jun 20, 2024
Author

@abhishekkrthakur -- Appreciate your efforts to create open source tools. Autotrain makes fine-tuning of llms, a bit less intimidating. I have 2 questions
Question 1: Is it possible to get information similar to the table below?

chat-template parameter value | Example dataset | Training method | Applicable open llms
None |openassistant-guanaco, alpaca | SFT |Phi-3-mini-4k-instruct, Meta-Llama-3-8B-Instruct
Chatml |?? |?? |??
Zephyr |?? |?? |??

Question 2: When chat-template = "tokenizer", is the functionality similar to using tokenizer.apply_chat_template(..,..,tokenize = False) to convert the prompt to a appropriate format for the llm?

Thanks in advance.

0 replies

prasiyer · 2024-08-09T14:13:06Z

prasiyer
Aug 9, 2024
Author

@abhishekkrthakur - Can you please provide feedback to the above question?

1 reply

Stealthwriter Aug 21, 2024

use tokenizer always and make sure the dataset has a proper format for chat template

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chat_template parameter #679

{{title}}

Replies: 6 comments 1 reply

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

chat_template parameter #679

prasiyer Jun 12, 2024

Replies: 6 comments · 1 reply

splanker Jun 18, 2024

abhishekkrthakur Jun 18, 2024 Maintainer

splanker Jun 19, 2024

abhishekkrthakur Jun 20, 2024 Maintainer

prasiyer Jun 20, 2024 Author

prasiyer Aug 9, 2024 Author

Stealthwriter Aug 21, 2024

prasiyer
Jun 12, 2024

Replies: 6 comments 1 reply

splanker
Jun 18, 2024

abhishekkrthakur
Jun 18, 2024
Maintainer

splanker
Jun 19, 2024

abhishekkrthakur
Jun 20, 2024
Maintainer

prasiyer
Jun 20, 2024
Author

prasiyer
Aug 9, 2024
Author