-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Open
Description
Hi, there,
We want to SFT a llama3 model with a dataset, with following format,
[
{
"question": "the content of question 1 ...",
"answer": "the content of answer 1 ...",
"label": either "good" or "bad", to evaluate the answer.
},
...
]
Our questions are,
-
How to convert this dataset's format into the format that is acceptable by Llama3?
-
Out of curiosity, what will happen inside Llama3 during training, if we convert the dateset into the following format?
<|begin_of_text|> <|start_header_id|>question<|end_header_id|> Here is my first question ... <|start_header_id|>answer<|end_header_id|> Here is the LLM's answer to the first question ... <|start_header_id|>system<|end_header_id|> good <|eot_id|> ... <|end_of_text|>As SFT, for each message, will Llama3 take the
questionas prompt, and start Llama3's prediction fromanswer? If so, what will happen toevalthat is either "good" or "bad"?
Many thanks,
Kan
Metadata
Metadata
Assignees
Labels
No labels