Question about the continued instruction-tuning phase #6

Jiaxin-Wen · 2024-06-01T15:04:43Z

In section 2.5, models are continued fine-tuned on several opensource instruction tuning datasets, which includes the training set of GSM8K and MATH.

I'm wondering after continued fine-tuning, are models evaluated still with few-shot prompting or zero-shot prompting.
For example, if the model is fine-tuned on GSM8K with the following data format:
Question:\n{question}\nAnswer:\n{answer}
In the inference stage, do you still incorporate multiple Q-A pairs into the input, or just the question (which is aligned with the continued fine-tuning stage).

Jiaxin-Wen · 2024-06-01T15:09:18Z

Moreover, I find that simply fine-tuning SOTA LMs (e.g., llama-3-8b) on the original training set of GSM8K does not lead to any improvement compared with few-shot performance.

llama-3-8b	GSM8K
few-shot prompting	55.57
fine-tuning	55.79

I would like to know if this aligns with your experiment results. If so, could you please share your data for continued instruction-tuning too? That would be really helpful to reproduce the experiment results in this paper.

xiangyue9607 · 2024-06-01T15:34:36Z

Thanks! We used few-shot for the evaluation. You can find more details in our evaluation code and implementation details in the paper.

Jiaxin-Wen · 2024-06-01T15:50:32Z

will you add eos token during pre-training or continue fine-tuning?

Jiaxin-Wen · 2024-06-01T15:53:27Z

as all training data is in one-shot format, I'm wondering whether I should remove eos token during pre-training or fine-tuning to adapt to few-shot evaluation

Jiaxin-Wen · 2024-06-01T17:04:29Z

or is there any other trick that you used to adapt to few-shot evaluation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the continued instruction-tuning phase #6

Question about the continued instruction-tuning phase #6

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

xiangyue9607 commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

Question about the continued instruction-tuning phase #6

Question about the continued instruction-tuning phase #6

Comments

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

xiangyue9607 commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024

Jiaxin-Wen commented Jun 1, 2024