[SUBMISSION] December 2024 - Module 1: Instruction tuning #78

jasoriya · 2024-12-10T16:43:12Z

December 2024 Student Submission

Module Completed

Changes Made

Describe what you've done in this PR:

What concepts did you learn?
What changes or additions did you make?
Any challenges you faced?

Notebooks Added/Modified

List any notebooks you've added or modified:

Added new example in module_name/students/my_example.ipynb
Modified existing notebook with additional examples
Added documentation or comments

Checklist

I have read the module materials
My code runs without errors
I have pushed models and datasets to the huggingface hub
My PR is based on the december_2024 branch

Questions or Discussion Points

Add any questions you have or points you'd like to discuss:
1.
2.

Additional Notes

Any other information that might be helpful for reviewers:
I tried to see how a challenging task like chain of thought prompting could be fine-tuned on a small LLM. As can be seen in the example output in the notebook, the fine-tuned model tries to start the reasoning chain but starts hallucinating soon after.

burtenshaw · 2024-12-11T13:18:05Z

Any other information that might be helpful for reviewers:
I tried to see how a challenging task like chain of thought prompting could be fine-tuned on a small LLM. As can be seen in the example output in the notebook, the fine-tuned model tries to start the reasoning chain but starts hallucinating soon after.

This is really cool!

Start a thread on Discord if you want to discuss! https://discord.com/channels/879548962464493619/1313889336907010110

burtenshaw · 2024-12-11T13:18:57Z

Could you exchange a review with one of the other students's PRs? [SUBMISSION]

jasoriya · 2024-12-11T16:45:54Z

@burtenshaw I'd be glad to exchange a review. It seems that we don't have the permission to add ourselves as a reviewer. Would you be adding us as a reviewer?

burtenshaw · 2024-12-11T17:51:40Z

@burtenshaw I'd be glad to exchange a review. It seems that we don't have the permission to add ourselves as a reviewer. Would you be adding us as a reviewer?

Thanks. Just comment and mention me. Then I'll assign.

bhautik-pithadiya · 2024-12-12T07:04:55Z

@burtenshaw add me as a reviewer for this PR.

burtenshaw · 2024-12-16T08:40:33Z

@bhautik-pithadiya if you review, I'll merge.

Complete exercise with finetuning on COT dataset

857ce6d

jasoriya changed the title ~~Complete exercise with finetuning on COT dataset~~ [SUBMISSION] December 2024 - Module 1: Instruction tuning Dec 10, 2024

burtenshaw requested a review from bhautik-pithadiya December 16, 2024 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUBMISSION] December 2024 - Module 1: Instruction tuning #78

[SUBMISSION] December 2024 - Module 1: Instruction tuning #78

jasoriya commented Dec 10, 2024 •

edited

Loading

burtenshaw commented Dec 11, 2024

burtenshaw commented Dec 11, 2024

jasoriya commented Dec 11, 2024

burtenshaw commented Dec 11, 2024

bhautik-pithadiya commented Dec 12, 2024

burtenshaw commented Dec 16, 2024

[SUBMISSION] December 2024 - Module 1: Instruction tuning #78

Are you sure you want to change the base?

[SUBMISSION] December 2024 - Module 1: Instruction tuning #78

Conversation

jasoriya commented Dec 10, 2024 • edited Loading

December 2024 Student Submission

Module Completed

Changes Made

Notebooks Added/Modified

Checklist

Questions or Discussion Points

Additional Notes

burtenshaw commented Dec 11, 2024

burtenshaw commented Dec 11, 2024

jasoriya commented Dec 11, 2024

burtenshaw commented Dec 11, 2024

bhautik-pithadiya commented Dec 12, 2024

burtenshaw commented Dec 16, 2024

jasoriya commented Dec 10, 2024 •

edited

Loading