LLM fine-tuning 관련 정리

작은 LLM의 경우 잘 작동하지 않는 문제가 있음

일반적으로 prompt-response 쌍이 특정 작업을 잘 완료하는경우가 많음

https://github.com/reasoning-survey/Awesome-Reasoning-Foundation-Models