这是2023年ZJU春夏学期课程地球电磁学的期末作业(written by Meng),感谢Yang Bo老师的悉心指导和学长的作业为我提供的思路 ...
Learning Supervised Finetuning and Reinforcement Learning on Math LLMs / 数学大模型有监督微调与强化学习实践 A collection of practical scripts and implementations for Supervised Fine-Tuning (SFT) and Reinforcement ...