A from-scratch PEFT learning experiment. Fine-tune a 0.5B-parameter model with LoRA to output correct tool-calling JSON from Chinese instructions. Core question: Full fine-tuning needs dozens of GB of ...
This command-line tool is a software development toolkit to help instructional teams author asynchronous graders for Coursera (typically programming assignments). Coursera's asynchronous grading ...