14 11 月, 2025

Pre-train Alignment

This is a review of the course…

This is a review of the course: 生成式AI時代下的機器學習(2025)】第五講：大型語言模型訓練方法「預訓練–對齊」(Pretrain-Alignment) 的強大與極限 from professor Hung-Yi Lee from NTU.
Link to the course video & slides.

Pre-training: Given large input data available, train the model
Supervised Fine Tuning (SFT, a.k.a Instruction Tuning): Given a smaller set of curated supervised data (input, ground-truth answer), fine-tune the pre-trained model.
RLHF: Reinforcement Learning with Human Feedback

Alignment means stage 2 & 3 where human interactions are involved. We hope the machine output aligns with humans’ expectations.

The performance gap is significant before and after the “Alignment” process.
You do NOT need large-scale data for alignment. (e.g.
Quality is the key! LIMA: Less Is More for Alignment