Supervised Fine Tuning on Curated Data Is Reinforcement Learning independentresearch.ai 3 points by saijajin 9 hours ago