Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Supervised Fine Tuning on Curated Data Is Reinforcement Learning

independentresearch.ai

3 points by saijajin 9 hours ago