3 comments

  • sillysaurusx43 minutes ago
    It’s been said that RL is the worst way to train a model, except for all the others. Many prominent scientists seem to doubt that this is how we’ll be training cutting edge models in a decade. I agree, and I encourage you to try to think of alternative paradigms as you go through this course.<p>If that seems unlikely, remember that image generation didn’t take off till diffusion models, and GPTs didn’t take off till RLHF. If you’ve been around long enough it’ll seem obvious that this isn’t the final step. The challenge for you is, find the one that’s better.
  • kgarten16 minutes ago
    Are the videos available somewhere?<p>spring course is on YouTube <a href="https:&#x2F;&#x2F;m.youtube.com&#x2F;playlist?list=PLoROMvodv4rN4wG6Nk6sNpTEbuOSosZdX" rel="nofollow">https:&#x2F;&#x2F;m.youtube.com&#x2F;playlist?list=PLoROMvodv4rN4wG6Nk6sNpT...</a>
  • zerosizedweasle46 minutes ago
    Given Ilya&#x27;s podcast this is an interesting title.