2 comments

  • hodgehog111 hour ago
    Obviously it&#x27;s great that those who are only aware of JEPA should be educated about CCA. If you don&#x27;t know CCA, you should not be working in unsupervised learning.<p>However, it&#x27;s pretty obvious that they are related since CCA is (or should be) well-known to be among the original unsupervised learning algorithms. It&#x27;s the progenitor of the field. It works, it always did. Just like logistic regression for classification. Deep learning is about putting in huge computational effort for the extra few percent.<p>This is like saying that Gauss deserves the credit for LLMs because he came up with least-squares regression, which was the progenitor of supervised learning. Yes, there is a chain of discoveries leading back, but when you give the credit that far back, it&#x27;s just insulting to the hard work that came inbetween.<p>Gauss and Hotelling are famous enough as it is.<p>(Before anyone asks, I&#x27;m not shilling for JEPA, I just think this argument is reductive for all of unsupervised and semi-supervised learning.)
    • jdw641 hour ago
      I want to make something in this area(LLM). Can you recommend any books?
      • hodgehog1158 minutes ago
        Books? No, not really. Maybe others will have better suggestions for newcomers, sorry. Are you talking research novelty or just applying current methods to a given task?<p>The latter is covered well by Andrej Karpathy&#x27;s videos and by just playing around with current models and other tutorials in a small test environment. You don&#x27;t need to know very much, there&#x27;s a lot of low-hanging fruit.<p>For the former, the field is moving rapidly and most of the innovations are coming from papers. Any book that claims to cover deep learning is almost inevitably outdated. Find a university or institution near you and see if they have an undergraduate reading group on deep learning that is open to the public to attend. Mine does, and it&#x27;s really helpful for staying up to date with the latest ideas. &quot;Probabilistic Machine Learning&quot; by Murphy contains the material that I would consider prerequisite if you want to understand the ideas which underpin modern deep learning (even if it contains virtually no deep learning in it), and I would hope that any student or colleague of mine would be familiar with most of it. But I&#x27;m not sure it&#x27;s good to learn from, and picking all that up takes a while to be honest.
        • nextos20 minutes ago
          &gt; &quot;Probabilistic Machine Learning&quot; by Murphy [...] even if it contains virtually no deep learning in it<p>This is confusing. Are you referring to the old 2012 version?<p>Volumes 1 &amp; 2 (2022-3) contain a substantial amount of deep learning [1], including relatively recent developments.<p>There&#x27;s also a new RL volume getting written, with some drafts deposited in arXiv [2].<p>[1] <a href="https:&#x2F;&#x2F;probml.github.io&#x2F;pml-book" rel="nofollow">https:&#x2F;&#x2F;probml.github.io&#x2F;pml-book</a><p>[2] <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;2412.05265" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;2412.05265</a>
          • hodgehog117 minutes ago
            I was mostly referring to Volume 1 (not advanced topics). You have a point that Volume 2 definitely contains more. To be honest, I was mostly covering myself from a &quot;that&#x27;s not real deep learning&quot; criticism; &quot;relatively recent developments&quot; is pretty generous if you&#x27;re active in the field. Given its rapidity, anything over a few years old is essentially considered classical. It&#x27;s almost impossible to have a book that is up-to-date with the state of the art here.<p>These are very nice volumes though (RL one is good too), and Murphy should be commended for the amount of work in here. It&#x27;s probably as good a compendium as one can expect.
        • jdw6447 minutes ago
          I&#x27;ve read the books you mentioned(Probabilistic Machine Learning). I guess there&#x27;s nothing left but papers, right? Thanks for the advice.
  • leecommamichael1 hour ago
    Interesting. So even more of the means to create this wave of AI existed sooner than we knew, at least in theory. Fun to think of a version of events where these models came up alongside GPUs; as if real-time graphics wasn&#x27;t demanding enough on the supply-chain, hah.