11 comments

  • muragekibicho3 days ago
    Introduction : Finite Field Assembly is a programming language that lets you emulate GPUs on CPUs<p>It&#x27;s a CUDA alternative that uses finite field theory to convert GPU kernels to prime number fields.<p>Finite Field is the primary data structure : FF-asm is a CUDA alternative designed for computations over finite fields.<p>Recursive computing support : not cache-aware vectorization, not parallelization, but performing a calculation inside a calculation inside another calculation.<p>Extension of C89 - runs everywhere gcc is available. Context : I&#x27;m getting my math PhD and I built this language around my area of expertise, Number Theory and Finite Fields.
    • zeroq1 hour ago
      I&#x27;ve read this and I&#x27;ve seen the site, and I still have no idea what it is, what&#x27;s the application and why should I be interested.<p>Additionally I&#x27;ve tried earlier chapters and they are behind a paywall.<p>You need a better introduction.
      • pizza1 hour ago
        This is phrased in a kind of demanding way to an author who has been kind enough to share their novel work with us. Are you sure you spent enough time trying to understand?
        • Conscat52 minutes ago
          It seems that pretty much everybody here is confused by this article. One user even accused it of LLM plagiarism, which is pretty telling in my opinion.<p>I for one have no clue what anything I read in there is supposed to mean. Emulating a GPU&#x27;s semantics on a CPU is a topic which I thought I had a decent grasp on, but everything from the stated goals at the top of this article to the example code makes no sense to me.
          • pizza30 minutes ago
            It just seems like residue numbering systems computation, which I&#x27;m already working with.
    • almostgotcaught52 minutes ago
      &gt; I&#x27;m getting my math PhD and I built this language around my area of expertise, Number Theory and Finite Fields.<p>Your LinkedIn says you&#x27;re an undergrad that took a gap year 10 months ago (before completing your senior year) to do sales for a real estate company.
      • pizza25 minutes ago
        Why bother doing a witch hunt and leaving out that they did Stats at Yale..
        • almostgotcaught19 minutes ago
          Because why does it matter? Are you suggesting undergrad stats at Yale is comparable to a PhD in number theory?
      • saghm30 minutes ago
        Depending on what properties they sold, they certainly could have gotten valuable real-world expertise with finite fields. It&#x27;s certainly easier to sell them than infinite ones!
      • saagarjha49 minutes ago
        Are you sure that’s their LinkedIn?
        • almostgotcaught44 minutes ago
          Why wouldn&#x27;t it be? All of the pics, names and details line up between GitHub, here, Reddit, and substack.
  • adamvenis1 hour ago
    I think I get it. You&#x27;re using the Ring isomorphism from the Chinese Remainder Theorem to do &quot;parallel computation&quot;. This is the same principle as how boolean algebra on binary strings computes the pairwise results of each bit in parallel. Unfortunately, there&#x27;s no free lunch - if you want to perform K operations on N-bit integers in parallel, you still need to work with (K * N)-bit-wide vectors, which is essentially what SIMD does anyway.
    • almostgotcaught58 minutes ago
      Yup that&#x27;s exactly what this is and thus, notably, it is not actually about finite fields.
  • vimarsh673948 minutes ago
    One of the more subtle aspects of retargeting GPU code to run on the CPU is the presence of fine grained(read - block level and warp level) explicit synchronization mechanisms being available in the GPU. However, this is not the same in CPU land, so additional care has to be taken to handle this. One example of work which tries this is <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;2207.00257" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;2207.00257</a> .<p>Interestingly, in the same work, contrary to what you’d expect, transpiling GPU code to run on CPU gives ~76% speedups in HPC workloads compared to a hand optimized multi-core CPU implementation on Fugaku(a CPU only supercomputer), after accounting for these differences in synchronization.
  • pwdisswordfishz17 minutes ago
    &gt; Field order (the number of elements your field can hold). i.e you can store 8 * 9 * 11 elements in the field<p>I thought a finite field&#x27;s order has to be a prime power.
  • hashxyz1 hour ago
    Pretty sure this is just vectorization. You can pack some 8bit ints into a machine-length 32bit int and add them together, that is vectorization.
    • Conscat58 minutes ago
      I don&#x27;t think that&#x27;s true when the add overflows. You wouldn&#x27;t want a lane&#x27;s overflow to carry into an adjacent lane.
  • foota1 hour ago
    It&#x27;s a bit hard for me to tell the intention here. Is the idea that finite fields can take better advantage of CPU architecture than something like SIMD for parallel computation? Or is this just for experimentation?<p>Edit: this tickles my brain about some similar seeming sort of programming language experiment, where they were also trying to express concurrency (not inherently the same as parallelism) using some fancy math. I can&#x27;t remember what it was though?
  • catapart1 hour ago
    If matrix multiplication does get added to this, I imagine that there is some utility for game development. At that point, I&#x27;d be curious what the comparison would be from CPU to GPU. Like, given a clock speed of x, what would a comparable GPU (or set of GPU features) look like?<p>I know that&#x27;s pretty abstract, but without that kind of &quot;apples to apples&quot; comparison, I have trouble contextualizing what kind of output is bring targeted with this kind of work.
  • tooltechgeek1 hour ago
    What are some problems where this approach has advantages?
  • imbusy1111 hour ago
    I suspect this is just AI slop.
  • almostgotcaught59 minutes ago
    It&#x27;s hilarious how gullible hn is. All you gotta do is put GPU and math buzzwords in your README and you&#x27;ll automatically be upvoted.<p>This was discussed on Reddit - this is not actually finite field arithmetic.<p>Also you can go to this dudes GitHub and see exactly how serious this project is.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;LeetArxiv&#x2F;Finite-Field-Assembly">https:&#x2F;&#x2F;github.com&#x2F;LeetArxiv&#x2F;Finite-Field-Assembly</a><p>Lol