4 comments

  • fodkodrasz4 minutes ago
    So DuckDB was developed to allow queries for bigish data finally without the need for a cluster to simplify data analysis... and we now put it to a cluster?<p>I think there are solutions for that scale of data already, and simplicity is the best feature of DuckDB (at lest for me).
  • mgaunard42 minutes ago
    In my experience ray clusters don&#x27;t scale well and end up costing you more money. You need to run permanent per-user instances etc.<p>What you need is a multi-tenancy shared infrastructure that is elastic.
  • dogman12349 minutes ago
    neat. i&#x27;m pretty novice in the guts of this kind of stuff, but how does this work under the hood for blocking operators where they &quot;cannot output a single row until the last row of their input has been seen&quot;?<p>i think this is where spark shuffling comes in? but how does it work here.<p><a href="https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;guides&#x2F;performance&#x2F;how_to_tune_workloads#blocking-operators" rel="nofollow">https:&#x2F;&#x2F;duckdb.org&#x2F;docs&#x2F;stable&#x2F;guides&#x2F;performance&#x2F;how_to_tun...</a>
  • nevalainen33 minutes ago
    feels like a missed opportunity to call it cluster-quack xD