2 comments
The simplicity of Gorilla is attractive but for better compression ratios without too much extra compute I'd instead recommend Sprintz: <a href="https://github.com/dblalock/sprintz" rel="nofollow">https://github.com/dblalock/sprintz</a>.<p>The downside is that (a) Sprintz requires the data to be quantised to fixed point integers, usually fine if the data is coming out of a sensor of some sort and (b) the Huffman coding step of Sprintz requires dynamic memory allocation, whilst Gorilla is almost trivially implemented without it.<p>Also see Chimp, which proposes some small tweaks to Gorilla to improve its performance: <a href="https://dl.acm.org/doi/abs/10.14778/3551793.3551852" rel="nofollow">https://dl.acm.org/doi/abs/10.14778/3551793.3551852</a>
How does it compare to DuckDB?