Hi team,
We recently proposed the support for FSST for Parquet. There are two main
options to take the FSST dependency:

1. CMake dependency on fsst GitHub:

   - Pull FSST as an external dependency via CMake
   - Adds an external dependency to the build

2. Vendor the code:

   - Need to copy 3-4 required source files directly into the repo
   - No external dependency


There was a brief initial discussion on pr, and I just wanted to start a
thread to discuss further.
Overall this is a lightweight dependency, with a couple of commits upstream
every few months, vendoring looks like a safe option. It may avoid external
dependency while keeping low maintenance overhead. However, we may have to
pull any major changes in the future.

Would love to know what folks think. Are there any concerns with either
approach, or a preference on how we have handled similar situations in the
past?


Thanks and Regards,
Arnav

Reply via email to