Hi Dianjin, Max, and all, I believe bringing the Apache Cloudberry (Incubating) MADlib work into the open by contributing directly to Apache MADlib is an excellent step toward reviving the project. The project would very much welcome Pull Requests introducing Cloudberry support in MADlib. Tentatively, we can plan for this work to be included in an upcoming *Apache MADlib 2.2.0* release.
I recommend basing the work on the *madlib2-master* branch ( https://github.com/apache/madlib/tree/madlib2-master), which introduces full Python 3 support and reflects the current direction of the MADlib 2 code base. The older *master* branch ( https://github.com/apache/madlib/tree/master) retains Python 2 support and is considered deprecated. Starting from the madlib2-master branch would ensure long-term maintainability and alignment with the modern Python ecosystem. It also provides a solid foundation for integrating Cloudberry-specific enhancements while remaining compatible with future Apache MADlib releases. Ideally, with this and future contributions, you and others may also want to consider interest in becoming members of the Apache MADlib PMC to help guide the project into the future. Best regards, -=e -- Ed Espino Apache Cloudberry (Incubating) & MADlib On Mon, Oct 20, 2025 at 4:50 AM Dianjin Wang <[email protected]> wrote: > Hi all, > > Happy Monday! > > I want to move this discussion forward and share some recent progress > from our side. > > Our team has implemented support for Apache Cloudberry based on the > latest madlib-master branch in the internal repository. After a > discussion with Max, we believe that contributing this Cloudberry > support back to the MADlib upstream would be a better and more > sustainable approach. > > We see this as a great opportunity to help bring new vitality to the > Apache MADlib project, while also enabling Cloudberry to benefit from > broader ecosystem integration — a potential win-win for both > communities. Now interested in taking concrete steps to move this > forward and contribute our work upstream. > > Before proceeding, we’d love to hear the community’s thoughts. > > If the community agrees to accept Cloudberry support upstream, would > submitting a Pull Request be the appropriate way to start? We’d also > appreciate feedback from Ed and other community members once the PR is > ready. > > Best, > Dianjin Wang > > On Thu, Sep 25, 2025 at 3:53 PM Dianjin Wang <[email protected]> > wrote: > > > > Can count me in. I'm interested in exploring the opportunity to revive > > the MADlib project as an individual on the marketing and community > > side. I can provide some ideas. > > > > > > Best, > > Dianjin Wang > > > > On Thu, Sep 25, 2025 at 3:39 PM Ed Espino <[email protected]> wrote: > > > > > > Hello Max, > > > > > > Thanks for reaching out. The latest Apache MADlib 2 code base is in the > > > madlib2-master branch. The project team made the decision to use this > > > branch due to the significant rewrite needed to move from Python 2 to > > > Python 3. > > > > > > I believe your interest may be in bringing the latest MADlib support to > > > Apache Cloudberry (Incubating). With the Greenplum DB project having > gone > > > closed source, there are currently no active downstream users of > MADlib. If > > > there is interest in reviving the Apache MADlib community, it would be > > > great to start that discussion. There may also be opportunities for > synergy > > > with the Cloudberry project that we can explore together. > > > > > > Additionally, as you may know, I also serve on the Cloudberry PPMC. I > > > haven’t had the chance to bring these potential collaborative efforts > to > > > the wider Cloudberry community consistently, and this may be a good > > > opportunity to do so. > > > > > > I’d be glad to provide some guidance if you’d like to pursue this > further. > > > > > > Best regards, > > > -=e > > > > > > -- > > > Ed Espino > > > Apache Cloudberry (Incubating) & MADlib > > > > > > > > > On Thu, Sep 25, 2025 at 12:13 AM Max Yang <[email protected]> > wrote: > > > > > > > Hi there, > > > > > > > > Do you know which MADlib branch is used by gp7 > > > > Is this the branch? > https://github.com/apache/madlib/commits/madlib2-master > > > > , > > > > or is it the master branch? > > > > I see that the master branch is older than madlib2-master. > > > > > > > > Appreciated in advance. > > > > > > > > Best regards, Max Yang > > > > >
