+1 for automating. The more automation the fewer mistakes we can make :-) On Tue, Feb 7, 2017 at 11:13 AM, Robert Metzger <rmetz...@apache.org> wrote:
> +1 to automate this and describe the config parameters in the code. > > That's exactly the approach Apache Kafka is taking as well for their > config. > > On Tue, Feb 7, 2017 at 9:04 AM, Ufuk Celebi <u...@apache.org> wrote: > > > I fully agree with you Greg. > > > > Since this is doomed to get out of sync again very shortly after clean > up, > > I vote to automate this. Stephan introduced the ConfigOption type, which > > makes it easy to define the options. It's already planned to migrate all > > configuration options from ConfigConstants to this approach. > > > > For an example see here: https://github.com/apache/flink/blob/master/ > > flink-core/src/main/java/org/apache/flink/configuration/ > > HighAvailabilityOptions.java > > > > I think that it is possible to build the configuration docs page from > this > > with reasonable effort. > > > > This would translate the task to: > > 1) Automate ConfigOption to HTML/Markdown generation > > 2) Extend ConfigOption with description fields > > 3) Migrate ConfigConstants to ConfigOptions > > > > I would also volunteer to take a first stab at this. > > > > Regarding the network buffers: +1 to your suggestion. Nico (cc'd) is > > starting to work on automating the network buffer configuration in order > to > > get rid of any manual tuning for most users (because of the issues you > > described + streaming and batch jobs require different tuning, which > > complicates things even more). > > > > – Ufuk > > > > On 6 February 2017 at 19:21:28, Greg Hogan (c...@greghogan.com) wrote: > > > > Hi devs, > > > > > > Flink's Configuration page [1] has grown intimidatingly long > > > and complex. > > > Options are described across three main sections: common options > > > (single > > > section), advanced options (multiple sections), and full reference. > > > The > > > trailing "background" section further describes the most impactful > > > options > > > in much greater detail. > > > > > > Several recent tickets, and a few outstanding, have added missing > > > options > > > to the configuration documentation. I'd like to propose a goal > > > of > > > organizing all options in the full reference into alphabetized, > > > tabular > > > form (one table per section), much like the system metrics [2]. > > > Columns > > > would be option name, description, and default value. > > > > > > The common and advanced sections could also be converted to tabular > > > form > > > with the exception of Kerberos-based Security. Missing options > > > would be > > > added to the full reference. > > > > > > Lastly, the simple heuristic for configuring network buffers > > > has prompted > > > many questions on the mailing list. With the 1.3 release the total > > > and > > > number of available buffers is reported through metrics and > > > in the web > > > dashboard. My experience has been that the number of required > > > buffers is > > > highly dependent on job topology and cluster performance. I > > > propose keeping > > > the simple heuristic and description while directing users > > > to monitor the > > > balance of available buffers. > > > > > > Greg > > > > > > [1] https://ci.apache.org/projects/flink/flink-docs- > > master/setup/config.html > > > [2] > > > https://ci.apache.org/projects/flink/flink-docs- > > master/monitoring/metrics.html#system-metrics > > > [3] > > > https://ci.apache.org/projects/flink/flink-docs- > > master/setup/config.html#configuring-the-network-buffers > > > > >