[heka] The path to 1.0

Rob Miller Wed, 15 Oct 2014 11:59:55 -0700

Hi all,

As I alluded to in a different thread earlier today, after over 2 yearsof development, Heka is finally closing in on what we're going to call a1.0 release. I thought now would be a good time to explain what thatmeans, exactly, and to point out a few important items that we have onthe roadmap between now and then. Apologies in advance for the wall oftext, I'm erring on the side of completeness here.

First, what does 1.0 mean? Things won't radically change. Heka willstill see updates, bug fixes, and improvements, although those of us onthe core Heka team will probably start to spend more of our time *using*Heka, and a bit less of our time developing it. The biggest change hasto do with our backwards compatibility guarantees.

So far, we've been using a modified semantic versioning scheme. Forpatch versions (e.g. from 0.7 to 0.7.1 to 0.7.2, etc) we only do bugfixes. We don't introduce new features, much less breaking changes. Butfor minor versions (0.7 to 0.8, for instance), we've reserved the rightto introduce backwards incompatibilities. These could be small thingslike changing the name of certain config settings, or bigger issues likechanging APIs such that plugin code needs to be updated to continue working.

Once we hit 1.0, we're going to put the brakes on our backwardsincompatible changes. Patch versions will still only contain bug fixes.Minor versions will contain new features, but will not break anyexisting features. Breakage will only happen when we bump major versions(e.g. 1.x.x to 2.0), and we will make a point of deprecating settingsand/or features, so there's at least one release of overlap between anolder and newer way of doing things, to give users time to adjust to anychanges that are introduced.

Now that the preliminaries are out of the way, we can get to the *real*reason I'm bringing all of this up now. Because we want to slow down therate-of-breaking-changes once we hit 1.0, that means we want to get allof the breaking changes that we already have on our radar out of the way*before* that happens. I want to let you all know what we have in mind,so you can know what to expect, and also to provide feedback. Currentlythere are 4 significant changes we want to make, each with an openissue, conveniently tagged as "breaking change":


https://github.com/mozilla-services/heka/labels/breaking%20change

Here's an overview of what each one means, and what impact it will have:

#424, Abstract out parser registry (aka Introduce "Splitter" plugins)

When we first released Heka, there were 4 plugin types: inputs,decoders, filters, and outputs. After a while we decided we needed aninverse to decoders, and encoders became the 5th. For quite some timenow, we've known we want to introduce a 6th plugin type, called"splitters". Splitters, like decoders, will be tightly coupled withspecific input plugins, and they will be responsible for looking at theraw data in an input stream, finding the record boundaries of thatstream, and extracting a single record's bytes to be passed on to thedecoder for more thorough parsing. Splitters actually sort of alreadyexist... many of our input plugins support config options called`parser_type`, `delimiter`, and `delimiter_location`, which perform thisfunction. But currently each input has to implement this separately,there's a lot of code duplication, and introducing new ways to findrecord boundaries is a lot harder than it should be. By abstracting themout as their own plugin type, it will be much easier to make themautomatically available to every new input. It will also be possible toimplement new message framing schemes and make them immediatelyavailable to everybody. The first splitters we introduce will exactlymatch the current parser_type options. For most of you, this will justmean updating your config to use splitters instead of parser types, butfor anyone who may have written their own input plugins there may besome small changes you need to make to play well with the new behaviour.


#918, Reimplement reporting infrastructure

Currently Heka provides some system wide operational metrics, and itprovides a way for each individual plugin to provide a custom set ofoperational metrics. All of this generated data is made visible to theuser in the DashboardOutput's HTML UI. One problem, though, is thatthere are certain data points we want *every* plugin to provide, such as# of messages processed, # of processing failures, sampled averagemessage processing time, etc. Right now every plugin has to implementthis by hand and explicitly include the data in its custom reportoutput. Some plugins do this, but many others don't, which is why the"messages processed" value in the dashboard is empty for many plugins,even when messages are flowing. Clearly this isn't ideal, Heka shouldhandle as much of this as possible automatically. Getting to this pointwill require changing some of how the reporting works, so less of it ishandled by the plugins themselves and more of it is handled by theplugin runners that Heka provides.This won't change the config format at all, and most plugins willcontinue to work unmodified. Any plugins that you have that arecurrently providing their own custom report output will need to bechanged to adjust to the new reporting APIs we write. Also, whilecounting the messages processed will come for free, counting processingerrors and sampling average processing time may still require a smallamount of cooperation from the plugins themselves, so there may beslight changes required to get the most out of the new reporting structure.


#930, Simplify Output plugins to only deal w/ output transport

This is the biggest of the changes. Originally, outputs wereresponsible for serializing their data themselves. Then we introducedencoders to handle that. *Then* we realized that, even though encodersserialize a single message, the output should be the one to specifywhether or not framing happens, so we now recommend that outputs call`OutputRunner.Encode()`, which first delegates to the encoder and thenapplies any desired framing.Now we've realized that, since the OutputRunner is doing the encodingwork anyway, it might make sense for this to happen automatically,before the output plugin is even invoked. What if output plugins didn'treceive message objects, but instead received bytes data that hadalready been framed (if necessary) and serialized. This reduces theburden of responsibility for each output plugin, b/c it no longer has toconcern itself w/ the details of encoding, Heka will take care of thatautomatically based on the config.This provides some additional benefits. Currently, the TcpInput usesa disk queue to make sure it doesn't lose data if the connection drops.But ideally *any* output plugin would be able to use a disk queue, andthe cursor wouldn't advance in the queue until the data was confirmed asdelivered. With this change in the design, implementing this would bemuch easier, any output could automatically support a `use_buffering`option. If true, data would be routed through the disk queue before iteven got to the output, and the output would just have to report backre: whether delivery succeeded (so we can advance in the queue) or not(so we can retry the last one again).Clearly this change significantly impacts all output plugins, andthere are still a few rough edges to work out, but ultimately we thinkthe wins are worth it. I'm curious what others think.


#1116, Improve Decoder config API

This is the last one, and it's much smaller in scope than the outputchanges. Right now, when an output specifies an encoder, Hekaautomatically notices this, creates the encoder plugin, and makes itavailable to the output plugin. All the output has to do is callOutputRunner.Encode(), and if we implement #930, then soon it won't haveto do even that.For inputs, the story isn't as good. Inputs have to explicitlyinclude `decoder` as a config option, which they have to parse andvalidate, and then they have to bootstrap the decoder by hand. This isstuff that Heka should be doing for you. The impact here is that anyinput that uses decoders (which is most of them) would need to changeslightly, to remove boilerplate code.

And there you have it. If you made it this far, I salute you. Hopefullyyou found it useful. If you have questions or comments on any of theseideas, please respond on the list and we'll be happy to discuss.


Thanks!

-r
_______________________________________________
Heka mailing list
Heka@mozilla.org
https://mail.mozilla.org/listinfo/heka

[heka] The path to 1.0

Reply via email to