No, thanks. I got past the issue with the Mophline stuff for this release. We 
will have to discuss what to do about it long term.

I have been fighting Avro/Netty for the last week upgrading those dependencies. 
I just have one more test to figure out and then that too will be done.

Then I have to look at the hadoop family dependencies and determine what kinds 
of upgrades they may need.

Waiting years to do releases really sucks.

Ralph

> On Jan 23, 2022, at 10:57 AM, Tristan Stevens <tris...@apache.org> wrote:
> 
> Hi Ralph,
> Here is the data.
> 
> MorphlineSolrSink and MorphlineInterceptor appear 13-14% of the time in this 
> sample set.
> 
> org.apache.flume.channel.kafka.KafkaChannel                64%  
> hdfs                                                       38%  
> org.apache.flume.source.kafka.KafkaSource                  33%  
> memory                                                     29%  
> file                                                       28%  
> spooldir                                                   26%  
> null                                                       25%  
> org.apache.flume.sink.kafka.KafkaSink                      25%  
> Custom JMSSource 18%  
> jms                                                        17%  
> static                                                     14%  
> org.apache.flume.sink.solr.morphline.MorphlineInterceptor  14%  
> ElasticSearchSink   13%  
> org.apache.flume.sink.solr.morphline.MorphlineSolrSink     13%  
> host                                                       13%  
> timestamp                                                  13%  
> avro                                                       11%  
> hbase                                                      11%
> 
> Let me know if you’d like to drill down further.
> 
> Tristan  
> 
> From: Ralph Goers <ralph.go...@dslextreme.com>
> Reply: dev@flume.apache.org <dev@flume.apache.org>
> Date: 15 January 2022 at 06:48:43
> To: dev@flume.apache.org <dev@flume.apache.org>
> Subject:  Re: Morphlines-solr-sink  
> 
> I would like to see the data on the usage. I’m not sure how you would know 
> since Cloudera doesn’t seem to include Flume in its products any more from 
> what I can tell.  
> 
> The kite-morphines project consists of 18 sub-modules plug 4 aggregation 
> modules. That is a heck of a lot of stuff to try to drag in. I would prefer 
> to fork the parts of kite we would need to a new flume-kite repo.  
> 
> It seems that the CVE the reporter mentioned does have a fix. It is available 
> in parquet-avro 1.11.2 and 1.12.2. I was able to swap the new version for the 
> old one even though the groupId has changed. That said, the kite-sdk 
> dependency that includes it is marked as optional, so parquet-avro would be 
> optional as well. So I have no idea if it is even used.  
> 
> In any case, the unit tests all pass with the updated dependency.  
> 
> Ralph  
> 
> 
> 
>> On Jan 14, 2022, at 3:33 PM, Tristan Stevens <tris...@apache.org> wrote:  
>> 
>> -1 from me.  
>> 
>> First wee can’t do that in a patch release, but that’s semantics.  
>> 
>> Both the Morphlines interceptor and the Morphlines-Solr-Sink are components 
>> that are widely used amongst the community. I did some analysis last year 
>> that I’ll dig out and share, but they are two of the most used components 
>> after HDFS sink, Kafka and JMS.  
>> 
>> Whilst I agree it’s sucky that Cloudera aren’t supporting Kite anymore, I 
>> wonder whether we can find a way to bring Morphlines into here, or otherwise 
>> get upstream and fix the bits that need fixing.  
>> 
>> Tristan  
>> 
>> 
>> From: Ralph Goers <ralph.go...@dslextreme.com> 
>> <mailto:ralph.go...@dslextreme.com>  
>> Reply: dev@flume.apache.org <mailto:dev@flume.apache.org> 
>> <dev@flume.apache.org> <mailto:dev@flume.apache.org>  
>> Date: 13 January 2022 at 15:26:12  
>> To: dev@flume.apache.org <dev@flume.apache.org> 
>> <mailto:dev@flume.apache.org>  
>> Subject: Morphlines-solr-sink  
>> 
>>> While I am not having any trouble building the morphline-solr-sink 
>>> component, it is dependent on the abandoned kite-sdk, which makes its life 
>>> very limited.  
>>> 
>>> In addition, the kite-sdk has a dependency on parquet-avro which, according 
>>> to https://issues.apache.org/jira/browse/FLUME-3403, has vulnerabilities in 
>>> every available release.  
>>> 
>>> Due to these factors I am going to remove the morphline-solr-sink module 
>>> from Flume for the 1.10.0 release.  
>>> 
>>> Ralph  
> 

Reply via email to