New UDF for processing Twitter text

2018-07-22 Thread Bob Rudis
This post -- https://rud.is/b/2018/07/22/new-apache-drill-udf-for-processing-twitter-tweet-text/ -- introduces a UDF package for Drill with 5 functions for extracting [meta]data from Twitter tweet text, including: - hashtag extraction - URL extraction - @-mentions extraction - reply-to (if it's

Re: New UDF for processing Twitter text

2018-07-22 Thread Charles Givre
Hey Bob, This looks pretty cool. Have you thought about submitting this as a PR for Drill? I’d be happy to help with that. — C > On Jul 22, 2018, at 17:36, Bob Rudis wrote: > > This post -- > https://rud.is/b/2018/07/22/new-apache-drill-udf-for-processing-twitter-tweet-text/ > -- introduce

Re: New UDF for processing Twitter text

2018-07-22 Thread Bob Rudis
Sir Givre! I confess I hadn't (thought it might be a bit niche) but I can drop a Jira q abt it to see if the team is interested in it. We (i.e. Rapid7) have a few more UDFs coming over the next few weeks (hoping for a Drill 1.14.0 release with an updated/non-acient guava JAR before doing so) as

Re: New UDF for processing Twitter text

2018-07-22 Thread Kunal Khatua
Saw this on Twitter... looked interesting. We've been thinking of revamping the site a bit to help around folks with leveraging Drill in different applications... this could sit nicely in a Cyber-related domain.  As for the Ctrl+Enter / Meta-Enter  ... let's keep that as a separate PR. Always e