Re: Regards /tmp/drill

2020-08-19 Thread Bob Rudis
in drill-override.com, something like: drill.exec: { cluster-id: "drillbits1", zk.connect: "localhost:2181", spill: { directories : [ "/some/other/place/for/spill" ], fs : "file:///" }, tmp: { directories: ["/some/other/place/for/tmp"],

Re: Drill's Future

2020-05-15 Thread Bob Rudis
I can carve out work-time to review PRs. -boB On Fri, May 15, 2020 at 11:25 AM Prabhakar Bhosaale wrote: > > HI Charles, > I have never contributed to any open source community before this. Also I > am not expert on JAVA but willing to contribute for sure. Please let me > know in what capacity

Re: java version for Drill JDBC driver

2020-04-09 Thread Bob Rudis
I use the JDBC driver via an RJDBC wrapper I wrote and the rJava it runs in is built with JDK 11, so it definitely is working in 11 for me. > On Apr 9, 2020, at 11:38, Jaimes, Rafael - 0993 - MITLL > wrote: > > On the topic of java versions, I am trying to load the Drill JDBC driver in a >

1.17.0 updated protobuf but mapr ODBC drivers have not been updated

2020-01-13 Thread Bob Rudis
HNY folks, This is more of an FYI vs anything else. I realize the intrepid/awesome Drill team has little control over MapR's speed of catching up to the latest releases but just in case others haven't upgraded to 1.17.0 (I just did today) and use ODBC, you'll see something like this in logs:

Re: [VOTE] Apache Drill Release 1.16.0 - RC0

2019-04-18 Thread Bob Rudis
0aee329976c2191 > > [1]: https://github.com/sohami/drill/commits/drill-1.16.0 > > Thanks, > Sorabh > > On Thu, Apr 18, 2019 at 1:12 PM Bob Rudis wrote: > > > Q abt the RC (and eventual full release): Does > > https://issues.apache.org/jira/browse/DRILL-5509 mean that the

Re: [VOTE] Apache Drill Release 1.16.0 - RC0

2019-04-18 Thread Bob Rudis
Q abt the RC (and eventual full release): Does https://issues.apache.org/jira/browse/DRILL-5509 mean that the ODBC drivers will need to be updated to avoid warning messages (and potential result set errors) as the was the case with a previous release or is this purely (as I read the ticket) a

Re: [DISCUSS] Including Features that Need Regular Updating?

2019-03-22 Thread Bob Rudis
Two rly  UDFs, too! Charles: many of the FOSS (or even non-FOSS) security tools we both likely tend to use in either of those contexts put the onus of updating the core DBs on the end-user (and try to go out of their way in the docs re: need to do that). I think as long as the version

Re: Compatibility Matrix for the ODBC/JDBC clients

2019-02-13 Thread Bob Rudis
Hey Joel… My experience has been the JDBC-side was more tolerant than the free ODBC driver. It took a bit for the ODBC driver to be released after Drill 1.13.0 and the macOS and Linux ODBC versions (I don't use legacy Windows operating systems) kept tossing messages in logs to the point where

Re: Drill fails to query pcap files

2019-02-09 Thread Bob Rudis
Agreed. The reason we have some malformed PCAPs from the global honeypot network is those pesky attackers trying to be clever as they "scapy" their way into breaking their attacks due to shoddy code (more incompetence in this case than capable maliciousness). And, I did indeed find a few and

Re: Drill fails to query pcap files

2019-02-07 Thread Bob Rudis
Sir Givre: I'll be able to (likely this weekend) go back ~18mos and re-test a bunch of our honeypot PCAP files (I remember various ones failing at the time). If I do find "bad" ones, they'll be old enough that we won't care abt a sample PCAP or two exposing honeypots IPs (since they're not there

FYI: Open Drill instances on the internet

2019-01-15 Thread Bob Rudis
Hey Drillers, There's been a spate of attacker groups looking for (for lack of a better term) "big data-ish" open servers on the internet. We've caught quite a few going after Hadoop, Spark and other things but I've also recently seen some hits to our global sensor network on 8047 (a port I

Some updates to the Drill R interface (the "sergeant" package) to start the new year

2019-01-09 Thread Bob Rudis
HNY Drillers! I've dropped a couple of posts on the R sergeant package enhancements as a result of the epic 1.15.0 release (it was a great present for the holidays!). This: https://rud.is/b/2019/01/09/on-the-road-to-0-8-0-some-additional-new-features-coming-in-the-sergeant-package/ covers

Re: Drill 1.14.0 and mapr ODBC drivers (linux/macos)

2018-08-05 Thread Bob Rudis
with a JDBC driver that > will work). There will be new MapR ODBC and JDBC drivers coming out with > the new MapR Drill release to address this. > > Thanks. > > --Robert > > On Sun, Aug 5, 2018 at 1:09 PM, Bob Rudis wrote: > >> Is anyone else getting libprotobu

Drill 1.14.0 and mapr ODBC drivers (linux/macos)

2018-08-05 Thread Bob Rudis
Is anyone else getting libprotobuf messages/errors when using the MapR ODBC drivers (maprdrill-1.3.16.1049-1.x86_64.rpm / MapR_Drill_1.3.dmg via http://package.mapr.com/tools/MapR-ODBC/MapR_Drill/MapRDrill_odbc_v1.3.16.1049/)? Trying to see if it's just me before I bug the dev list (tho I know

A few UDFs for retrieving and working with HTML content

2018-07-23 Thread Bob Rudis
There is a WIP collection of HTML retrieval and processing UDFs available at - GitLab: https://gitlab.com/hrbrmstr/drill-html-tools - GitHub: https://github.com/hrbrmstr/drill-html-tools The UDFs use the lightweight jsoup Java library (https://github.com/jhy/jsoup/) included with the project

Re: New UDF for processing Twitter text

2018-07-22 Thread Bob Rudis
8, at 5:38 PM, Charles Givre wrote: > > Hey Bob, > This looks pretty cool. Have you thought about submitting this as a PR for > Drill? I’d be happy to help with that. > — C > >> On Jul 22, 2018, at 17:36, Bob Rudis wrote: >> >> This post -- >> https://

Re: Required help in apache drill

2018-04-20 Thread Bob Rudis
That's base 64 encoded data which turns out to be '1114502378'. I cld have sworn Drill has built-in base64 conversion but I also wrote a UDF a while back that has a simple decoding function to turn base64 to text (which is dangerous since base64 content could and often is binary).

Re: PCAP files with Apache Drill and Sergeant R

2018-02-07 Thread Bob Rudis
Thank you :-) And, I've poked at PCAPs with Drill & sergeant to great effect (not on S3, but that — as you said — should work fine, too). I just wish I had time to PR into the project to have it not totally bork on imperfect packets, support more PCAP formats and add in/port some helper UDF

Re: drill in 10 minutes tutorial problem with osx

2018-01-28 Thread Bob Rudis
I run drill in single-node cluster mode (vs standalone) on macOS and use the installation procedure in this gist to install/run it (change the drill d/l version since it's still at 10) : https://gist.github.com/hrbrmstr/82a3f74b0b353a63a09061207e9f2725 > On Jan 26, 2018, at 3:32 AM,

Re: Apache Drill Cookbook

2017-11-03 Thread Bob Rudis
>From what I recall there was a Drill book in the works but I haven't seen much on it in over a year. On Fri, Nov 3, 2017 at 8:53 PM, Saurabh Mahapatra wrote: > Hi all, > > I was curious if there is anyone in the community writing a book on Apache > Drill. I think

Re: Date Conversion Question

2017-10-18 Thread Bob Rudis
FWIW I was doing very similar substring (etc) machinations until we started converting output from back-end data-generation tools directly into parquet (using other tools). IMO it's a common enough format (at least in the types of data you and I likely have to work with :-) that it'd be great if

Re: Does Drill Use Apache Struts

2017-09-08 Thread Bob Rudis
le to use that as the > starting point). Also, it's good to raise awareness on all of these points > in general so I always appreciate lively discussions :) > > > > On Fri, Sep 8, 2017 at 10:42 AM, Bob Rudis <b...@rud.is> wrote: > >> I personally haven't had t

Re: Does Drill Use Apache Struts

2017-09-08 Thread Bob Rudis
ing about attackers using Struts to get to a zone where SQLi was >>> possible. I will be clear(er): I have not verified that Equifax is wholly >>> struts, or even related to Struts, but my fear right now is focused on open >>> source projects that may use Struts and I thi

Re: Does Drill Use Apache Struts

2017-09-08 Thread Bob Rudis
Equifax was likely unrelated SQL injection. Don't spread rumors. Struts had yet-another-remote exploit (three of 'em, actually). I do this for a living (cybersecurity research). Drill is not impacted which can be verified by looking at dependencies in

Re: 1.11.0 RC question

2017-07-26 Thread Bob Rudis
t; > > > Is DRILL-5432 the one you are talking about? I saw it's merged and should > > have been put in the release candidate. > > > > What type of error did you see when you tried to query a PCAP? Also, it > may > > help to provide the commit id of your build,

1.11.0 RC question

2017-07-26 Thread Bob Rudis
I wasn't sure if this belonged on the dev list or not but I was peeking around the JIRA for 1.11.0 RC and noticed that it _looked_ like PCAP support is/was going to be in 1.11.0 but when I did a quick d/l and test of the RC (early yesterday) and tried to query a PCAP it did not work. I'm

R interface to Apache Drill now on CRAN

2017-07-17 Thread Bob Rudis
Hey folks, Those using R or contemplating dabbling in R can now grab the 'sergeant' package — an R interface to Apache Drill — directly from CRAN : https://cran.r-project.org/package=sergeant As stated previously, the package provides an RJDBC wrapper (you supply the jar file), a DBI wrapper to

R interface to Drill heading to CRAN (last call for issues/features)

2017-06-17 Thread Bob Rudis
Hey folks, I've mentioned sergeant - - before. It's an R package that provides an RJDBC driver, R DBI driver, dplyr interface (with some custom functions mapped) and a REST interface client to Apache Drill. Most of the focus/dev has been on the dplyr

Re: Drill Summit/Conference Proposal

2017-06-14 Thread Bob Rudis
I grok this is the Drill list and I'm also a big user of Drill (and have made some UDFs) but there might be some efficacy in expanding the scope to the Presto and Redshift Spectrum communities. I'm not claiming there's 100% equivalence, but the broader view of being able to access multiple types

Re: Reg; Apache Drill

2017-06-06 Thread Bob Rudis
You should likely spend some time studying statistics and machine learning then examine the pluses and minuses of a few "data science"-oriented programming languages and focus on one that has idioms that make sense to you. Then you'll see just how inappropriate your question is. On Tue, Jun 6,

Re: [ANNOUNCE] Apache Drill 1.10.0 Released

2017-03-23 Thread Bob Rudis
Drill core & supporting devs: 1.10.0 release is incredibly timely, helpful & ++gd. It's faster (I'll benchmark at some point), appears (I'll measure at some point) to be more memory-efficient and many of the fixes came just in time for an annual project that uses data I had to spend time

Re: Drill memory configuration in windows

2017-03-20 Thread Bob Rudis
Follow this thread : http://mail-archives.apache.org/mod_mbox/drill-user/201703.mbox/%3CCAJgUOA7OyXFeX4WCTZv6fAzvOxQt_kjTJBbxk8_QEGDW_3nZbA%40mail.gmail.com%3E Searching mailing list archives is generally step #2 before posting (step #1 is generally google, but it doesn't catch the list archives

Re: REST API queries with PAM

2017-02-25 Thread Bob Rudis
You have to hit the auth check endpoint first and save off the session cookies: curl -X POST \ -H "Content-Type: application/x-www-form-urlencoded" \ -k \ -c cookies.txt \ -s \ -d "j_username=USERNAME" \ -d "j_password=PASSWORD" \

WARC files

2017-01-16 Thread Bob Rudis
Hey folks, Does anyone know if there have been UDFs made to enable working with WARC files in Drill? WARC: http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml thx, -Bob

Re: New Drill R "dplyr" interface

2016-12-30 Thread Bob Rudis
quot; when there's Drill, Spark, H2O, etc that do a good amt of that work better, albeit lacking in many of the more advanced stats/ML pkgs R has available to it). On Mon, Dec 26, 2016 at 3:24 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > On Mon, Dec 26, 2016 at 7:22 AM, Bob Rudis

Re: New Drill R "dplyr" interface

2016-12-26 Thread Bob Rudis
aint, Drill is at heart an EDA/BI workhorse and while it provides the data format transformation functions such a use of VALUES is a bit extreme. On Sun, Dec 25, 2016 at 2:40 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > On Sun, Dec 25, 2016 at 11:04 AM, Bob Rudis <b...@rud.is> wr

New Drill R "dplyr" interface

2016-12-22 Thread Bob Rudis
First time poster; short-time lurker. I've built an alpha R package — sergeant — https://github.com/hrbrmstr/sergeant — that provides direct access to the Drill REST and JDBC interfaces and has a very nascent R "dplyr" interface (if you know R you prbly know what that means). Please kick the