JSON_SUB_SAN is the Json reader. It uses Jackson to do actual parsing, and 
converts the data into Drill's internal value vector format. TEXT_SUB_SCAN is 
the corresponding operator for csv.

If the Drill system has access to the /log/profile directory then you can, in 
fact, use Drill to query the json in the query profile. You might want to setup 
an nfs location for the query profiles,so that the directory is visible to all 
drillbits.  The simply create a new workspace pointing to the directory. You 
will be able to read the profiles like any other Json file.

________________________________
From: Nikos R. Katsipoulakis <nick.kat...@gmail.com>
Sent: Wednesday, January 11, 2017 7:37:30 AM
To: user@drill.apache.org
Subject: Additional information on JSON_SUB_SCAN operator and access to query 
profiles not from the Web UI

Hello all,

I am a new user of Apache Drill and I am in the process of better
understanding its internals. To that end, I have two questions, for which I
was unable to find more information online.

First, when I execute an EXPLAIN command for a query that gets its data
from JSON files, I see a physical operator named JSON_SUB_SCAN. What does
that operator exactly do? Is it only used for parsing (extracting) fields
from JSON data? Or does it perform additional processing? As far as I know,
Drill uses Jackson Streaming API for extracting JSON data. Is that still
true? Finally, what is the equivalent operator for CSV files?

Second, I need to access query profiles from a server that is behind a
firewall. Therefore, accessing the URL of that machine on port 8047 is a
headache (since I have to submit a ticket to IT Support). My question is
whether I can access the Query Profiles in any other way? Like from the
sqlline or through log/profile files created while executing queries.

Thank you and Kind Regards,

--
Nikos R. Katsipoulakis,
Department of Computer Science
University of Pittsburgh

Reply via email to