Hi Marc, I should have asked, are you running Drill on a single windows machine? If so, Drill will be able to query anything you throw at it. If your data starts to get bigger than a single machine can handle, you'll need to set up a Drill cluster with multiple nodes. This is no different than Spark. I would suggest using Drill to convert the data to parquet format. Often you can achieve a 10x reduction in file size and extreme improvements in query speed.
As for configuring Drill, take a look here: https://drill.apache.org/docs/workspaces/. This explains how to set up a workspace. What you'll want to do is set the workspace to the path to your desktop. Then you can query the files as noted below. Best, -- C > On Nov 24, 2022, at 6:05 PM, marc nicole <[email protected]> wrote: > > also how to execute such queries as SELECT * > FROM dfs.desktop.`file.json` in Java ? > > Le jeu. 24 nov. 2022 à 23:31, Charles Givre <[email protected]> a écrit : > >> Hi Marc, >> Welcome to Drill! Firstly, take a look at the docs for querying a file >> system: https://drill.apache.org/docs/querying-a-file-system-introduction/ >> >> When you start up drill out of the box, there is a connector called dfs >> which points to the local filesystem. You can configure a workspace to >> your desktop folder, then all you have to do is write a query like: >> >> SELECT * >> FROM dfs.desktop.`file.json` >> >> If you're looking to do this programmatically from Java and your data >> isn't too big, the easiest way is probably to use Drill's REST API ( >> https://drill.apache.org/docs/rest-api-introduction/). You can make a >> simple HtTP call to Drill and get the data that way. >> >> Hope this helps! >> -- C >> >> >> >>> On Nov 24, 2022, at 5:02 PM, marc nicole <[email protected]> wrote: >>> >>> Hi, >>> >>> I want to query a JSON file placed in Desktop folder (Windows). >>> How to do that in Java ? >>> >>> PS: i saw this type of code : >>> >>> Connection con = null; >>> >>> con = new Driver().connect(DRILL_JDBC_LOCAL_URI, >> getDefaultProperties()); >>> Statement stmt = con.createStatement(); >>> ResultSet rs = stmt.executeQuery(DRILL_SAMPLE_QUERY);... >>> >>> >>> But that requires using JDBC and to place JSON in jar file within CP of >>> Drill which i don't want; >>> >>> Thanks. >> >>
