[ https://issues.apache.org/jira/browse/METRON-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16607678#comment-16607678 ]
ASF GitHub Bot commented on METRON-1772: ---------------------------------------- GitHub user nickwallen opened a pull request: https://github.com/apache/metron/pull/1191 METRON-1772 Support alternative input formats in the Batch Profiler [Feature Branch] Additional formats, like ORC, should be supported when reading the input telemetry in the Batch Profiler. The user should be able to customize the Batch Profiler based on how they have chosen to archive their telemetry. - Added an additional command line option that allows the user to pass custom options to the `DataFrameReader`. This may be needed by a user depending on how they've archive their telemetry. - Added an integration test that validates that the Batch Profiler can read CSV data. I added CSV as a test so that I could validate the user providing custom options to the `DataFrameReader`. - Added an integration test that validates that the Batch Profiler can read ORC data. This is a pull request against the `METRON-1699-create-batch-profiler` feature branch. This is dependent on the following PRs. By filtering on the last commit, this PR can be reviewed before the others are reviewed and merged. - [ ] #1189 ## Testing - [ ] TODO I need to provide a solid set of test instructions for Full Dev. ## Pull Request Checklist - [x] Have you included steps to reproduce the behavior or problem that is being changed or addressed? - [x] Have you included steps or a guide to how the change may be verified and tested manually? - [x] Have you ensured that the full suite of tests and checks have been executed in the root metron folder via: - [x] Have you written or updated unit tests and or integration tests to verify your changes? - [x] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? - [ ] Have you verified the basic functionality of the build by building and running locally with Vagrant full-dev environment or the equivalent? You can merge this pull request into a Git repository by running: $ git pull https://github.com/nickwallen/metron METRON-1772 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/metron/pull/1191.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1191 ---- commit eb419d8bdc6e60900bb08142d4d2bb50b2bd8c23 Author: Nick Allen <nick@...> Date: 2018-09-06T18:47:28Z METRON-1770 Add Docs for Running the Profiler with Spark on YARN commit b17e328fed0f3f51f14383ae48cb526b0993d314 Author: Nick Allen <nick@...> Date: 2018-09-07T21:07:44Z METRON-1772 Support alternative input formats in the Batch Profiler ---- > Support alternative input formats in the Batch Profiler > ------------------------------------------------------- > > Key: METRON-1772 > URL: https://issues.apache.org/jira/browse/METRON-1772 > Project: Metron > Issue Type: Sub-task > Reporter: Nick Allen > Assignee: Nick Allen > Priority: Major > > Additional formats, like ORC, should be supported when reading the input > telemetry in the Batch Profiler. The user should be able to customize the > Batch Profiler based on how they have chosen to archive their telemetry. -- This message was sent by Atlassian JIRA (v7.6.3#76005)