[jira] [Updated] (PIG-4457) Error is thrown by JobStats.getOutputSize() when storing to a MySql table

Rohini Palaniswamy (JIRA) Tue, 24 Mar 2015 02:39:07 -0700

     [ 
https://issues.apache.org/jira/browse/PIG-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Rohini Palaniswamy updated PIG-4457:
------------------------------------
    Attachment: PIG-4457-1.patch

> Error is thrown by JobStats.getOutputSize() when storing to a MySql table
> -------------------------------------------------------------------------
>
>                 Key: PIG-4457
>                 URL: https://issues.apache.org/jira/browse/PIG-4457
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>            Reporter: Kunal Kumar
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.15.0
>
>         Attachments: PIG-4457-1.patch
>
>
> Here is an example of stack trace printed to console output. Actually, this 
> is a warning message and does not make the job fail. The data is getting 
> stored to mysql table, but i have no idea why pig is looking to store output 
> on hdfs. I am using PIg along with Tez.
> using output size reader: 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader
> unable to find the output file
> java.io.FileNotFoundException: File 
> hdfs://pts0021.persistent.co.in:9000/user/shareinsights/filtered_stock_data 
> does not exist.
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:647)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:101)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:705)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:701)
>         at 
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:701)
>         at 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.FileBasedOutputSizeReader.getOutputSize(FileBasedOutputSizeReader.java:81)
>         at 
> org.apache.pig.tools.pigstats.JobStats.getOutputSize(JobStats.java:351)
>         at 
> org.apache.pig.tools.pigstats.tez.TezVertexStats.addOutputStatistics(TezVertexStats.java:270)
>         at 
> org.apache.pig.tools.pigstats.tez.TezVertexStats.accumulateStats(TezVertexStats.java:188)
>         at 
> org.apache.pig.tools.pigstats.tez.TezDAGStats.accumulateStats(TezDAGStats.java:209)
>         at 
> org.apache.pig.tools.pigstats.tez.TezPigScriptStats.accumulateStats(TezPigScriptStats.java:180)
>         at 
> org.apache.pig.backend.hadoop.executionengine.tez.TezJob.run(TezJob.java:194)
>         at 
> org.apache.pig.backend.hadoop.executionengine.tez.TezLauncher$1.run(TezLauncher.java:167)
>         at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:262)
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>         at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (PIG-4457) Error is thrown by JobStats.getOutputSize() when storing to a MySql table

Reply via email to