[ 
https://issues.apache.org/jira/browse/HADOOP-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12647030#action_12647030
 ] 

Marc-Olivier Fleury commented on HADOOP-4635:
---------------------------------------------

Yes, after posting my last comment I noticed exactly the problem you mentioned.

I see two ways of correcting this : either increment the number of groups, or 
delete one more item than the number of groups. It merely depends on the 
meaning of num_groups. 

I chose the second way to solve it, adding  {code}free(groups[i]){code} at the 
end of the for loop, to mirror the creation code.

I also changed
{code}
groupnames = (char**)malloc(sizeof(char*)* (*num_groups) + 1);
{code}
to
{code}
groupnames = (char**)malloc(sizeof(char*)* (*num_groups + 1) );
{code}

just in case sizeof(char*) != 1  (we never know...)

Thanks for the details.

> Memory leak ?
> -------------
>
>                 Key: HADOOP-4635
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4635
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/fuse-dfs
>    Affects Versions: 0.19.0, 0.20.0
>            Reporter: Marc-Olivier Fleury
>
> I am running a process that needs to crawl a tree structure containing ~10K 
> images, copy the images to the local disk, process these images, and copy 
> them back to HDFS.
> My problem is the following : after about 10h of processing, the processes 
> crash, complaining about a std::bad_alloc exception (I use hadoop pipes to 
> run existing software). When running fuse_dfs in debug mode, I get an 
> outOfMemoryError, telling that there is no more room in the heap.
> While the process is running, using top or ps, I notice that fuse is using up 
> an increasing amount of memory, until some limit is reached. At that point , 
> the memory used is oscillating. I suppose that this is due to the use of the 
> virtual memory.
> This leads me to the conclusion that there is some memory leak in fuse_dfs, 
> since the only other programs running are Hadoop and the existing software, 
> both thoroughly tested in the past.
> My problem is that my knowledge concerning memory leak tracking is rather 
> limited, so I will need some instructions to get more insight concerning this 
> issue.
> Thank you

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to