[jira] [Created] (FLINK-1139) HadoopOutputFormat fails for DOP > 1

2014-10-06 Thread Fabian Hueske (JIRA)
Fabian Hueske created FLINK-1139: Summary: HadoopOutputFormat fails for DOP > 1 Key: FLINK-1139 URL: https://issues.apache.org/jira/browse/FLINK-1139 Project: Flink Issue Type: Bug

Re: load broadcast set into searchable set

2014-10-06 Thread Stephan Ewen
You actually can check directly in the broadcast set. But since it is a list, searching in is slower than in a hash set. That is all... On Mon, Oct 6, 2014 at 1:09 PM, Martin Neumann wrote: > Yes, thanks. > > I was wondering if I can check directly in the broadcast set but since I > have to get

Re: load broadcast set into searchable set

2014-10-06 Thread Martin Neumann
Yes, thanks. I was wondering if I can check directly in the broadcast set but since I have to get it local anyway it should be not to much overhead. cheers Martin On Mon, Oct 6, 2014 at 12:34 PM, Stephan Ewen wrote: > Hej! > > Yes, the "getRuntimeEnvironment().getBroadcastVariable() returns a

Re: load broadcast set into searchable set

2014-10-06 Thread Stephan Ewen
Hej! Yes, the "getRuntimeEnvironment().getBroadcastVariable() returns a list, which you can add to set: // in the function: private Set specials; public void open(Configuration conf) { List bc = getRuntimeContect().getBroadcastVariable("the-bc-var-name"); specials = new HashSet(bc); }

load broadcast set into searchable set

2014-10-06 Thread Martin Neumann
Hej, I have a Flink job with with a filter step. I now have a list of exceptions where I need to do some extra work (300k data). I thought I just use a boradcast set and then for each like compare if its in the exception set. What is the best way to implement this in Flink? Is there an efficient