Re: [math] threading redux

2015-04-22 Thread Gilles
On Wed, 22 Apr 2015 11:33:30 -0500, Ole Ersoy wrote: On Mon, Apr 20, 2015 at 6:05 PM Phil Steitz wrote: There are lots of ways to allow distributed processes to share common data. Spark has a very nice construct called a Resilient Distributed Dataset (RDD) designed for exactly this purpose.

Re: [COMPRESS and others] FW: Any interest in running Apache Tika as part of CommonCrawl?

2015-04-22 Thread Dominik Stadler
Hi, I have now published a first version of a tool to download binary data of certain file types from the Common Crawl URL Index. Currently it only supports the previous index format, so the data is from around 2012/2013, but this also provides tons of files for mass-testing of our frameworks. I

Re: [LANG] Add ThreadUtils

2015-04-22 Thread Hendrik Dev
based on what i understand from this discussion i did some changes, pls see https://github.com/salyh/commons-lang/commit/12edd41ec7563368b03fa540af5c9a127bbfd89d for the diff and https://github.com/salyh/commons-lang/blob/12edd41ec7563368b03fa540af5c9a127bbfd89d/src/main/java/org/apache/commons/la

[dbutils][dbcp] SQL states

2015-04-22 Thread Gary Gregory
Hi All: JDBC's SQLException has a String SQLState field. I have a need for a formal OO SQLState class and I've started to hack up something very limited for the purpose our app. I feels hacky and less than satisfactory ATM. I wonder: What do you do for handling and generating SQL states in your

Re: [math] threading redux

2015-04-22 Thread Ole Ersoy
On Mon, Apr 20, 2015 at 6:05 PM Phil Steitz wrote: There are lots of ways to allow distributed processes to share common data. Spark has a very nice construct called a Resilient Distributed Dataset (RDD) designed for exactly this purpose. Are there any examples of a class in commons math wher

Re: [math] threading redux

2015-04-22 Thread James Carman
On Mon, Apr 20, 2015 at 6:05 PM Phil Steitz wrote: > > > There are lots of ways to allow distributed processes to share > common data. Spark has a very nice construct called a Resilient > Distributed Dataset (RDD) designed for exactly this purpose. > To take the abstraction layer a step further

Re: [math] threading redux

2015-04-22 Thread Ole Ersoy
That is one way to achieve parallelism. The Executor is one way to manage concurrently executing threads in a single process. There are other ways to do this. My challenge is to find a way to make it possible for users to plug in alternatives. Some of the methods on CompletableFuture allow th

[LANG] Contribution adding some features in StringUtils

2015-04-22 Thread Loic Guibert
Hello, I wrote a personal class "StringHelper" using the StringUtils class and containing some classic routines. I think those could be added to the StringUtils class. There are some of these methods (JavaDoc is actually in French, so I didn't include it here) : --