I'd convert the whole DataFrame to a matrix and use a loop over rows.
On Nov 22, 2015 2:54 AM, "Fred" <fred.softwa...@gmail.com> wrote:

> In my last example, the function mean() is not well chosen. In fact, what
> I would like to calculate is a statistical test line by lline, like TTest,
> or Wilcoxon. This is why I need to iterate thought 2 DataFrames at the same
> time if I subset the DataFrame first to increase speed :)
>
>
> Something like :
>
> julia> for r1,r2 in eachrow(df1, df2)
>               println(TTest(r1,r2))
>        end
> ERROR: syntax: invalid iteration specification
>
>
>
>
> Le samedi 21 novembre 2015 19:17:27 UTC+1, Fred a écrit :
>>
>> It is a good idea but how is it possible to iterate two dataframes at the
>> same time ? Something like :
>>
>> julia> df = DataFrame(a=1:5, b=7:11, c=10:14, d=20:24)
>> 5x4 DataFrames.DataFrame
>> | Row | a | b  | c  | d  |
>> |-----|---|----|----|----|
>> | 1   | 1 | 7  | 10 | 20 |
>> | 2   | 2 | 8  | 11 | 21 |
>> | 3   | 3 | 9  | 12 | 22 |
>> | 4   | 4 | 10 | 13 | 23 |
>> | 5   | 5 | 11 | 14 | 24 |
>>
>> julia> df1 = df[1:2,]
>> 5x2 DataFrames.DataFrame
>> | Row | a | b  |
>> |-----|---|----|
>> | 1   | 1 | 7  |
>> | 2   | 2 | 8  |
>> | 3   | 3 | 9  |
>> | 4   | 4 | 10 |
>> | 5   | 5 | 11 |
>>
>> julia> df1 = df[3:4,]
>> 5x2 DataFrames.DataFrame
>> | Row | c  | d  |
>> |-----|----|----|
>> | 1   | 10 | 20 |
>> | 2   | 11 | 21 |
>> | 3   | 12 | 22 |
>> | 4   | 13 | 23 |
>> | 5   | 14 | 24 |
>>
>> julia> for r1,r2 in eachrow(df1, df2)
>>               println(mean(r1,r2))
>>        end
>> ERROR: syntax: invalid iteration specification
>>
>>
>>
>>
>> Le samedi 21 novembre 2015 15:08:34 UTC+1, tshort a écrit :
>>>
>>> For the subset, do the indexing after the conversion to an array, or
>>> subset the DataFrame first (probably faster).
>>>
>>

Reply via email to