If hashing would work, then keying on ": would work.  I expect though
that I would need to hash at least twice (adding epsilon before the
second hash) and I expect that I would need to do something similar if
I used ":

-- 
Raul

On Mon, Jan 16, 2012 at 2:14 PM, Roger Hui <rogerhui.can...@gmail.com> wrote:
> Hashing has expected O(n) time.
>
>
>
> On Mon, Jan 16, 2012 at 11:11 AM, Raul Miller <rauldmil...@gmail.com> wrote:
>
>> I think that the "monster case" would be a case where all values are
>> similar enough that they all map to the same index but different
>> enough that they cannot be recognized as literal equivalents.
>>
>> In the case I am currently interested in, the original values would be
>> 32 bit floating point numbers and only a relatively few bits of the
>> available precision would allow values to be treated as "tolerantly
>> equal".  This would suggest that the size of the "monster case" is
>> limited based on the number of bits being ignored.
>>
>> So, in principle at least, this should limit the size of the
>> "quadratic part" of the problem, for the cases I am trying to address.
>>
>> --
>> Raul
>>
>> On Mon, Jan 16, 2012 at 12:04 PM, Roger Hui <rogerhui.can...@gmail.com>
>> wrote:
>> > The paper I cited,  *Hashing for Tolerant
>> > Index-Of<http://www.jsoftware.com/papers/Hashing.htm>
>> > * , presents a "monster" that defeats a sorting algorithm.   (Defeat in
>> the
>> > sense of causing it take quadratic time.)
>> >
>> >
>> >
>> > On Mon, Jan 16, 2012 at 8:07 AM, Henry Rich <henryhr...@nc.rr.com>
>> wrote:
>> >
>> >> You can sort the lists and then compare adjacent values; find
>> >> superfluous ones; then i.!.0 to find them in the original list.
>> >>
>> >> A tricky part is that proximity is not a transitive property.  If the
>> >> tolerance is 2, and the data is
>> >>
>> >> 1 2 3 4 5 6 7
>> >>
>> >> what should the result of the i.~ be?
>> >>
>> >> Henry Rich
>> >>
>> >> On 1/16/2012 10:06 AM, Raul Miller wrote:
>> >> > First:  I like Roger Hui's response.  And, in essence, it's doing
>> >> > exactly what you suggest.  However, this requires comparing every
>> >> > number in the left list with every number in the right list.  I am
>> >> > currently pondering algorithms which rely on I. so that when the lists
>> >> > are long computation times are still reasonable (perhaps with 100000
>> >> > members in each list).
>> >> >
>> >> > Second:  I would want the three PI values in my original message to be
>> >> > treated as equal.  I want to be able to specify a magnitude of
>> >> > acceptable difference which is greater than any of the differences in
>> >> > that data sample.
>> >> >
>> >> > FYI,
>>
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to