Hi all,

I've been looking at the patient matching module standalone, and trying to
understand the matching concept.

Assuming I have table A as follows,
0|1|2|3 (Column IDs)
---------
1|max|m|A
2|jack|m|A
3|jim|m|A

and table B-
0|1|2|3 (Column IDs)
---------
1|jim|m|A
2|jim|m|B

If I block on Column 1, and include column 2, then I would expect it to
identify that the third record in table A is similar to records 1 and 2 of
table B.
After running the linkage process, I get the following results,

the _groups.txt file says  0|0.0|3|2|jim|jim|m|m|A|B

while the _new_match file says

0.0|3|1|jim|jim|m|m|A|A
0.0|3|2|jim|jim|m|m|A|B    (i'm assuming that the double (0.0) in both cols
mean that both these records match evenly ?)


   - any idea why the _groups.txt file fails to identify the relationship
   between 3|1 ?


Instead if I were to run the same test blocking on Column 1, and including
columns 2 and 3,
I would expect it to identify that the third record in table A is similar to
the first record of table B.

the _groups.txt file says 0|3.1699250014423126|3|1|jim|jim|m|m|A|A

while the _new_match file says

3.1699250014423126|3|1|jim|jim|m|m|A|A
-3.1699250014423126|3|2|jim|jim|m|m|A|B


   - Can someone please help explain the data values given here
   ? I'm assuming that this means that the first record (3|1) has a high chance
   of matching  while the second record (3|2) has really low chance of doing so
   ?



-- 
Best Regards,

Suranga

_________________________________________

To unsubscribe from OpenMRS Implementers' mailing list, send an e-mail to 
[email protected] with "SIGNOFF openmrs-implement-l" in the  body 
(not the subject) of your e-mail.

[mailto:[email protected]?body=SIGNOFF%20openmrs-implement-l]

Reply via email to