Hi all, I've been looking at the patient matching module standalone, and trying to understand the matching concept.
Assuming I have table A as follows, 0|1|2|3 (Column IDs) --------- 1|max|m|A 2|jack|m|A 3|jim|m|A and table B- 0|1|2|3 (Column IDs) --------- 1|jim|m|A 2|jim|m|B If I block on Column 1, and include column 2, then I would expect it to identify that the third record in table A is similar to records 1 and 2 of table B. After running the linkage process, I get the following results, the _groups.txt file says 0|0.0|3|2|jim|jim|m|m|A|B while the _new_match file says 0.0|3|1|jim|jim|m|m|A|A 0.0|3|2|jim|jim|m|m|A|B (i'm assuming that the double (0.0) in both cols mean that both these records match evenly ?) - any idea why the _groups.txt file fails to identify the relationship between 3|1 ? Instead if I were to run the same test blocking on Column 1, and including columns 2 and 3, I would expect it to identify that the third record in table A is similar to the first record of table B. the _groups.txt file says 0|3.1699250014423126|3|1|jim|jim|m|m|A|A while the _new_match file says 3.1699250014423126|3|1|jim|jim|m|m|A|A -3.1699250014423126|3|2|jim|jim|m|m|A|B - Can someone please help explain the data values given here ? I'm assuming that this means that the first record (3|1) has a high chance of matching while the second record (3|2) has really low chance of doing so ? -- Best Regards, Suranga _________________________________________ To unsubscribe from OpenMRS Implementers' mailing list, send an e-mail to [email protected] with "SIGNOFF openmrs-implement-l" in the body (not the subject) of your e-mail. [mailto:[email protected]?body=SIGNOFF%20openmrs-implement-l]

