#145: transform (ETL) GPC i2b2 data to PCORNet CDM --------------------------+---------------------------- Reporter: dconnolly | Owner: ngraham Type: enhancement | Status: assigned Priority: major | Milestone: data-domains2 Component: data-sharing | Resolution: Keywords: | Blocked By: 109 Blocking: 160 | --------------------------+----------------------------
Comment (by ngraham): Using the `c_dimcode` in comment:14 doesn't work well for cases where there are multiple concepts in the GPC/HERON hierarchy map to a single concept in the PCORI hierarchy. One example is Hispanic: ||PCORI path||HERON path ||\PCORI\DEMOGRAPHIC\HISPANIC\Y\||\i2b2\Demographics\Ethnicity\Hispanic\|| ||\PCORI\DEMOGRAPHIC\HISPANIC\Y\||\i2b2\Demographics\Ethnicity\Hispanic, Latino or Spanish Origin\|| So, instead of using the `c_dimcode` I plan to insert rows into the concept dimension that map the PCORI paths to the appropriate concept codes. I plan to do this by building a mapping between the PCORI paths and the GPC paths (much like we did in [https://bitbucket.org/njgraham /pcori-annotated-data- dictionary/src/1c1c0f980377bbd1adbe839edf84d8de92f5b1de/heron_to_pcori.csv?at=default heron_to_pcori.csv] for the Annotated data Dictionary). I tried this out for diagnosis and was able to run a query for `ICD-9-CM` in the PCORI hierarchy (item key `<item_key>\\PCORI_\PCORI\DIAGNOSIS\DX_TYPE\09\</item_key>` against the small KU test data set and return a patient count: {{{ Finished Query: "ICD-9-CM@08:59:10" [2.4 secs] Compute Time: 1 secs Number of patients for "ICD-9-CM@08:59:10" patient_count: 141 }}} Also, demographics (and, specifically, the Hispanic flag noted above). {{{ Finished Query: "HISPANIC@09:01:26" [1.7 secs] Compute Time: 1 secs Number of patients for "HISPANIC@09:01:26" patient_count: 721 }}} -- Ticket URL: <http://informatics.gpcnetwork.org/trac/Project/ticket/145#comment:15> gpc-informatics <http://informatics.gpcnetwork.org/> Greater Plains Network - Informatics _______________________________________________ Gpc-dev mailing list Gpc-dev@listserv.kumc.edu http://listserv.kumc.edu/mailman/listinfo/gpc-dev