#56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ... --------------------------+---------------------------- Reporter: dconnolly | Owner: bos Type: enhancement | Status: assigned Priority: minor | Milestone: data-domains3 Component: data-sharing | Resolution: Keywords: | Blocked By: 388 Blocking: | --------------------------+----------------------------
Comment (by dconnolly): Russ just pointed out the **Synthetic File Creation Process**: The DE-SynPUF was created by starting with an actual beneficiary as a “seed” for a synthetic beneficiary. The variables of the seed beneficiary profile were changed by taking characteristics from similar but different “donor” beneficiaries within the source data. The claims from the seed beneficiary were then replaced with claims from other donor beneficiary claims sets. ... -- section 6. Methodology and Limitation \\[https://www.cms.gov/Research-Statistics-Data-and-Systems /Downloadable-Public-Use-Files/SynPUFs/Downloads/SynPUF_DUG.pdf User Manual: CMS Linkable 2008–2010 Medicare DE-SynPUF] \\[https://www.cms.gov/Research-Statistics-Data-and-Systems /Downloadable-Public-Use-Files/SynPUFs/ Medicare Claims Synthetic Public Use Files (SynPUFs)] See also SamTheEagle CMS->i2b2 ETL work. -- Ticket URL: <http://informatics.gpcnetwork.org/trac/Project/ticket/56#comment:31> gpc-informatics <http://informatics.gpcnetwork.org/> Greater Plains Network - Informatics _______________________________________________ Gpc-dev mailing list Gpc-dev@listserv.kumc.edu http://listserv.kumc.edu/mailman/listinfo/gpc-dev