RE: Gather 2015 4th quarterly QA results

Dan Connolly Mon, 15 Feb 2016 12:19:14 -0800

Thanks for elaborating, but  I'm having trouble finding that statement in our 
shared code. "JOIN  #Q4_2015_GPC_QA_ACTIVE_PATIENT" doesn't occur in 
qa_mssql.sql<https://bitbucket.org/gpcnetwork/gpc-qa-quarterly/src/e6d33c16e8691529f37025c36163bd3377d7e9e8/qa_mssql.sql?at=default>
 of 2016-01-20 e6d33c16e869. nor does "QA_ACTIVE_TMP".


Are you referring to some previous version of the code? If so, which version?

Regarding editing val_list, I'm not entirely clear on how it works. I sort of 
naively hope that practice diminishes over time. The instructions in the 
REAME<https://bitbucket.org/gpcnetwork/gpc-qa-quarterly> say:
For the script to return values, the columns titled "CONCEPT_PATH_OR_CODE" need 
to be updated with your organization's paths or codes. By default this column 
contains the paths or codes used in Babel. Do not edit the values in the 
"BABEL_PATH_OR_CODE" columns. These are used for reference to identify if your 
organization's path matches the babel path.
--
Dan

________________________________
From: Green, Timothy A. [green...@health.missouri.edu]
Sent: Monday, February 15, 2016 1:54 PM
To: Thomas Mish; Dan Connolly; Mosa, Abu S.
Cc: McNeeley, Todd A.; <gpc-dev@listserv.kumc.edu>
Subject: RE: Gather 2015 4th quarterly QA results

Tom, thanks.  We’ve not yet started doing anything related to making it neutral 
with regards to location of patient attributes.  It’s a tall order…I’ve taken a 
look at the way the i2b2 service layer builds it’s queries.  It’s fairly 
complex object oriented code that will be hard to adapt to straight SQL I think.

I think we will approach the first round with some hard coded queries against 
patient_dimension.

Dan:

This statement returns no results:
select v.VARIABLE_NAME ,count(distinct o.PATIENT_NUM) as RESULT
  into #Q4_2015_GPC_QA_ACTIVE_TMP
  FROM I2B2Data.Q4_2015_GPC_QA_VAL_LIST v
        JOIN I2B2Data.CONCEPT_DIMENSION c on (c.CONCEPT_PATH like 
v.CONCEPT_PATH_OR_CODE or c.CONCEPT_CD like v.CONCEPT_PATH_OR_CODE)
        JOIN I2B2Data.OBSERVATION_FACT o on o.concept_cd = c.concept_cd
        JOIN  #Q4_2015_GPC_QA_ACTIVE_PATIENT p on p.patient_num = o.patient_num
        where v.POPULATION = 'Active'
        and v.QUERY_TYPE = 'COUNT PATIENT'
        group by v.variable_name;

We of course do have patients in our database, but this query assumes the 
information of interest is in observation_fact when it is in fact in 
patient_dimension in our database.

Regarding the ontology mapping, what I got from the instructions was to map our 
concept paths into the Q4_2015_GPC_QA_VAL_LIST file so it would return results. 
 Am I to understand that we should not be mapping our ontology paths to the 
Q4_2015_GPC_QA_VAL_LIST, but instead alter our metadata in i2b2 to match that 
which is in the base Q4_2015_GPC_QA_VAL_LIST file?


From: Thomas Mish [mailto:m...@wisc.edu]
Sent: Monday, February 15, 2016 12:17 PM
To: Dan Connolly; Green, Timothy A.; Mosa, Abu S.
Cc: McNeeley, Todd A.; <gpc-dev@listserv.kumc.edu>
Subject: RE: Gather 2015 4th quarterly QA results

I think that very early on in the GPC conversations we covered a lot of ground 
in the fact vs patient dimension conversation. I know that here at Madison we 
altered the way we organized our i2b2 data at the time to that we were more 
like the rest of the GPC. I understand the angst over changing. I’d love to see 
what you doing/have done to make the script more neutral on this issues. We’ll 
incorporate them as best as possible for next time to minimize the churn.

-TM

From: 
gpc-dev-boun...@listserv.kumc.edu<mailto:gpc-dev-boun...@listserv.kumc.edu> 
[mailto:gpc-dev-boun...@listserv.kumc.edu] On Behalf Of Dan Connolly
Sent: Monday, February 15, 2016 11:50 AM
To: Green, Timothy A. 
<green...@health.missouri.edu<mailto:green...@health.missouri.edu>>; Mosa, Abu 
S. <mo...@health.missouri.edu<mailto:mo...@health.missouri.edu>>
Cc: McNeeley, Todd A. 
<mcneele...@health.missouri.edu<mailto:mcneele...@health.missouri.edu>>; 
<gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>> 
<gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>>
Subject: RE: Gather 2015 4th quarterly QA results

To the extent that I did any of the writing of the QA scripts, I made no 
assumption about fact table vs. patient dimension. All our ontology design 
decisions regard paths. I haven't looked at all the code in detail, though. If 
there is such an assumption, it should be straightforward to refer us to one or 
more lines or statements that exhibit the assumption. Would you please give 
such details? Or just run the script and give us one or more diagnostic 
messages that show the assumption in action?

And as to "the relatively simple task of mapping..." that's the point of the QA 
scripts: to measure progress on that mapping. If you run the scripts and get 
some zeros, the script is still working correctly. It correctly shows that 
you're not finished. (You're not alone. KUMC has plenty of work to do in this 
area, for example.)

--
Dan
________________________________
From: Green, Timothy A. [green...@health.missouri.edu]
Sent: Monday, February 15, 2016 10:38 AM
To: Mosa, Abu S.; Dan Connolly
Cc: <gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>>; McNeeley, 
Todd A.
Subject: RE: Gather 2015 4th quarterly QA results
The patient demographics are stored in our i2b2 in the person dimension.  The 
QA scripts are written with the assumption that all data is stored in the fact 
table.  There is no provision for the script to leverage the ontology 
architecture to determine the storage location of the various pieces of data.  
I’m sure it was done that way to simplify the script, but it does ignore a 
pretty major feature of i2b2: the ontology mapping functionality.  This makes 
it less portable to sites that have a different implementation of the ontology. 
 I don’t characterize a difference in implementation a problem in the data 
warehouse.

Keep in mind that MU had an i2b2 instance back in 2014, and the MU architecture 
was established before joining the GPC.  It would be work to either shift our 
demographics storage (and additional storage capacity), or to rewrite the QA 
scripts.  We will opt to rewrite the QA scripts, but to do it in a portable way 
(i.e. we could share our changes back to the GPC) will require some complexity 
of coding to leverage the ontology framework in i2b2.  We may have to opt for 
hardcoding our version against the dimension table.

Aside from that, we still have the relatively simple task of mapping our 
concept codes/paths to get the remainder (non-demographic part) of the script 
to work correctly.

Tim



From: Mosa, Abu S.
Sent: Monday, February 15, 2016 10:20 AM
To: Dan Connolly
Cc: <gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>>; Green, 
Timothy A.; McNeeley, Todd A.
Subject: RE: Gather 2015 4th quarterly QA results

I would defer this to Tim and Todd to share their thoughts on this. Thanks. 
--Mosa

From: Dan Connolly [mailto:dconno...@kumc.edu]
Sent: Monday, February 15, 2016 10:03 AM
To: Mosa, Abu S. <mo...@health.missouri.edu<mailto:mo...@health.missouri.edu>>
Cc: <gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>> 
<gpc-dev@listserv.kumc.edu<mailto:gpc-dev@listserv.kumc.edu>>
Subject: RE: Gather 2015 4th quarterly QA results

What are the symptoms when you try to run the QA scripts, Mosa? What 
diagnostics do you get?

The premise of the QA scripts is that if they don't run, that's a problem with 
your data warehouse. Perhaps there are actually problems with the scripts, but 
if so, we need details on what those problems are. Also, we have a shared 
source code repository in 
gpc-qa-quarterly<https://bitbucket.org/gpcnetwork/gpc-qa-quarterly>. So if you 
make changes, please strive to make them portable to other sites.

p.s. I presume it's OK to share this thread with gpc-dev, as it regards group 
technical work.

--
Dan
________________________________
From: Mosa, Abu S. [mo...@health.missouri.edu]
Sent: Friday, February 12, 2016 4:41 PM
To: Dan Connolly
Subject: Gather 2015 4th quarterly QA results
Hi Dan,

I talked with our technical team about generating the 2015 4th quarterly 
report. They looked into the MSSQL code that you referred to me on BitBucket. 
They advised that major re-writing effort is needed in order to localize the 
scripts to be able to run on our i2b2 data model. They are currently focusing 
on the Phase I implementation (specifically tumor registry data load) which 
they target to complete by end of this month. So, a viable timeline for 
generating the 2015 4th quarterly report for us would be Mid-march. Let me know 
if you have any questions. If needed, we can discuss this during the next Dev 
call.

Regards,
Abu Saleh Mohammad Mosa, PhD
Director, Research Informatics
Institute for Clinical and Translational Science
University of Missouri

_______________________________________________
Gpc-dev mailing list
Gpc-dev@listserv.kumc.edu
http://listserv.kumc.edu/mailman/listinfo/gpc-dev

RE: Gather 2015 4th quarterly QA results

Reply via email to