In regards to PMC, you have to build solr core for pmc (check if into 
your solr instance there is pmc core configurations). Download the csv 
file from pubmed ftp://ftp.ncbi.nlm.nih.gov/pub/pmc/PMC-ids.csv.gz
After that try to run this script (you need a basic experience with the 
shell script and the curl tool):

split -d -l 50000 <file.pmc.csv> split_files.
for file in `ls split_files.*`
do
curl 
'http://<host:port>/solr/pmc/update?update.contentType=text/csv&fieldnames=,,,,,,,DOI,PMCID,PMID,,'
 
-H 'Content-type:text/csv; charset=utf-8' --data-binary @$file;
done

The script split the csv file in n files to avoid problem due to the max 
upload limit on your server and send it to your solr to ingest it. Tune 
it with your need. Good luck :-)

After that launch the main script at 
org.dspace.app.cris.pmc.script.RetrieveCitationInPMC with "-x 0" 
parameter, this query pmc to retrieve the actual count and citation 
informations.
After all launch org.dspace.app.cris.pmc.script.RetrievePubMedID (with 
no params) to retrieve pmc/pmid identifier for items that have a doi.

Hope this help. Let me know.

Luigi Andrea Pascarelli

Il 07/07/2014 11:50, Andrea Bollini ha scritto:
> Hi Alexander,
> I'm out-of-office these days I'm sure that Luigi will reply with information 
> about the script need to update the pmc count.
> There are two script one is able to retrieve pmc/pmid identifier for items 
> that have a doi, the other query pmc to retrieve the actual count and 
> citation information.
>
> About the statistics you need to look to the solr log files. When you hit a 
> researcher page do you see an insert on  solr (call to the /update handler?). 
> What if you hit an item page?
>
> Andrea
>
>

-- 
Cineca - Sede di Roma
via dei Tizii 6 - 00185 Roma

ph. +39 06 44486626
email: l.pascare...@cineca.it


------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to