On Thu, Oct 24, 2013 at 9:16 AM, Smith, Ina <[email protected]>
<[email protected]> wrote:
> We would very much like to demonstrate the impact of our repository, and 
> would like to query the database for the following information (our 
> repository is at http://scholar.sun.ac.za) – specifically the downloads per 
> item according to:
>
>
>
> dc.type - Thesis
>
> dc.title
>
> dc.date.issued – for the years 2012, 2011, 2010, 2009, 2008, 2007, 2006, 2005
>
> Pdf download statistics
>
>
>
> The above needs to be exported to a spreadsheet in the end. Can somebody 
> perhaps advice how we should go about, do we need a Java programmer, or what 
> skills are required?


Hi Ina,

no, you don't need a Java programmer, though you do need a technically
skilled person. A person who has a good knowledge of SQL would do
(preferably Solr knowlege, but that might be rare).

You didn't mention which DSpace statistics module you're using.
Assuming you're using the Solr statistics (Available since DSpace
1.6), there are 2 ways you can get the information.

The statistics information is stored in the "statistics" Solr core.
The item metadata is stored both in the "search" Solr core and the
DSpace SQL database (schema description here [3] for DSpace 1.8 but
completely valid for this purpose regardless of your version).

You'll have to get the data in two queries and correlate them. It's up
to you if you use Solr+SQL or Solr+Solr. This page describes how to
connect to Solr and how to query it [1].

To get the access event, make a query to the "statistics" core with
"type:0" (0=bitstream, see constants in [2]) and "id" is bitstream_id.
Then make a second query to the "search" core for
"search.resourcetype:0" to get the metadata (and filter by metadata
values), e.g. to filter by year issued "dateissued.year:2011", by type
"dc.type:Thesis" etc. Join the two queries by bitstream_id (outside of
Solr).

[1] https://wiki.duraspace.org/display/DSPACE/Solr
[2] 
https://github.com/DSpace/DSpace/blob/dspace-3.2/dspace-api/src/main/java/org/dspace/core/Constants.java#L20
[3] 
https://wiki.duraspace.org/display/DSDOC3x/Storage+Layer#StorageLayer-RDBMS/DatabaseStructure

Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60135991&iu=/4140/ostg.clktrk
_______________________________________________
Dspace-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-general

Reply via email to