Re: [dspace-tech] check pdf files

2017-07-11 Thread mariafernanda . ruiz . cepal
Thanks Emilio for your answer.

I will check the dspace-tech for more information.

Cheers!!!




On Wednesday, July 5, 2017 at 5:42:39 AM UTC-4, Emilio Lorenzo wrote:
>
> Hi Fernanda, 
>
> I suppose you are referring to PDF-A1,2 and 3 vs PDF. The checking is 
> difficult due to the ISO-standard complexity as explained in  
> https://en.wikipedia.org/wiki/PDF/A
>
> Nevertheless, probably the best option to access files  in the assetstore 
> (an object contains multiple files as you have discovered)  is using the 
> java methods in Dspace (a good starting point would be the curation task  
> called Bitstream Format Profiler)  or using specific database queries 
> (there are similar questions in Dspace-teh regarding this identification 
> and access.
>
> After that access,  you will need a validator to test the compliance (see 
> https://softwarerecs.stackexchange.com/questions/18161/is-there-free-or-open-source-software-for-checking-pdf-a-compliance
> )
>
> best luck
> Emilio 
>
> El 04/07/2017 a las 17:39, mariafernand...@gmail.com  
> escribió:
>
> Hi everybody
>
> I need to check wich files (bitstreams) are ftp or ftp-a. It is possible 
> to check with Adobe Professional what kind of pdf is, however I need to put 
> all the files to check under one directory.
>
> I know all the files (ftp and others) are storaged inside other 
> directories under /assetstore folder, and they are indicated in the name of 
> the file, for example:
>
> A bitstream with the internal ID 12345678901234567890123456789012345678 is 
> stored in the directory
>
> [dspace]/assetstore/12/34/56/12345678901234567890123456789012345678
>
> My question is:  how can I access directly to the bitstreams with 
> extension pdf?, because if I open this bitstream 
> 12345678901234567890123456789012345678, it will only have plain text.
>
> Thanks for your help!!
>
> Fernanda
> -- 
> You received this message because you are subscribed to the Google Groups 
> "DSpace Technical Support" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to dspace-tech...@googlegroups.com .
> To post to this group, send email to dspac...@googlegroups.com 
> .
> Visit this group at https://groups.google.com/group/dspace-tech.
> For more options, visit https://groups.google.com/d/optout.
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dspace-tech+unsubscr...@googlegroups.com.
To post to this group, send email to dspace-tech@googlegroups.com.
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.


Re: [dspace-tech] check pdf files

2017-07-05 Thread emilio lorenzo

Hi Fernanda,

I suppose you are referring to PDF-A1,2 and 3 vs PDF. The checking is 
difficult due to the ISO-standard complexity as explained in  
https://en.wikipedia.org/wiki/PDF/A


Nevertheless, probably the best option to access files  in the 
assetstore (an object contains multiple files as you have discovered)  
is using the java methods in Dspace (a good starting point would be the 
curation task  called Bitstream Format Profiler)  or using specific 
database queries (there are similar questions in Dspace-teh regarding 
this identification and access.


After that access,  you will need a validator to test the compliance 
(see 
https://softwarerecs.stackexchange.com/questions/18161/is-there-free-or-open-source-software-for-checking-pdf-a-compliance)


best luck

Emilio

El 04/07/2017 a las 17:39, mariafernanda.ruiz.ce...@gmail.com escribió:

Hi everybody

I need to check wich files (bitstreams) are ftp or ftp-a. It is 
possible to check with Adobe Professional what kind of pdf is, however 
I need to put all the files to check under one directory.


I know all the files (ftp and others) are storaged inside other 
directories under /assetstore folder, and they are indicated in the 
name of the file, for example:


A bitstream with the internal ID 
12345678901234567890123456789012345678 is stored in the directory


[dspace]/assetstore/12/34/56/12345678901234567890123456789012345678

My question is:  how can I access directly to the bitstreams with 
extension pdf?, because if I open this bitstream 
12345678901234567890123456789012345678, it will only have plain text.


Thanks for your help!!

Fernanda
--
You received this message because you are subscribed to the Google 
Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send 
an email to dspace-tech+unsubscr...@googlegroups.com 
.
To post to this group, send email to dspace-tech@googlegroups.com 
.

Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.


--
You received this message because you are subscribed to the Google Groups "DSpace 
Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dspace-tech+unsubscr...@googlegroups.com.
To post to this group, send email to dspace-tech@googlegroups.com.
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.


[dspace-tech] check pdf files

2017-07-04 Thread mariafernanda . ruiz . cepal
Hi everybody

I need to check wich files (bitstreams) are ftp or ftp-a. It is possible to 
check with Adobe Professional what kind of pdf is, however I need to put 
all the files to check under one directory.

I know all the files (ftp and others) are storaged inside other directories 
under /assetstore folder, and they are indicated in the name of the file, 
for example:

A bitstream with the internal ID 12345678901234567890123456789012345678 is 
stored in the directory

[dspace]/assetstore/12/34/56/12345678901234567890123456789012345678

My question is:  how can I access directly to the bitstreams with extension 
pdf?, because if I open this bitstream 
12345678901234567890123456789012345678, it will only have plain text.

Thanks for your help!!

Fernanda

-- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dspace-tech+unsubscr...@googlegroups.com.
To post to this group, send email to dspace-tech@googlegroups.com.
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.