Thanks Pete.
It worked for me.

Regards,
Anuj

-----Original Message-----
From: general-boun...@developer.marklogic.com
[mailto:general-boun...@developer.marklogic.com] On Behalf Of Pete Aven
Sent: Tuesday, August 18, 2009 6:03 PM
To: General Mark Logic Developer Discussion
Subject: RE: [MarkLogic Dev General] Problem in unzipping the MS Office
Word2007 document when loaded as binary into MarkLogic server database.

Hi Anuj,

   Open XML Extract will unzip Office 2007 documents and insert their
associated parts.
   WordprocessingML Process merges split runs of text within the
document.xml piece of an unzipped .docx (Word 2007) package.
   MS Office conversion is for converting Office 2003 documents.

Assuming you've attached the pipelines.  You need to save  your document
to the domain specified by the pipeline.
Assuming the Documents database, you can check domain in Admin UI by
navigating to : Databases -> Documents -> Content Processing -> Domains

the default domain is "/". So that means if you have a document,
'foo.docx',  you need to save it as '/foo.docx' in MarkLogic for the
pipeline to work. 

.docx packages are extracted into a folder named for the original
package.  So when you save "/foo.docx", it will be extracted to a
directory, "/foo_docx_parts/".


Hope this helps,
Pete

________________________________________
From: general-boun...@developer.marklogic.com
[general-boun...@developer.marklogic.com] On Behalf Of
anuj.kum...@cognizant.com [anuj.kum...@cognizant.com]
Sent: Tuesday, August 18, 2009 12:40 AM
To: general@developer.marklogic.com
Subject: [MarkLogic Dev General] Problem in unzipping the MS Office Word
2007 document when loaded as binary into MarkLogic server database.

Hi All,
I want to load a word 2007 document into MarkLogic server database which
in turn should unzip the document parts an load it into the MarkLogic
server database.
I have attached the WordprocessingML
Process<http://localhost:8001/cpf-pipeline-admin.xqy?section=database&da
tabase=17593846791238263848&pipeline=2946334788674590107> , MS Office
Conversion<http://localhost:8001/cpf-pipeline-admin.xqy?section=database
&database=17593846791238263848&pipeline=13606748502382551960> and
Office OpenXML
Extract<http://localhost:8001/cpf-pipeline-admin.xqy?section=database&da
tabase=17593846791238263848&pipeline=14999971590861047845>  pipelines
with my database, but still it does not unzip and load the parts of
documents into the database. Please help.
Thanks in advance.

Regards,
Anuj Kumar
Cognizant,Kolkata
Vnet:306409

This e-mail and any files transmitted with it are for the sole use of
the intended recipient(s) and may contain confidential and privileged
information.
If you are not the intended recipient, please contact the sender by
reply e-mail and destroy all copies of the original message.
Any unauthorised review, use, disclosure, dissemination, forwarding,
printing or copying of this email or any action taken in reliance on
this e-mail is strictly
prohibited and may be unlawful.

_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general

This e-mail and any files transmitted with it are for the sole use of the 
intended recipient(s) and may contain confidential and privileged information.
If you are not the intended recipient, please contact the sender by reply 
e-mail and destroy all copies of the original message.
Any unauthorised review, use, disclosure, dissemination, forwarding, printing 
or copying of this email or any action taken in reliance on this e-mail is 
strictly
prohibited and may be unlawful.
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general

Reply via email to