[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-18 Thread theone
There is not any error in the logs. I think it is because of protection. The website use https connection. Is there anyway to pass over it? On 18 Şubat, 19:45, Ernesto Karim Oltra wrote: > Perhaps the page has been protected so you need UserAgent headers, > etc. Anyway, check the Logs section in

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-18 Thread Ernesto Karim Oltra
Perhaps the page has been protected so you need UserAgent headers, etc. Anyway, check the Logs section in Dashboard to check for errors. On 17 feb, 23:18, theone wrote: > Actually for my case using google docs api very complicated and tiring > because I just want to get the content to make search

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-17 Thread theone
Actually for my case using google docs api very complicated and tiring because I just want to get the content to make search in files. I won't make any modification on the document. I think that using an external service like https://secure.convert-doc.com/converters/doc-to-txt.html might be approp

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-17 Thread Ernesto Karim Oltra
Taskqueues, that's what I'm using now. To update a word document of about ten pages, it takes more or less 1500ms average (at least for me =) ). And this have a big advantage, you can take control of retries if docs is temporaly unavailable. Another useful option would be to store a cache in datas

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-16 Thread Calvin
I think he means that importing and retrieving the converted document using the gdata api may not always be possible within the 30 second limit of a user-facing app engine request. If that's the case it would be a good idea to do the conversion using a task queue, which has a much higher limit.

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-16 Thread theone
What I want to do is just get all content of the document and save it with some other form values such as name, surname, birthdate etc. We can do that for docx format because their format is zip as you said. import zipfile, re docx = zipfile.ZipFile('/path/to/file/mydocument.docx') content = docx.

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-16 Thread Ernesto Karim Oltra
Well, maybe it's not a viable solution, but Word document is a ZIP file with a lot of plain text files inside, so if you know the format, you can store the words the document contains. The option I will implement would be sending the document, storing it in datastore, enqueue a task to re-send it

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-16 Thread theone
Do you mean that I cannot convert and store word document contents with google docs api in a cheap way? On 16 Şubat, 22:00, "Brandon Wirtz" wrote: > You would need to convert the word doc to text so that it is searchable.  It > is not an issues of Storing, but conversion.  I believe there is enou

[google-appengine] Re: Storing word document contents in datastore(Python)

2011-02-16 Thread Calvin
Maybe you could have users import their documents into Google Docs and then read them from there using the gdata api? http://code.google.com/apis/documents/docs/1.0/developers_guide_python.html -- You received this message because you are subscribed to the Google Groups "Google App Engine" gro