I upload a *"Person.csv"* file to my GAE application via bulkloader, which 
looks like this;

Name,Age,Gender
Sercan,22,M
Tabitha,21,F

once I upload the table, I can properly see it in datastore viewer. But 
then when I want to download it back, again via bulkloader, file comes back 
wit corrupt characters in it like this;

<https://lh3.googleusercontent.com/-wxAsP6Soz44/UiXzN_z-TMI/AAAAAAAAAFw/hqdyxiFE6Ww/s1600/Ads%C4%B1z.png>

cheezy eh?

So first I thought application kinda encripted it, but I deleted my Person 
entity form datastore and tried to upload the one I just download (the one 
looks corrupted). Well, it was unable to upload back and gave me an utf-8 
error. Here is my console output.

Name of your application: banamsgbirak
Name of the table to upload: Person
Your gmail adress: sercan.altundas
()
05:39 PM Uploading data records.
[INFO    ] Logging to bulkloader-log-20130903.173928
[INFO    ] Throttling transfers:
[INFO    ] Bandwidth: 250000 bytes/second
[INFO    ] HTTP connections: 8/second
[INFO    ] Entities inserted/fetched/modified: 20/second
[INFO    ] Batch Size: 10
[INFO    ] Opening database: bulkloader-progress-20130903.173928.sql3
[INFO    ] Connecting to banamsgbirak.appspot.com/_ah/remote_api
[INFO    ] Starting import; maximum 10 entities per post
[ERROR   ] [Thread-12] DataSourceThread:
Traceback (most recent call last):
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\tools\bulkloader.py", line 
1601, in run self.PerformWork()
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\tools\bulkloader.py", line 
1720, in PerformWork for item in content_gen.Batches():
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\tools\bulkloader.py", line 
556, in Batches self._ReadRows(key_start, key_end)
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\tools\bulkloader.py", line 
466, in _ReadRows row = self.reader.next()
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\ext\bulkload\csv_connector.py", 
line 218, in generate_import_record for input_dict in self.dict_generator:
  File "C:\Python27\lib\csv.py", line 103, in next self.fieldnames
  File "C:\Python27\lib\csv.py", line 90, in fieldnames self._fieldnames = 
self.reader.next()
  File "C:\Program Files 
(x86)\Google\google_appengine\google\appengine\ext\bulkload\csv_connector.py", 
line 54, in utf8_recoder for line in codecs.getreader(encoding)(stream):
  File "C:\Python27\lib\codecs.py", line 615, in next line = self.readline()
  File "C:\Python27\lib\codecs.py", line 530, in readline data = 
self.read(readsize, firstline=True)
  File "C:\Python27\lib\codecs.py", line 477, in read newchars, 
decodedbytes = self.decode(data, self.errors)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xfc in position 0: 
invalid start byte [INFO    ] An error occurred. Shutting down...
[ERROR   ] Error in data source thread: 'utf8' codec can't decode byte 0xfc 
in position 0: invalid start byte
[INFO    ] 0 entities total, 0 previously transferred
[INFO    ] 0 entities (2590 bytes) transferred in 2.6 seconds
[INFO    ] Some entities not successfully transferred
Press ENTER to exit...


I looked for appcfg.py options but could not find any about charset, so now 
I am setting a trap to catch your ideas? What sould I do about this?

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to google-appengine+unsubscr...@googlegroups.com.
To post to this group, send email to google-appengine@googlegroups.com.
Visit this group at http://groups.google.com/group/google-appengine.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to