About: xindice-1.1b4 - commandline tool Question: How to preserve encodings across export/import ???
--------------------------------
Scenario explaining the problem
--------------------------------
Step 1.
Adding document:
xindice ad -c xmldb:xindice://localhost:8080/db/foo/ -n 36.xml -f 36.xml
on document 36.xml:
<?xml version="1.0" encoding="iso-8859-1"?> ... etc ...
This works OK
Step 2.
Retrieving document:
xindice rd -c xmldb:xindice://localhost:8080/db/foo/ -n 36.xml -f 36a.xml
results in document 36a.xml:
<?xml version="1.0"?> ... etc ...
Payload of extracted doc (36a.xml) is very much identical to payload of original document (36.xml)
Step 3.
Adding document:
xindice ad -c xmldb:xindice://localhost:8080/db/foo/ -n 36a.xml -f 36a.xml
on document 36a.xml above.
This results in error: ERROR : Invalid byte 2 of 3-byte UTF-8 sequence.
This seems like a bug. Please file bug report into the bugzilla. And if you know how to fix it, attach a patch to the bug report.
Thanks, Vadim
