I am quite new to lucene, but am trying to learn quite quickly because: I am trying to convert a dictionary which has been stored in a lucene database (several in fact) into Stardict format so that I can ultimately import it into Dictionary.app in OS X The dictionary in question has a java front-end which doesn't seem to get past the startup screen. Using the tool Luke to look into the databases, there is basically one database with 88455 documents (which i think is the number of words in the dictionary) and 936811 terms: this database has 3 populated fields - WORD, ORDER, and docid There are two other databases, one of which also has 88455 documents with fields FILE and docid My guess is that the docid corresponds between these databases, and the contents of the FILE field is the definition. the problem is that the contents of this field is just garbled text. is there any obvious compression technique which might have been used to store this? obviously, the dictionary entires would just have text (and I believe some formatting, which might be in html) the dictionary is spanish, so some of the custom java code has spanish variable names, making it harder to see what's going on, but there are some java files for crypto, although it looks like a premade java crypto package. The text in the files looks like this:
*8G04)B0e0000000000ortpTAnRG3KU3rXB8CC1qoOfRCN8JYF7LIgcTUwlCfQjnPCL5rOKMGybEhe5 *8GXuLvt6ESsnc7OJJCMlIE7vAPMy(xbim2Ic8lI0TT0)T3TTp)dGfktY1LLdFzg4k8K8(U6)EDRrmk *8GYkkR0sj8qRidSbi5LgOJ2Nnj2AnlnpfGBIYnDomJCt(UyAy2Ly7BN8qKzPg6MExX8qAOU9XfbReI *8G3p65Cc866TeeNtIW8QwfsMR6TZQTe3Xq6nBu5SvEUUU)bt2CcfK8bB5IoS05mrKYyuld1xsYdy7Q *8GauqKMZ6Sh1S0uU9XqmfT90lO8kAp3RK0dNczm2Kx(ch(LNZq3gUxdPfkdv8iNdlSYzV8lNaqqjOR *8G5JLY06yzQCYW8ZBRQ9nvoceOy6kwpHvpKFQN2co9PR8JV3M4YKJBd(J0KhGuhI1GuQNi3GGSkRRY *8G6DxTOeOTYeGM5Y7q1XZJQeIJGtAqiFVKPLHWBAocfanQ02UehDbpwPg8nN0CrYWxHS8)Lxd72jhv *8GdwuWnmgRXHWne)LMOK35RpWurEMh0detASuPYgdMVNGO9KqT4)D8kIT7vqT9mlIcPylV5WYSuVjx *8GeLBy7Ofb7TeeKspjTkI5d0fsidiPUK9DQHLlN2vVLLIABbAuM1FGBoC5j3wBEPOL4wcTaYbg9A4L *8G9AibxcClXPs0rGuIgjuWMh8DU20ckjsIGSLBIJVHLukJfRlduDDw5CU6r)SsQBGXNuMMiIvL8gOV *8GABZdZ8AwuDZBYNlGCinBcmaJxDLvQYLClWnTP3Ksu7)V0lnh87HQCVZcVhDPmUkpwLSZBUbMhswY *8GhRT3WJ0LMHihWUU3fhQnmQyLQMbvbEeQKo10ns0paPZ5H6QRSdMgb7IJq)fji8njIQ7vLtWQUb70 *8GCfy2h6dgIMEtOEi61FSaZtp9r0Rk2yBdB(Sp(Ta5V9xTrNdgVM(rONyrFNkeytX7s6ht9MiyS(by *8GjZ79g3WNPusjwuxYNBq(We(J0Wqrh)kvxWfT2nRnFcZEnFMbYaog(lO6kKhOK2ytKJDHZLprOEnw *8GkCbQ(QhE()iezUjjneoFD2X2b2IEqCb7jB75JwbQsUyseRRRsaxZuL9wWiVaMoequbC0a59LEjwR *8GF5TFO9hOhUDgN)cnpFNbnjGBjTRFJtC)xl0sRzx6x2O(f3TTpnsu23LFEQndbnTzZqK)tugalR5h *8GmaL4HBtqynz1sAzLJPwMSA4p(1NPAG(BbpXBDCDPWZC6sUryYl5PPI1IWJGiaKZc8zF8WOtkxpNm *8GHEP)81dxuscQ00s5yxfZBd3Us3sHZ796XvrwU(qF0T)qwZpU5FtQSe8s3X)ubUe6rF)Fif1)xADL *0dI)(gRIa0k00003aN http://www.nabble.com/file/p21420932/Fich2 Fich2 Any help would be really appreciated - this is purchased software that ought to work, but i have turned it into a project rather than sending it back. -- View this message in context: http://www.nabble.com/stuck-with-Encoded-%28possibly-%29-Database-entries-tp21420932p21420932.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.