I am quite new to lucene, but am trying to learn quite quickly because:
I am trying to convert a dictionary which has been stored in a lucene
database (several in fact) into Stardict format so that I can ultimately
import it into Dictionary.app in OS X
The dictionary in question has a java front-end which doesn't seem to get
past the startup screen.
Using the tool Luke to look into the databases, there is basically one
database with 88455 documents (which i think is the number of words in the
dictionary) and 936811 terms: this database has 3 populated fields - WORD,
ORDER, and docid
There are two other databases, one of which also has 88455 documents with
fields FILE and docid
My guess is that the docid corresponds between these databases, and the
contents of the FILE field is the definition.
the problem is that the contents of this field is just garbled text.
is there any obvious compression technique which might have been used to
store this? obviously, the dictionary entires would just have text (and I
believe some formatting, which might be in html)
the dictionary is spanish, so some of the custom java code has spanish
variable names, making it harder to see what's going on, but there are some
java files for crypto, although it looks like a premade java crypto package.
The text in the files looks like this:

*8G04)B0e0000000000ortpTAnRG3KU3rXB8CC1qoOfRCN8JYF7LIgcTUwlCfQjnPCL5rOKMGybEhe5
*8GXuLvt6ESsnc7OJJCMlIE7vAPMy(xbim2Ic8lI0TT0)T3TTp)dGfktY1LLdFzg4k8K8(U6)EDRrmk
*8GYkkR0sj8qRidSbi5LgOJ2Nnj2AnlnpfGBIYnDomJCt(UyAy2Ly7BN8qKzPg6MExX8qAOU9XfbReI
*8G3p65Cc866TeeNtIW8QwfsMR6TZQTe3Xq6nBu5SvEUUU)bt2CcfK8bB5IoS05mrKYyuld1xsYdy7Q
*8GauqKMZ6Sh1S0uU9XqmfT90lO8kAp3RK0dNczm2Kx(ch(LNZq3gUxdPfkdv8iNdlSYzV8lNaqqjOR
*8G5JLY06yzQCYW8ZBRQ9nvoceOy6kwpHvpKFQN2co9PR8JV3M4YKJBd(J0KhGuhI1GuQNi3GGSkRRY
*8G6DxTOeOTYeGM5Y7q1XZJQeIJGtAqiFVKPLHWBAocfanQ02UehDbpwPg8nN0CrYWxHS8)Lxd72jhv
*8GdwuWnmgRXHWne)LMOK35RpWurEMh0detASuPYgdMVNGO9KqT4)D8kIT7vqT9mlIcPylV5WYSuVjx
*8GeLBy7Ofb7TeeKspjTkI5d0fsidiPUK9DQHLlN2vVLLIABbAuM1FGBoC5j3wBEPOL4wcTaYbg9A4L
*8G9AibxcClXPs0rGuIgjuWMh8DU20ckjsIGSLBIJVHLukJfRlduDDw5CU6r)SsQBGXNuMMiIvL8gOV
*8GABZdZ8AwuDZBYNlGCinBcmaJxDLvQYLClWnTP3Ksu7)V0lnh87HQCVZcVhDPmUkpwLSZBUbMhswY
*8GhRT3WJ0LMHihWUU3fhQnmQyLQMbvbEeQKo10ns0paPZ5H6QRSdMgb7IJq)fji8njIQ7vLtWQUb70
*8GCfy2h6dgIMEtOEi61FSaZtp9r0Rk2yBdB(Sp(Ta5V9xTrNdgVM(rONyrFNkeytX7s6ht9MiyS(by
*8GjZ79g3WNPusjwuxYNBq(We(J0Wqrh)kvxWfT2nRnFcZEnFMbYaog(lO6kKhOK2ytKJDHZLprOEnw
*8GkCbQ(QhE()iezUjjneoFD2X2b2IEqCb7jB75JwbQsUyseRRRsaxZuL9wWiVaMoequbC0a59LEjwR
*8GF5TFO9hOhUDgN)cnpFNbnjGBjTRFJtC)xl0sRzx6x2O(f3TTpnsu23LFEQndbnTzZqK)tugalR5h
*8GmaL4HBtqynz1sAzLJPwMSA4p(1NPAG(BbpXBDCDPWZC6sUryYl5PPI1IWJGiaKZc8zF8WOtkxpNm
*8GHEP)81dxuscQ00s5yxfZBd3Us3sHZ796XvrwU(qF0T)qwZpU5FtQSe8s3X)ubUe6rF)Fif1)xADL
*0dI)(gRIa0k00003aN

http://www.nabble.com/file/p21420932/Fich2 Fich2 
Any help would be really appreciated - this is purchased software that ought
to work, but i have turned it into a project rather than sending it back.
-- 
View this message in context: 
http://www.nabble.com/stuck-with-Encoded-%28possibly-%29-Database-entries-tp21420932p21420932.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Reply via email to