Getting junk characters while indexing

2009-04-29 Thread Koushik Mitra
Hi,

We are trying to index a .doc file. However, after indexing the dot( . ) and 
apostrophe( ' ) present in the file, getting converted to junk values.

How to resolve the issue?

Thanks,
Koushik

 CAUTION - Disclaimer *
This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely 
for the use of the addressee(s). If you are not the intended recipient, please 
notify the sender by e-mail and delete the original message. Further, you are 
not 
to copy, disclose, or distribute this e-mail or its contents to any other 
person and 
any such actions are unlawful. This e-mail may contain viruses. Infosys has 
taken 
every reasonable precaution to minimize this risk, but is not liable for any 
damage 
you may sustain as a result of any virus in this e-mail. You should carry out 
your 
own virus checks before opening the e-mail or attachment. Infosys reserves the 
right to monitor and review the content of all messages sent to or from this 
e-mail 
address. Messages sent to or from this e-mail address may be stored on the 
Infosys e-mail system.
***INFOSYS End of Disclaimer INFOSYS***


Getting incorrect value while trying to extract content from xlsx

2009-04-28 Thread Koushik Mitra
HI,

I was trying to extract content from an xlsx file for indexing.
However, I am getting julian date value for a cell with date format and '1.0' 
in place of '100%'.
I want to retain the value as present in that xlsx file.

Solution appreciated.

Thanks,
Koushik

 CAUTION - Disclaimer *
This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely 
for the use of the addressee(s). If you are not the intended recipient, please 
notify the sender by e-mail and delete the original message. Further, you are 
not 
to copy, disclose, or distribute this e-mail or its contents to any other 
person and 
any such actions are unlawful. This e-mail may contain viruses. Infosys has 
taken 
every reasonable precaution to minimize this risk, but is not liable for any 
damage 
you may sustain as a result of any virus in this e-mail. You should carry out 
your 
own virus checks before opening the e-mail or attachment. Infosys reserves the 
right to monitor and review the content of all messages sent to or from this 
e-mail 
address. Messages sent to or from this e-mail address may be stored on the 
Infosys e-mail system.
***INFOSYS End of Disclaimer INFOSYS***


Some characters are searchable

2009-04-23 Thread Koushik Mitra
Hi,

I am trying to search the following characters present through solr.

`, @, #, $, %, _ , , , .

But I am not getting any result back, even if those characters are present in 
the document .
So my question is are these characters getting indexed?

Thanks,
Koushik

 CAUTION - Disclaimer *
This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely 
for the use of the addressee(s). If you are not the intended recipient, please 
notify the sender by e-mail and delete the original message. Further, you are 
not 
to copy, disclose, or distribute this e-mail or its contents to any other 
person and 
any such actions are unlawful. This e-mail may contain viruses. Infosys has 
taken 
every reasonable precaution to minimize this risk, but is not liable for any 
damage 
you may sustain as a result of any virus in this e-mail. You should carry out 
your 
own virus checks before opening the e-mail or attachment. Infosys reserves the 
right to monitor and review the content of all messages sent to or from this 
e-mail 
address. Messages sent to or from this e-mail address may be stored on the 
Infosys e-mail system.
***INFOSYS End of Disclaimer INFOSYS***


Create incremental snapshot

2009-04-18 Thread Koushik Mitra
Hi,

We want to create snapshot incrementally.

What we want is every time the snap shooter script runs, it should not create a 
snapshot with pre-existing (last snapshot indexes) + delta (newly created 
indexes), rather just create a snapshot with the delta (newly created indexes).

Any references here would be highly appreciated.

Regards,
Koushik

 CAUTION - Disclaimer *
This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely 
for the use of the addressee(s). If you are not the intended recipient, please 
notify the sender by e-mail and delete the original message. Further, you are 
not 
to copy, disclose, or distribute this e-mail or its contents to any other 
person and 
any such actions are unlawful. This e-mail may contain viruses. Infosys has 
taken 
every reasonable precaution to minimize this risk, but is not liable for any 
damage 
you may sustain as a result of any virus in this e-mail. You should carry out 
your 
own virus checks before opening the e-mail or attachment. Infosys reserves the 
right to monitor and review the content of all messages sent to or from this 
e-mail 
address. Messages sent to or from this e-mail address may be stored on the 
Infosys e-mail system.
***INFOSYS End of Disclaimer INFOSYS***


Re: Create incremental snapshot

2009-04-18 Thread Koushik Mitra
When we run the snapshooter script, it creates a snapshot folder e.g. 
snapshot.20090418064010 and this snapshot folder contains physical index files 
which take space on the file system (as shown below). Are we missing anything 
here?

-rw-r-  46 test  test 59 Apr 17 23:26 _i.tii
-rw-r-  46 test  test507 Apr 17 23:26 _i.prx
-rw-r-  46 test  test 14 Apr 17 23:26 _i.nrm
-rw-r-  46 test  test333 Apr 17 23:26 _i.frq
-rw-r-  46 test  test135 Apr 17 23:26 _i.fnm
-rw-r-  46 test  test 12 Apr 17 23:26 _i.fdx
-rw-r-  46 test  test   1433 Apr 17 23:26 _i.fdt

Regards,
Koushik



On 18/04/09 12:17 PM, Noble Paul നോബിള്‍  नोब्ळ् noble.p...@gmail.com wrote:

the snapshooter does not really copy any files. They ar just hardlinks
(does not consume disk space) so even a full copy is not very
expensive

On Sat, Apr 18, 2009 at 12:06 PM, Koushik Mitra
koushik_mi...@infosys.com wrote:
 Hi,

 We want to create snapshot incrementally.

 What we want is every time the snap shooter script runs, it should not create 
 a snapshot with pre-existing (last snapshot indexes) + delta (newly created 
 indexes), rather just create a snapshot with the delta (newly created 
 indexes).

 Any references here would be highly appreciated.

 Regards,
 Koushik

  CAUTION - Disclaimer *
 This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely
 for the use of the addressee(s). If you are not the intended recipient, please
 notify the sender by e-mail and delete the original message. Further, you are 
 not
 to copy, disclose, or distribute this e-mail or its contents to any other 
 person and
 any such actions are unlawful. This e-mail may contain viruses. Infosys has 
 taken
 every reasonable precaution to minimize this risk, but is not liable for any 
 damage
 you may sustain as a result of any virus in this e-mail. You should carry out 
 your
 own virus checks before opening the e-mail or attachment. Infosys reserves the
 right to monitor and review the content of all messages sent to or from this 
 e-mail
 address. Messages sent to or from this e-mail address may be stored on the
 Infosys e-mail system.
 ***INFOSYS End of Disclaimer INFOSYS***




--
--Noble Paul



Re: Create incremental snapshot

2009-04-18 Thread Koushik Mitra
Ok

If these are hard links, then where does the index data get stored? Those must 
be getting stored somewhere in the file system.

Regards,
Koushik


On 18/04/09 12:35 PM, Noble Paul നോബിള്‍  नोब्ळ् noble.p...@gmail.com wrote:

yeah ,that is right but those are hardlinks

http://linux.about.com/cs/linux101/g/hardlinks.htm


On Sat, Apr 18, 2009 at 12:20 PM, Koushik Mitra
koushik_mi...@infosys.com wrote:
 When we run the snapshooter script, it creates a snapshot folder e.g. 
 snapshot.20090418064010 and this snapshot folder contains physical index 
 files which take space on the file system (as shown below). Are we missing 
 anything here?

 -rw-r-  46 test  test 59 Apr 17 23:26 _i.tii
 -rw-r-  46 test  test507 Apr 17 23:26 _i.prx
 -rw-r-  46 test  test 14 Apr 17 23:26 _i.nrm
 -rw-r-  46 test  test333 Apr 17 23:26 _i.frq
 -rw-r-  46 test  test135 Apr 17 23:26 _i.fnm
 -rw-r-  46 test  test 12 Apr 17 23:26 _i.fdx
 -rw-r-  46 test  test   1433 Apr 17 23:26 _i.fdt

 Regards,
 Koushik



 On 18/04/09 12:17 PM, Noble Paul നോബിള്‍  नोब्ळ् noble.p...@gmail.com 
 wrote:

 the snapshooter does not really copy any files. They ar just hardlinks
 (does not consume disk space) so even a full copy is not very
 expensive

 On Sat, Apr 18, 2009 at 12:06 PM, Koushik Mitra
 koushik_mi...@infosys.com wrote:
 Hi,

 We want to create snapshot incrementally.

 What we want is every time the snap shooter script runs, it should not 
 create a snapshot with pre-existing (last snapshot indexes) + delta (newly 
 created indexes), rather just create a snapshot with the delta (newly 
 created indexes).

 Any references here would be highly appreciated.

 Regards,
 Koushik

  CAUTION - Disclaimer *
 This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely
 for the use of the addressee(s). If you are not the intended recipient, 
 please
 notify the sender by e-mail and delete the original message. Further, you 
 are not
 to copy, disclose, or distribute this e-mail or its contents to any other 
 person and
 any such actions are unlawful. This e-mail may contain viruses. Infosys has 
 taken
 every reasonable precaution to minimize this risk, but is not liable for any 
 damage
 you may sustain as a result of any virus in this e-mail. You should carry 
 out your
 own virus checks before opening the e-mail or attachment. Infosys reserves 
 the
 right to monitor and review the content of all messages sent to or from this 
 e-mail
 address. Messages sent to or from this e-mail address may be stored on the
 Infosys e-mail system.
 ***INFOSYS End of Disclaimer INFOSYS***




 --
 --Noble Paul





--
--Noble Paul