Hi there, I have been reading with interest the docs on processing Excel documents. In particular I am interested in text extraction for which I am currently using Aperture 1.6.0 which includes POI 3.8-beta5.
I would like to know how to fine tune the process because in addition to expected text (column titles, workbook names, etc.), it also extracts a lot of floating point numbers which bear no visible relation to the numbers in the cells. I can only think that perhaps they are dates (or similar) in an internal format? Anyway, for my application they are unwanted and need to be suppressed somehow. Could someone kindly point me in the right direction? Thanks, - Chris Chris Bamford Senior Developer 2 - 8 Balfe Street Kings Cross, London, N1 9EG mobile +44 7860 405292 tel: +44 (0) 207 843 2300 web www.mimecast.com The information contained in this communication from [email protected] is confidential and may be legally privileged. It is intended solely for use by [email protected] and others authorized to receive it. If you are not [email protected] you are hereby notified that any disclosure, copying, distribution or taking action in reliance of the contents of this information is strictly prohibited and may be unlawful. Mimecast Ltd. is a company registered in England and Wales with the company number 4698693 VAT No. GB 123 4197 34 Registered Office:2 - 8 Balfe Street, Kings Cross London, N1 9EG Email Address: [email protected] This email message has been scanned for viruses by Mimecast. Mimecast delivers a complete managed email solution from a single web based platform. For more information please visit http://www.mimecast.com
