xlsx somewhat recently switched to Scientific notation for long sequences of digits?

2016-06-29 Thread Allison, Timothy B.
All, On https://issues.apache.org/jira/browse/TIKA-2025, a Tika user noted that, at least for xlsx, what used to be rendered as a long sequence of digits (e.g. 340229177292566) is now being extracted as scientific notation (3.40229E+14). This new behavior mimics Excel more closely, but is the

Re: xlsx somewhat recently switched to Scientific notation for long sequences of digits?

2016-06-29 Thread Javen O'Neal
Excel and POI don't make a distinction between double/decimal and int. Does Excel make any guarantees that doubles won't have precision issues? 16-digit credit cards are not storable as 32-bit ints, but require 64-bit longs. On Jun 29, 2016 5:53 AM, "Allison, Timothy B." wrote: All, On https:/

RE: xlsx somewhat recently switched to Scientific notation for long sequences of digits?

2016-06-29 Thread Allison, Timothy B.
columns are of a certain width, so in some ways this is progress. The question: is there an easy way for us to get the old behavior? -Original Message- From: Javen O'Neal [mailto:javenon...@gmail.com] Sent: Wednesday, June 29, 2016 11:13 AM To: POI Users List Subject: Re: xlsx

RE: xlsx somewhat recently switched to Scientific notation for long sequences of digits?

2016-06-29 Thread Allison, Timothy B.
As Aeham Abushwashi pointed out on TIKA-2025, this was caused by the improvement/closer alignment to Excel's spec in org.apache.poi.ss.usermodel.ExcelGeneralNumberFormat. https://bz.apache.org/bugzilla/show_bug.cgi?id=58471 and http://svn.apache.org/viewvc?diff_format=h&view=revision&revision=