Re: RFR 8177552: Compact Number Formatting support

Nishit Jain Mon, 26 Nov 2018 23:14:05 -0800

Hi Naoto,

On 26-11-2018 21:01, naoto.s...@oracle.com wrote:

Hi Nishit,
On 11/26/18 12:41 AM, Nishit Jain wrote:
Hi Naoto,
To add to my previous mail comment, the DecimalFormat spec also saysthat
/*"DecimalFormat can be instructed to format and parse scientificnotation only via a pattern; there is currently no factory methodthat creates a scientific notation format. In a pattern, the exponentcharacter immediately followed by one or more digit charactersindicates scientific notation. "
*/That is, exponent formatting and parsing is instructed only via ascientific notation pattern and I think should not be there with*general number* formatting.
I am not sure the quoted sentence should be interpreted that way. Myunderstanding is that the section means there is no publicNumberFormat.getScientificInstance() method (cf. line 601 atNumberFormat.java), so that users will have to use 'E' in theirpattern string.
Anyway, my point is that if you prefer to treat the scientificnotation differently between DecimalFormat and CompactDecimalFormat,then it will need to be clarified in the spec. Personally I agree thatit is not practical to interpret E in the CNF.

OK. If it is better to specify the parsing behavior w.r.t. theexponential numbers, I have added a statement in the parse() API.

*/"CompactNumberFormat parse does not allow parsing exponential numberstrings. For example, parsing a string "1.05E4K" in US locale breaks atcharacter 'E' and returns 1.05."/*


Updated the webrev
http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.03/

Will also update the CSR and refinalize it.

Regards,
Nishit Jain

Naoto
Updated webrev based on the other comments

http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.02/

 > Some more comments (all in CompactNumberFormat.java)
> line 807: expandAffix() seems to treat localizable special patterncharacters, but currently the implementation only cares for the minussign. Should other localizable pattern chars be taken care of, suchas percent sign?- Other special characters like '%' percent sign are not allowed asper CNF compact pattern spec
 > line 869, 888: Define what -1 means as a ret value.
- OK.
> line 897: iterMultiplier be better all capitalized as it is aconstant. And it could be statically defined in the class to beshared with other locations that use "10" for arithmetic operation.
- OK, made it static final and renamed it as RANGE_MULTIPLIER

 > line 1531: Any possibility this could lead to divide-by-zero?
- None which I am aware of, unless you are pointing to the issue likeJDK-8211161, which we know is not an issue.
Regards,
Nishit Jain
On 23-11-2018 15:55, Nishit Jain wrote:
Hi Naoto,
> I think DecimalFormat and CNF should behave the same, ie. 'E'should be treated as the exponent without a quote.
Personally I don't think that the exponential parsing should besupported by CompactNumberFormat, because the objective of compactnumbers is to represent numbers in short form. So, parsing of numberformat like "1.05E4K" should not be expected fromCompactNumberFormat, I am even doubtful that such forms ("1.05E4K")are used anywhere where exponential and compact form are togetherused. If formatting and parsing of exponential numbers are needed itshould be done by DecimalFormat scientific instance *not *with thegeneral number instance.So, I don't think that we should allowparsing of exponential numbers.Comments welcome.
Regards,
Nishit Jain
On 22-11-2018 02:02, naoto.s...@oracle.com wrote:
Hi Nishit,

On 11/21/18 12:53 AM, Nishit Jain wrote:
Hi Naoto,

Updated the webrev based on suggestions

http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.01/

Changes made:
- Replaced List<String> with String[] to be added to the theresource bundles
Good.
- refactored DecimalFormat.subparse() to be used by theCNF.parse(), to reduce code duplication.
I presume CNF is calling package-private methods in DF to share thesame code. Some comments noting the sharing would be helpful.
- Also updated it with other changes as suggested in the comments
Sorry I missed your question the last time:
>>> Do you think this is an issue with DecimalFormat.parse() and CNF
>>> should avoid parsing exponential numbers? Or, shouldCNF.parse() be>>> modified to be consistent with DecimalFormat.parse() in thisaspect?
I think DecimalFormat and CNF should behave the same, ie. 'E'should be treated as the exponent without a quote.
Some more comments (all in CompactNumberFormat.java)
line 807: expandAffix() seems to treat localizable special patterncharacters, but currently the implementation only cares for theminus sign. Should other localizable pattern chars be taken careof, such as percent sign?
line 869, 888: Define what -1 means as a ret value.
line 897: iterMultiplier be better all capitalized as it is aconstant. And it could be statically defined in the class to beshared with other locations that use "10" for arithmetic operation.
line 1531: Any possibility this could lead to divide-by-zero?

Naoto
Regards,
Nishit Jain
On 20-11-2018 00:33, naoto.s...@oracle.com wrote:
Hi Nishit,

On 11/18/18 10:29 PM, Nishit Jain wrote:
Hi Naoto,

Please check my comments inline.

On 17-11-2018 04:52, naoto.s...@oracle.com wrote:
Hi Nishit,

Here are my comments:
- CLDRConverter: As the compact pattern no more employsList<String>, can we eliminate stringListEntry/Element, and useArray equivalent instead?
Since the CNF design does not put any limit on the size ofcompact pattern, so at the time of parsing the CLDR xmls usingSAX parser, it becomes difficult to identify the size of arraywhen the parent element of compact pattern is encountered, so Ithink it is better to keep the List<String> while extracting theresources.
OK. However I'd not keep the List<String> format on generatingthe resource bundle, as there is no reason to introduce yetanother bundle format other than the existing array of String.
- CompactNumberFormat.java

Multiple locations: Use StringBuilder instead of StringBuffer.
OK
line 268: The link points toNumberFormat.getNumberInstance(Locale) instead of DecimalFormat
OK. Changed it at line 165 also.
line 855: no need to do toString(). length() can detect whetherit's empty or not.
line 884: "Overloaded method" reads odd here. I'd preferspecializing in the "given number" into either long or biginteger.
OK
line 1500: subparseNumber() pretty much shares the same codewith DecimalFormat.subparse(). can they be merged?
The existing CNF.subParseNumber differs in the wayparseIntegerOnly is handled, DecimalFormat.parse()/subparse()behaviour is unpredictable with parseIntegeronly = true whenmultipliers are involved (Please see JDK-8199223).
Also, I had thought that the CNF.parse()/subparseNumber() should*not *parse the exponential notation e.g. while parsing"1.05E4K" the parsing should break at 'E' and returns 1.05,because 'E' should be considered as unparseable character forgeneral number format pattern or compact number pattern, butthis is not the case with DecimalFormat.parse(). The belowDecimalFormat general number format instance
NumberFormat nf =  NumberFormat.getNumberInstance();
nf.parse("1.05E4")
Successfully parse the string and returns 10500. The samebehaviour is there with other DecimalFormat instances also e.g.currency instance.
Do you think this is an issue with DecimalFormat.parse() and CNFshould avoid parsing exponential numbers? Or, should CNF.parse()be modified to be consistent with DecimalFormat.parse() in thisaspect?
No, I understand there are differences. But I see a lot ofduplicated piece of code which I would like to eliminate.
line 1913-1923, 1950-1960, 1987-1997, 2024-2034: It simplycalls super. No need to override them.
Since setters are overridden, I think that it is better tooverride getters also (even if they are just calling super andhave same javadoc) to keep them at same level. But, if you seeno point in keeping them in CNF, I will remove them. Does thatneed CSR change?
I don't see any point for override. I don't think there needs aCSR, but better ask Joe about it.
line 2231: You need to test the type before cast. OtherwiseClassCastException may be thrown.
The type is checked in the superclass equals method getClass()!= obj.getClass(), so I think there is no need to check the typehere.
OK.

Naoto
Regards,
Nishit Jain
Naoto

On 11/16/18 9:54 AM, Nishit Jain wrote:
Hi,
Please review this non trivial feature addition toNumberFormat API.
The existing NumberFormat API provides locale based supportfor formatting and parsing numbers which includes formattingdecimal, percent, currency etc, but the support for formattinga number into a human readable or compact form is missing.This RFE adds that feature to format a decimal number in acompact format (e.g. 1000 -> 1K, 1000000 -> 1M in en_USlocale) , which is useful for the environment where displayspace is limited, so that the formatted string can bedisplayed in that limited space. It is defined by LDML'sspecification for Compact Number Formats.
http://unicode.org/reports/tr35/tr35-numbers.html#Compact_Number_Formats
RFE: https://bugs.openjdk.java.net/browse/JDK-8177552
Webrev:http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.00/
CSR: https://bugs.openjdk.java.net/browse/JDK-8188147

Request to please help review the the change.

Regards,
Nishit Jain

Re: RFR 8177552: Compact Number Formatting support

Reply via email to