[
https://issues.apache.org/jira/browse/XERCESJ-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16607094#comment-16607094
]
Yitzhak Khabinsky edited comment on XERCESJ-1701 at 9/7/18 1:25 PM:
--------------------------------------------------------------------
It shall be fixed internally in XercesJ.
I experienced the same issue with Saxon long time ago.
They changed the hashing formula and the issue was resolved.
Please see it here:
*{color:#555555}"Hash function for XSD identity constraints has poor worst-case
performance{color}{color:#333333}|{color}"*
[{color:#333333}[https://saxonica.plan.io/issues/3487]{color}]
was (Author: ykhabins):
It shall be done internally in XercesJ.
I experienced the same issue with Saxon long time ago.
They changed the hashing formula and the issue was resolved.
Please see it here:
*{color:#555555}"Hash function for XSD identity constraints has poor worst-case
performance{color}{color:#333333}|{color}"*
[{color:#333333}[https://saxonica.plan.io/issues/3487]{color}]
> Xerces-J 2.12.0: XSD 1.1 PK constraint scalability issue
> --------------------------------------------------------
>
> Key: XERCESJ-1701
> URL: https://issues.apache.org/jira/browse/XERCESJ-1701
> Project: Xerces2-J
> Issue Type: Bug
> Components: JAXP (javax.xml.validation)
> Affects Versions: 2.12.0
> Environment: Windows 10, 64-bit
> Reporter: Yitzhak Khabinsky
> Priority: Major
> Labels: XSD, key
> Fix For: 2.12.0
>
> Attachments: SubscriberCountFact.zip
>
>
> Hello,
> A test case is very simple:
> * XML file, size 700 MB
> * XSD file
> XSD is enforcing the following:
> * XML structure
> * Data elements/attributes data types
> * *PK constraint, composite primary key based on four elements*
> * No asserts/assertions/CTAs
> <xs:key name="PK">
> <xs:selector xpath="r"/>
> <xs:field xpath="CountryCode"/>
> <xs:field xpath="Date"/>
> <xs:field xpath="AnalyticsArrangementKey"/>
> <xs:field xpath="PaymentType"/>
> </xs:key>
>
> Saxon Java EE runs XSD validation for 2 minutes
> Xerces-J 2.12.0 cannot finish it at all, running for many hours.
> If I comment out the *xs:key* constraint, Xerces has no problems to finish
> the validation.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]