[ https://issues.apache.org/jira/browse/CONNECTORS-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13127598#comment-13127598 ]
Karl Wright commented on CONNECTORS-274: ---------------------------------------- Looking at the api_result.xml, the only thing that raises a warning flag for me are the double brackets ("[[" and "]]"). I can imagine that xerces might get confused here looking for CDATA segments. The only way I'm going to know for sure is by looking what xerces spits out during parsing. I may be able to look at that a bit this afternoon, with luck. > Wiki connector loses some data > ------------------------------ > > Key: CONNECTORS-274 > URL: https://issues.apache.org/jira/browse/CONNECTORS-274 > Project: ManifoldCF > Issue Type: Bug > Components: Wiki connector > Affects Versions: ManifoldCF 0.4 > Reporter: Karl Wright > Assignee: Karl Wright > Fix For: ManifoldCF 0.4 > > Attachments: api_result.xml, api_result.xml, fetch_result_content.txt > > > The wiki connector reportedly does not capture the entire content for a > document properly; it deletes some in a pattern which is mysterious. For > example: > As for an example of the content I got and what I was expecting: > content indexed: > > "ff Nienhagen *> " > content expected: > "... > == Akquise == > * Riff Nienhagen > ..." > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira