Hi Janne,
I added code as your suggestions. So now I am using text generated from
cleanTextRender to buld index on the page content.
However there seems to be some problem whenever I have '[{TableOfContents
}]' mark up in my page then CleanTextRender possibly generates HTML code or
somethig similar. For other markups like %%strike and !!!, the html code is
not generated and I get pure text. Its just '[{TableOfContents }]' that
seems to be translated to html (or something similar) code.
How I produce this issue:
1) I create a page named TestPage with the following content
[{TableOfContents }]
!!! Chapter 1
Test page without any helpful info
!! Chapter 1.1
hi
2) I save the page and then I search for keywords like 'href' and 'div'
3) The page that I just created comes as search result
However when I edit TestPage and remove the above contents and just put
following code
%%strike God is great!!/%?
and save and Then search using 'div' keyword, it does not show any result
and works correctly.That confirms CleanTextRender did NOT return html
equivalent for %%strike and thus no html code was used for index creation.
Note that '<div class="strike">' is html translation for %%strike
So, the problem seems to be with pure text conversion of -
[{TableOfContents }]
Is this some bug with Clean text renderer, or am I making a mistake
somewhere.
Please suggest how to resolve it.
Thanks and regards!
Pushker Chaubey
On Apr 21, 2009 10:06pm, Janne Jalkanen <[email protected]> wrote:
Try the CleanTextRenderer. Get a WikiDocument from the
JSPWikiMarkupParser, then create a CleanTextRenderer instance and pass
the document to it. Look at RenderingManager for help.
/Janne
On 21 Apr 2009, at 18:39, [email protected] wrote:
Hi experts,
I am new to JSPWIKI.
I want to do Indexing based on page contents. But I want to remove the
wiki mark-ups from the page and index only plain textual data. Please let
me know how this can be done.
Thanks and regards,
Pushker Chaubey