Hello,
I have a problem with the way magnolia(5.3.12) is indexing page content. It is
indexing not just page content but css classes content too. Which brings me to
results that are not accurate
the search index config:
<SearchIndex
class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
<param name="path" value="${wsp.home}/index" />
<param name="useCompoundFile" value="true" />
<param name="minMergeDocs" value="100" />
<param name="volatileIdleTime" value="3" />
<param name="maxMergeDocs" value="100000" />
<param name="mergeFactor" value="10" />
<param name="maxFieldLength" value="10000" />
<param name="bufferSize" value="10" />
<param name="cacheSize" value="1000" />
<param name="queryClass"
value="org.apache.jackrabbit.core.query.QueryImpl" />
<param name="respectDocumentOrder" value="true" />
<param name="resultFetchSize" value="2147483647" />
<param name="extractorPoolSize" value="3" />
<param name="extractorTimeout" value="100" />
<param name="extractorBackLogSize" value="100" />
<param name="enableConsistencyCheck" value="false" />
<param name="forceConsistencyCheck" value="false" />
<param name="autoRepair" value="false" />
<param name="onWorkspaceInconsistency" value="log" />
</SearchIndex>
the sql: QUERY_PATTERN = "select * from nt:base where jcr:path like ''{0}/%''
and contains(*, ''{1}'') order by jcr:path";
Is there something I am missing so that I can search only in actual content and
not in all the HTML page?
Thanks,
Roxana
--
Context is everything:
http://forum.magnolia-cms.com/forum/thread.html?threadId=3393d9ea-35de-457d-b0e9-60345ec9e5a9
----------------------------------------------------------------
For list details, see http://www.magnolia-cms.com/community/mailing-lists.html
Alternatively, use our forums: http://forum.magnolia-cms.com/
To unsubscribe, E-mail to: <[email protected]>
----------------------------------------------------------------