[ https://issues.apache.org/jira/browse/CONNECTORS-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615550#comment-14615550 ]
Shinichiro Abe commented on CONNECTORS-1219: -------------------------------------------- r1689485. StringBuilder(int capacity) , this capacity was approximately 700 MB. In the past I realized Solrj also have the same limitation, even though Solrj doesn't use StringBuilder, but use String.getBytes(). org.apache.lucene.lucene.util.ArrayUtil.grow is also using byte array, maybe occurs OOM when exceeding that size. > Lucene Output Connector > ----------------------- > > Key: CONNECTORS-1219 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1219 > Project: ManifoldCF > Issue Type: New Feature > Reporter: Shinichiro Abe > Assignee: Shinichiro Abe > Attachments: CONNECTORS-1219-v0.1patch.patch, > CONNECTORS-1219-v0.2.patch > > > A output connector for Lucene local index directly, not via remote search > engine. It would be nice if we could use Lucene various API to the index > directly, even though we could do the same thing to the Solr or Elasticsearch > index. I assume we can do something to classification, categorization, and > tagging, using e.g lucene-classification package. -- This message was sent by Atlassian JIRA (v6.3.4#6332)