I would say definitely investigate the performance of the query, but also
since you're using CachedSqlEntityProcessor, you might want to back off on
the transaction isolation to READ_COMMITTED, which I think is the lowest
one that Oracle supports:

http://wiki.apache.org/solr/DataImportHandler#Configuring_JdbcDataSource

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <https://twitter.com/Appinions> | g+:
plus.google.com/appinions
w: appinions.com <http://www.appinions.com/>


On Fri, Jun 28, 2013 at 2:52 PM, Otis Gospodnetic <
otis.gospodne...@gmail.com> wrote:

> Hi,
>
> I'd go talk to the DBA.  How long does this query take if you run it
> directly against Oracle?  How long if you run it locally vs. from a
> remove server (like Solr is in relation to your Oracle server(s)).
> What happens if you increase batchSize?
>
> Otis
> --
> Solr & ElasticSearch Support -- http://sematext.com/
> Performance Monitoring -- http://sematext.com/spm
>
>
>
> On Thu, Jun 27, 2013 at 6:41 PM, srinalluri <nallurisr...@yahoo.com>
> wrote:
> > Hello,
> >
> > I am using Solr 4.3.2 and Oracle DB. The sub entity is using
> > CachedSqlEntityProcessor. The dataSource is having batchSize="500". The
> > full-import is failed with 'ORA-01555: snapshot too old: rollback segment
> > number  with name "" too small ORA-22924: snapshot too old' Exception
> after
> > 5 hours.
> >
> > We already increased the undo space 4 times at the database end. Number
> of
> > records in the jan_story table is 800,000 only. Tomcat is with 4GB JVM
> > memory.
> >
> > Following is the entity (there are other sub-entities, I didn't mention
> them
> > here. As the import failed with article_details entity. article_details
> is
> > the first sub-entity)
> >
> > <entity name="par8-article-testingprod" dataSource="par8_prod" pk="VCMID"
> >     preImportDeleteQuery="content_type:article AND
> > repository:par8qatestingprod"
> >     query="select ID as VCMID from jan_story">
> > <entity name="article_details" dataSource="par8_prod"
> > transformer="TemplateTransformer,ClobTransformer,RegexTransformer"
> >       query="select bb.recordid, aa.ID as DID,aa.STORY_TITLE,
> > aa.STORY_HEADLINE, aa.SOURCE, aa.DECK, regexp_replace(aa.body,
> > '\<p\>\[(pullquote|summary)\]\</p\>|\[video [0-9]+?\]|\[youtube
> > .+?\]', '') as BODY, aa.PUBLISHED_DATE, aa.MODIFIED_DATE, aa.DATELINE,
> > aa.REPORTER_NAME, aa.TICKER_CODES,aa.ADVERTORIAL_CONTENT from jan_story
> > aa,mapp bb where aa.id=bb.keystring1" cacheKey="DID"
> > cacheLookup="par8-article-testingprod.VCMID"
> > processor="CachedSqlEntityProcessor" >
> >         <field column="content_type" template="article" />
> >         <field column="RECORDID" name="native_id" />
> >         <field column="repository" template="par8qatestingprod" />
> >         <field column="STORY_TITLE" name="title" />
> >         <field column="DECK" name="description" clob="true" />
> >         <field column="PUBLISHED_DATE" name="date" />
> >         <field column="MODIFIED_DATE" name="last_modified_date" />
> >         <field column="BODY" name="body" clob="true" />
> >         <field column="SOURCE" name="source" />
> >         <field column="DATELINE" name="dateline" />
> >         <field column="STORY_HEADLINE" name="export_headline" />
> >       </entity>
> >       </entity>
> >
> >
> > The full-import without CachedSqlEntityProcessor is taking 7 days. That
> is
> > why I am doing all this.
> >
> >
> >
> > --
> > View this message in context:
> http://lucene.472066.n3.nabble.com/full-import-failed-after-5-hours-with-Exception-ORA-01555-snapshot-too-old-rollback-segment-number-wd-tp4073822.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to