Good [morning|day|evening|night],

A new message has been posted to DataparkSearch Engine forum at 
http://www.dataparksearch.org/

- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: SimonB
Subject: Indexer segfaults

I am indexing a bunch of PDF files. When I do "sbin/indexer -a" it processes about 5 
files and then segfaults. No coredump visible in the directory.

I recompiled with the debug option enabled. Here is what happends when I run index -a 
in gdb (I have changed the website name and the words it was indexing with xxxx as the 
stuff is company confidential for a protected extranet).

(gdb) run
Starting program: /usr/local/dpsearch/sbin/indexer -a
Reading symbols from shared object read from target memory...done.
Loaded system supplied DSO at 0xffffe000
[Thread debugging using libthread_db enabled]
[New Thread 1080210464 (LWP 19788)]
indexer[19788]: indexer from dpsearch-4.25-11102004-mysql-pqsql started with 
'/usr/local/dpsearch/etc/indexer.conf'
[New Thread 1089248176 (LWP 19799)]
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=M;O=A
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=M;O=D
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am001125.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am001231.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=S;O=A
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am001111.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=N;O=A
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am900110.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=D;O=A
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=D;O=D
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am010220.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/sm990310.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am010118.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am000330.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/sm990309.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/sm990311.pdf
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/?C=N;O=D
indexer[19788]: {01} URL: https://foo.bar.com/pdfs/am001216.pdf
Detaching after fork from child process 19804.

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 1089248176 (LWP 19799)]
0x4047eb2a in _int_malloc () from /lib/tls/libc.so.6



(gdb) backtrace
#0  0x4047eb2a in _int_malloc () from /lib/tls/libc.so.6
#1  0x4047de7d in malloc () from /lib/tls/libc.so.6
#2  0x40083ec2 in _DpsSQLQuery (db=0x85fe4a0, SQLRes=0x0,
    query=0x8649818 "INSERT INTO dict (word,url_id,intag) VALUES 
('xxxxx',9,196864),('xxxx',9,262400),('xxxx',9,327936),('xxxx',9,393472),('xxxx',9,524544),('xxxx',9,655616),('xxxx',9,721152),('xxxx',9,104"...,
 file=0x4009647a "sql.c",
    line=1250) at sqldbms.c:2446
#3  0x40031aa1 in StoreWordsSingle (Indexer=0x85ec2b0, Doc=0x8600b98, db=0x85fe4a0) at 
sql.c:1229
#4  0x40031df5 in DpsStoreWords (Indexer=0x85ec2b0, Doc=0x0, db=0x0) at sql.c:1330
#5  0x4003f3d8 in DpsURLActionSQL (A=0x85ec2b0, D=0x0, cmd=1079256832, db=0x0) at 
sql.c:4716
#6  0x4006dbc4 in DpsURLAction (A=0x85ec2b0, D=0x8600b98, cmd=5) at db.c:682
#7  0x4006c78c in DocUpdate (Indexer=0x85ec2b0, Doc=0x8600b98) at db.c:465
#8  0x4006d3c8 in DpsDocUpdate (Indexer=0x85ec2b0, Doc=0x8600b98) at db.c:522
#9  0x4006e049 in DpsURLAction (A=0x85ec2b0, D=0x8600b98, cmd=19) at db.c:599
#10 0x4002afcf in DpsIndexNextURL (Indexer=0x85ec2b0) at indexer.c:1249
#11 0x0804b263 in thread_main (arg=0x85ec2b0) at main.c:796
#12 0x40401dec in start_thread () from /lib/tls/libpthread.so.0
#13 0x404e919a in clone () from /lib/tls/libc.so.6
- - - - - - - - - - - - - - - - - - - - - - - - - - - -

Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=01;post=

Reply via email to