- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Emil
Subject: Segmentation fault on 64 bit
Hello Maxime!
I have problems running dpsearch on 64-bit linux (Gentoo). I was using dpsearch
for a while with no problems, but after a major upgrade I ran into problems
witch search.cgi and indexer giving me segmentation faults. Had a lot of head
ache trying to compile different versions to find out where the problem is. I
thought maybe I was using some different compile options which caused the
errors, but I tried with the default values and with default options in configs
but without success. I downgraded then back to version dpsearch-4.43 which is
running fine... But with any version above dpsearch-4.44 I get segmentation
faults. I tried compiling the latest versions of dpsearch on 32-bit and it runs
very well. On 64-bit I tried on AMD64 and on dual Intel 64 and it gives me the
same segfaults.
search.cgi gives Internal Server Error
indexer stops with segmentation fault when running more than one thread ( eg.
./indexer -N2 )
I tried also to compile without pthreads - then indexer doesn't work at all.
Here is what I get:
_______________________________________________________
I run indexer:
# ./indexer -a -N2
indexer[12268]: {00} indexer from dpsearch-4.46-06042007-mysql started with
'/usr/local/dpsearch/etc/indexer.conf'
indexer[12268]: {01} URL: http://news.bbc.co.uk/
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/africa/6528869.stm
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/africa/6528917.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/africa/6530227.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/africa/default.stm
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/americas/6526589.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/americas/6531727.stm
indexer[12268]: {01} URL:
http://news.bbc.co.uk/1/hi/world/south_asia/default.stm
indexer[12268]: {01} URL:
http://news.bbc.co.uk/1/hi/world/asia-pacific/default.stm
indexer[12268]: {02} URL:
http://news.bbc.co.uk/secure-uk.imrworldwide.com/cgi-bin/m?ci=bbc&cg=0
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/low/africa/6528869.stm
Segmentation fault (core dumped)
_______________________________________________________
debugger info:
gdb indexer core.12268
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu"...
Using host libthread_db library "/lib/libthread_db.so.1".
Reading symbols from /usr/local/dpsearch/lib/libdpsearch-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpsearch-4.so
Reading symbols from /lib64/libpthread.so.0...done.
Loaded symbols for /lib/libpthread.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib/librt.so.1
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib/libm.so.6
Reading symbols from /lib64/libz.so.1...done.
Loaded symbols for /lib/libz.so.1
Reading symbols from /usr/lib64/libmysqlclient.so.15...done.
Loaded symbols for /usr/lib/libmysqlclient.so.15
Reading symbols from /usr/local/dpsearch/lib/libdpcharset-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpcharset-4.so
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libcrypt.so.1...done.
Loaded symbols for /lib/libcrypt.so.1
Reading symbols from /lib64/libnsl.so.1...done.
Loaded symbols for /lib/libnsl.so.1
Reading symbols from /usr/lib64/libssl.so.0.9.8...done.
Loaded symbols for /usr/lib/libssl.so.0.9.8
Reading symbols from /usr/lib64/libcrypto.so.0.9.8...done.
Loaded symbols for /usr/lib/libcrypto.so.0.9.8
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /lib64/libnss_files.so.2...done.
Loaded symbols for /lib/libnss_files.so.2
Reading symbols from /lib64/libnss_dns.so.2...done.
Loaded symbols for /lib/libnss_dns.so.2
Reading symbols from /lib64/libresolv.so.2...done.
Loaded symbols for /lib/libresolv.so.2
Core was generated by `-[2] URL:http 2'.
Program terminated with signal 11, Segmentation fault.
#0 0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250,
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
at uconv-sys.c:33
33 *wb = *wc;
(gdb) bt
#0 0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250,
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
at uconv-sys.c:33
#1 0x00002aab86aa3853 in DpsConv (c=0x409e0420, d=0x2aaab0008250
"8\002�\207�*", dlen=1084097088, s=0x994731 "BC NEWS | World | Asia-Pacific",
slen=46912585632400) at uconv.c:93
#2 0x00002aab8622245b in DpsPrepareWords (Indexer=0x5412a0, Doc=0x605960) at
parsehtml.c:385
#3 0x00002aab861de045 in DpsIndexNextURL (Indexer=0x5412a0) at indexer.c:1511
#4 0x0000000000403ecc in thread_main (arg=0x409e0420) at main.c:868
#5 0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
#6 0x00002aab86d09f5d in clone () from /lib/libc.so.6
#7 0x0000000000000000 in ?? ()
(gdb) thread apply all bt full
Thread 3 (process 12268):
#0 0x00002aab86cde321 in nanosleep () from /lib/libc.so.6
No symbol table info available.
#1 0x00002aab86cde14e in sleep () from /lib/libc.so.6
No symbol table info available.
#2 0x0000000000405b2d in main (argc=0, argv=0x2) at main.c:1218
num = 2
i = 2
env = 0xfffffffffffffdfc <Address 0xfffffffffffffdfc out of bounds>
pid_fd = 1
cfg_res = -516
pidbuf = "[EMAIL PROTECTED]"...
Thread 2 (process 12272):
#0 dps_tolower (c=5772657) at charset-utils.c:30
No locals.
#1 0x0000000000581571 in ?? ()
No symbol table info available.
#2 0x0000000000000001 in ?? ()
No symbol table info available.
#3 0x0000000000581570 in ?? ()
No symbol table info available.
#4 0x00000000009964d0 in ?? ()
No symbol table info available.
#5 0x0000000000000010 in ?? ()
No symbol table info available.
#6 0x00002aab86243b14 in DpsWildCaseCmp (str=0x9964df
".co.uk/newswatch/ifs/low/notes_and_corrections/default.stm", wexp=0x581571
"D=D")
at wild.c:115
ret = 68
x = 0
y = 0
#7 0x00002aab86230fde in DpsMatchExec (Match=0x582fa0, string=0x9964d0
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm",
net_string=0x9964d0
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm",
sin=0x0, nparts=10, Parts=0x413e02a0)
at match.c:124
NET = {sin_family = 28800, sin_port = 88, sin_addr = {s_addr = 0},
sin_zero = "\000\000\000\000\000\000\000"}
bits = 68
i = 46
res = 68
subs = {{rm_so = 7, rm_eo = 0}, {rm_so = 117, rm_eo = 0}, {rm_so = 0,
rm_eo = 0}, {rm_so = 0, rm_eo = 0}, {rm_so = 10054624, rm_eo = 0}, {
rm_so = 9632048, rm_eo = 0}, {rm_so = 0, rm_eo = 0}, {rm_so = 0, rm_eo =
0}, {rm_so = 0, rm_eo = 0}, {rm_so = 0, rm_eo = 0}}
se = 0x581571 "D=D"
plen = 1094582944
slen = 68
#8 0x00002aab86231558 in DpsMatchListFind (L=0x50ede0, str=0x9964d0
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm",
nparts=10, Parts=0x413e02a0) at match.c:330
M = (DPS_MATCH *) 0x582fa0
i = 85
#9 0x00002aab861db4e9 in DpsFilterFind (log_level=5, L=0x2e, newhref=0x581571
"D=D", reason=0x413e0320 "", default_method=10) at indexer.c:107
P = {{beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1},
{beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1}, {
---Type <return> to continue, or q <return> to quit---
beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1,
end = -1}}
M = (DPS_MATCH *) 0x91f4d0
res = 1
#10 0x00002aab861db711 in DpsHrefCheck (Indexer=0x92f930, Href=0x921130,
newhref=0x9964d0
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm") at
indexer.c:224
reason = '\0' <repeats 4096 times>
newURL = (DPS_URL *) 0x996cb0
rule = (DPS_ROBOT_RULE *) 0x91f4d0
Srv = (DPS_SERVER *) 0x91f4d0
depth = 9565392
method = 0x2e <Address 0x2e out of bounds>
s = 0x44 <Address 0x44 out of bounds>
#11 0x00002aab861dc15b in DpsConvertHref (Indexer=0x92f930, CurURL=0x95c4b8,
Href=0x921130) at indexer.c:492
parse_res = 68
cascade = 0
newURL = (DPS_URL *) 0x996ab0
newhref = 0x9964d0
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm"
Alias = (DPS_MATCH *) 0x0
alias = 0x0
aliassize = 10052816
Parts = {{beg = 0, end = 0}, {beg = 9802720, end = 0}, {beg = 9632048,
end = 0}, {beg = -2044560553, end = 10923}, {beg = 256, end = 0}, {
beg = 1094591984, end = 0}, {beg = 10259323, end = 0}, {beg = 0, end = 0},
{beg = 0, end = 5}, {beg = 255, end = 0}}
#12 0x00002aab861dc52d in DpsDocStoreHrefs (Indexer=0x92f930, Doc=0x9593e0) at
indexer.c:516
i = 1392
weight = 4.06376555e-44
#13 0x00002aab861de12d in DpsIndexNextURL (Indexer=0x92f930) at indexer.c:1553
subsection = 0x0
min_size = 68
cont_len = 54720
lang = 0x9206c0 "en"
reason = '\0' <repeats 4096 times>
wordnum = 1368
state = 9569984
mp3type = 0
result = 1368
status = 200
Doc = (DPS_DOCUMENT *) 0x9593e0
url = 0x927db0 "http://news.bbc.co.uk/2/low/africa/6528869.stm"
alias = 0x0
origurl = 0x0
aliasurl = 0x0
Server = (DPS_SERVER *) 0x587080
#14 0x0000000000403ecc in thread_main (arg=0x2e) at main.c:868
NewConf = {freeme = 0, errstr = '\0' <repeats 2047 times>, bcs = 0x0,
lcs = 0x0, url_number = 0, Servers = {{nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}}, Cfg_Srv = 0x0, Robots =
{nrobots = 0, Robot = 0x0}, Aliases = {nmatches = 0, Match = 0x0},
ReverseAliases = {nmatches = 0, Match = 0x0}, MimeTypes = {nmatches = 0,
Match = 0x0}, Filters = {nmatches = 0, Match = 0x0}, SectionFilters = {
nmatches = 0, Match = 0x0}, StoreFilters = {nmatches = 0, Match = 0x0},
SectionMatch = {nmatches = 0, Match = 0x0}, HrefSectionMatch = {
nmatches = 0, Match = 0x0}, SubSectionMatch = {nmatches = 0, Match = 0x0},
BodyPatterns = {nmatches = 0, Match = 0x0}, Targets = {
total_found = 0, work_time = 0, first = 0, last = 0, fetched = 0, num_rows
= 0, cur_row = 0, offset = 0, memused = 0, PerSite = 0x0, freeme = 0,
Doc = 0x0, Suggest = 0x0, WWList = {nuniq = 0, nwords = 0, maxulen = 0,
Word = 0x0}, CoordList = {ncoords = 0, order = 0, word = 0x0,
Coords = 0x0, Data = 0x0}, nitems = 0, mitems = 0, ncmds = 0, orig_nitems
= 0, max_order = 0, phrase = 0, prepared = 0, items = 0x0},
---Type <return> to continue, or q <return> to quit---
Sections = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0} <repeats
256 times>}}, HrefSections = {freeme = 0, Root = {{nvars = 0, mvars = 0,
Var = 0x0} <repeats 256 times>}}, Vars = {freeme = 0, Root = {{nvars =
0, mvars = 0, Var = 0x0} <repeats 256 times>}}, LangMaps = {nmaps = 0,
Map = 0x0}, Synonyms = {nsynonyms = 0, msynonyms = 0, Synonym = 0x0, Back =
0x0}, Acronyms = {nacronyms = 0, macronyms = 0, Acronym = 0x0},
StopWords = {nstopwords = 0, StopWord = 0x0}, Parsers = {nparsers = 0, Parser
= 0x0}, dbl = {nitems = 0, currdbnum = 0, db = 0x0}, Spells = {
nspell = 0, mspell = 0, nLang = 0, Spell = 0x0, SpellTree = {{Left = {0
<repeats 256 times>}, Right = {0 <repeats 256 times>},
lang = "\000\000"} <repeats 16 times>}}, Affixes = {naffixes = 0,
maffixes = 0, Affix = 0x0, PrefixTree = {{Left = {0 <repeats 256 times>},
Right = {0 <repeats 256 times>}, lang = "\000\000"} <repeats 16
times>}, SuffixTree = {{Left = {0 <repeats 256 times>}, Right = {
0 <repeats 256 times>}, lang = "\000\000"} <repeats 16 times>}},
WordParam = {min_word_len = 0, max_word_len = 0, correct_factor = 0,
incorrect_factor = 0, number_factor = 0, alnum_factor = 0}, Chi = {nwords =
0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Thai = {
nwords = 0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Korean =
{nwords = 0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Flags = {
bind_addr = {sin_family = 0, sin_port = 0, sin_addr = {s_addr = 0},
sin_zero = "\000\000\000\000\000\000\000"}, hold_cache = 0,
robots_period = 0, do_store = 0, do_excerpt = 0, CVS_ignore = 0,
collect_links = 0, use_crc32_url_id = 0, use_crosswords = 0, use_newsext = 0,
use_accentext = 0, use_aspellext = 0, use_meta = 0, update_lm = 0,
provide_referer = 0, OptimizeAtUpdate = 0, PreloadURLData = 0, cold_var = 0,
PopRankNeoIterations = 0, GuesserBytes = 0, skip_unreferred = 0, track_hops
= 0, poprank_postpone = 0, limits = 0, nmaps = 0, URLInfoSQL = 0,
CheckInsertSQL = 0, mark_for_index = 0, use_date_header = 0, MaxSiteLevel =
0, SEASentences = 0, SEASentenceMinLength = 0, Resegment = 0,
poprank_method = DPS_POPRANK_GOO, cmd = DPS_IND_INDEX}, flags = 0,
URLDataFile = 0x0, CharsToEscape = 0x0, logs_only = 0, is_log_open = 0,
logFD = 0x0, ThreadInfo = 0, LockProc = 0, RefInfo = 0}
rc = 68
z = 0
Indexer = (DPS_AGENT *) 0x92f930
res = 0
done = 0
i_sleep = 0
notarget = 0
now = 0
#15 0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
No symbol table info available.
#16 0x00002aab86d09f5d in clone () from /lib/libc.so.6
No symbol table info available.
#17 0x0000000000000000 in ?? ()
No symbol table info available.
Thread 1 (process 12270):
#0 0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250,
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
at uconv-sys.c:33
wb = (dpsunicode_t *) 0xb0008250
#1 0x00002aab86aa3853 in DpsConv (c=0x409e0420, d=0x2aaab0008250
"8\002�\207�*", dlen=1084097088, s=0x994731 "BC NEWS | World | Asia-Pacific",
slen=46912585632400) at uconv.c:93
i = 0
codes = 1
res = 66
wc = {66, 10922, 6314336, 0, 2128, 0, 2261490665, 10923, 2250696272,
10923, 5509792, 0, 2113, 0, 2160, 0, 6320664, 0, 2250406743, 10923,
6320664, 0, 2250406743, 10923, 1084097520, 95, 2952790048, 10922, 2113, 0,
10462880, 0}
zero = 0
d_o = 0x2aaab0008250 "8\002�\207�*"
s_e = 0x994750 "\020U\226"
d_e = 0x2aaab0008a90 "\003"
s_o = 0x994730 "BBC NEWS | World | Asia-Pacific"
#2 0x00002aab8622245b in DpsPrepareWords (Indexer=0x5412a0, Doc=0x605960) at
parsehtml.c:385
dstlen = 2112
ustr = (dpsunicode_t *) 0x2aaab0008250
UStr = (dpsunicode_t *) 0x605960
srclen = 32
reslen = 2112
src = 0x2aaab0008254 "�*"
Item = (DPS_TEXTITEM *) 0x9fa6a0
i = 0
doccset = 0x42 <Address 0x42 out of bounds>
doccs = (DPS_CHARSET *) 0x20
sys_int = (DPS_CHARSET *) 0x42
dc_uni = {from = 0x2aab86c4caa0, to = 0x2aab86c4d980, CharsToEscape =
0x52c130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
flags = 12, istate = 0, ostate = 0}
tlist = (DPS_TEXTLIST *) 0x608a20
Sec = (DPS_VAR *) 0x42
crc32 = 0
crossec = 10
seasec = 0
uword = (dpsunicode_t *) 0x2aaab0007aa0
lcsword = 0x2aaab0007b30 "\200"
max_word_len = 32
min_word_len = 1
indexed_size = 0
indexed_limit = 0
content_lang = 0x2aaab00078a0 "it"
exrpt = {allocated_size = 256, data_size = 0, page_size = 256, freeme =
0, data = 0x2aaab0007990 ""}
#3 0x00002aab861de045 in DpsIndexNextURL (Indexer=0x5412a0) at indexer.c:1511
subsection = 0x0
min_size = 66
cont_len = 0
lang = 0x2aaab00078a0 "it"
reason = '\0' <repeats 4096 times>
wordnum = 0
state = -1342146400
mp3type = 0
result = 0
status = 200
Doc = (DPS_DOCUMENT *) 0x605960
url = 0x99d9f0
"http://news.bbc.co.uk/1/hi/world/asia-pacific/default.stm"
alias = 0x0
origurl = 0x0
aliasurl = 0x0
Server = (DPS_SERVER *) 0x587080
#4 0x0000000000403ecc in thread_main (arg=0x409e0420) at main.c:868
NewConf = {freeme = 0, errstr = '\0' <repeats 2047 times>, bcs = 0x0,
lcs = 0x0, url_number = 0, Servers = {{nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
min_ordre = 0, sorted = 0, Server = 0x0}}, Cfg_Srv = 0x0, Robots =
{nrobots = 0, Robot = 0x0}, Aliases = {nmatches = 0, Match = 0x0},
ReverseAliases = {nmatches = 0, Match = 0x0}, MimeTypes = {nmatches = 0,
Match = 0x0}, Filters = {nmatches = 0, Match = 0x0}, SectionFilters = {
nmatches = 0, Match = 0x0}, StoreFilters = {nmatches = 0, Match = 0x0},
SectionMatch = {nmatches = 0, Match = 0x0}, HrefSectionMatch = {
nmatches = 0, Match = 0x0}, SubSectionMatch = {nmatches = 0, Match = 0x0},
BodyPatterns = {nmatches = 0, Match = 0x0}, Targets = {
total_found = 0, work_time = 0, first = 0, last = 0, fetched = 0, num_rows
= 0, cur_row = 0, offset = 0, memused = 0, PerSite = 0x0, freeme = 0,
Doc = 0x0, Suggest = 0x0, WWList = {nuniq = 0, nwords = 0, maxulen = 0,
Word = 0x0}, CoordList = {ncoords = 0, order = 0, word = 0x0,
Coords = 0x0, Data = 0x0}, nitems = 0, mitems = 0, ncmds = 0, orig_nitems
= 0, max_order = 0, phrase = 0, prepared = 0, items = 0x0},
Sections = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0} <repeats
256 times>}}, HrefSections = {freeme = 0, Root = {{nvars = 0, mvars = 0,
Var = 0x0} <repeats 256 times>}}, Vars = {freeme = 0, Root = {{nvars =
0, mvars = 0, Var = 0x0} <repeats 256 times>}}, LangMaps = {nmaps = 0,
Map = 0x0}, Synonyms = {nsynonyms = 0, msynonyms = 0, Synonym = 0x0, Back =
0x0}, Acronyms = {nacronyms = 0, macronyms = 0, Acronym = 0x0},
StopWords = {nstopwords = 0, StopWord = 0x0}, Parsers = {nparsers = 0, Parser
= 0x0}, dbl = {nitems = 0, currdbnum = 0, db = 0x0}, Spells = {
nspell = 0, mspell = 0, nLang = 0, Spell = 0x0, SpellTree = {{Left = {0
<repeats 256 times>}, Right = {0 <repeats 256 times>},
lang = "\000\000"} <repeats 16 times>}}, Affixes = {naffixes = 0,
maffixes = 0, Affix = 0x0, PrefixTree = {{Left = {0 <repeats 256 times>},
Right = {0 <repeats 256 times>}, lang = "\000\000"} <repeats 16
times>}, SuffixTree = {{Left = {0 <repeats 256 times>}, Right = {
0 <repeats 256 times>}, lang = "\000\000"} <repeats 15 times>, {Left
= {0 <repeats 256 times>}, Right = {0 <repeats 209 times>,
-2046085092, 10923, 0, 0, -2024287952, 10923, 8, 0, 15, 0,
-1472036457, 0, -2046084150, 10923, 0, 0, 1084227376, 0, -1472036458, 0,
1084227680, 0, 1084227704, 0, -2033900728, 10923, 0, 0, -2024287368,
10923, -2033917952, 10923, -2043048383, 10923, -2033850832, 10923,
-2043054696, 10923, 0, 1, 1852, 1, 0, 0, -2046003952, 10923,
1084227760}, lang = "\000\000"}}}, WordParam = {min_word_len = 1084227680,
max_word_len = 1084227704, correct_factor = 2822930839, incorrect_factor =
46916198454136, number_factor = 46916176657961, alnum_factor = 0},
Chi = {nwords = 46916198454136, mwords = 1, total = 0, ChiWord = 0x1, hash =
0x0}, Thai = {nwords = 0, mwords = 0, total = 0,
ChiWord = 0x100000000, hash = 0x2aab860c7510}, Korean = {nwords =
1084227808, mwords = 46916176736688, total = 46916179693121, ChiWord = 0x0,
hash = 0x0}, Flags = {bind_addr = {sin_family = 0, sin_port = 0, sin_addr =
{s_addr = 0}, sin_zero = "\000\000\000\000\000\000\000"},
hold_cache = 0, robots_period = 0, do_store = 0, do_excerpt = 0, CVS_ignore
= 0, collect_links = 0, use_crc32_url_id = -2033856280,
use_crosswords = 10923, use_newsext = -2033917952, use_accentext = 10923,
use_aspellext = 0, use_meta = 0, update_lm = 110932256,
provide_referer = 0, OptimizeAtUpdate = 0, PreloadURLData = 0, cold_var =
1130576, PopRankNeoIterations = 0, GuesserBytes = -2043035806,
skip_unreferred = 10923, track_hops = 614437152, poprank_postpone = 32767,
limits = 3, nmaps = 0, URLInfoSQL = 1084231680, CheckInsertSQL = 0,
mark_for_index = -2043060224, use_date_header = 10923, MaxSiteLevel =
-2046069473, SEASentences = 10923, SEASentenceMinLength = 1, Resegment = 0,
poprank_method = DPS_POPRANK_GOO, cmd = DPS_IND_INDEX}, flags = 0,
URLDataFile = 0x0, CharsToEscape = 0x2aab86c5d0e8 "\236H",
logs_only = 1084229952, is_log_open = 0, logFD = 0x0, ThreadInfo =
0x2aab860bc852 <_dl_runtime_resolve+66>, LockProc = 0x40a00940,
RefInfo = 0x2aab8639bfb6 <start_thread+84>}
rc = 66
z = 0
Indexer = (DPS_AGENT *) 0x5412a0
res = 0
done = 0
i_sleep = 0
notarget = 0
now = 0
#5 0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
No symbol table info available.
#6 0x00002aab86d09f5d in clone () from /lib/libc.so.6
No symbol table info available.
#7 0x0000000000000000 in ?? ()
No symbol table info available.
________________________________________________________________________
now search.cgi from console:
#./search.cgi news
Segmentation fault (core dumped)
# gdb search.cgi core
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu"...
Using host libthread_db library "/lib/libthread_db.so.1".
Reading symbols from /usr/local/dpsearch/lib/libdpsearch-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpsearch-4.so
Reading symbols from /lib64/libpthread.so.0...done.
Loaded symbols for /lib/libpthread.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib/librt.so.1
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib/libm.so.6
Reading symbols from /lib64/libz.so.1...done.
Loaded symbols for /lib/libz.so.1
Reading symbols from /usr/lib64/libmysqlclient.so.15...done.
Loaded symbols for /usr/lib/libmysqlclient.so.15
Reading symbols from /usr/local/dpsearch/lib/libdpcharset-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpcharset-4.so
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libcrypt.so.1...done.
Loaded symbols for /lib/libcrypt.so.1
Reading symbols from /lib64/libnsl.so.1...done.
Loaded symbols for /lib/libnsl.so.1
Reading symbols from /usr/lib64/libssl.so.0.9.8...done.
Loaded symbols for /usr/lib/libssl.so.0.9.8
Reading symbols from /usr/lib64/libcrypto.so.0.9.8...done.
Loaded symbols for /usr/lib/libcrypto.so.0.9.8
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Core was generated by `./search.cgi news'.
Program terminated with signal 11, Segmentation fault.
#0 0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0,
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
at uconv-sys.c:33
33 *wb = *wc;
(gdb)
bt
#0 0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0,
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
at uconv-sys.c:33
#1 0x00002b983b1b9853 in DpsConv (c=0x7fff702e04b0, d=0x2b983bce1010 "",
dlen=140735075451504,
s=0x2b983bc95011 "ome News Sport Radio TV Weather Languages UK version
International version About the versions Low graphics Accessibility help The
World Today Mobiles Get news headlines on your handheld device News Fr"...,
slen=47932839005488) at uconv.c:93
#2 0x00002b983a94919d in DpsExcerptDoc (query=0x523e80, Res=0x551320, Doc=0x0,
size=256, padding=40) at store.c:731
#3 0x00002b983a9430a0 in DpsFind (A=0x523e80) at db.c:1280
#4 0x00000000004026f9 in main (argc=5507120, argv=0x0) at search.c:490
(gdb)
bt full
#0 0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0,
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
at uconv-sys.c:33
wb = (dpsunicode_t *) 0x3bce1010
#1 0x00002b983b1b9853 in DpsConv (c=0x7fff702e04b0, d=0x2b983bce1010 "",
dlen=140735075451504,
s=0x2b983bc95011 "ome News Sport Radio TV Weather Languages UK version
International version About the versions Low graphics Accessibility help The
World Today Mobiles Get news headlines on your handheld device News Fr"...,
slen=47932839005488) at uconv.c:93
i = 0
codes = 1
res = 72
wc = {72, 0, 1, 0, 983066810, 11160, 982620655, 11160, 0, 0, 122, 0,
5631664, 0, 982769587, 11160, 982908012, 11160, 0, 0, 982908012, 11160,
983066810, 11160, 5600344, 0, 5600344, 0, 122, 0, 982748390, 11160}
zero = 0
d_o = 0x2b983bce1010 ""
s_e = 0x2b983bc95112 ""
d_e = 0x2b983bd78d30 ""
s_o = 0x2b983bc95010 "Home News Sport Radio TV Weather Languages UK
version International version About the versions Low graphics Accessibility
help The World Today Mobiles Get news headlines on your handheld device News
F"...
#2 0x00002b983a94919d in DpsExcerptDoc (query=0x523e80, Res=0x551320, Doc=0x0,
size=256, padding=40) at store.c:731
HDoc = 0x2b983bc95010 "Home News Sport Radio TV Weather Languages UK
version International version About the versions Low graphics Accessibility
help The World Today Mobiles Get news headlines on your handheld device News
F"...
HEnd = 0x2b983bc95111 ""
htok = 0x0
last = 0x55b790 ""
lcharset = 0x2b983b362aa0 ""
doclang = 0x299 <Address 0x299 out of bounds>
bcs = (DPS_CHARSET *) 0x2b983b362aa0
dcs = (DPS_CHARSET *) 0x2b983b362aa0
sys_int = (DPS_CHARSET *) 0x2b983b363980
tag = {type = 2, script = 0, style = 0, title = 0, body = 0, follow =
0, index = 0, comment = 0, select = 0, lasthref = 0x0,
next_b = 0x2b983a938bc0 <DpsNextCharB>, next_e = 0x2b983a938bd0
<DpsNextCharE>, e = 0x55b790 "", b = 0x0, lt = 0x7fff702e0448, s = 0x55b790 "",
socket_sd = 0, socket_rv = 0, chunks = 0, Content = 0x0, finished = 0, ntoks
= 0, toks = {{name = 0x0, val = 0x0, nlen = 0,
vlen = 0} <repeats 65 times>}}
start = (dpsunicode_t *) 0x2b983b362aa0
end = (dpsunicode_t *) 0x3
prevend = (dpsunicode_t *) 0x0
uni = (dpsunicode_t *) 0x2b983bce1010
ures = 665
p = (dpsunicode_t *) 0xc
oi = (dpsunicode_t *) 0x55e4a0
dot = {32, 46, 46, 46, 0}
c = (dpsunicode_t *) 0x55b2f0
os = 0x2b983b362aa0 ""
s = 0
r = 665
wlen = (size_t *) 0x55b310
i = 665
len = 257
maxwlen = 3
ulen = 140735075451688
prevlen = 5617620
osl = 665
index_limit = 0
dc_uni = {from = 0x2b983b362aa0, to = 0x2b983b363980, CharsToEscape =
0x506130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
flags = 12, istate = 0, ostate = 0}
uni_bc = {from = 0x2b983b363980, to = 0x2b983b362aa0, CharsToEscape =
0x506130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
flags = 12, istate = 0, ostate = 0}
rec_id = -856889641
ChunkSize = 10
DocSize = 155454
dbnum = 140735075451504
Source = 0x299 <Address 0x299 out of bounds>
SourceToFree = 0x55b690 "Home News Sport Radio TV Weather Languages UK
version International version About the versions Low graphics Accessibility
help The World Today Mobiles Get news headlines on your handheld device News
F"...
needFreeSource = 1
NOprefixHL = 1
#3 0x00002b983a9430a0 in DpsFind (A=0x523e80) at db.c:1280
al = 0x55e480 "http://news.bbc.co.uk/"
db = (DPS_DB *) 0x55e480
Res = (DPS_RESULT *) 0x551320
res = 0
ticks = 1175814173286
ticks_ = 1175814173286
i = 0
dbfrom = 72
dbto = 1
num = 1
page_number = 5629056
page_size = 0
ExcerptSize = 256
ExcerptPadding = 40
Excerpt = 0x0
str = "[EMAIL PROTECTED]", '\0' <repeats 13 times>,
"\037�|:\230+\000\000\001", '\0' <repeats 15 times>,
"\220pR\000\000\000\000\000$�\223:\230+\000\000\020\034\216:\230+\000\000\200hP\000\000\000\000\0000\bT\000\000\000\000\000R(}:[EMAIL
PROTECTED]"
#4 0x00000000004026f9 in main (argc=5507120, argv=0x0) at search.c:490
s = 0x7fff702e0270 "H"
e = 0x506880 "\001"
env = 0x506880 "\001"
bcharset = 0x506880 "\001"
lcharset = 0x540830 "iso-8859-1"
conf_dir = 0x40502e "/usr/local/dpsearch/etc"
template_name = "/usr/local/dpsearch/etc/search.htm", '\0' <repeats
4067 times>
template_filename = 0x0
query_string = 0x53a8c0 "q=news"
self = '\0' <repeats 1023 times>
nav = 0x0
url = 0x0
searchwords = 0x0
storedstr = 0x0
ResultContentType = 0x3bce1010 <Address 0x3bce1010 out of bounds>
res = 72
httpd = 0
catcolumns = 0
page1 = 5269632
page2 = 5507120
npages = 140735075451504
ppp = 10
page_size = 10
page_number = 0
i = 0
swlen = 0
nav_len = 4214830
storedlen = 0
Env = (DPS_ENV *) 0x506880
Agent = (DPS_AGENT *) 0x523e80
Res = (DPS_RESULT *) 0x0
query_vars = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0}
<repeats 113 times>, {nvars = 1, mvars = 32, Var = 0x53ccd0}, {nvars = 0,
mvars = 0, Var = 0x0} <repeats 142 times>}}
site_id = 0
prev_site_id = 0
(gdb)
______________________________________________________________
Hope this would help .... 'cause I don't know what else to try... maybe the
problem is with my system... no idea...
Thanks in advance Maxime!
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;post=