- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Emil
Subject: Segmentation fault on 64 bit

Hello Maxime!

I have problems running dpsearch on 64-bit linux (Gentoo). I was using dpsearch 
for a while with no problems, but after a major upgrade I ran into problems 
witch search.cgi and indexer giving me segmentation faults. Had a lot of head 
ache trying to compile different versions to find out where the problem is. I 
thought maybe I was using some different compile options which caused the 
errors, but I tried with the default values and with default options in configs 
but without success. I downgraded then back to version dpsearch-4.43 which is 
running fine... But with any version above dpsearch-4.44 I get segmentation 
faults. I tried compiling the latest versions of dpsearch on 32-bit and it runs 
very well. On 64-bit I tried on AMD64 and on dual Intel 64 and it gives me the 
same segfaults.

search.cgi gives Internal Server Error
indexer stops with segmentation fault when running more than one thread ( eg. 
./indexer -N2 )

I tried also to compile without pthreads - then indexer doesn't work at all.

Here is what I get:
_______________________________________________________
I run indexer:
# ./indexer -a -N2
indexer[12268]: {00} indexer from dpsearch-4.46-06042007-mysql started with 
'/usr/local/dpsearch/etc/indexer.conf'
indexer[12268]: {01} URL: http://news.bbc.co.uk/
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/africa/6528869.stm
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/africa/6528917.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/africa/6530227.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/africa/default.stm
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/hi/americas/6526589.stm
indexer[12268]: {01} URL: http://news.bbc.co.uk/2/hi/americas/6531727.stm
indexer[12268]: {01} URL: 
http://news.bbc.co.uk/1/hi/world/south_asia/default.stm
indexer[12268]: {01} URL: 
http://news.bbc.co.uk/1/hi/world/asia-pacific/default.stm
indexer[12268]: {02} URL: 
http://news.bbc.co.uk/secure-uk.imrworldwide.com/cgi-bin/m?ci=bbc&cg=0
indexer[12268]: {02} URL: http://news.bbc.co.uk/2/low/africa/6528869.stm
Segmentation fault (core dumped)

_______________________________________________________

debugger info:

gdb indexer core.12268
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu"...
Using host libthread_db library "/lib/libthread_db.so.1".
Reading symbols from /usr/local/dpsearch/lib/libdpsearch-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpsearch-4.so
Reading symbols from /lib64/libpthread.so.0...done.
Loaded symbols for /lib/libpthread.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib/librt.so.1
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib/libm.so.6
Reading symbols from /lib64/libz.so.1...done.
Loaded symbols for /lib/libz.so.1
Reading symbols from /usr/lib64/libmysqlclient.so.15...done.
Loaded symbols for /usr/lib/libmysqlclient.so.15
Reading symbols from /usr/local/dpsearch/lib/libdpcharset-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpcharset-4.so
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libcrypt.so.1...done.
Loaded symbols for /lib/libcrypt.so.1
Reading symbols from /lib64/libnsl.so.1...done.
Loaded symbols for /lib/libnsl.so.1
Reading symbols from /usr/lib64/libssl.so.0.9.8...done.
Loaded symbols for /usr/lib/libssl.so.0.9.8
Reading symbols from /usr/lib64/libcrypto.so.0.9.8...done.
Loaded symbols for /usr/lib/libcrypto.so.0.9.8
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /lib64/libnss_files.so.2...done.
Loaded symbols for /lib/libnss_files.so.2
Reading symbols from /lib64/libnss_dns.so.2...done.
Loaded symbols for /lib/libnss_dns.so.2
Reading symbols from /lib64/libresolv.so.2...done.
Loaded symbols for /lib/libresolv.so.2
Core was generated by `-[2] URL:http  2'.
Program terminated with signal 11, Segmentation fault.
#0  0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250, 
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
    at uconv-sys.c:33
33        *wb = *wc;

(gdb) bt
#0  0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250, 
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
    at uconv-sys.c:33
#1  0x00002aab86aa3853 in DpsConv (c=0x409e0420, d=0x2aaab0008250 
"8\002�\207�*", dlen=1084097088, s=0x994731 "BC NEWS | World | Asia-Pacific",
    slen=46912585632400) at uconv.c:93
#2  0x00002aab8622245b in DpsPrepareWords (Indexer=0x5412a0, Doc=0x605960) at 
parsehtml.c:385
#3  0x00002aab861de045 in DpsIndexNextURL (Indexer=0x5412a0) at indexer.c:1511
#4  0x0000000000403ecc in thread_main (arg=0x409e0420) at main.c:868
#5  0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
#6  0x00002aab86d09f5d in clone () from /lib/libc.so.6
#7  0x0000000000000000 in ?? ()
(gdb) thread apply all bt full

Thread 3 (process 12268):
#0  0x00002aab86cde321 in nanosleep () from /lib/libc.so.6
No symbol table info available.
#1  0x00002aab86cde14e in sleep () from /lib/libc.so.6
No symbol table info available.
#2  0x0000000000405b2d in main (argc=0, argv=0x2) at main.c:1218
        num = 2
        i = 2
        env = 0xfffffffffffffdfc <Address 0xfffffffffffffdfc out of bounds>
        pid_fd = 1
        cfg_res = -516
        pidbuf = "[EMAIL PROTECTED]"...

Thread 2 (process 12272):
#0  dps_tolower (c=5772657) at charset-utils.c:30
No locals.
#1  0x0000000000581571 in ?? ()
No symbol table info available.
#2  0x0000000000000001 in ?? ()
No symbol table info available.
#3  0x0000000000581570 in ?? ()
No symbol table info available.
#4  0x00000000009964d0 in ?? ()
No symbol table info available.
#5  0x0000000000000010 in ?? ()
No symbol table info available.
#6  0x00002aab86243b14 in DpsWildCaseCmp (str=0x9964df 
".co.uk/newswatch/ifs/low/notes_and_corrections/default.stm", wexp=0x581571 
"D=D")
    at wild.c:115
        ret = 68
        x = 0
        y = 0
#7  0x00002aab86230fde in DpsMatchExec (Match=0x582fa0, string=0x9964d0 
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm";,
    net_string=0x9964d0 
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm";, 
sin=0x0, nparts=10, Parts=0x413e02a0)
    at match.c:124
        NET = {sin_family = 28800, sin_port = 88, sin_addr = {s_addr = 0}, 
sin_zero = "\000\000\000\000\000\000\000"}
        bits = 68
        i = 46
        res = 68
        subs = {{rm_so = 7, rm_eo = 0}, {rm_so = 117, rm_eo = 0}, {rm_so = 0, 
rm_eo = 0}, {rm_so = 0, rm_eo = 0}, {rm_so = 10054624, rm_eo = 0}, {
    rm_so = 9632048, rm_eo = 0}, {rm_so = 0, rm_eo = 0}, {rm_so = 0, rm_eo = 
0}, {rm_so = 0, rm_eo = 0}, {rm_so = 0, rm_eo = 0}}
        se = 0x581571 "D=D"
        plen = 1094582944
        slen = 68
#8  0x00002aab86231558 in DpsMatchListFind (L=0x50ede0, str=0x9964d0 
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm";,
    nparts=10, Parts=0x413e02a0) at match.c:330
        M = (DPS_MATCH *) 0x582fa0
        i = 85
#9  0x00002aab861db4e9 in DpsFilterFind (log_level=5, L=0x2e, newhref=0x581571 
"D=D", reason=0x413e0320 "", default_method=10) at indexer.c:107
        P = {{beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1}, 
{beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1}, {
---Type <return> to continue, or q <return> to quit---
    beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, end = -1}, {beg = -1, 
end = -1}}
        M = (DPS_MATCH *) 0x91f4d0
        res = 1
#10 0x00002aab861db711 in DpsHrefCheck (Indexer=0x92f930, Href=0x921130,
    newhref=0x9964d0 
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm";) at 
indexer.c:224
        reason = '\0' <repeats 4096 times>
        newURL = (DPS_URL *) 0x996cb0
        rule = (DPS_ROBOT_RULE *) 0x91f4d0
        Srv = (DPS_SERVER *) 0x91f4d0
        depth = 9565392
        method = 0x2e <Address 0x2e out of bounds>
        s = 0x44 <Address 0x44 out of bounds>
#11 0x00002aab861dc15b in DpsConvertHref (Indexer=0x92f930, CurURL=0x95c4b8, 
Href=0x921130) at indexer.c:492
        parse_res = 68
        cascade = 0
        newURL = (DPS_URL *) 0x996ab0
        newhref = 0x9964d0 
"http://news.bbc.co.uk/newswatch/ifs/low/notes_and_corrections/default.stm";
        Alias = (DPS_MATCH *) 0x0
        alias = 0x0
        aliassize = 10052816
        Parts = {{beg = 0, end = 0}, {beg = 9802720, end = 0}, {beg = 9632048, 
end = 0}, {beg = -2044560553, end = 10923}, {beg = 256, end = 0}, {
    beg = 1094591984, end = 0}, {beg = 10259323, end = 0}, {beg = 0, end = 0}, 
{beg = 0, end = 5}, {beg = 255, end = 0}}
#12 0x00002aab861dc52d in DpsDocStoreHrefs (Indexer=0x92f930, Doc=0x9593e0) at 
indexer.c:516
        i = 1392
        weight = 4.06376555e-44
#13 0x00002aab861de12d in DpsIndexNextURL (Indexer=0x92f930) at indexer.c:1553
        subsection = 0x0
        min_size = 68
        cont_len = 54720
        lang = 0x9206c0 "en"
        reason = '\0' <repeats 4096 times>
        wordnum = 1368
        state = 9569984
        mp3type = 0
        result = 1368
        status = 200
        Doc = (DPS_DOCUMENT *) 0x9593e0
        url = 0x927db0 "http://news.bbc.co.uk/2/low/africa/6528869.stm";
        alias = 0x0
        origurl = 0x0
        aliasurl = 0x0
        Server = (DPS_SERVER *) 0x587080
#14 0x0000000000403ecc in thread_main (arg=0x2e) at main.c:868
        NewConf = {freeme = 0, errstr = '\0' <repeats 2047 times>, bcs = 0x0, 
lcs = 0x0, url_number = 0, Servers = {{nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}}, Cfg_Srv = 0x0, Robots = 
{nrobots = 0, Robot = 0x0}, Aliases = {nmatches = 0, Match = 0x0},
  ReverseAliases = {nmatches = 0, Match = 0x0}, MimeTypes = {nmatches = 0, 
Match = 0x0}, Filters = {nmatches = 0, Match = 0x0}, SectionFilters = {
    nmatches = 0, Match = 0x0}, StoreFilters = {nmatches = 0, Match = 0x0}, 
SectionMatch = {nmatches = 0, Match = 0x0}, HrefSectionMatch = {
    nmatches = 0, Match = 0x0}, SubSectionMatch = {nmatches = 0, Match = 0x0}, 
BodyPatterns = {nmatches = 0, Match = 0x0}, Targets = {
    total_found = 0, work_time = 0, first = 0, last = 0, fetched = 0, num_rows 
= 0, cur_row = 0, offset = 0, memused = 0, PerSite = 0x0, freeme = 0,
    Doc = 0x0, Suggest = 0x0, WWList = {nuniq = 0, nwords = 0, maxulen = 0, 
Word = 0x0}, CoordList = {ncoords = 0, order = 0, word = 0x0,
      Coords = 0x0, Data = 0x0}, nitems = 0, mitems = 0, ncmds = 0, orig_nitems 
= 0, max_order = 0, phrase = 0, prepared = 0, items = 0x0},
---Type <return> to continue, or q <return> to quit---
  Sections = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0} <repeats 
256 times>}}, HrefSections = {freeme = 0, Root = {{nvars = 0, mvars = 0,
        Var = 0x0} <repeats 256 times>}}, Vars = {freeme = 0, Root = {{nvars = 
0, mvars = 0, Var = 0x0} <repeats 256 times>}}, LangMaps = {nmaps = 0,
    Map = 0x0}, Synonyms = {nsynonyms = 0, msynonyms = 0, Synonym = 0x0, Back = 
0x0}, Acronyms = {nacronyms = 0, macronyms = 0, Acronym = 0x0},
  StopWords = {nstopwords = 0, StopWord = 0x0}, Parsers = {nparsers = 0, Parser 
= 0x0}, dbl = {nitems = 0, currdbnum = 0, db = 0x0}, Spells = {
    nspell = 0, mspell = 0, nLang = 0, Spell = 0x0, SpellTree = {{Left = {0 
<repeats 256 times>}, Right = {0 <repeats 256 times>},
        lang = "\000\000"} <repeats 16 times>}}, Affixes = {naffixes = 0, 
maffixes = 0, Affix = 0x0, PrefixTree = {{Left = {0 <repeats 256 times>},
        Right = {0 <repeats 256 times>}, lang = "\000\000"} <repeats 16 
times>}, SuffixTree = {{Left = {0 <repeats 256 times>}, Right = {
          0 <repeats 256 times>}, lang = "\000\000"} <repeats 16 times>}}, 
WordParam = {min_word_len = 0, max_word_len = 0, correct_factor = 0,
    incorrect_factor = 0, number_factor = 0, alnum_factor = 0}, Chi = {nwords = 
0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Thai = {
    nwords = 0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Korean = 
{nwords = 0, mwords = 0, total = 0, ChiWord = 0x0, hash = 0x0}, Flags = {
    bind_addr = {sin_family = 0, sin_port = 0, sin_addr = {s_addr = 0}, 
sin_zero = "\000\000\000\000\000\000\000"}, hold_cache = 0,
    robots_period = 0, do_store = 0, do_excerpt = 0, CVS_ignore = 0, 
collect_links = 0, use_crc32_url_id = 0, use_crosswords = 0, use_newsext = 0,
    use_accentext = 0, use_aspellext = 0, use_meta = 0, update_lm = 0, 
provide_referer = 0, OptimizeAtUpdate = 0, PreloadURLData = 0, cold_var = 0,
    PopRankNeoIterations = 0, GuesserBytes = 0, skip_unreferred = 0, track_hops 
= 0, poprank_postpone = 0, limits = 0, nmaps = 0, URLInfoSQL = 0,
    CheckInsertSQL = 0, mark_for_index = 0, use_date_header = 0, MaxSiteLevel = 
0, SEASentences = 0, SEASentenceMinLength = 0, Resegment = 0,
    poprank_method = DPS_POPRANK_GOO, cmd = DPS_IND_INDEX}, flags = 0, 
URLDataFile = 0x0, CharsToEscape = 0x0, logs_only = 0, is_log_open = 0,
  logFD = 0x0, ThreadInfo = 0, LockProc = 0, RefInfo = 0}
        rc = 68
        z = 0
        Indexer = (DPS_AGENT *) 0x92f930
        res = 0
        done = 0
        i_sleep = 0
        notarget = 0
        now = 0
#15 0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
No symbol table info available.
#16 0x00002aab86d09f5d in clone () from /lib/libc.so.6
No symbol table info available.
#17 0x0000000000000000 in ?? ()
No symbol table info available.

Thread 1 (process 12270):
#0  0x00002aab86aa5202 in dps_wc_mb_sys_int (conv=0x409e0420, cs=0xb0008250, 
wc=0x409e0240, s=0x2aaab0008254 "�*", e=0x2aaab0008a90 "\003")
    at uconv-sys.c:33
        wb = (dpsunicode_t *) 0xb0008250
#1  0x00002aab86aa3853 in DpsConv (c=0x409e0420, d=0x2aaab0008250 
"8\002�\207�*", dlen=1084097088, s=0x994731 "BC NEWS | World | Asia-Pacific",
    slen=46912585632400) at uconv.c:93
        i = 0
        codes = 1
        res = 66
        wc = {66, 10922, 6314336, 0, 2128, 0, 2261490665, 10923, 2250696272, 
10923, 5509792, 0, 2113, 0, 2160, 0, 6320664, 0, 2250406743, 10923,
  6320664, 0, 2250406743, 10923, 1084097520, 95, 2952790048, 10922, 2113, 0, 
10462880, 0}
        zero = 0
        d_o = 0x2aaab0008250 "8\002�\207�*"
        s_e = 0x994750 "\020U\226"
        d_e = 0x2aaab0008a90 "\003"
        s_o = 0x994730 "BBC NEWS | World | Asia-Pacific"
#2  0x00002aab8622245b in DpsPrepareWords (Indexer=0x5412a0, Doc=0x605960) at 
parsehtml.c:385
        dstlen = 2112
        ustr = (dpsunicode_t *) 0x2aaab0008250
        UStr = (dpsunicode_t *) 0x605960
        srclen = 32
        reslen = 2112
        src = 0x2aaab0008254 "�*"
        Item = (DPS_TEXTITEM *) 0x9fa6a0
        i = 0
        doccset = 0x42 <Address 0x42 out of bounds>
        doccs = (DPS_CHARSET *) 0x20
        sys_int = (DPS_CHARSET *) 0x42
        dc_uni = {from = 0x2aab86c4caa0, to = 0x2aab86c4d980, CharsToEscape = 
0x52c130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
  flags = 12, istate = 0, ostate = 0}
        tlist = (DPS_TEXTLIST *) 0x608a20
        Sec = (DPS_VAR *) 0x42
        crc32 = 0
        crossec = 10
        seasec = 0
        uword = (dpsunicode_t *) 0x2aaab0007aa0
        lcsword = 0x2aaab0007b30 "\200"
        max_word_len = 32
        min_word_len = 1
        indexed_size = 0
        indexed_limit = 0
        content_lang = 0x2aaab00078a0 "it"
        exrpt = {allocated_size = 256, data_size = 0, page_size = 256, freeme = 
0, data = 0x2aaab0007990 ""}
#3  0x00002aab861de045 in DpsIndexNextURL (Indexer=0x5412a0) at indexer.c:1511
        subsection = 0x0
        min_size = 66
        cont_len = 0
        lang = 0x2aaab00078a0 "it"
        reason = '\0' <repeats 4096 times>
        wordnum = 0
        state = -1342146400
        mp3type = 0
        result = 0
        status = 200
        Doc = (DPS_DOCUMENT *) 0x605960
        url = 0x99d9f0 
"http://news.bbc.co.uk/1/hi/world/asia-pacific/default.stm";
        alias = 0x0
        origurl = 0x0
        aliasurl = 0x0
        Server = (DPS_SERVER *) 0x587080
#4  0x0000000000403ecc in thread_main (arg=0x409e0420) at main.c:868
        NewConf = {freeme = 0, errstr = '\0' <repeats 2047 times>, bcs = 0x0, 
lcs = 0x0, url_number = 0, Servers = {{nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0, 
min_ordre = 0, sorted = 0, Server = 0x0}, {nservers = 0, mservers = 0,
      min_ordre = 0, sorted = 0, Server = 0x0}}, Cfg_Srv = 0x0, Robots = 
{nrobots = 0, Robot = 0x0}, Aliases = {nmatches = 0, Match = 0x0},
  ReverseAliases = {nmatches = 0, Match = 0x0}, MimeTypes = {nmatches = 0, 
Match = 0x0}, Filters = {nmatches = 0, Match = 0x0}, SectionFilters = {
    nmatches = 0, Match = 0x0}, StoreFilters = {nmatches = 0, Match = 0x0}, 
SectionMatch = {nmatches = 0, Match = 0x0}, HrefSectionMatch = {
    nmatches = 0, Match = 0x0}, SubSectionMatch = {nmatches = 0, Match = 0x0}, 
BodyPatterns = {nmatches = 0, Match = 0x0}, Targets = {
    total_found = 0, work_time = 0, first = 0, last = 0, fetched = 0, num_rows 
= 0, cur_row = 0, offset = 0, memused = 0, PerSite = 0x0, freeme = 0,
    Doc = 0x0, Suggest = 0x0, WWList = {nuniq = 0, nwords = 0, maxulen = 0, 
Word = 0x0}, CoordList = {ncoords = 0, order = 0, word = 0x0,
      Coords = 0x0, Data = 0x0}, nitems = 0, mitems = 0, ncmds = 0, orig_nitems 
= 0, max_order = 0, phrase = 0, prepared = 0, items = 0x0},
  Sections = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0} <repeats 
256 times>}}, HrefSections = {freeme = 0, Root = {{nvars = 0, mvars = 0,
        Var = 0x0} <repeats 256 times>}}, Vars = {freeme = 0, Root = {{nvars = 
0, mvars = 0, Var = 0x0} <repeats 256 times>}}, LangMaps = {nmaps = 0,
    Map = 0x0}, Synonyms = {nsynonyms = 0, msynonyms = 0, Synonym = 0x0, Back = 
0x0}, Acronyms = {nacronyms = 0, macronyms = 0, Acronym = 0x0},
  StopWords = {nstopwords = 0, StopWord = 0x0}, Parsers = {nparsers = 0, Parser 
= 0x0}, dbl = {nitems = 0, currdbnum = 0, db = 0x0}, Spells = {
    nspell = 0, mspell = 0, nLang = 0, Spell = 0x0, SpellTree = {{Left = {0 
<repeats 256 times>}, Right = {0 <repeats 256 times>},
        lang = "\000\000"} <repeats 16 times>}}, Affixes = {naffixes = 0, 
maffixes = 0, Affix = 0x0, PrefixTree = {{Left = {0 <repeats 256 times>},
        Right = {0 <repeats 256 times>}, lang = "\000\000"} <repeats 16 
times>}, SuffixTree = {{Left = {0 <repeats 256 times>}, Right = {
          0 <repeats 256 times>}, lang = "\000\000"} <repeats 15 times>, {Left 
= {0 <repeats 256 times>}, Right = {0 <repeats 209 times>,
          -2046085092, 10923, 0, 0, -2024287952, 10923, 8, 0, 15, 0, 
-1472036457, 0, -2046084150, 10923, 0, 0, 1084227376, 0, -1472036458, 0,
          1084227680, 0, 1084227704, 0, -2033900728, 10923, 0, 0, -2024287368, 
10923, -2033917952, 10923, -2043048383, 10923, -2033850832, 10923,
          -2043054696, 10923, 0, 1, 1852, 1, 0, 0, -2046003952, 10923, 
1084227760}, lang = "\000\000"}}}, WordParam = {min_word_len = 1084227680,
    max_word_len = 1084227704, correct_factor = 2822930839, incorrect_factor = 
46916198454136, number_factor = 46916176657961, alnum_factor = 0},
  Chi = {nwords = 46916198454136, mwords = 1, total = 0, ChiWord = 0x1, hash = 
0x0}, Thai = {nwords = 0, mwords = 0, total = 0,
    ChiWord = 0x100000000, hash = 0x2aab860c7510}, Korean = {nwords = 
1084227808, mwords = 46916176736688, total = 46916179693121, ChiWord = 0x0,
    hash = 0x0}, Flags = {bind_addr = {sin_family = 0, sin_port = 0, sin_addr = 
{s_addr = 0}, sin_zero = "\000\000\000\000\000\000\000"},
    hold_cache = 0, robots_period = 0, do_store = 0, do_excerpt = 0, CVS_ignore 
= 0, collect_links = 0, use_crc32_url_id = -2033856280,
    use_crosswords = 10923, use_newsext = -2033917952, use_accentext = 10923, 
use_aspellext = 0, use_meta = 0, update_lm = 110932256,
    provide_referer = 0, OptimizeAtUpdate = 0, PreloadURLData = 0, cold_var = 
1130576, PopRankNeoIterations = 0, GuesserBytes = -2043035806,
    skip_unreferred = 10923, track_hops = 614437152, poprank_postpone = 32767, 
limits = 3, nmaps = 0, URLInfoSQL = 1084231680, CheckInsertSQL = 0,
    mark_for_index = -2043060224, use_date_header = 10923, MaxSiteLevel = 
-2046069473, SEASentences = 10923, SEASentenceMinLength = 1, Resegment = 0,
    poprank_method = DPS_POPRANK_GOO, cmd = DPS_IND_INDEX}, flags = 0, 
URLDataFile = 0x0, CharsToEscape = 0x2aab86c5d0e8 "\236H",
  logs_only = 1084229952, is_log_open = 0, logFD = 0x0, ThreadInfo = 
0x2aab860bc852 <_dl_runtime_resolve+66>, LockProc = 0x40a00940,
  RefInfo = 0x2aab8639bfb6 <start_thread+84>}
        rc = 66
        z = 0
        Indexer = (DPS_AGENT *) 0x5412a0
        res = 0
        done = 0
        i_sleep = 0
        notarget = 0
        now = 0
#5  0x00002aab8639c027 in start_thread () from /lib/libpthread.so.0
No symbol table info available.
#6  0x00002aab86d09f5d in clone () from /lib/libc.so.6
No symbol table info available.
#7  0x0000000000000000 in ?? ()
No symbol table info available.


________________________________________________________________________


now search.cgi from console:
#./search.cgi news
Segmentation fault (core dumped)

# gdb search.cgi core
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "x86_64-pc-linux-gnu"...
Using host libthread_db library "/lib/libthread_db.so.1".
Reading symbols from /usr/local/dpsearch/lib/libdpsearch-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpsearch-4.so
Reading symbols from /lib64/libpthread.so.0...done.
Loaded symbols for /lib/libpthread.so.0
Reading symbols from /lib64/librt.so.1...done.
Loaded symbols for /lib/librt.so.1
Reading symbols from /lib64/libm.so.6...done.
Loaded symbols for /lib/libm.so.6
Reading symbols from /lib64/libz.so.1...done.
Loaded symbols for /lib/libz.so.1
Reading symbols from /usr/lib64/libmysqlclient.so.15...done.
Loaded symbols for /usr/lib/libmysqlclient.so.15
Reading symbols from /usr/local/dpsearch/lib/libdpcharset-4.so...done.
Loaded symbols for /usr/local/dpsearch/lib/libdpcharset-4.so
Reading symbols from /lib64/libc.so.6...done.
Loaded symbols for /lib/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libcrypt.so.1...done.
Loaded symbols for /lib/libcrypt.so.1
Reading symbols from /lib64/libnsl.so.1...done.
Loaded symbols for /lib/libnsl.so.1
Reading symbols from /usr/lib64/libssl.so.0.9.8...done.
Loaded symbols for /usr/lib/libssl.so.0.9.8
Reading symbols from /usr/lib64/libcrypto.so.0.9.8...done.
Loaded symbols for /usr/lib/libcrypto.so.0.9.8
Reading symbols from /lib64/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Core was generated by `./search.cgi news'.
Program terminated with signal 11, Segmentation fault.
#0  0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0, 
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
    at uconv-sys.c:33
33        *wb = *wc;
(gdb)  
bt
#0  0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0, 
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
    at uconv-sys.c:33
#1  0x00002b983b1b9853 in DpsConv (c=0x7fff702e04b0, d=0x2b983bce1010 "", 
dlen=140735075451504,
    s=0x2b983bc95011 "ome News Sport Radio TV Weather Languages UK version 
International version About the versions Low graphics Accessibility help The 
World Today Mobiles Get news headlines on your handheld device News Fr"..., 
slen=47932839005488) at uconv.c:93
#2  0x00002b983a94919d in DpsExcerptDoc (query=0x523e80, Res=0x551320, Doc=0x0, 
size=256, padding=40) at store.c:731
#3  0x00002b983a9430a0 in DpsFind (A=0x523e80) at db.c:1280
#4  0x00000000004026f9 in main (argc=5507120, argv=0x0) at search.c:490
(gdb)    
bt full
#0  0x00002b983b1bb202 in dps_wc_mb_sys_int (conv=0x7fff702e04b0, 
cs=0x3bce1010, wc=0x7fff702e0270, s=0x2b983bce1014 "", e=0x2b983bd78d30 "")
    at uconv-sys.c:33
        wb = (dpsunicode_t *) 0x3bce1010
#1  0x00002b983b1b9853 in DpsConv (c=0x7fff702e04b0, d=0x2b983bce1010 "", 
dlen=140735075451504,
    s=0x2b983bc95011 "ome News Sport Radio TV Weather Languages UK version 
International version About the versions Low graphics Accessibility help The 
World Today Mobiles Get news headlines on your handheld device News Fr"..., 
slen=47932839005488) at uconv.c:93
        i = 0
        codes = 1
        res = 72
        wc = {72, 0, 1, 0, 983066810, 11160, 982620655, 11160, 0, 0, 122, 0, 
5631664, 0, 982769587, 11160, 982908012, 11160, 0, 0, 982908012, 11160,
  983066810, 11160, 5600344, 0, 5600344, 0, 122, 0, 982748390, 11160}
        zero = 0
        d_o = 0x2b983bce1010 ""
        s_e = 0x2b983bc95112 ""
        d_e = 0x2b983bd78d30 ""
        s_o = 0x2b983bc95010 "Home News Sport Radio TV Weather Languages UK 
version International version About the versions Low graphics Accessibility 
help The World Today Mobiles Get news headlines on your handheld device News 
F"...
#2  0x00002b983a94919d in DpsExcerptDoc (query=0x523e80, Res=0x551320, Doc=0x0, 
size=256, padding=40) at store.c:731
        HDoc = 0x2b983bc95010 "Home News Sport Radio TV Weather Languages UK 
version International version About the versions Low graphics Accessibility 
help The World Today Mobiles Get news headlines on your handheld device News 
F"...
        HEnd = 0x2b983bc95111 ""
        htok = 0x0
        last = 0x55b790 ""
        lcharset = 0x2b983b362aa0 ""
        doclang = 0x299 <Address 0x299 out of bounds>
        bcs = (DPS_CHARSET *) 0x2b983b362aa0
        dcs = (DPS_CHARSET *) 0x2b983b362aa0
        sys_int = (DPS_CHARSET *) 0x2b983b363980
        tag = {type = 2, script = 0, style = 0, title = 0, body = 0, follow = 
0, index = 0, comment = 0, select = 0, lasthref = 0x0,
  next_b = 0x2b983a938bc0 <DpsNextCharB>, next_e = 0x2b983a938bd0 
<DpsNextCharE>, e = 0x55b790 "", b = 0x0, lt = 0x7fff702e0448, s = 0x55b790 "",
  socket_sd = 0, socket_rv = 0, chunks = 0, Content = 0x0, finished = 0, ntoks 
= 0, toks = {{name = 0x0, val = 0x0, nlen = 0,
      vlen = 0} <repeats 65 times>}}
        start = (dpsunicode_t *) 0x2b983b362aa0
        end = (dpsunicode_t *) 0x3
        prevend = (dpsunicode_t *) 0x0
        uni = (dpsunicode_t *) 0x2b983bce1010
        ures = 665
        p = (dpsunicode_t *) 0xc
        oi = (dpsunicode_t *) 0x55e4a0
        dot = {32, 46, 46, 46, 0}
        c = (dpsunicode_t *) 0x55b2f0
        os = 0x2b983b362aa0 ""
        s = 0
        r = 665
        wlen = (size_t *) 0x55b310
        i = 665
        len = 257
        maxwlen = 3
        ulen = 140735075451688
        prevlen = 5617620
        osl = 665
        index_limit = 0
        dc_uni = {from = 0x2b983b362aa0, to = 0x2b983b363980, CharsToEscape = 
0x506130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
  flags = 12, istate = 0, ostate = 0}
        uni_bc = {from = 0x2b983b363980, to = 0x2b983b362aa0, CharsToEscape = 
0x506130 "\"&<>", ibytes = 0, obytes = 0, icodes = 1, ocodes = 1,
  flags = 12, istate = 0, ostate = 0}
        rec_id = -856889641
        ChunkSize = 10
        DocSize = 155454
        dbnum = 140735075451504
        Source = 0x299 <Address 0x299 out of bounds>
        SourceToFree = 0x55b690 "Home News Sport Radio TV Weather Languages UK 
version International version About the versions Low graphics Accessibility 
help The World Today Mobiles Get news headlines on your handheld device News 
F"...
        needFreeSource = 1
        NOprefixHL = 1
#3  0x00002b983a9430a0 in DpsFind (A=0x523e80) at db.c:1280
        al = 0x55e480 "http://news.bbc.co.uk/";
        db = (DPS_DB *) 0x55e480
        Res = (DPS_RESULT *) 0x551320
        res = 0
        ticks = 1175814173286
        ticks_ = 1175814173286
        i = 0
        dbfrom = 72
        dbto = 1
        num = 1
        page_number = 5629056
        page_size = 0
        ExcerptSize = 256
        ExcerptPadding = 40
        Excerpt = 0x0
        str = "[EMAIL PROTECTED]", '\0' <repeats 13 times>, 
"\037�|:\230+\000\000\001", '\0' <repeats 15 times>, 
"\220pR\000\000\000\000\000$�\223:\230+\000\000\020\034\216:\230+\000\000\200hP\000\000\000\000\0000\bT\000\000\000\000\000R(}:[EMAIL
 PROTECTED]"
#4  0x00000000004026f9 in main (argc=5507120, argv=0x0) at search.c:490
        s = 0x7fff702e0270 "H"
        e = 0x506880 "\001"
        env = 0x506880 "\001"
        bcharset = 0x506880 "\001"
        lcharset = 0x540830 "iso-8859-1"
        conf_dir = 0x40502e "/usr/local/dpsearch/etc"
        template_name = "/usr/local/dpsearch/etc/search.htm", '\0' <repeats 
4067 times>
        template_filename = 0x0
        query_string = 0x53a8c0 "q=news"
        self = '\0' <repeats 1023 times>
        nav = 0x0
        url = 0x0
        searchwords = 0x0
        storedstr = 0x0
        ResultContentType = 0x3bce1010 <Address 0x3bce1010 out of bounds>
        res = 72
        httpd = 0
        catcolumns = 0
        page1 = 5269632
        page2 = 5507120
        npages = 140735075451504
        ppp = 10
        page_size = 10
        page_number = 0
        i = 0
        swlen = 0
        nav_len = 4214830
        storedlen = 0
        Env = (DPS_ENV *) 0x506880
        Agent = (DPS_AGENT *) 0x523e80
        Res = (DPS_RESULT *) 0x0
        query_vars = {freeme = 0, Root = {{nvars = 0, mvars = 0, Var = 0x0} 
<repeats 113 times>, {nvars = 1, mvars = 32, Var = 0x53ccd0}, {nvars = 0,
      mvars = 0, Var = 0x0} <repeats 142 times>}}
        site_id = 0
        prev_site_id = 0
(gdb)                  

______________________________________________________________


Hope this would help ....  'cause I don't know what else to try... maybe the 
problem is with my system... no idea...

Thanks in advance Maxime!









- - - - - - - - - - - - - - - - - - - - - - - - - - - -

Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;post=


Reply via email to