- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Индексация от обеда до забора
Если имеется в виду индексирование всего Рунета, то
Realm regex ^http://[^/\.]*\.ru/
Realm regex ^http://www.[^/\.]*\.ru/
Если имеется в виду индексирование всех ссылок, найденых на как
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: FTP поиск по именам дирокторий и файлов
Честно говоря удивлен, что работает :) Вчерашний снапшот был недоделаным,
сегодня пофиксил: http://www.dataparksearch.org/dpsearch-4.50-05072008.tar.bz2
deb пэкадж я не собир
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: MF
Subject: Re: FTP поиск по именам дирокторий и файлов
Вот официальная дока дебиана.
http://www.us.debian.org/doc/manuals/maint-guide/
я плохо представляю как dataparksearch и mnogosearch могут совмещаться в 1
сисетеме, просто много
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: RSS выборочное срабатывание
Проверьте, какой именно лог вы смотрите, эта команда включает максимальный
уровень выдачи отладочной информации, поэтому вывод в error_log должен
увеличиться.
- - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: RSS выборочное срабатывание
Попробуйте выполнить из командной строки:
QUERY_STRING="%F1%EE%E1%E0%EA%E8&c=&site=&m=all&sp=1&sy=0&s=DRP&tmplt=rss.htm"
./search.cgi 2>err
и покажите, что выводится в файл err.
- - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: dalex
Subject: Re: Segmentation fault при индексировании
Вот бэктрейс дампа версии 1.50 от 5-го числа этого месяца. Так же вываливается
в segfault, только я удалил документ на котором валилось в прошлый раз. Сейчас
валится на другом
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: RSS выборочное срабатывание
Попробуйте пересобрать указав для configure ключ --enable-syslog вместо
--disable-syslog.
Появится ли после этого отладочная информация в error_lor/файле err ?
- - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Андрей
Subject: Проблема при configure
Здравствуйте! Только что нашел Вашу технологию, очень заинтересовала.
Решил установить на сервак (ASPLinux 11) к себе, но...
checking for daemon... yes
checking for inet_addr... yes
checking for s
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Андрей
Subject: Re: Проблема при configure
Вот еще вырезки из config.log
This file contains any messages produced by compilers while
running configure, to aid debugging if configure makes a mistake.
It was created by configure, which
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: dalex
Subject: Re: Segmentation fault при индексировании
> At 19:55:38 10/07/08, Maxime wrote:
>А в генерируемой вами таблице могут попадаться "слова" длиной более 256
>символов ?
Просмотрел - да, были сочетания символов (знак подчер
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: zabar
Subject: Re: RSS выборочное срабатывание
> At 18:52:16 09/07/08, Maxime wrote:
>Попробуйте пересобрать указав для configure ключ --enable-syslog вместо
>--disable-syslog.
>Появится ли после этого отладочная информация в error_lo
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Андрей
Subject: Re: Проблема при configure
> At 14:50:11 11/07/08, Maxime wrote:
>Проверьте, стоят ли у вас пэкаджи, необходимы для сборки ПО из исходников, в
>Линуксах обычно они не ставятся по-умолчанию.
А какие именно?
- - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Проблема при configure
Инструментарий, необходимый для сборки, указан в документации:
http://www.dataparksearch.org/dpsearch-toolsreq.ru.html
Я не могу назвать имена пэкаджей для линукса, но кроме перечисленных на э
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Anup Nair
Subject: Configuration of Dataparksearch utility with Cygwin linux utility?
Hi,
I have been trying to install DataparkSearch using Cygwin on a Windows XP SP2
system.
I have downloaded the entire installation of Cygwin, all re
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Configuration of Dataparksearch utility with Cygwin linux utility?
DataparkSearch is a Unix software. I can't believe it would be compiled on
Windows successfully. Although I know nothing about Cygwin, so I can't a
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Протестил новый поиск
Похожие запросы - это отдельный поиск, когда таблица qtrack проиндексирована
средствами DataparkSearch, обращение к этому поиску идет через HttpRequest, по
сути это отдельный поиск.
Номера те
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Roman
Subject: Re: Протестил новый поиск
Вижу, а не лучше как у nigma.ru сделать (парсить из текста) - так и базу
дёргать не нужно?
Вот ещё распространённый глук - в большенстве страниц ошибочно распознаётся
язык, на русские страници
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: will harris
Subject: How To Use DPSearch
It's not entirely clear to me how to use this progam. The documentation lists
several options but I am new, and am not exactly sure why I would want to do
certain steps over other ones. I have
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: Re: segfault | Can
при индексирование,
после "indexer -Ecreate"
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=05;topic_id=121673
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: install
I would like to either offer my server (high spec dedicated) for testing in
exchange for install support, or find someone I can pay to help with the
initial install.
- - - - - - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: zabar
Subject: Индексаторы запирают базу
FreeBSD 7.0/amd64
mysql 5.0.51a
при сканировании после подобных записей
[74103]{12} Can't connect to host dreamtour.info:80
[74103]{15} Download timeout
[74103]{17} Download timeout
в процесса
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: install
Hi,
I installed the script but when I run (make install) after successfully running
./install.pl and make I get these errors
make[2]: *** [install-includeHEADERS] Error 1
make[2]: Leaving directory `/usr/loc
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: install
It looks like you have put sources under /usr/local/dpsearch and you're trying
to install into the same directory.
Try to move sources into another directory, i.e. into your home directory, and
repeat insta
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: No
It looks like you have entered a Server command without trailing slash. Try
correct it like this one:
Server http://www.sina.com.cn/
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: ssharry
Subject: Re: No
Thank you!
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.com/cgi-bin/simpleforum.cgi?fid=02;topic_id=1217405250
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Problem with install of 4.50
Please look inside config.log file in the directory where you have ran
configure/install.pl, especially for the line which starts with
checking for MySQL support...
How this line looks l
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: ssharry
Subject: About Chinese charset
Hi,
I configured the project as follow,but still can't see the right chinese words
through cgi.
./configure --prefix=/home/sc/ --with-pgsql=/usr/local/pgsql/
--with-extra-charsets=chinese --wit
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: About Chinese charset
did you uncomment all chinese language maps in langmap.conf file ? They are
commented out by default, since the support for chinese charsets doesn't
compiled in by default.
If you need to unco
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: ssharry
Subject: An Error about client_encoding
Hi
Here is the log of an error when indexing.
{sql.c:1990} Query: SELECT rec_id, hops FROM url WHERE
url='http://www.verycd.com/tags/动漫/'
SQL-server message: ERROR: invalid by
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Install for people like me cpanel - linux
As couldnt find an install for dummies like me this is what I did
In cpanel make sure you create a new mysql database and give a user ALL
priviliges the account and database name
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Roman
Subject: Re: Протестил новый поиск
stored, я где-то в мануале видел команду к indexer переиндексировать базу
поиска из сохранённых копии (что то счас не найду как точно она выглядит).
Правда не заглючит ли она, при условии что с
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Протестил новый поиск
Я пока не знаю о причинах пропадания ссылок, поэтому при индексировании из базы
stored (это ключ -B для indexer), возможно, вы получите только 30% документов
из базы stored, остальные будут пр
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Right now all is dbmode multi
As soon as I change this to cache the following happens
I search for mason -- no results
I search for Mason -- some results
I search for 1 -- No results
I
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Using dbmode cache you have to write down fresh URL data and limits using the
command
./indexer -THW
after each indexing/reindexing (or periodically if indexing takes long run).
Pleas
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: Search for XYZ. Search results: lait: 95421 / 95421 and don
Thank you,
I have dont this and started indexing, also ran the THW,
However one thing is weird
Search for Masons and you get results, Search for masons and
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Thank you,
I have dont this and started indexing, also ran the THW,
However one thing is weird
Search for Masons and you get results, Search for masons and you get no results.
Also if
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: How to crawl from one site to other sites using links?
Please describe more what are expecting to get ?
By default, dpsearch crawls all links betwen site which are having a
corresponding Server/Realm/Subnet command
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Kicked of the indexer last night, and just came back to my office now..
17,000,000 indexed dict definitions.. its going well!
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Re
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
This GroupBySites=yes can I not put this in the indexer or search.htm template?
If not, how do I pass this to my search.cgi
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Rea
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
You may put it as a hidden CGI parameter into your search form:
You don't need to put it into your search template search.htm, since it already
put here and take the value by defaul
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: pending
Subject: ? in url
Generally speaking, dpsearch indexes my site correctly, which is using a php
framework.
Although after indexing the site, it indeed indexed all required urls including
those like http://mySiteDomain/product
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: segfault
Перевел баэу с 4.48 на 4.50
indexer -Erehashstored
поиск отказывется работать с появлением такого сообщения в логах системы
search.cgi[2681]: segfault at 8 ip 7ff145dcb932 sp 7fff4f530190 error 4 in
libc-2.8.so[
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Thank you so much!
cache works
group by page works
indexer is running hard
Aspell is working
awesome!... thank you sooo much!
1 question for today
How to Disallow a url, for example
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: ? in url
Please run the command:
./indexer -qamv5 -u http://mySiteDomain/products/1/index.html?id=353
the -v5 switch here enables full debug output, include information why this
page has been indexed or not.
Please
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Put this command into your indexer.conf file:
Disallow regex amazon\.com
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.o
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
When I do that it gives me
indexer[9452]: {01} SubDoc.robots.txt: 'Disallow /'
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksear
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
No, this message means, that a subdocument is disallowed by a rule in
robots.txt of remote site.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
htt
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I am not sure what happens,
but all my indexer seem to be stuck amazon, nothing goes along... it gets worse
if I put the line
Disallow regex amazon\.com (or Regex)
in my indexer.co
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
What do you mean under "stuck amazon" ?
Probably, you've got a vast number of URLs from amazon.com and indexer deletes
all of them according to this Disallow command.
- - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I am not sure what happened... but I guess your right, it now has to delete all
the amazon entries.
Its a lot of fine tuning hey!
- - - - - - - - - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: Re: segfault | Can
но уже после "indexer -Erehashstored" назад дороги нет, Видимо придется
переиндексировать с нуля
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksear
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: segfault | Can
Включите, пожалуйста, создание посмертных дампов для пользователя, из-под
которого запускается search.cgi, командой
limits -c unlimited
затем создайте по полученому дампу отчет как написано здесь:
ht
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: Re: segfault | Can
Запустил индексацию с нуля все ok, появился шанс это сделать :) думаю нет
смысла тратить время на проблемы с совместимостью, пока. Дальше будут проблемы
выложу дамп. Спасибо.
- - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: Re: segfault | Can
trouble с каткгориями в версии 4.50
индексация произведена с ключами:
## LIMITS !!!
Limit c:category
...
##
Category 01
Server site http://site.name
...
при поиске добавляем "&c=01" в URL результат "did
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: segfault | Can
Эта же команда Limit присутствует в шаблоне search.htm или в файле конфигурации
searchd.conf, если используется searchd ?
Добавьте в шаблон searchd.htm или в searchd.conf команду
LogLevel 5
что при
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Hi,
I have no idea what I did wrong,
But when I start my indexer (I did a ./indexer -C) It show me the following
[EMAIL PROTECTED] ~]# /usr/local/dpsearch/sbin/indexer
indexer[4172]:
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
What the output is for the command:
/usr/local/dpsearch/sbin/indexer -S
?
Try to run
/usr/local/dpsearch/sbin/indexer -a
which is force reindexing for all documents in the database.
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Ok. it is running, but no dict is filled,
Database statistics
StatusExpired Total
-
0 108210 111937 Not indexed yet
200
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
If you use dbmode cache, dict table isn't filles. All data stores under
/usr/local/dpserach/var directory.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I must have broken something, because there are no results anymore...
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Very confused,
If I search for "bible" i get over a thousand results, but if I then search for
other words in the results of "bible" they dont show.. What am I doing wrong?
- - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: pending
Subject: Re: ? in url
thanks a lot, i have figured out what the problem is. session issue for cgi
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simplefor
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
When dbmode cache is used, it use caching to reduce disk usage. It looks like
the "bible" word is one of most used in your collection and its buffer have
been already flushed while o
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Hi, thanks as always,
I give up on cache mode, it is too much trouble... but multi is working nicely
About the amazon exclusion, I put the line you gave me in the indexer.conf but
it
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Please show the output for the command
/usr/local/dpsearch/sbin/indexer -v5 -n1 -u http://www.amazon.com/%
Yes, it will be huge, post it anyway.
- - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Place
Allow *
command in your indexer.conf file below any of Disallow command.
All Allow/Disallow commands are trying on order of appearance in the
indexer.conf and only the first ma
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
This is my indexer.conf
Am I doing something wrong?
#VarDir /usr/local/dpsearch/var
#NewsExtensions no
#AccentExtensions no
#SyslogFacility local7
#LocalCharset iso-8859-1
#LocalChars
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Yes, it seems you need to comment in the
Allow *
command on 31st line.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Like this
CrossWords yes
CollectLinks yes
DoStore yes
StopwordFile stopwords/en.sl
Include stopwords.conf
Include langmap.conf
MinWordLength 1
MaxWordLength 32
#Allow *
Allow Case *.HT
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Yes, it is.
Please note, the commands
Disallow regex amazon\.com
Allow *
doesn't play anything, since all documents are dissalowed by the command
Disallow *
above.
If you need to disa
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Thats confusing, sorry
Like this , it looks silly!
Allow .html .txt .php .php* .htm */ .shtml .pl
Disallow regex amazon\.com
Allow *
Disallow *
- - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Once again, the
Allow *
command just after
Disallow regex amazon\.com
command allows indexing of everything except amazon.com and makes any of Allow
/ Disallow command after it. It se
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I did that.. thanks a lot for your patience,
One thing keeps happening,
My indexer keeps freezing or something.. it starts, and then it stops after a
few minutes...
at different pages
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
How many indexing threads do you start at same time ? (what is the value for -N
switch for indexer ?)
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Fox
Subject: Re: segfault | Can
Limit присутствует в шаблоне search.htm и searchd.conf
файл error_log не появляется, смог вывести в syslog, следующую инфу, если это
моможет:
###
search.cgi started with '/home/indexer/dpsearch/etc/sear
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: segfault | Can
Выглядит, как будто нет данных в лимите по категориям. Выполнялась ли команда
indexer -TW
по окончании индексирования и searchd отправлялся сигнал -HUP на перезагрузку
данных об URL и лимитов, если
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Hi,
At the end of the day this is the message
SQL-server message: MySQL driver: #1203: User biblers_search has already more
than 'max_user_connections' active connections
indexer[431
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
What value for max_user_connections do you have for the User biblers_search ?
How many indexers running simultaneously do you have an ow many indexing
threads each of them have ?
By d
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I just increased max connections to 100 so it should be ok, I have 2 indexers
running now,
BUT>.. I wanted to use cache mode and
changed all dbmode multi to cache
Added this line to
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Have you stopped
/usr/local/dpsearch/sbin/indexer
?
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
ctrl Z before I did all the other work...
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=0
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Ctrl Z suspends the program. To stop it, use Ctrl C.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforu
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I had to make a decission anyways on multi or cache, and as multi works very
well now its just the easier choice.
Thank you for your patience and kind advise!
- - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
Please note, dbmode cache works much faster with huge number of URLs indexed.
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksear
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: mike
Subject: show total sites
I would like to out a blib if info on the site
total sites indexed
total size of index
can someone please advise me how to do this...
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the ful
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: show total sites
You may find the number of site indexed with this SQL-query to the search
database;
SELECT COUNT(*) FROM (SELECT distinct site_id FROM url) AS foo;
Please note, this query works only for PgSQL and
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
Yeah I know, but I keep doing something wrong and cant get cache to work...
weird!
The multi database is useless if you want cache after right? it is one or the
other I think...
I wis
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: show total sites
Thanks a lot!
Also.. i was wondering, is there anywhere that the search terms are kept? It
would be a great statistic to keep track of!
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read t
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: show total sites
You need to enable search query tracking, see:
http://www.dataparksearch.org/dpsearch-track.en.html
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.datap
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: gagrilli
Subject: Cannot display search results
Hi,
Trying to setup DPsearch for the first time, so this is probably some stupid
mistake, but here it is..
Apache 2.2.9, MySQL 5.0.51b, Perl 5.10.0 , (just search.cgi no mod or searchd)
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Cannot display search results
You can use both socket and dbmode parameters in DBAddr in that way:
DBAddr mysql://?socket=...&dbmode=...
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full top
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: gagrilli
Subject: Re: Cannot display search results
Thanks for your quick reply, Maxime.
I got the indexer working, it told me it had 805 documents indexed, but, again
nothing(!)
I think I am doing something wrong woth the Server comm
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: gagrilli
Subject: Re: Cannot display search results
I really don't understand what else can I add to my indexer.conf so that the
basic functionality appears..
I tried changing the dbmode , I tried altering the Server directive, I tried
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: show total sites
Do I need to Edrop Ecreate again to change it over?
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;topi
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: show total sites
No, you don't need it.
You can add any URL to the database using the following SQL command:
INSERT INTO url (url, next_index_time) VALUES ('http://server.ext/', 0);
Attention: don't delete any URL i
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: Cannot display search results
Have you created your sections.conf file and include it from your indexer.conf
file ?
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full topic here:
http://www.datap
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
If you would like ti try cache mode once again, add the following command to
your search.htm template
LogLevel 5
and show the output to the server error log when your perform a search
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
I am so sorry, but which error log? The servers error log shows no errors when
I add LogLevel 5 to the search.htm
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Read the full
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Maxime
Subject: Re: getting closer to my end result
It's web-server error log for a web-server where search.cgi is calling.
Or you can run search.cgi from command line:
/usr/local/dpsearch/bin/search.cgi bible 2>err.log
then show the
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
Name: Mike
Subject: Re: getting closer to my end result
search.cgi[3292]: {00} search.cgi started with
'/usr/local/dpsearch/etc/search.htm'
search.cgi[3292]: {00} VarDir: '/usr/local/dpsearch/var'
search.cgi[3292]: {00} Affixes: 0, Spells: 0
Результаты 601 - 700 из 1663 matches
Mail list logo