[dataparksearch] [Forum] Re: Индексация от обеда до забора

2008-07-04 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Индексация от обеда до забора Если имеется в виду индексирование всего Рунета, то Realm regex ^http://[^/\.]*\.ru/ Realm regex ^http://www.[^/\.]*\.ru/ Если имеется в виду индексирование всех ссылок, найденых на как

[dataparksearch] [Forum] Re: FTP поиск по именам дирокторий и файлов

2008-07-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: FTP поиск по именам дирокторий и файлов Честно говоря удивлен, что работает :) Вчерашний снапшот был недоделаным, сегодня пофиксил: http://www.dataparksearch.org/dpsearch-4.50-05072008.tar.bz2 deb пэкадж я не собир

[dataparksearch] [Forum] Re: FTP поиск по именам дирокторий и файлов

2008-07-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: MF Subject: Re: FTP поиск по именам дирокторий и файлов Вот официальная дока дебиана. http://www.us.debian.org/doc/manuals/maint-guide/ я плохо представляю как dataparksearch и mnogosearch могут совмещаться в 1 сисетеме, просто много

[dataparksearch] [Forum] Re: RSS выборочное срабатывание

2008-07-08 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: RSS выборочное срабатывание Проверьте, какой именно лог вы смотрите, эта команда включает максимальный уровень выдачи отладочной информации, поэтому вывод в error_log должен увеличиться. - - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: RSS выборочное срабатывание

2008-07-09 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: RSS выборочное срабатывание Попробуйте выполнить из командной строки: QUERY_STRING="%F1%EE%E1%E0%EA%E8&c=&site=&m=all&sp=1&sy=0&s=DRP&tmplt=rss.htm" ./search.cgi 2>err и покажите, что выводится в файл err. - - - -

[dataparksearch] [Forum] Re: Segmentation fault при индексировании

2008-07-09 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: dalex Subject: Re: Segmentation fault при индексировании Вот бэктрейс дампа версии 1.50 от 5-го числа этого месяца. Так же вываливается в segfault, только я удалил документ на котором валилось в прошлый раз. Сейчас валится на другом

[dataparksearch] [Forum] Re: RSS выборочное срабатывание

2008-07-09 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: RSS выборочное срабатывание Попробуйте пересобрать указав для configure ключ --enable-syslog вместо --disable-syslog. Появится ли после этого отладочная информация в error_lor/файле err ? - - - - - - - - - - - - - -

[dataparksearch] [Forum] Проблема при configure

2008-07-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Андрей Subject: Проблема при configure Здравствуйте! Только что нашел Вашу технологию, очень заинтересовала. Решил установить на сервак (ASPLinux 11) к себе, но... checking for daemon... yes checking for inet_addr... yes checking for s

[dataparksearch] [Forum] Re: Проблема при configure

2008-07-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Андрей Subject: Re: Проблема при configure Вот еще вырезки из config.log This file contains any messages produced by compilers while running configure, to aid debugging if configure makes a mistake. It was created by configure, which

[dataparksearch] [Forum] Re: Segmentation fault при индексировании

2008-07-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: dalex Subject: Re: Segmentation fault при индексировании > At 19:55:38 10/07/08, Maxime wrote: >А в генерируемой вами таблице могут попадаться "слова" длиной более 256 >символов ? Просмотрел - да, были сочетания символов (знак подчер

[dataparksearch] [Forum] Re: RSS выборочное срабатывание

2008-07-12 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: zabar Subject: Re: RSS выборочное срабатывание > At 18:52:16 09/07/08, Maxime wrote: >Попробуйте пересобрать указав для configure ключ --enable-syslog вместо >--disable-syslog. >Появится ли после этого отладочная информация в error_lo

[dataparksearch] [Forum] Re: Проблема при configure

2008-07-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Андрей Subject: Re: Проблема при configure > At 14:50:11 11/07/08, Maxime wrote: >Проверьте, стоят ли у вас пэкаджи, необходимы для сборки ПО из исходников, в >Линуксах обычно они не ставятся по-умолчанию. А какие именно? - - - - - -

[dataparksearch] [Forum] Re: Проблема при configure

2008-07-14 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Проблема при configure Инструментарий, необходимый для сборки, указан в документации: http://www.dataparksearch.org/dpsearch-toolsreq.ru.html Я не могу назвать имена пэкаджей для линукса, но кроме перечисленных на э

[dataparksearch] [Forum] Configuration of Dataparksearch utility with Cygwin linux utility?

2008-07-17 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Anup Nair Subject: Configuration of Dataparksearch utility with Cygwin linux utility? Hi, I have been trying to install DataparkSearch using Cygwin on a Windows XP SP2 system. I have downloaded the entire installation of Cygwin, all re

[dataparksearch] [Forum] Re: Configuration of Dataparksearch utility with Cygwin linux utility?

2008-07-19 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Configuration of Dataparksearch utility with Cygwin linux utility? DataparkSearch is a Unix software. I can't believe it would be compiled on Windows successfully. Although I know nothing about Cygwin, so I can't a

[dataparksearch] [Forum] Re: Протестил новый поиск

2008-07-20 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Протестил новый поиск Похожие запросы - это отдельный поиск, когда таблица qtrack проиндексирована средствами DataparkSearch, обращение к этому поиску идет через HttpRequest, по сути это отдельный поиск. Номера те

[dataparksearch] [Forum] Re: Протестил новый поиск

2008-07-20 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Roman Subject: Re: Протестил новый поиск Вижу, а не лучше как у nigma.ru сделать (парсить из текста) - так и базу дёргать не нужно? Вот ещё распространённый глук - в большенстве страниц ошибочно распознаётся язык, на русские страници

[dataparksearch] [Forum] How To Use DPSearch

2008-07-20 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: will harris Subject: How To Use DPSearch It's not entirely clear to me how to use this progam. The documentation lists several options but I am new, and am not exactly sure why I would want to do certain steps over other ones. I have

[dataparksearch] [Forum] Re: segfault | Can

2008-07-26 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: Re: segfault | Can при индексирование, после "indexer -Ecreate" - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=05;topic_id=121673

[dataparksearch] [Forum] install

2008-07-28 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: install I would like to either offer my server (high spec dedicated) for testing in exchange for install support, or find someone I can pay to help with the initial install. - - - - - - - - - - - - - - - - - - - - - - -

[dataparksearch] [Forum] Индексаторы запирают базу

2008-07-28 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: zabar Subject: Индексаторы запирают базу FreeBSD 7.0/amd64 mysql 5.0.51a при сканировании после подобных записей [74103]{12} Can't connect to host dreamtour.info:80 [74103]{15} Download timeout [74103]{17} Download timeout в процесса

[dataparksearch] [Forum] Re: install

2008-07-28 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: install Hi, I installed the script but when I run (make install) after successfully running ./install.pl and make I get these errors make[2]: *** [install-includeHEADERS] Error 1 make[2]: Leaving directory `/usr/loc

[dataparksearch] [Forum] Re: install

2008-07-28 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: install It looks like you have put sources under /usr/local/dpsearch and you're trying to install into the same directory. Try to move sources into another directory, i.e. into your home directory, and repeat insta

[dataparksearch] [Forum] Re: No

2008-07-30 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: No It looks like you have entered a Server command without trailing slash. Try correct it like this one: Server http://www.sina.com.cn/ - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic

[dataparksearch] [Forum] Re: No

2008-07-30 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: ssharry Subject: Re: No Thank you! - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.com/cgi-bin/simpleforum.cgi?fid=02;topic_id=1217405250

[dataparksearch] [Forum] Re: Problem with install of 4.50

2008-07-31 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Problem with install of 4.50 Please look inside config.log file in the directory where you have ran configure/install.pl, especially for the line which starts with checking for MySQL support... How this line looks l

[dataparksearch] [Forum] About Chinese charset

2008-08-01 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: ssharry Subject: About Chinese charset Hi, I configured the project as follow,but still can't see the right chinese words through cgi. ./configure --prefix=/home/sc/ --with-pgsql=/usr/local/pgsql/ --with-extra-charsets=chinese --wit

[dataparksearch] [Forum] Re: About Chinese charset

2008-08-01 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: About Chinese charset did you uncomment all chinese language maps in langmap.conf file ? They are commented out by default, since the support for chinese charsets doesn't compiled in by default. If you need to unco

[dataparksearch] [Forum] An Error about client_encoding

2008-08-04 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: ssharry Subject: An Error about client_encoding Hi Here is the log of an error when indexing. {sql.c:1990} Query: SELECT rec_id, hops FROM url WHERE url='http://www.verycd.com/tags/动漫/' SQL-server message: ERROR: invalid by

[dataparksearch] [Forum] Install for people like me cpanel - linux

2008-08-04 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Install for people like me cpanel - linux As couldnt find an install for dummies like me this is what I did In cpanel make sure you create a new mysql database and give a user ALL priviliges the account and database name

[dataparksearch] [Forum] Re: Протестил новый поиск

2008-08-04 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Roman Subject: Re: Протестил новый поиск stored, я где-то в мануале видел команду к indexer переиндексировать базу поиска из сохранённых копии (что то счас не найду как точно она выглядит). Правда не заглючит ли она, при условии что с

[dataparksearch] [Forum] Re: Протестил новый поиск

2008-08-04 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Протестил новый поиск Я пока не знаю о причинах пропадания ссылок, поэтому при индексировании из базы stored (это ключ -B для indexer), возможно, вы получите только 30% документов из базы stored, остальные будут пр

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Right now all is dbmode multi As soon as I change this to cache the following happens I search for mason -- no results I search for Mason -- some results I search for 1 -- No results I

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Using dbmode cache you have to write down fresh URL data and limits using the command ./indexer -THW after each indexing/reindexing (or periodically if indexing takes long run). Pleas

[dataparksearch] [Forum] Re: Search for XYZ. Search results: lait: 95421 / 95421 and don

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: Search for XYZ. Search results: lait: 95421 / 95421 and don Thank you, I have dont this and started indexing, also ran the THW, However one thing is weird Search for Masons and you get results, Search for masons and

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Thank you, I have dont this and started indexing, also ran the THW, However one thing is weird Search for Masons and you get results, Search for masons and you get no results. Also if

[dataparksearch] [Forum] Re: How to crawl from one site to other sites using links?

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: How to crawl from one site to other sites using links? Please describe more what are expecting to get ? By default, dpsearch crawls all links betwen site which are having a corresponding Server/Realm/Subnet command

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Kicked of the indexer last night, and just came back to my office now.. 17,000,000 indexed dict definitions.. its going well! - - - - - - - - - - - - - - - - - - - - - - - - - - - - Re

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result This GroupBySites=yes can I not put this in the indexer or search.htm template? If not, how do I pass this to my search.cgi - - - - - - - - - - - - - - - - - - - - - - - - - - - - Rea

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-05 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result You may put it as a hidden CGI parameter into your search form: You don't need to put it into your search template search.htm, since it already put here and take the value by defaul

[dataparksearch] [Forum] ? in url

2008-08-06 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: pending Subject: ? in url Generally speaking, dpsearch indexes my site correctly, which is using a php framework. Although after indexing the site, it indeed indexed all required urls including those like http://mySiteDomain/product

[dataparksearch] [Forum] segfault

2008-08-06 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: segfault Перевел баэу с 4.48 на 4.50 indexer -Erehashstored поиск отказывется работать с появлением такого сообщения в логах системы search.cgi[2681]: segfault at 8 ip 7ff145dcb932 sp 7fff4f530190 error 4 in libc-2.8.so[

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Thank you so much! cache works group by page works indexer is running hard Aspell is working awesome!... thank you sooo much! 1 question for today How to Disallow a url, for example

[dataparksearch] [Forum] Re: ? in url

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: ? in url Please run the command: ./indexer -qamv5 -u http://mySiteDomain/products/1/index.html?id=353 the -v5 switch here enables full debug output, include information why this page has been indexed or not. Please

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Put this command into your indexer.conf file: Disallow regex amazon\.com - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.o

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result When I do that it gives me indexer[9452]: {01} SubDoc.robots.txt: 'Disallow /' - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksear

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result No, this message means, that a subdocument is disallowed by a rule in robots.txt of remote site. - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: htt

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I am not sure what happens, but all my indexer seem to be stuck amazon, nothing goes along... it gets worse if I put the line Disallow regex amazon\.com (or Regex) in my indexer.co

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result What do you mean under "stuck amazon" ? Probably, you've got a vast number of URLs from amazon.com and indexer deletes all of them according to this Disallow command. - - - - - - - -

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I am not sure what happened... but I guess your right, it now has to delete all the amazon entries. Its a lot of fine tuning hey! - - - - - - - - - - - - - - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: segfault | Can

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: Re: segfault | Can но уже после "indexer -Erehashstored" назад дороги нет, Видимо придется переиндексировать с нуля - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksear

[dataparksearch] [Forum] Re: segfault | Can

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: segfault | Can Включите, пожалуйста, создание посмертных дампов для пользователя, из-под которого запускается search.cgi, командой limits -c unlimited затем создайте по полученому дампу отчет как написано здесь: ht

[dataparksearch] [Forum] Re: segfault | Can

2008-08-07 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: Re: segfault | Can Запустил индексацию с нуля все ok, появился шанс это сделать :) думаю нет смысла тратить время на проблемы с совместимостью, пока. Дальше будут проблемы выложу дамп. Спасибо. - - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: segfault | Can

2008-08-08 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: Re: segfault | Can trouble с каткгориями в версии 4.50 индексация произведена с ключами: ## LIMITS !!! Limit c:category ... ## Category 01 Server site http://site.name ... при поиске добавляем "&c=01" в URL результат "did

[dataparksearch] [Forum] Re: segfault | Can

2008-08-08 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: segfault | Can Эта же команда Limit присутствует в шаблоне search.htm или в файле конфигурации searchd.conf, если используется searchd ? Добавьте в шаблон searchd.htm или в searchd.conf команду LogLevel 5 что при

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-09 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Hi, I have no idea what I did wrong, But when I start my indexer (I did a ./indexer -C) It show me the following [EMAIL PROTECTED] ~]# /usr/local/dpsearch/sbin/indexer indexer[4172]:

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-09 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result What the output is for the command: /usr/local/dpsearch/sbin/indexer -S ? Try to run /usr/local/dpsearch/sbin/indexer -a which is force reindexing for all documents in the database. -

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Ok. it is running, but no dict is filled, Database statistics StatusExpired Total - 0 108210 111937 Not indexed yet 200

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result If you use dbmode cache, dict table isn't filles. All data stores under /usr/local/dpserach/var directory. - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I must have broken something, because there are no results anymore... - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Very confused, If I search for "bible" i get over a thousand results, but if I then search for other words in the results of "bible" they dont show.. What am I doing wrong? - - - - - -

[dataparksearch] [Forum] Re: ? in url

2008-08-10 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: pending Subject: Re: ? in url thanks a lot, i have figured out what the problem is. session issue for cgi - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simplefor

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result When dbmode cache is used, it use caching to reduce disk usage. It looks like the "bible" word is one of most used in your collection and its buffer have been already flushed while o

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Hi, thanks as always, I give up on cache mode, it is too much trouble... but multi is working nicely About the amazon exclusion, I put the line you gave me in the indexer.conf but it

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Please show the output for the command /usr/local/dpsearch/sbin/indexer -v5 -n1 -u http://www.amazon.com/% Yes, it will be huge, post it anyway. - - - - - - - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Place Allow * command in your indexer.conf file below any of Disallow command. All Allow/Disallow commands are trying on order of appearance in the indexer.conf and only the first ma

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result This is my indexer.conf Am I doing something wrong? #VarDir /usr/local/dpsearch/var #NewsExtensions no #AccentExtensions no #SyslogFacility local7 #LocalCharset iso-8859-1 #LocalChars

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Yes, it seems you need to comment in the Allow * command on 31st line. - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Like this CrossWords yes CollectLinks yes DoStore yes StopwordFile stopwords/en.sl Include stopwords.conf Include langmap.conf MinWordLength 1 MaxWordLength 32 #Allow * Allow Case *.HT

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Yes, it is. Please note, the commands Disallow regex amazon\.com Allow * doesn't play anything, since all documents are dissalowed by the command Disallow * above. If you need to disa

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Thats confusing, sorry Like this , it looks silly! Allow .html .txt .php .php* .htm */ .shtml .pl Disallow regex amazon\.com Allow * Disallow * - - - - - - - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Once again, the Allow * command just after Disallow regex amazon\.com command allows indexing of everything except amazon.com and makes any of Allow / Disallow command after it. It se

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I did that.. thanks a lot for your patience, One thing keeps happening, My indexer keeps freezing or something.. it starts, and then it stops after a few minutes... at different pages

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result How many indexing threads do you start at same time ? (what is the value for -N switch for indexer ?) - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here

[dataparksearch] [Forum] Re: segfault | Can

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Fox Subject: Re: segfault | Can Limit присутствует в шаблоне search.htm и searchd.conf файл error_log не появляется, смог вывести в syslog, следующую инфу, если это моможет: ### search.cgi started with '/home/indexer/dpsearch/etc/sear

[dataparksearch] [Forum] Re: segfault | Can

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: segfault | Can Выглядит, как будто нет данных в лимите по категориям. Выполнялась ли команда indexer -TW по окончании индексирования и searchd отправлялся сигнал -HUP на перезагрузку данных об URL и лимитов, если

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Hi, At the end of the day this is the message SQL-server message: MySQL driver: #1203: User biblers_search has already more than 'max_user_connections' active connections indexer[431

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result What value for max_user_connections do you have for the User biblers_search ? How many indexers running simultaneously do you have an ow many indexing threads each of them have ? By d

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I just increased max connections to 100 so it should be ok, I have 2 indexers running now, BUT>.. I wanted to use cache mode and changed all dbmode multi to cache Added this line to

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Have you stopped /usr/local/dpsearch/sbin/indexer ? - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simpleforum

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result ctrl Z before I did all the other work... - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=0

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-11 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Ctrl Z suspends the program. To stop it, use Ctrl C. - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simpleforu

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-12 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I had to make a decission anyways on multi or cache, and as multi works very well now its just the easier choice. Thank you for your patience and kind advise! - - - - - - - - - - - - -

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-12 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result Please note, dbmode cache works much faster with huge number of URLs indexed. - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksear

[dataparksearch] [Forum] show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: mike Subject: show total sites I would like to out a blib if info on the site total sites indexed total size of index can someone please advise me how to do this... - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the ful

[dataparksearch] [Forum] Re: show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: show total sites You may find the number of site indexed with this SQL-query to the search database; SELECT COUNT(*) FROM (SELECT distinct site_id FROM url) AS foo; Please note, this query works only for PgSQL and

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result Yeah I know, but I keep doing something wrong and cant get cache to work... weird! The multi database is useless if you want cache after right? it is one or the other I think... I wis

[dataparksearch] [Forum] Re: show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: show total sites Thanks a lot! Also.. i was wondering, is there anywhere that the search terms are kept? It would be a great statistic to keep track of! - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read t

[dataparksearch] [Forum] Re: show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: show total sites You need to enable search query tracking, see: http://www.dataparksearch.org/dpsearch-track.en.html - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.datap

[dataparksearch] [Forum] Cannot display search results

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: gagrilli Subject: Cannot display search results Hi, Trying to setup DPsearch for the first time, so this is probably some stupid mistake, but here it is.. Apache 2.2.9, MySQL 5.0.51b, Perl 5.10.0 , (just search.cgi no mod or searchd)

[dataparksearch] [Forum] Re: Cannot display search results

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Cannot display search results You can use both socket and dbmode parameters in DBAddr in that way: DBAddr mysql://?socket=...&dbmode=... - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full top

[dataparksearch] [Forum] Re: Cannot display search results

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: gagrilli Subject: Re: Cannot display search results Thanks for your quick reply, Maxime. I got the indexer working, it told me it had 805 documents indexed, but, again nothing(!) I think I am doing something wrong woth the Server comm

[dataparksearch] [Forum] Re: Cannot display search results

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: gagrilli Subject: Re: Cannot display search results I really don't understand what else can I add to my indexer.conf so that the basic functionality appears.. I tried changing the dbmode , I tried altering the Server directive, I tried

[dataparksearch] [Forum] Re: show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: show total sites Do I need to Edrop Ecreate again to change it over? - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.dataparksearch.org/cgi-bin/simpleforum.cgi?fid=02;topi

[dataparksearch] [Forum] Re: show total sites

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: show total sites No, you don't need it. You can add any URL to the database using the following SQL command: INSERT INTO url (url, next_index_time) VALUES ('http://server.ext/', 0); Attention: don't delete any URL i

[dataparksearch] [Forum] Re: Cannot display search results

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: Cannot display search results Have you created your sections.conf file and include it from your indexer.conf file ? - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full topic here: http://www.datap

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result If you would like ti try cache mode once again, add the following command to your search.htm template LogLevel 5 and show the output to the server error log when your perform a search

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result I am so sorry, but which error log? The servers error log shows no errors when I add LogLevel 5 to the search.htm - - - - - - - - - - - - - - - - - - - - - - - - - - - - Read the full

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Maxime Subject: Re: getting closer to my end result It's web-server error log for a web-server where search.cgi is calling. Or you can run search.cgi from command line: /usr/local/dpsearch/bin/search.cgi bible 2>err.log then show the

[dataparksearch] [Forum] Re: getting closer to my end result

2008-08-13 Пенетрантность DataparkSearchForum
- - - - - - - - - - - - - - - - - - - - - - - - - - - - Name: Mike Subject: Re: getting closer to my end result search.cgi[3292]: {00} search.cgi started with '/usr/local/dpsearch/etc/search.htm' search.cgi[3292]: {00} VarDir: '/usr/local/dpsearch/var' search.cgi[3292]: {00} Affixes: 0, Spells: 0

<    2   3   4   5   6   7   8   9   10   11   >