Webboard: Can't find search.cgi
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Hello, I'm installing on SUSE Linux. I ran the install script, it went fine. Did the make/make install. However when I looked for the search.cgi it was not to be found. Is this a separate download/install / am I missing something? Any help much appreciated. It is in /bin directory of mnogosearch installation. Usually /usr/local/mnogosearch/bin/ Reply: http://www.mnogosearch.org/board/message.php?id=4943 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Search results contains deleted pages...
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Hi, The search results on our web site contain links to pages that were deleted. If I click on one of those results, I obtain a 404. I tried to use DeleteBad=yes in the indexer.conf and reindexing the whole site (indexer -a), but these broken links still appear. What could I do to remove these links from the results? You can delete them using indexer with these arguments: ./indexer -Cw -s 404 However, this looks like a bug. Could you try the following please: let's say http://site/bad.html is a page which doesn't longer exits on your server but still can be found using search. Run the following: ./indexer -amv6 -u http://site.bad.html and post it's output. Reply: http://www.mnogosearch.org/board/message.php?id=4944 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: index.php disappeared from url
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Indexer stores urls in DB without index.php. I mean that real url is http://www.xxx.ru/index.php?act=todoamp;smth=1 but indexer cuts index.php off. And in DB we see http://www.xxx.ru/?act=todoamp;smth=1 What can I do with it ? This is either the result of incorrect links on your pages or the result of bug. The idea is to find on which page this wrong link appeared first time. Use: SELECT referrer FROM url WHERE url='wrong url'; SELECT url FROM url WHERE rec_id=RRR; (where RRR is a result from the first query). Then take a look into the source of this page and check whether all links on it are correct. Reply: http://www.mnogosearch.org/board/message.php?id=4946 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Indexing hostnames and domainnames
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Yes but I want to stop it splitting the string quot;your-site.comquot; up into different words. I'd like to configure the indexer to treat it as a literal string. Is there a way to do that? Unfortunately no. Reply: http://www.mnogosearch.org/board/message.php?id=4947 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Indexer caught in a loop?
I had Alias http://www.myproductionsite.com file:/Library/Webserver/Documents (Mac OS X setup filesystem) Then Server http://www.myproductionsite.com/; Actually, I noticed it tended to get hung-up inside one specific folder. I moved that one out and now it seems to get farther before getting an Alias error. I can't think of any sort of thing that could this...maybe there is a symlink that creates a circular reference in the folder? The folder I moved out is only one of a 20 or so on the site and it has quite a few folders inside it and a couple hundred HTML pages, so it isn't easy to debug. Now it simply stops indexing with Error in Aliased URL: xxx but I haven't really looked into this new problem now...it is probably something with that file. I had a few realy large HTML documents (like 1.5MB converted Excel spreadsheet) and maxsize was being ignored and the indexer would seg fault on these... ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Server Table
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: mnoGoSearch v3.1.19 I am just looking at the structure of the server table and cannot find any documentation on some of the fields, could someone explain the alnum_factor field Take a look into indexer.conf-dist for AlnumFactor explanation. Reply: http://www.mnogosearch.org/board/message.php?id=4948 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: site and path
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: If I enter the following site into my server table: http://members.tripod.com/mysportster/ how can I stop the indexer following links to http://members.tripod.com/ Write this in your indexer.conf: Server path http://members.tripod.com/ and any other directories on this domain without physically excluding each directory, should I use 'site' or 'path'? The same with above. Reply: http://www.mnogosearch.org/board/message.php?id=4950 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: site and path
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Mine is set up like your taking and I use path. no clue about signal 2. I am still stuck with my front end Sorry I didn't understand you. Reply: http://www.mnogosearch.org/board/message.php?id=4951 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: How to exclude URLs?
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Hi there. On our website, we have a few sections of content we would like people to be able to search, including: * Mailing Lists /mailinglists/archives/ * Presentation /presentations/ * Meeting Minutes/meeting/past/ * All of the above / * Everything EXCEPT Mailing Lists Right now, the first four are trivial. I simply provide a pulldown menu for the quot;ulquot; form variable which looks something like: lt;select name=quot;ulquot;gt; lt;option value=quot;/quot;gt;Entire Site lt;option value=quot;/mailinglists/archives/quot;gt;Mailing Lists lt;option value=quot;/presentations/quot;gt;Presentations lt;optionv alue=quot;/meeting/past/quot;gt;Meeting Minutes lt;/selectgt; It's the last one which is giving me difficulty. I tried a number of things so far, including: Tags - Couldn't seem to get anything tag-related to work! I thought Tags or Categories would be the solution... Doing something like this in the quot;indexer.confquot; file: Server http://lugod.org/ Tag 1 Server http://www.lugod.org/mailinglists/archives/ etc. Or SOMETHING. :( Try something like this inf your indexer.conf, them use tag limit in search.htm: Tag 1 Server http://www.lugod.org/mailinglists/archives/ Tag 2 Server http://www.lugod.org/mailinglists/presentations/ Tag 3 Server http://www.lugod.org/mailinglists/meeting/past/ # All of the above / # Everything EXCEPT # Mailing Lists Tag 4 Server http://www.lugod.org/ Reply: http://www.mnogosearch.org/board/message.php?id=4952 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: search full URL
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: When i trying to search full URL like (http://someting.com) it always gave me no result, like this Search results: http : 275, www : 0, something : 19, com : 0 does it have something for correct that..? is it in my indexer.conf of in my parameter passed in my search.cgi..? First of all, you have to enable URL indexing using either URLWeight in 3.1.19 or Section url.* in 3.2.x Another thing that you probably don't use the same set of stopwords in both indexer.conf and search.htm. Probably indexer ignores www and com as stopwords, but search doesn't know that these two words are stopwords. Reply: http://www.mnogosearch.org/board/message.php?id=4953 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: How to ignore some HTML tags in body?
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Whenever I run indexer it works fine, but when I search for anything I get results like this: /quot;gt;HOME gt; /articlesquot;gt;ARTICLES gt; /articles/southweststates/index.htmlquot;gt;SOUTHWESTSTATES gt; /articles/southweststates/Santa_Fe/index.htmlquot;gt;SANTA_FE Carlsbad, New Mexico: A Year-Round Treat Santa Fe, New Mexico showPics(0,100,5) ?gt; CARLSBAD, NEW MEXICO A... obviously I dont want the php or those html tags in there, how can I only display the body of the document? If it is your site, there are two ways. 1. Check HTTP_USER_AGENT in your pages and don't display menus when robot is downloading the page. 2. Add !--UdmComment ... !--/UdmComment-- around menus. indexer will skip the text between these tags. Reply: http://www.mnogosearch.org/board/message.php?id=4954 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Partial Keyword searching (Fuzzy)
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: The simpliest way is to check what kind of SQL queries are sent to server during search. Can you extract them from queries log? Hi Alexander, I followed the instructions of setting DBMode to multi and substring setting. But, the partial searching still does not work. Can you please help. indexer.conf settings: DBHostlocalhost DBNameudmsearch DBUserfoo DBPassbar DBMode multi .. Search.htm settings: # Choose storage mode (see explanation in indexer.conf-dist): DBMode multi Match: lt;SELECT NAME=quot;mquot;gt; lt;OPTION VALUE=quot;allquot; SELECTED=quot;$mquot;gt;All lt;OPTION VALUE=quot;anyquot; SELECTED=quot;$mquot;gt;Any lt;OPTION VALUE=quot;substringquot; SELECTED=quot;$mquot;gt;Substring lt;/SELECTgt; .. Thanks. Perry Luo Reply: http://www.mnogosearch.org/board/message.php?id=4955 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Problems indexing word doc files
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Still no luck unfortunately, After adding in the type change as suggested. I've also come to run the package today to get all the new files that will be in there now, and all that happened was: Indexer[19177]: indexer from mnogosearch-3.1.19/MySQL started with '../etc/indexer.conf' Indexer[19177]: [1] http://myserver.mydomain.co.uk/cv/cvindex Indexer[19177]: [1] No Content-type in 'http://myserver.mydomain.co.uk/cv/cvindex'! Indexer[19177]: [1] Done (8 seconds) And no indexing of other documents? Anyone got any suggestions, could it be because I'm trying to index files that are physically located in a directory, rather than fetch them via a web url?? Does this page work fine when you open it in browser? Do you have any *.doc files in your database? Check it using SELECT url FROM url. Also, seeing in your previous letter, I noticed a mistake: CheckOnly *.doc You should use Allow *.doc ( not CheckOnly *.doc ) if you want parser to be executed and then it's output to be indexed. Reply: http://www.mnogosearch.org/board/message.php?id=4956 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Search results cache
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: search results cache has stopped working on my site, no files are created in the cache directory. I am using v3.1.19. Originally I had set quot;Cache yesquot; in search.htm and it seemed to work fine. I then had to move the var directory to a partition with more disk space and it seems to have stopped working around that time. Everything else seems to be normal. Any ideas why this has stopped? Just specify new location of your var directory using VarDir command. Reply: http://www.mnogosearch.org/board/message.php?id=4957 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
ISO9000
=?GB2312?B?ytY=?= To: [EMAIL PROTECTED] Content-Type: text/html;charset=GB2312 Date: Wed, 12 Jun 2002 22:10:57 +0800 X-Priority: 3 X-Mailer: FoxMail 3.11 Release [cn] general:ÄúºÃ! ¸ù¾Ý2000°æISO9000×åϵÁбê×¼£¬ÓÉÖйú±ê×¼»¯Ð»áÓë±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾£¬¹²Í¬ÑÐÖÆ¿ª·¢µÄ ¡¶2000°æISO9000×åÖÊÁ¿¹ÜÀíÌåϵ ±ê×¼Îļþ´¦Àíϵͳ¡·ÊÇISO9000£º2000°æÍÆÐеÄÅäÌ×Èí¼þ£¬Ê¹ÒѾͨ¹ýÈÏÖ¤µÄÆóÒµºÍ¼´½«¼ÓÈëÈÏÖ¤µÄÆóҵʹÓÃÎÒ ÃǵÄÈí¼þºó£¬Äܹ»¹æ·¶±ê×¼µØ°´ÕÕ Ð±ê×¼ÒªÇó£¬ÍêÕûµØÖÆ×÷ÖÊÁ¿ÌåϵÎļþ¡£ ¡¶2000°æISO 9000×åÖÊÁ¿¹ÜÀíÌåϵ±ê×¼Îļþ´¦Àíϵͳ¡·º¸ÇÁËISO9000£º2000°æµÄÖÊÁ¿¹ÜÀíÌåϵµÄ»ù´¡Êõ Óï¡¢Àí½âÒªÇóºÍʵʩҪµãÒÔ¼°ÐÐÒµÖÊ Á¿ÊÖ²áʾÀý£¨ÖÆÔìÒµ¡¢±ö¹ÝÒµ¡¢ÎïÒµ¹ÜÀí£©£¬²¢Éè¼ÆÁËÆóÒµÖÆ×÷ÖÊÁ¿¹ÜÀíÌåϵÎļþµÄ±ê×¼²Î¿¼Ä£°å£¬·½±ãʵ Óã¬Ò»Ä¿ÁËÈ»¡£Ëü½«ÊÇÆóÒµÄÚÉóÈË Ô±¡¢×ÉѯÈËÔ±¡¢ÈÏÖ¤ÈËÔ±µÄºÃ°ïÊÖ¡£ ×Ü·¢ÐУº±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾ ¶¨¼Û£º680Ôª/Ì× Óʾ֡¢ÒøÐлã¿î½Ô¿É£¬¿îµ½·¢»õ£»Ç뽫»ã¿îƾ֤´«ÕæÖÁ010-6843 4198¡£ Óʾֻã¿îÖÁ£º±±¾©Êк£µíÇø³µ¹«×¯Î÷·29ºÅ ±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾ Óʱࣺ100044 ÒøÐлã¿îÖÁ£º¿ª»§Ãû³Æ£º±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾ ¿ª»§ÒøÐУºÕÐÉÌÒøÐб±¾©·ÖÐÐÓªÒµ²¿ Õʺţº0186437510001 »¶ÓÀ´µç¡¢À´º¯´¹Ñ¯²¢³ÏÕ÷ÇøÓò´úÀí£¡ ÁªÏµÈË£ºÖ£ÏÈÉú£¬°¬Ð¡½ã µç»°£º010-68732115£¬68732117£¬68435484 ´«Õ棺010-68434198 E-mail£º[EMAIL PROTECTED] Ïà¹Ø²úÆ·£º ¡¶ÖÊÁ¿¹ÜÀíÌåϵʵսÈí¼þ£¨±ö¹ÝÒµ£©¡· 1200Ôª/Ì× ¡¶Ö°Òµ°²È«½¡¿µ¹ÜÀíÌåϵÎļþ±àдϵͳ¡· 880Ôª/Ì× ¡¶ISO 14000»·¾³¹ÜÀíÌåϵ±ê×¼Îļþ´¦Àíϵͳ¡·880Ôª/Ì× Ö Àñ! ±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾ [EMAIL PROTECTED] 2002-06-12 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: URL table structure / Performance
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: I have just created the index and I noticed an immediate increase in the speed of the indexer. Will this have any adverse side effects? Should not. If not why is this not in the standard table structure? We've added this index in 3.2.x tables structure. Thanks for suggestion! Reply: http://www.mnogosearch.org/board/message.php?id=4958 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Various errors and problems
Mark Roebuck wrote: Hi, I have posted these questions to the boards, but with no results. I am using v3.1.19 with mysql and cache mode 1, the ~ character does not exclude words from the search results. You have to choose boolean search type in select box which has all/any/bool values. Take a look into default template. 2, cached results are not being saved in the cache directory, cache is set to yes. Search results cache files can be found in /var/cache/ subdirectory of mnogosearch installation, i.e. /usr/local/mnogosearch/var/cache/ by default. Do you have any files in this directory? 3, is it safe to create an index on the url table field next_index_time Yes. 4, how can I stop indexer indexing certain parts of a site, for instance if I have http://www.domain.com/ in the server table with follow set to path or site how can I stop it indexing http://www.domain.com/directory/ http://www.domain.com/directory/ but still index all the other directories. Use either robots.txt file or this command in indexer.conf: Disallow http://www.domain.com/directory/* ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: conditional output from search.cgi
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Hi, I'm hoping to get some advice about how to tackle a specific situation. I am indexing a network of 5 domains. One of these domains requires a subscription. When you search on any of the 5 sites you get results for documents on all 5 domains. Tags are used to allow people to only search a single domain if they want. What I want to do provide html like quot;subscription required-- click here to learn morequot; in the search results when the page found is on the subscription site. My initial solution to this problem was to use the php frontend and create a custom template variable that would provide me with this functionality. But I am changing servers (from redhat to freeBSD) and using the default freebsd ports has me increasing my php version to 4.2.1 and decreasing mnogosearch to 3.1.19-2. The php frontend doesn't work with this combination so I have been looking to switch to using search.cgi. If the problem is only in software versions, you can install both php and mnogosearch from sources, not from ports. So far the best solution I have come up with to provide this functionality on search.cgi is to use javascript to do a check on $DU for the subscription url and write out the required html when I get a match. Can anyone think of a better option? I have complete control over the subscription site and can dynamically produce any meta tags I want... You can use external includes in search.htm, please take a look into documentation. Reply: http://www.mnogosearch.org/board/message.php?id=4959 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: search a blob
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: can mnogosearch search a datatype of blob in the dbase? i tried running indexer thru a table which has a blob datatype and it doesn't go thru it. just wanna make it clear if it's possible or not. thanks! If you mean indexing blobs using HTDB, yes it is possible in MySQL. Not sure about other databases, as far as most of them have a special API to access blobs. Reply: http://www.mnogosearch.org/board/message.php?id=4960 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Disallow Indexing of particular files ?
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: hi all, wht i want is just the opposit of this.i am running a site for a news agancy.i want to index only the story pages and not the others (ie story.php?someid=123).i tried all the changes ni my indexer.conf but all in vain. thx in advabce if anyone can help me. The simplies sollution might look like this: Server http://server/ Allow *story.php?someid=* HrefOnly * Reply: http://www.mnogosearch.org/board/message.php?id=4961 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: 3.2.5 searchd problem
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: When using the searchd daemon with 3.2.5 I get the following error message #1044: Access denied for user: '@localhost' to database 'mnogosearch' The entry in search.htm is SearchdAddr localhost searchd daemon is started at the machine startin search.cgi from the command line results in this error: An error occured! #1049: Unknown database 'mnogosearch' The Database definition in searchd.conf is: DBAddr mysql://xxx:xxx@localhost/udm5/?dbmode=cache Limit t:tag:lim_tag Limit c:cat:lim_cat It means searchd.conf has incorrect user or password in DBAddr string. Reply: http://www.mnogosearch.org/board/message.php?id=4962 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Can not find server?
alexson wrote: Hi Why do the web browser show Can not find server after click the search button? Open search.cgi in your browser, then take a look into View page source. Check what is written in FORM ACTION=??? tag. ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: How to ignore a specific directory?
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: I also have this problem, some sites will have hundreds of thousands of pages within forums. This make search results a complete mess. If you work it out could you let me know. There is no a kind of automatic detection which can disallow indexing of various forums. But you can use Disallow command for a particular site. Reply: http://www.mnogosearch.org/board/message.php?id=4964 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Webboard: How to ignore a specific directory?
Penny Rand wrote: I have a similar problem. My web directory has more than my main site in it but I only want to index the main site. I can do this with: #Server [subsection] URL [alias] Server http://www.asix.com/ file:/web/html/ But the URL on the results page is incorrect as it does not include the html directory in the path. For example the directory structure is; /web/html/asix/index.html But the results URL shows; http://www.asix.com/asix/index.html If I understood you correctly, try something like this: Server http://www.asix.com/ file:/web/html/asix/ ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: path indexing
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Run for example indexer -am -u http://www.fortunecity.com/ indexer will explain why it accepts this page. I have the following urls in the server table: http://members.fortunecity.com/freddie10/FEARLESS/indexx.html http://silverstone.fortunecity.com/austin/679/ http://www.fortunecity.com/marina/dockers/270/ http://www.fortunecity.com/olympia/mays/139/ http://www.fortunecity.com/silverstone/tyre/167/smcc/index.htm http://www.fortunecity.com/skyscraper/quantum/1504/ they are all set to index path, but I find the following urls have been indexed: http://www.fortunecity.com/ http://www.fortunecity.com/corporate/about/q3current.shtml and others. I thought that using 'path' meant that only pages below the given url would be indexed. I am using v3.1.19 in cache mode Could anyone explain what is happening. Thanks Reply: http://www.mnogosearch.org/board/message.php?id=4965 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Allow/disallow (no)match ip ???
Hi! We already implemented this feature in 3.2.0, but didn't put it into the documentation yet. Use this style command: Realm subnet 129.27.0.* Mario Lang wrote: Hello. I have a quite complicated allow /disaalow statment structure in my indexer.conf. I recently had the idea, that the whole configuration would be alot more simple if I could limit the allowed hosts according to a some subnets. We have a class B subnet here, so if I could do something like Allow ip 129.27.0.0/16 That would simply alot of things. Is there any plan to implement this in 3.2.x, or could you advice how I should go about writing a patch for that. It should be to hard, is it? ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: How to set the mnogosearch to support Big5
Hi! alexson wrote: Hi How to set the mnogosearch to support big5, how many parameters will be invovled? Help me, please. Just use this command in both indexer.conf and search.htm: LocalCharset big5 As well as this command in search.htm: BrowserCharset big5 Note that big5 works only in 3.2.x branch of mnogosearch. ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Webboard: string search
To Ramil: any comments? Does phrase search work? Chiara Biancheri wrote: I mean phase search. I saw it is in the todo list. Do you have any idea about the time it would be implemented? (on a large scale: 1 or 6 month, 1 year... thanks a lot chiara On Tue, 2002-06-11 at 13:20, Aleksey Trubin wrote: [EMAIL PROTECTED] wrote: Author: chiara Email: Message: How you search for a complete string? If it is not implemented, is it in the todo list? thanks chiara Reply: http://www.mnogosearch.org/board/message.php?id=4941 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: indexer -Cw takes too much time
What does mysqladmin processlist display? Are there any active queries being executed by mnogosearch? Kreso wrote: RedHat 7.2, 2x1Ghz, 512MB, hardware RAID dedicated search maschine, no other tasks running. MySQL 3.23.49, default configuration. Maybe some mysql tuning would help. Unfortunately, indexer has terminated after 3 hours, without doing its job. It seems like a bug to me. Are there any known limitations on database size, i.e. on the number of URLs that can be indexed? thanks again, Kresimir On Tue, Jun 11, 2002 at 06:21:22PM +0500, Aleksey Trubin wrote: Kreso wrote: Hallo, I have a problem with indexer -Cw -t xxx. It takes too much time. With a database 1.5 million entries it works longer than 2 hours already. mysqld doesn't take much CPU. Any ideas? mnogosearch v3.1.19 is in the question. I have created an index on tags before. thanks, Kresimir ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED] So... It can depends on many things. What is your OS, what tasks are executing on the server, mysql version? ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED] ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Indexer caught in a loop?
Hi! Please take a look here: http://www.mnogosearch.org/board/message.php?id=4946 They have the same problem: urls are stored incorrectly in the database. Can you please find the source of error using the same recomendation as in the message above? Thanks. Cal Zemelman wrote: I had Alias http://www.myproductionsite.com file:/Library/Webserver/Documents (Mac OS X setup filesystem) Then Server http://www.myproductionsite.com/; Actually, I noticed it tended to get hung-up inside one specific folder. I moved that one out and now it seems to get farther before getting an Alias error. I can't think of any sort of thing that could this...maybe there is a symlink that creates a circular reference in the folder? The folder I moved out is only one of a 20 or so on the site and it has quite a few folders inside it and a couple hundred HTML pages, so it isn't easy to debug. Now it simply stops indexing with Error in Aliased URL: xxx but I haven't really looked into this new problem now...it is probably something with that file. I had a few realy large HTML documents (like 1.5MB converted Excel spreadsheet) and maxsize was being ignored and the indexer would seg fault on these... ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED] ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: path indexing
Author: Alex Barkov Email: [EMAIL PROTECTED] Message: Run for example indexer -am -u http://www.fortunecity.com/ indexer will explain why it accepts this page. Sorry, the proper command is: indexer -amv6 -u http://www.fortunecity.com/ Reply: http://www.mnogosearch.org/board/message.php?id=4967 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: I'm a beginner and need help
Author: Cori Email: [EMAIL PROTECTED] Message: I wrote to your support email this morning and still haven't received a response so I decided to try this. I am just beginning to use your software and am very confused at how to use it. I have downloaded many websites and saved them on a server. I would like to search through all of the sites and documents for different strings and have the search return the list documents where the strings were found. The sites are not stored in a database, but from what I'm reading in your help section and on the Webboard I think I need to store them in a database. I don't know. As you can see I'm very confused on how to use your software. If possible please explain how to use it beginning with the Wizard that pops up and what information I need to enter to configure your software to do what I want it to do. Thanks! Reply: http://www.mnogosearch.org/board/message.php?id=4969 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Help with PHP front end MacOS X
Author: Clive Email: [EMAIL PROTECTED] Message: Please excuse this ignorant Mac user. This is where I'm up to: I have MySQL running ok The mnoGoSearch indexer appears to work I can see data in the mnoGoSearch MySQL database via phpMyAdmin interface I've set the required variables in the search.htm and search.php files I've built and installed PHP 4.2.1, which seems to be working and reports mnoGoSearch Support enabled and mnoGoSearch library version 30119 (via phpMyAdmin) However, I can't get any results using the supplied PHP front end (3.1.3.5). The $Q variable doesn't seem to be set, and no warning/error messages show in the delivered search.php page, ie: !--notfound-- Search Time: $SearchTimebr Search results: small$W/smallHR CENTER Sorry, but search returned no results.P ITry to produce less restrictive search query./I /CENTER !--/notfound-- ... Can anyone give me some ideas how to track this problem down and fix it? I'm running mnoGoSearch 3.1.19 on MacOS X 10.1.4 with WebStar 5.1.2. TIA -- Clive Reply: http://www.mnogosearch.org/board/message.php?id=4970 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Problems with 3.2.5 Searching
I'm having some problems with searching in version 3.2.5, and am unsure if they're known bugs, or bits of the config I've messed up. I'm also after some clarification on some things not clearly documented. Although I have toyed with mNoGo in the past, it was missing a few features important to us. I've recently installed 3.2.5, which supposedly has some of these features now supported, but I'm running into some problems. Firstly, indexing an ordinary site, and using the supplied search.cgi, with a minimally modified search.htm, I cannot get the ul search filter/restriction to work. I've followed the docs, but none of my searches are sub-searches with this pattern match (they all search the entire content each time). What's more, after submitting it, the SELECTED option is not retained - although this seems to work for other select-style fields. Secondly, I've had a stab at configuring it to support searching custom META tags, with mixed results. There seems to be little to no docs on how to go about this, but what I've tried is to add some Sections for each tag, preceeding the tag name with meta.. eg; Section meta.modified 11128 Section meta.modified.position 12128 Section meta.modified.email 13128 This seems to've worked to a degree; Searches will now search these fields. However, these seem to be global, and I can't figure out how to enable or disable them, or how to change the priority weights for them. The docs talk about using Section in conjunction with wf, but it's rather confusing. Hmmm, actually, the main thing that threw me was that the example Section declarations are all on one line in the docs, but viewing the HTML source, they're on separate lines. And if that's the case, then I always need to set a wf field, with one character per Section? What if I have 50 Sections? (It would be nice to be able to assign multiple META tags to being the same section, where they are related info, or synonyms for the same field...) I think I'll have to experiment with this a bit more... -- . Trevor Phillips - http://jurai.murdoch.edu.au/ . : Web Technical Administrator - [EMAIL PROTECTED] : | IT Services- Murdoch University | | On nights such as this, evil deeds are done. And good deeds, of / | course. But mostly evil, on the whole. / \ -- (Terry Pratchett, Wyrd Sisters) / ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]