Re: Getting started with writing parser

2011-01-25 Thread Gora Mohanty
On Tue, Jan 25, 2011 at 3:46 PM, Dinesh mdineshkuma...@karunya.edu.in wrote: no i actually changed the directory to mine where i stored the log files.. it is /home/exam/apa..solr/example/exampledocs i specified it in a solr schema.. i created an DataImportHandler for that in try.xml.. then

Re: Getting started with writing parser

2011-01-24 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 2:28 PM, Dinesh mdineshkuma...@karunya.edu.in wrote: my solrconfig.xml http://pastebin.com/XDg0L4di my schema.xml http://pastebin.com/3Vqvr3C0 my try.xml http://pastebin.com/YWsB37ZW [...] OK, thanks for the above. You also need to: * Give us a sample of your

Re: Indexing FTP Documents through SOLR??

2011-01-23 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 8:15 AM, Adam Estrada estrada.a...@gmail.com wrote: +1 on Nutch! [...] Would it be possible for Markus, and you to clarify on what the advantages of Nutch are in crawling a well-defined filesystem hierarchy? A simple shell script that POSTs to Solr works fine for this, so

Re: Getting started with writing parser

2011-01-23 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 10:47 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: i tried editing the schema file and indexing my own log..  the error that i got is root@karunya-desktop:/home/karunya/apache-solr-1.4.1/example/exampledocs# java -jar post.jar sample.txt SimplePostTool: version

Re: Indexing FTP Documents through SOLR??

2011-01-23 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 11:07 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: I'd be happy to comment: A simple shell script doesn't provide URL filtering and control of how you crawl those documents on the local file system. Nutch has several levels of URL filtering based

Re: Getting started with writing parser

2011-01-23 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 11:18 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: i tried those examples.. is it compuslory that i should make it into XML, how does it index CSV.. You will have to convert either into XML, or CSV, but neither of those should be too difficult.

Re: Getting started with writing parser

2011-01-23 Thread Gora Mohanty
On Mon, Jan 24, 2011 at 11:54 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: i did all the configurations correctly.. previously i missed a configuration file Sorry, what are you trying to configure now? The built-in Solr example, or the setup for your log files? Did you get the built-in Solr

Re: Indexing FTP Documents through SOLR??

2011-01-21 Thread Gora Mohanty
On Fri, Jan 21, 2011 at 1:31 PM, pankaj bhatt panbh...@gmail.com wrote: Hi Gora,           Thanks for the answer. I want to index all the PDF,HTML documents lying within a tree hierarchy at FTP Server.       In addition, can i add an attribute location whose value is the FTP FILE LOCATION.  

Re: Indexing FTP Documents through SOLR??

2011-01-21 Thread Gora Mohanty
On Fri, Jan 21, 2011 at 1:47 PM, pankaj bhatt panbh...@gmail.com wrote: Hi Gora,        Thanks, however i think it would be a cumbersome process, to do all this manual.     Aren't there any plugin or extracter does this automatically.???    Anyone in the group, if had done this previously.?

Re: Indexing same data in multiple fields with different filters

2011-01-20 Thread Gora Mohanty
On Thu, Jan 20, 2011 at 4:08 PM, shm s...@dbc.dk wrote: Hi, I have a little problem regarding indexing, that i don't know how to solve, i need to index the same data in different ways into the same field. The problem is a normalization problem, and here is an example: I have a special

Re: Indexing FTP Documents through SOLR??

2011-01-20 Thread Gora Mohanty
On Fri, Jan 21, 2011 at 12:21 PM, pankaj bhatt panbh...@gmail.com wrote: Hi All,  Is there is any way in SOLR or any plug-in through which the folders and documents in FTP location can be indexed. [...] What format are these documents in? Which parts of the documents do you want to index? In

Re: lazy loading error?

2011-01-19 Thread Gora Mohanty
On Wed, Jan 19, 2011 at 7:35 PM, Jörg Agatz joerg.ag...@googlemail.com wrote: ok, but i cant find the folders in the Tomcat folder /varlib/tomcat6/solr/ no existing contrib folder or lib folder? The contrib/extraction/lib folder should be under the top-level directory of your Solr source

Re: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?

2011-01-18 Thread Gora Mohanty
On Wed, Jan 19, 2011 at 2:34 AM, Grant Ingersoll gsing...@apache.org wrote: [...] Where do you get your Lucene/Solr downloads from? [...] [X] ASF Mirrors (linked in our release announcements or via the Lucene website) [X] I/we build them from source via an SVN/Git checkout. Regards, Gora

Re: explicit field type descriptions

2011-01-17 Thread Gora Mohanty
On Tue, Jan 18, 2011 at 11:55 AM, Dennis Gearon gear...@sbcglobal.net wrote: Is there any tabular data anywhere on ALL field types and ALL options? There is this: http://search.lucidimagination.com/search/document/CDRG_ch04_4.4.2 Not sure if it meets your needs. For example, I've looked

Re: Getting started with writing parser

2011-01-17 Thread Gora Mohanty
On Tue, Jan 18, 2011 at 11:59 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: how to write a parser program that will convert log files into XML.. [...] There is no point to starting multiple threads on this issue, hoping that someone will somehow solve your problem. You have been given the

Re: DataimportHandler development issue

2011-01-14 Thread Gora Mohanty
On Fri, Jan 14, 2011 at 12:17 AM, Derek Werthmuller dwert...@ctg.albany.edu wrote: Its not clear why its not working.  Advice? Also is this the best way to load data?  We intent on loading several thousand docbook documents once we understand how this all works.  We stuck with the rss/atom

Re: Improving Solr performance

2011-01-14 Thread Gora Mohanty
On Fri, Jan 14, 2011 at 1:56 PM, supersoft elarab...@gmail.com wrote: The tests are performed with a selfmade program. [...] May I ask in what language is the program written in? The reason to ask that is to eliminate the possibility that there is an issue with the threading model, e.g., if you

Re: Improving Solr performance

2011-01-13 Thread Gora Mohanty
On Thu, Jan 13, 2011 at 10:10 PM, supersoft elarab...@gmail.com wrote: On the one hand, I found really interesting those comments about the reasons for sharding. Documentation agrees you about why to split an index in several shards (big sizes problems) but I don't find any explanation about

Re: Variable datasources

2011-01-13 Thread Gora Mohanty
On Fri, Jan 14, 2011 at 1:02 AM, tjpoe tanner.post...@gmail.com wrote: [...] I also tried creating datasources for each local and then using a variable datasource in the entity such as: datasource url=jdbc:mysql://localhost/aaa name=content_aaa / datasource url=jdbc:mysql://localhost/bbb

Re: Adding a new site to existing solr configuration

2011-01-13 Thread Gora Mohanty
On Thu, Jan 13, 2011 at 10:47 PM, PeterKerk vettepa...@hotmail.com wrote: I still have the default Solr example config running on Jetty. I use Cygwin to start my current site. Now I already have fully configured one solr instance with these files:

Re: Input raw log file

2011-01-12 Thread Gora Mohanty
On Wed, Jan 12, 2011 at 11:50 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: I have installed and tested the sample xml file and tried indexing.. everything went successful and when i tried with log files i got an error.. Please provide details of what you are doing, and of the error

Re: Input raw log file

2011-01-12 Thread Gora Mohanty
On Wed, Jan 12, 2011 at 12:10 PM, Dinesh mdineshkuma...@karunya.edu.in wrote: if i convert it to CSV or XML then it will be time consuming cause the indexing and getting data out of it should be real time.. is there any way i can do other than this.. if not what are the ways i can convert them

Re: Regex DataImportHandler

2011-01-12 Thread Gora Mohanty
On Wed, Jan 12, 2011 at 3:07 PM, Dinesh mdineshkuma...@karunya.edu.in wrote: Can anyone explain me how to create regex DataImportHandler.. [...] Dear Dinesh, No offence, but please do some basic leg work on your own first, and then ask more specific questions. Did you read the Hathi trust

Re: Resolve a DataImportHandler datasource based on previous entity

2011-01-12 Thread Gora Mohanty
is created with the same table structure. OK, I understand. Gora Mohanty-3 wrote: I meant a script that runs the query that defines the datasources for all fields, writes a Solr DIH configuration file, and then initiates a dataimport. Ok, so the query would select only the articles for which

Re: Specifying returned fields

2011-01-12 Thread Gora Mohanty
On Thu, Jan 13, 2011 at 1:11 AM, Dmitriy Shvadskiy dshvads...@gmail.com wrote: Hello, I know you can explicitly specify list of fields returned via fl=field1,field2,field3 Is there a way to specify return all fields but field1 and field2? Not that I know of, but below is an earlier

Re: Question on deleting all rows for an index

2011-01-12 Thread Gora Mohanty
On Thu, Jan 13, 2011 at 6:08 AM, Wilson, Robert rwil...@constantcontact.com wrote: We are just staring with Solr and have a multi core implementation and need to delete all the rows in the index to clean things up. When running an update via a url we are using something like the following

Re: spell suggest response

2011-01-11 Thread Gora Mohanty
On Tue, Jan 11, 2011 at 3:07 PM, satya swaroop satya.yada...@gmail.com wrote: Hi All,         can we get just suggestions only without the files response?? Here I state an example when i query    http://localhost:8080/solr/spellcheckCompRH?q=java daka

Re: Input raw log file

2011-01-11 Thread Gora Mohanty
On Tue, Jan 11, 2011 at 10:06 AM, Dinesh mdineshkuma...@karunya.edu.in wrote: can u give an example.. like something that is currently being used.. Sorry, I do not have anything like this at hand at the moment.

Re: Resolve a DataImportHandler datasource based on previous entity

2011-01-11 Thread Gora Mohanty
On Tue, Jan 11, 2011 at 11:10 PM, alexei achugu...@gmail.com wrote: Hi, I am in a situation where the data needed for one of the fields in my document may be sitting in a different datasource each time. [...] At what point of time will you be aware of which datasource the field is coming

Re: Resolve a DataImportHandler datasource based on previous entity

2011-01-11 Thread Gora Mohanty
On Wed, Jan 12, 2011 at 1:40 AM, alexei achugu...@gmail.com wrote: [...] The datasource number is stored in the database. The parent entity queries for this number and in theory it should becomes available to the child entity - Article in my case. I do not think that it is possible to have the

Re: Tuning StatsComponent

2011-01-10 Thread Gora Mohanty
On Mon, Jan 10, 2011 at 2:28 PM, stockii st...@shopgate.com wrote: Hello. i`m using the StatsComponent to get the sum of amounts. but solr statscomponent is very slow on a huge index of 30 Million documents. how can i tune the statscomponent ? Not sure about this problem. the problem is,

Re: Solr indexing socket timeout errors

2011-01-09 Thread Gora Mohanty
On Sat, Jan 8, 2011 at 3:44 AM, Burton-West, Tom tburt...@umich.edu wrote: Hello all, We are getting intermittent socket timeout errors (see below).  Out of about 600,000 indexing requests, 30 returned these socket timeout errors.  We haven't been able to correlate these with large merges,

Re: Input raw log file

2011-01-08 Thread Gora Mohanty
On Sat, Jan 8, 2011 at 3:50 PM, Dinesh mdineshkuma...@karunya.edu.in wrote: i don't have much idea about converting log into CSV and then giving it as input.. can u please specify how to do it excatly.. [...] As the format of the raw log file is known only to you, it is difficult for someone

Re: DIH - Closing ResultSet in JdbcDataSource

2011-01-08 Thread Gora Mohanty
On Sat, Jan 8, 2011 at 1:10 AM, Shane Perry thry...@gmail.com wrote: Hi, I am in the process of migrating our system from Postgres 8.4 to Solr 1.4.1.  Our system is fairly complex and as a result, I have had to define 19 base entities in the data-config.xml definition file.  Each of these

Re: DIH load only selected documents with XPathEntityProcessor

2011-01-08 Thread Gora Mohanty
On Fri, Jan 7, 2011 at 12:30 PM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: Hello list, is it possible to load only selected documents with XPathEntityProcessor? While loading docs I want to drop/skip/ignore documents with missing URL. Example: documents    document        

Re: DIH Transformer

2011-01-08 Thread Gora Mohanty
On Fri, Jan 7, 2011 at 6:15 PM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: Hi list, currently the Transformers return row but can I skip or drop a row from the Transformer? [...] I presume that you are referring to a ScriptTransformer. Just do not return anything, e.g.,you can make

Re: Show SQL-DIH datasource name in result list

2011-01-08 Thread Gora Mohanty
On Sat, Jan 8, 2011 at 7:47 PM, Patrick Kirsch p-kir...@gmx.de wrote: Hey,  does somebody know, if there is a command option in Solr to show which datasource provided the result. Or with other words: is it possible to output in the result the tag name given in datasource / or entity /? [...]

Re: [sqljdbc4.jar] Errors

2011-01-05 Thread Gora Mohanty
On Wed, Jan 5, 2011 at 10:18 AM, Estrada Groups estrada.adam.gro...@gmail.com wrote: I downloaded that driver today and will test it tomorrow. Thanks for the tip! Would you mind sending an XML code snippet if it's any different to load than the MS driver? [...] I presume that you are

Re: [sqljdbc4.jar] Errors

2011-01-05 Thread Gora Mohanty
On Wed, Jan 5, 2011 at 8:22 PM, Adam Estrada estrada.a...@gmail.com wrote: I can't tell any difference in performance but it does work like a charm. At least the messaging in the console is a lot more verbose. [...] Well, as far as the performance goes, we were also simultaneously making other

Re: How to integrate Solr with C/CPP client

2011-01-04 Thread Gora Mohanty
On Wed, Jan 5, 2011 at 9:54 AM, prasad deshpande prasad.deshpand...@gmail.com wrote: It seems Solr/Lucene is completely developed in Java. I want to integrate Solr with C/CPP client. Is it possible? If yes, how? [] Lucene has a C port called Lucy ( http://incubator.apache.org/lucy/ ) Solr

Re: [sqljdbc4.jar] Errors

2011-01-04 Thread Gora Mohanty
On Wed, Jan 5, 2011 at 5:01 AM, Adam Estrada estrada.adam.gro...@gmail.com wrote: Can anyone help me with the following error. I upgraded my database to SQL Server 2008 SP2 and now I get the following error. It was working with SQL Server 2005. Solr Error Stack Caused by:

Re: Apply a patch

2011-01-02 Thread Gora Mohanty
On Mon, Jan 3, 2011 at 10:33 AM, Darx Oman darxo...@gmail.com wrote: Hi guys would someone please explain to me how to apply a patch to solr lets say for example SOLR-2129-version3.patch i'm Running Solr Trunk apache-solr-4.0-2011-01-02_08-06-03 on Windows environment This link,

Re: Different behavior for q=goo.com vs q=@goo.com in queries?

2010-12-31 Thread Gora Mohanty
On Fri, Dec 31, 2010 at 2:40 AM, mrw mikerobertsw...@gmail.com wrote: Basically, just what you've suggested.  I did the field/query analysis piece with verbose output.  Not entirely sure how to interpret the results, of course.  Currently reading anything I can find on that. [...] From the

Re: Dynamic column names using DIH

2010-12-28 Thread Gora Mohanty
On Wed, Dec 29, 2010 at 2:59 AM, Mark static.void@gmail.com wrote: Is there a way to create dynamic column names using the values returned from the query? For example: entity name=foo             dataSource=my_database             query=select * from foo where item_id=${item.id} field

Re: Item precedence search problem

2010-12-23 Thread Gora Mohanty
On Wed, Dec 22, 2010 at 3:53 PM, Hasnain hasn...@hotmail.com wrote: [...] In my schema, i have another field by the name of Category and, for example's sake, let's assume that my application supports only two categories: computers and accessories. Now, what i require is a mechanism to assign

Re: Item precedence search problem

2010-12-22 Thread Gora Mohanty
On Wed, Dec 22, 2010 at 3:09 PM, Hasnain hasn...@hotmail.com wrote: [...] From my understanding, if i query for a keyword, let's say Laptop, preference is given to result rows having more occurences of the search keyword Laptop in the field name. This, however, is producing undesirable

Re: Solr index directory '/solr/data/index' doesn't exist. Creating new index... on Geronimo

2010-12-22 Thread Gora Mohanty
On Wed, Dec 22, 2010 at 4:55 PM, Bac Hoang bac.ho...@axonactive.vn wrote: Hello Erick, Could you kindly give a hand on my problem. Any ideas, hints, suggestions are highly appreciated. Many thanks 1. The problem: Solr index directory '/solr/data/index' doesn't exist. Creating new index...

Re: Solr query to get results based on the word length (letter count)

2010-12-22 Thread Gora Mohanty
On Wed, Dec 22, 2010 at 9:06 PM, Giri giriprak...@gmail.com wrote: Hi, I have a solar index that has thousands of records, the title is one of the solar fields, and I would like to query for title values that are less than 50 characters long. Is there a way to construct the Solr query to

Re: Performance Monitoring Solution

2010-12-19 Thread Gora Mohanty
On Mon, Dec 20, 2010 at 3:13 AM, Cameron Hurst wakemaste...@z33k.com wrote: I am at the point in my set up that I am happy with how things are being indexed and my interface is all good to go but what I don't know how to judge is how often it will be queried and how much resources it needs to

Re: Solr Newbie - need a point in the right direction

2010-12-07 Thread Gora Mohanty
On Tue, Dec 7, 2010 at 9:12 AM, Mark markjl...@gmail.com wrote: [...] What I'm trying to do is extract some (presumably) structured information from non-uniform data (eg, prices from a nutch crawl) that needs to show in search queries, and I've come up against a wall. I've been unable to

Re: How to make a client in JSP which will take output from Solr Server

2010-12-04 Thread Gora Mohanty
On Sun, Dec 5, 2010 at 1:51 AM, Anurag anurag.it.jo...@gmail.com wrote: Ok, I solved it by just opening the connection and then parsing the output from xml to front page. Though It has some security isuue... See AJAX Solr: http://evolvingweb.github.com/ajax-solr/ Regards, Gora

Re: ArrayIndexOutOfBoundsException in sort

2010-11-30 Thread Gora Mohanty
On Wed, Dec 1, 2010 at 10:56 AM, Jerry Li zongjie...@gmail.com wrote: Hi team My solr version is 1.4 There is an ArrayIndexOutOfBoundsException when i sort one field and the following is my code and log info, any help will be appreciated. Code:        SolrQuery query = new SolrQuery();  

Re: about the string type field, will it be analyzed?

2010-11-24 Thread Gora Mohanty
On Thu, Nov 25, 2010 at 9:34 AM, xu cheng xcheng@gmail.com wrote: hi all: I have a solr app, and there is* a filed named filePath *whose type is * string* and the filePath fields  in the documents are* unique* (supposed to be uniqued) cos uniqueKeyfilePath/uniqueKey and now I wanna

Re: Export Index Data.

2010-11-19 Thread Gora Mohanty
On Fri, Nov 19, 2010 at 10:33 PM, Anderson vasconcelos anderson.v...@gmail.com wrote: Hi Is possible to export one set of documents indexed in one solr server for do a sincronization with other solr server? Yes. The easiest way probably is to set up replication:

Re: Is it fine to transmit indexes in this way?

2010-11-19 Thread Gora Mohanty
On Fri, Nov 19, 2010 at 10:53 PM, Bing Li lbl...@gmail.com wrote: Hi, all, Since I didn't find that Lucene presents updated indexes to us, may I transmit indexes in the following way? 1) One indexing machine, A, is busy with generating indexes; 2) After a certain time, the indexing process

Re: Is it fine to transmit indexes in this way?

2010-11-19 Thread Gora Mohanty
On Fri, Nov 19, 2010 at 11:39 PM, Bing Li lbl...@gmail.com wrote: [...] When updates are replicated to slave servers, it is supposed that the updates are merged with the existing indexes and readings on them can be done concurrently. If so, the queries must be responded instantly. That's what

Re: How to Transmit and Append Indexes

2010-11-19 Thread Gora Mohanty
On Sat, Nov 20, 2010 at 12:05 AM, Bing Li lbl...@gmail.com wrote: Dear Erick, Thanks so much for your help! I am new in Solr. So I have no idea about the version. The solr/admin/registry.jsp URL on your local Solr installation should show you the version at the top. But I wonder what are

Re: How to Transmit and Append Indexes

2010-11-19 Thread Gora Mohanty
On Sat, Nov 20, 2010 at 12:39 AM, Bing Li lbl...@gmail.com wrote: Hi, Gora, No, I really wonder if Solr is based on Hadoop? As far as I know, no it it isn't. Hadoop is efficient when using on search engines since it is suitable to the write-once-read-many model. After reading your emails,

Re: Need Middleware between search client and solr?

2010-11-19 Thread Gora Mohanty
On Sat, Nov 20, 2010 at 2:28 AM, cyang2010 ysxsu...@hotmail.com wrote: [...] Do you need a middleware/module between your search client and solr server? The response message is very solr specific.   Do you need to translate it to application object model and return back to search client?   In

Re: Looking for Developers

2010-10-29 Thread Gora Mohanty
On Fri, Oct 29, 2010 at 12:23 PM, scott chu (朱炎詹) scott@udngroup.com wrote: When I first saw this particular email, I wrote a letter intend to ask the sender remove solr-user from its recepient cause I thought this should go to solr-dev. But then I thought again, it's about 'job-offer' not

Re: Possible bug in query sorting

2010-10-29 Thread Gora Mohanty
On Fri, Oct 29, 2010 at 1:47 PM, Pablo Recio pre...@yaco.es wrote: That's my schema XML:   fieldType name=text class=solr.TextField positionIncrementGap=100     analyzer type=index       tokenizer class=solr. WhitespaceTokenizerFactory/       filter class=solr.LowerCaseFilterFactory /      

Re: If I want to move a core from one physical machine to another....

2010-10-28 Thread Gora Mohanty
On Thu, Oct 28, 2010 at 3:42 AM, Ron Mayer r...@0ape.com wrote: If I want to move a core from one physical machine to another, is it as simple as just   scp -r core5 otherserver:/path/on/other/server/ and then adding    core name=core5name instanceDir=core5 / on that other server's solr.xml

Re: Possible bug in query sorting

2010-10-28 Thread Gora Mohanty
On Thu, Oct 28, 2010 at 5:18 PM, Michael McCandless luc...@mikemccandless.com wrote: Is it somehow possible that you are trying to sort by a multi-valued field? [...] Either that, or or your field gets processed into multiple tokens via the analyzer/tokenizer path in your schema. The reported

Re: Strange search

2010-10-27 Thread Gora Mohanty
On Wed, Oct 27, 2010 at 1:23 PM, ramzesua michaelnaza...@gmail.com wrote: Can anyone give me working schema.xml and solrconfig from own project? [...] Solr comes with an example configuration in example/solr/conf/ . Please see http://lucene.apache.org/solr/tutorial.html for an example of how to

Re: How to index on basis of a condition?

2010-10-26 Thread Gora Mohanty
On Tue, Oct 26, 2010 at 2:37 PM, Pawan Darira pawan.dar...@gmail.com wrote: Thanks Mr. Ephraim Ofir. I used the SELECT IF() for my requirement. The query result is correct. But when i see it in my index, the value stored is something unusual bunch of characters e.g. *...@6628ad5a* [...] Which

Re: How to index on basis of a condition?

2010-10-26 Thread Gora Mohanty
On Tue, Oct 26, 2010 at 3:56 PM, Pawan Darira pawan.dar...@gmail.com wrote: I am using mysql database, and, field type is date [...] Could you show us the exact SELECT statement, and some example values returned by running the SELECT directly at a mysql console? Regards, Gora

Re: RAM increase

2010-10-21 Thread Gora Mohanty
On Thu, Oct 21, 2010 at 10:46 AM, satya swaroop satya.yada...@gmail.com wrote: Hi all,              I increased my RAM size to 8GB and i want 4GB of it to be used for solr itself. can anyone tell me the way to allocate the RAM for the solr. [...] You will need to set up the allocation of

Re: does solr support posting gzipped content?

2010-10-20 Thread Gora Mohanty
On Tue, Oct 19, 2010 at 9:34 PM, danomano dshopk...@earthlink.net wrote: Hi folks, I was wondering if there is any native support for posting gzipped files to solr? i.e. I'm testing a project where we inject our log files into solr for indexing, these logs files are gzipped, and I figure it

Re: how can i use solrj binary format for indexing?

2010-10-20 Thread Gora Mohanty
On Mon, Oct 18, 2010 at 8:22 PM, Jason, Kim hialo...@gmail.com wrote: Sorry for the delay in replying. Was caught up in various things this week. Thank you for reply, Gora But I still have several questions. Did you use separate index? If so, you indexed 0.7 million Xml files per instance

Re: how can i use solrj binary format for indexing?

2010-10-18 Thread Gora Mohanty
On Mon, Oct 18, 2010 at 5:26 PM, Jason, Kim hialo...@gmail.com wrote: Hi, Gora I haven't tried yet indexing huge amount of xml files through curl or pure java(like a post.jar). Indexing through xml is really fast? How many files did you index? And How did it(using curl or pure java)? [...]

Re: DIH - configure password in 1 place and store it in encrypted form?

2010-10-17 Thread Gora Mohanty
On Sun, Oct 17, 2010 at 7:02 PM, Arunkumar Ayyavu arunkumar.ayy...@gmail.com wrote: Hi! I have multiple cores reading from the same database and I've provided the user credentials in all data-config.xml files. Is there a way to tell JdbcDataSource in data-config.xml to read the username and

Re: how can i use solrj binary format for indexing?

2010-10-17 Thread Gora Mohanty
On Mon, Oct 18, 2010 at 8:31 AM, Jason, Kim hialo...@gmail.com wrote: Hi all I have a huge amount of xml files for indexing. I want to index using solrj binary format to get performance gain. Because I heard that using xml files to index is quite slow. [...] Do not know about SolrJ's binary

Re: searching while importing

2010-10-15 Thread Gora Mohanty
On Thu, Oct 14, 2010 at 4:08 AM, Shawn Heisey s...@elyograg.org wrote:  If you are using the DataImportHandler, you will not be able to search new data until the full-import or delta-import is complete and the update is committed.  When I do a full reindex, it takes about 5 hours, and until it

Re: Problem with Indexing

2010-10-12 Thread Gora Mohanty
On Tue, Oct 12, 2010 at 5:57 PM, Jörg Agatz joerg.ag...@googlemail.com wrote: java -Xmx1024m -jar start.jar Hmm, could you try increasing the RAM, though this out to be enough? Maybe I missed this earlier, but what is the size of the data being indexed? Regards, Gora

Re: Search within a subset of documents

2010-10-11 Thread Gora Mohanty
On Mon, Oct 11, 2010 at 8:20 PM, Sergey Bartunov sbos@gmail.com wrote: Whether it will be enough effective if the subset is really large? [...] If the subset of IDs is large, and disjoint (so that you cannot use ranges), the query might look ugly, but generating it should not be much of a

Re: Problem with Indexing

2010-10-11 Thread Gora Mohanty
On Mon, Oct 11, 2010 at 1:27 PM, Jörg Agatz joerg.ag...@googlemail.com wrote: ok, i have try it.. and now iget this error: POSTing file e067f59c-d046-11df-b552-000c29e17baa_SEARCH.xml SimplePostTool: FATAL: Solr returned an error:

Re: Changing schema and reindexing documents

2010-10-06 Thread Gora Mohanty
On Wed, Oct 6, 2010 at 11:59 AM, M.Rizwan griz...@gmail.com wrote: Hi, I have lots of documents in my solr index. Now I have a requirement to change its schema and add a new field. What should I do, so that all the documents keep working after schema change? [...] You will need to reindex

Re: Help needed on indexing Zope CMS content

2010-10-06 Thread Gora Mohanty
On Wed, Oct 6, 2010 at 1:58 PM, Marian Steinbach mar...@sendung.de wrote: Hi! We are planning to periodically index several MySQL database tables plus a Zope CMS document tree in Solr. Indexing the Zope DB seems to be tricky though. [...] Been a while since I touched Zope, but there seems

Re: having problem about Solr Date Field.

2010-10-06 Thread Gora Mohanty
On Wed, Oct 6, 2010 at 9:17 PM, Kouta Osabe kota0919was...@gmail.com wrote: Hi, Gora Thanks for your advice. and then I try to write these codes following your advice. Case1 pub_date column(MySQL) is 2010-09-27 00:00:00. I wrote like below. SolrJDto info = new SolrJDto(); TimeZone tz2

Re: having problem about Solr Date Field.

2010-10-04 Thread Gora Mohanty
On Mon, Oct 4, 2010 at 10:24 PM, Kouta Osabe kota0919was...@gmail.com wrote: Hi,All I have a problem about Solr Date Field. [...] the value of pub_date column comes from MySQL and actually value is 2010-10-05 00:00:00. I regist foo bean to Solr through SolrJ like new

Re: Tuning solr

2010-10-01 Thread Gora Mohanty
On Sat, Oct 2, 2010 at 5:21 AM, Stavros Korokithakis stav...@korokithakis.net wrote: [...] Is there a guide for tuning solr somewhere? We have about a million documents (the documents are 8 fields, one of which is the full text of webpages) and we'd like to give solr a bit more memory and

Re: General hardware requirements?

2010-09-30 Thread Gora Mohanty
On Thu, Sep 30, 2010 at 8:09 PM, Nicholas Swarr nsw...@hotmail.com wrote: Our index is about 10 gigs in size with about 3 million documents.  The documents range in size from dozens to hundreds of kilobytes.  Per week, we only get about 50k queries. Currently, we use lucene and have one box

Re: Automatic xslt to responses ??

2010-09-30 Thread Gora Mohanty
On Thu, Sep 30, 2010 at 10:47 PM, Sharma, Raghvendra sraghven...@corelogic.com wrote: Is there a way to specify a xslt at the server side, and make it default, i.e. whenever a response is returned, that xslt is applied to the response automatically... This should be of help:

Re: Faster loading to solr...

2010-09-30 Thread Gora Mohanty
On Thu, Sep 30, 2010 at 10:49 PM, Sharma, Raghvendra sraghven...@corelogic.com wrote: I have been able to load around a million rows/docs in around 5+ minutes.   The schema contains around 250+ fields.  For the moment, I have kept everything as string. I am sure there are ways to get better

Re: How can I delete the entire contents of the index?

2010-09-22 Thread Gora Mohanty
On Thu, Sep 23, 2010 at 9:05 AM, Igor Chudov ichu...@gmail.com wrote: Let's say that I added a number of elements to Solr (I use Webservice::Solr as the interface to do so). Then I change my mind and want to delete them all. How can I delete all contents of the database, but leave the

Re: How to make a client in JSP which will take output from Solr Server

2010-09-18 Thread Gora Mohanty
On Sat, Sep 18, 2010 at 6:22 PM, Anurag anurag.it.jo...@gmail.com wrote: Actually i have Crwaled the data using Nutch-1.0 and Indexxed it to Solr -1.3 (using this http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ tutorial ) Now i want to create a page in JSP that will be a Simple

Re: How will solr behave if data importing is called while another importing operation is still ongoing?

2010-09-15 Thread Gora Mohanty
On Wed, Sep 15, 2010 at 4:21 PM, yklxmas yklx...@gmail.com wrote: [...] I'm using standard data import handler with file data source and xpath processor. so my script will be calling http://host:8983/solr/dataimport?command=full-import I am not sure if you are aware of this, but unless you are

Re: Newbie question: no search results

2010-09-04 Thread Gora Mohanty
On Sat, 4 Sep 2010 01:15:11 -0700 (PDT) BobG b...@bitwise-bncc.nl wrote: Hi, I am trying to set up a new SOLR search engine on a windows platform. It seems like I managed to fill an index with the contents of my SQL server table. When I use the default *.* query I get a nice result:

Solr-related meeting in Delhi, India: At Dilli Haat: 5pm, Sun., 5th Sep.

2010-09-02 Thread Gora Mohanty
Hi, A Solr-related meeting will take place in Delhi, India, as per the details below. As the FOSS community in Delhi/NCR is quite small, and getting increasingly fragmented, we have tried to combine more than one topic, in the interest of getting more attendees. Event:FOSS meeting

Meeting of Solr users in Delhi/NCR, India?

2010-08-31 Thread Gora Mohanty
Hi, From past postings, I understand that discussions on meetings related to Solr are acceptable on this list. We have been talking about arranging a meeting in Delhi/National Capital region (NCR) of people using Solr, or those interested in search and related technologies. This first

Re: solr

2010-08-31 Thread Gora Mohanty
On Tue, 31 Aug 2010 07:03:08 -0700 ankita shinde ankitashinde...@gmail.com wrote: hello all, I have indexed database using DIH. But I am not able to search the data using each field. Dear Ankita, Do not mean to offend, but did you not post about this earlier, and Peter Karich followed up

Re: Is there any strss test tool for testing Solr?

2010-08-26 Thread Gora Mohanty
On Wed, 25 Aug 2010 19:58:36 -0700 Amit Nithian anith...@gmail.com wrote: i recommend JMeter. We use that to do load testing on a search server. [...] JMeter is certainly good, but we have also found Apache bench to also be of much use. Maybe it is just us, and what we are familiar with, but

Re: Is there a SubstringTransformer?

2010-08-24 Thread Gora Mohanty
On Tue, 24 Aug 2010 08:46:52 +0200 Gonzalo Payo Navarro gpa...@gmail.com wrote: Hi everyone! I need to get the first 100 chars of a string-type field, but I am not able to find something like a SubstringTransformer, therefore I am using the RegexTransformer, but I suspect that it eats a

Re: Duplicate docs when merging indices?

2010-08-21 Thread Gora Mohanty
On Sat, 21 Aug 2010 05:26:59 -0700 (PDT) Andrew Clegg andrew.cl...@gmail.com wrote: [...] If I merge two indices with CoreAdmin, as detailed here... http://wiki.apache.org/solr/MergingSolrIndexes What happens to duplicate documents between the two? i.e. those that have the same unique key.

Re: improving search response time

2010-08-18 Thread Gora Mohanty
On Wed, 18 Aug 2010 05:18:34 -0700 (PDT) Muneeb Ali muneeba...@hotmail.com wrote: Hi All, I need some guidance over improving search response time for our catalog search. [...] I would appreciate if anyone with similar background could shed some light on upgrading hardware in our

Re: Can a Solr Server be both master and slave?

2010-08-16 Thread Gora Mohanty
On Mon, 16 Aug 2010 10:43:38 -0700 Amit Nithian anith...@gmail.com wrote: I am not sure if this is the best approach to this problem but I was curious if a single solr server could be both a master and a slave without causing index corruption? It seems that you could setup multiple

Re: Indexing large files using Solr Cell causes OutOfMemory error

2010-08-12 Thread Gora Mohanty
On Thu, 12 Aug 2010 14:32:19 +0200 Lannig Carina lan...@ssi-schaefer-noell.com wrote: Hi, I'm trying to index a txt-File (~150MB) using Solr Cell/Tika. The curl command aborts due to a java.lang.OutOfMemoryError. [...] AFAIK Tika keeps the whole file in RAM and posts it as one single

Re: how to query a string using solr URL in the browser

2010-08-09 Thread Gora Mohanty
On Mon, 9 Aug 2010 05:31:36 -0700 (PDT) e8en e...@tokobagus.com wrote: I forgot something, when I enter this: http://172.11.18.120:9000/search/select/?q=text:bracketq.op=ANDstart=0rows=1000 the result will show all ITEM_ID that contain 'bracket' word in both or one of ITEM_DESCR_SHORT

Re: Indexing data on MSSQL failed: Caused by: org.apache.solr.common.SolrException: Error loading class 'com.micros oft.sqlserver.jdbc.SQLServerDriver'

2010-08-02 Thread Gora Mohanty
On Mon, 2 Aug 2010 02:22:50 -0700 (PDT) PeterKerk vettepa...@hotmail.com wrote: I'm running MSSQL Server 2008 I did this: set CLASSPATH= C:\Program Files\Microsoft SQL Server JDBC Driver 3.0\sqljdbc_3.0\enu\sqljdbc4.jar (I dont know where I can check if this has been executed correctly)

Re: advice on creating a solr index when data source is from many unrelated db tables

2010-07-30 Thread Gora Mohanty
On Thu, 29 Jul 2010 15:33:42 -0400 S Ahmed sahmed1...@gmail.com wrote: I understand (and its straightforward) when you want to create a index for something simple like Products. But how do you go about creating a Solr index when you have data coming from 10-15 database tables, and the

<    2   3   4   5   6   7   8   >