Re: Indexed data not searchable
Thanks alot, so I will make a XSLT. Great community here! -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4055258.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
Thanks to this! No I have another problem. I tried to give the XML file the right format so I made this ?xml version=1.0 encoding=UTF-8? adddoc field name=id455HHS-2232/field field name=titleT0072-00031-DOWNLOAD - Blatt 12v/field field name=formatapplication/pdf/field field name=created2012-11-07T11:15:19.887+01:00/field field name=lastModified2012-11-07T11:15:19.887+01:00/field field name=issued2012-11-07T11:15:19.887+01:00/field field name=revision0/field field name=pidhdl:11858/00-1734--0008-12C5-2/field field name=extent1131033/field field name=projectSt. Matthias Test 07/field field name=availabilitypublic/field field name=rightsHolderStadtbibliothek und Stadtarchiv Trier/field /doc/add I also made the changes in the schema.xml I added this fields: field name=identifier type=text_general indexed=true stored=true/ field name=format type=text_general indexed=true stored=true/ field name=created type=date indexed=true stored=true/ field name=issued type=date indexed=true stored=true/ field name=revision type=int indexed=true stored=true/ field name=pid type=text_general indexed=true stored=true/ field name=extent type=int indexed=true stored=true/ field name=dataContributor type=text_general indexed=true stored=true/ field name=project type=text_general indexed=true stored=true/ field name=availability type=text_general indexed=true stored=true/ field name=rightsholder type=text_general indexed=true stored=true/ Did I made anything wrong? -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054960.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
Just for information: I indicate that the problem occurs when I try to add the fields, created, last_modified, issued (all three have the type date) and the field rightsholder. Maybe it is helpful! -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054977.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
On Wed, Apr 10, 2013 at 10:35 AM, Max Bo maximilian.brod...@gmail.comwrote: Just for information: I indicate that the problem occurs when I try to add the fields, created, last_modified, issued (all three have the type date) and the field rightsholder. Maybe it is helpful! From the example you sent earlier, it appears that the format of your timestamps does not conform with SOLR's expectations; they should be something like 1995-12-31T23:59:59.999Z (i.e, zero timezone offset, designated with a trailing Z).
Re: Indexed data not searchable
Thank you. I changed it and now it works. But is there any possibility to make the given timestamp acceptable for solr? -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054985.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
Solr assumes you are using UTC. It is your job to do a conversion. If you want Solr to do it, you could use an UpdateProcessor to do it, either using RegExp, or perhaps a ScriptUpdateProcessor. In fact, if you're comfortable with XSLT, you can make Solr accept your old format of XML by posting it to Solr providing a tr= parameter that points to an XSLT stylesheet which transforms your XML into an adddoc format. Upayavira On Wed, Apr 10, 2013, at 10:36 AM, Max Bo wrote: Thank you. I changed it and now it works. But is there any possibility to make the given timestamp acceptable for solr? -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054985.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
The XML files are formatted like this. I think there is the problem. metadataContainerType ns3:object ns3:generic ns3:provided ns3:titleT0084-00371-DOWNLOAD - Blatt 184r/ns3:title ns3:identifier type=METSXMLIDT0084-00371-DOWNLOAD/ns3:identifier ns3:formatapplication/pdf/ns3:format /ns3:provided ns3:generated ns3:created2012-11-08T00:09:57.531+01:00/ns3:created ns3:lastModified2012-11-08T00:09:57.531+01:00/ns3:lastModified ns3:issued2012-11-08T00:09:57.531+01:00/ns3:issued .. -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054651.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
On 9 April 2013 13:10, Max Bo maximilian.brod...@gmail.com wrote: The XML files are formatted like this. I think there is the problem. [...] Yes, to use curl to post to /solr/update you need to have XML in the form described at http://wiki.apache.org/solr/UpdateXmlMessages Else, you can use FileListEntityProcessor and XPathEntityProcessor with FileDataSource from the Solr DataImportHandler. Please see examples at http://wiki.apache.org/solr/DataImportHandler Regards, Gora
Indexed data not searchable
Hello, I'm very new to Solr and I come to an unexplainable point by myself so I need your help. I have indexed a huge amount of xml-Files by a shell script. function solringest_rec { for SRCFILE in $(find $1 -type f); do #DESTFILE=$URL${SRCFILE/$1/} echo ingest $SRCFILE curl $URL -H Content-type: text/xml --data-binary @$SRCFILE done } The respone I get is everytime: ?xml version=1.0! encoding=UTF-8? response lst name=responseHeaderint name=status0int name=QTime116/int/lst /respone Because of this I think that everything should be fine but the queries doesn't work. For all other operations as the post operation I use the stuff from example folder. Maybe I have to configure something in the schema.xml or solrconfig.xml ? Hope you can help me! Kind regards, Max -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
That is the structure of your content? Is it formatted in the same XML structure as the example data is? What URL are you posting to? Upayavira On Mon, Apr 8, 2013, at 02:08 PM, Max Bo wrote: Hello, I'm very new to Solr and I come to an unexplainable point by myself so I need your help. I have indexed a huge amount of xml-Files by a shell script. function solringest_rec { for SRCFILE in $(find $1 -type f); do #DESTFILE=$URL${SRCFILE/$1/} echo ingest $SRCFILE curl $URL -H Content-type: text/xml --data-binary @$SRCFILE done } The respone I get is everytime: ?xml version=1.0! encoding=UTF-8? response lst name=responseHeaderint name=status0int name=QTime116/int/lst /respone Because of this I think that everything should be fine but the queries doesn't work. For all other operations as the post operation I use the stuff from example folder. Maybe I have to configure something in the schema.xml or solrconfig.xml ? Hope you can help me! Kind regards, Max -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
On 8 April 2013 18:38, Max Bo maximilian.brod...@gmail.com wrote: Hello, I'm very new to Solr and I come to an unexplainable point by myself so I need your help. I have indexed a huge amount of xml-Files by a shell script. [...] For posting XML files to Solr directly with curl the XML files need to be in a particular format, and you need to commit at least at the end of the indexing. Please see http://wiki.apache.org/solr/UpdateXmlMessages If you are following the exact command there, and using XML files from the example/ directory, things should just work. Regards, Gora
Re: Indexed data not searchable
Thanks for your help: The URL I'am positng to is: http://localhost:8983/solr/update?commit=true The XML-Filess I've added contains fields like author so I thought they have to serachable since it it declared as indexed in the example schema. -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054481.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
Are you sure your XML is formatted according to the SolrXML rules? See: http://wiki.apache.org/solr/UpdateXmlMessages I have to ask, because sometimes people send raw XML to Solr, not realizing that Solr accepts a particular format of XML. -- Jack Krupansky -Original Message- From: Max Bo Sent: Monday, April 08, 2013 9:56 AM To: solr-user@lucene.apache.org Subject: Re: Indexed data not searchable Thanks for your help: The URL I'am positng to is: http://localhost:8983/solr/update?commit=true The XML-Filess I've added contains fields like author so I thought they have to serachable since it it declared as indexed in the example schema. -- View this message in context: http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054481.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Indexed data not searchable
On 8 April 2013 19:26, Max Bo maximilian.brod...@gmail.com wrote: Thanks for your help: The URL I'am positng to is: http://localhost:8983/solr/update?commit=true The XML-Filess I've added contains fields like author so I thought they have to serachable since it it declared as indexed in the example schema. Please include an example of your .xml file and of Solr's schema.xml. It is difficult to keep guessing in the dark. Regards, Gora
Re: Indexed data not searchable
hi I use dataimporter the actual entity contain this : field column=id_product name=id / field column=quantity name=inStock / field column=reference name=ref / field column=supplier name=brand / field column=manufacturer name=brand / field column=name name=brand / field column=comptabible_model regex=Piéce détachée pour ([\w 0-9éèêîûô]+) Modèle sourceColName=description_short / field column=version_model regex=Modèle:([0-9a-zA-Zéèêîûô-]+),? sourceColName=description_short / data sample : Piéce détachée pour Skimmer COFIES Modèle:Premium-Design-Omega, Zipper5 Piéce détachée pour Régulateur de niveau modèle 3150 Modèle:3150 depuis 2003 Ideal result : name = Couvercle SK siglé - HAYWARD manufacturer = HAYWARD compatibility = [Skimmer COFIES] - [Premium-Design-Omega, Zipper5] [Régulateur de niveau modèle 3150] - [3150 depuis 2003] Then I wish to ba able to get all result for, all product with HAYWARD as Manufacturer. Then retreive the list of All Compatible product, in end the list of available model. Schema.xml contains : field name=ref type=string indexed=true stored=true omitNorms=true multiValued=false/ field name=name type=text_fr indexed=true stored=true / field name=cat type=text_fr indexed=true stored=true multiValued=true / field name=brand type=text_fr indexed=true stored=true multiValued=true / field name=features type=text_fr indexed=true stored=true multiValued=true / where fieldType name=text_fr class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.StandardTokenizerFactory / !-- removes l', etc -- filter class=solr.ElisionFilterFactory ignoreCase=true articles=lang/contractions_fr.txt / filter class=solr.LowerCaseFilterFactory / filter class=solr.StopFilterFactory ignoreCase=true words=lang/stopwords_fr.txt format=snowball enablePositionIncrements=true / filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.StandardFilterFactory / filter class=solr.FrenchLightStemFilterFactory / filter class=solr.FrenchMinimalStemFilterFactory / !-- more aggressive: filter class=solr.SnowballPorterFilterFactory language=French/ -- /analyzer /fieldType fieldType name=text_html_fr class=solr.TextField positionIncrementGap=100 analyzer charFilter class=solr.HTMLStripCharFilterFactory / tokenizer class=solr.StandardTokenizerFactory / !-- removes l', etc -- filter class=solr.ElisionFilterFactory ignoreCase=true articles=lang/contractions_fr.txt / filter class=solr.LowerCaseFilterFactory / filter class=solr.StopFilterFactory ignoreCase=true words=lang/stopwords_fr.txt format=snowball enablePositionIncrements=true / filter class=solr.ASCIIFoldingFilterFactory / filter class=solr.FrenchLightStemFilterFactory / filter class=solr.FrenchMinimalStemFilterFactory / filter class=solr.SnowballPorterFilterFactory language=French/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ /analyzer /fieldType I do not see how to organize this specification correctly with solr. regards eric Le 08/04/2013 16:36, Gora Mohanty a écrit : On 8 April 2013 19:26, Max Bo maximilian.brod...@gmail.com wrote: Thanks for your help: The URL I'am positng to is: http://localhost:8983/solr/update?commit=true The XML-Filess I've added contains fields like author so I thought they have to serachable since it it declared as indexed in the example schema. Please include an example of your .xml file and of Solr's schema.xml. It is difficult to keep guessing in the dark. Regards, Gora
Re: Indexed data not searchable
On 8 April 2013 21:35, It-forum it-fo...@meseo.fr wrote: hi I use dataimporter [...] Please do not hijack threads. Instead, start a new one for your questions, or follow up in a thread that you had started. Here is why this is bad practice: http://people.apache.org/~hossman/#threadhijack Regards, Gora