Re: Indexed data not searchable

2013-04-11 Thread Max Bo
Thanks alot, so I will make a XSLT. 

Great community here!



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4055258.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-10 Thread Max Bo
Thanks to this!

No I have another problem. I tried to give the XML file the right format so
I made this

?xml version=1.0 encoding=UTF-8?

adddoc
field name=id455HHS-2232/field
field name=titleT0072-00031-DOWNLOAD - Blatt 12v/field
field name=formatapplication/pdf/field
field name=created2012-11-07T11:15:19.887+01:00/field
field name=lastModified2012-11-07T11:15:19.887+01:00/field
field name=issued2012-11-07T11:15:19.887+01:00/field
field name=revision0/field
field name=pidhdl:11858/00-1734--0008-12C5-2/field
field name=extent1131033/field
field name=projectSt. Matthias Test 07/field
field name=availabilitypublic/field
field name=rightsHolderStadtbibliothek und Stadtarchiv Trier/field
/doc/add



I also made the changes in the schema.xml

I added this fields:

   field name=identifier type=text_general indexed=true
stored=true/
   field name=format type=text_general indexed=true stored=true/
   field name=created type=date indexed=true stored=true/
   field name=issued type=date indexed=true stored=true/
   field name=revision type=int indexed=true stored=true/
   field name=pid type=text_general indexed=true stored=true/
   field name=extent type=int indexed=true stored=true/
   field name=dataContributor type=text_general indexed=true
stored=true/
   field name=project type=text_general indexed=true stored=true/
   field name=availability type=text_general indexed=true
stored=true/
   field name=rightsholder type=text_general indexed=true
stored=true/

Did I made anything wrong?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054960.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-10 Thread Max Bo
Just for information: I indicate that the problem occurs when I try to add
the fields, created, last_modified, issued (all three have the type date)
and the field rightsholder.

Maybe it is helpful!



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054977.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-10 Thread Raymond Wiker
On Wed, Apr 10, 2013 at 10:35 AM, Max Bo maximilian.brod...@gmail.comwrote:

 Just for information: I indicate that the problem occurs when I try to add
 the fields, created, last_modified, issued (all three have the type date)
 and the field rightsholder.

 Maybe it is helpful!


From the example you sent earlier, it appears that the format of your
timestamps does not conform with SOLR's expectations; they should be
something like 1995-12-31T23:59:59.999Z (i.e, zero timezone offset,
designated with a trailing Z).


Re: Indexed data not searchable

2013-04-10 Thread Max Bo
Thank you. 

I changed it and now it works.

But is there any possibility to make the given timestamp acceptable for
solr?




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054985.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-10 Thread Upayavira
Solr assumes you are using UTC. It is your job to do a conversion.

If you want Solr to do it, you could use an UpdateProcessor to do it,
either using RegExp, or perhaps a ScriptUpdateProcessor.

In fact, if you're comfortable with XSLT, you can make Solr accept your
old format of XML by posting it to Solr providing a tr= parameter that
points to an XSLT stylesheet which transforms your XML into an
adddoc format.

Upayavira

On Wed, Apr 10, 2013, at 10:36 AM, Max Bo wrote:
 Thank you. 
 
 I changed it and now it works.
 
 But is there any possibility to make the given timestamp acceptable for
 solr?
 
 
 
 
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054985.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-09 Thread Max Bo
The XML files are formatted like this. I think there is the problem.

metadataContainerType
  ns3:object
ns3:generic
   ns3:provided
   ns3:titleT0084-00371-DOWNLOAD - Blatt 184r/ns3:title
   ns3:identifier
type=METSXMLIDT0084-00371-DOWNLOAD/ns3:identifier
   ns3:formatapplication/pdf/ns3:format
   /ns3:provided
 ns3:generated
   ns3:created2012-11-08T00:09:57.531+01:00/ns3:created
  
ns3:lastModified2012-11-08T00:09:57.531+01:00/ns3:lastModified
   ns3:issued2012-11-08T00:09:57.531+01:00/ns3:issued
   ..




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054651.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-09 Thread Gora Mohanty
On 9 April 2013 13:10, Max Bo maximilian.brod...@gmail.com wrote:
 The XML files are formatted like this. I think there is the problem.
[...]

Yes, to use curl to post to /solr/update you need to
have XML in the form described at
http://wiki.apache.org/solr/UpdateXmlMessages

Else, you can use  FileListEntityProcessor and
XPathEntityProcessor with FileDataSource from
the Solr DataImportHandler. Please see examples
at http://wiki.apache.org/solr/DataImportHandler

Regards,
Gora


Indexed data not searchable

2013-04-08 Thread Max Bo
Hello,

I'm very new to Solr and I come to an unexplainable point by myself so I
need your help.

I have indexed a huge amount of xml-Files by a shell script.

function solringest_rec {

for SRCFILE in $(find $1 -type f); do
#DESTFILE=$URL${SRCFILE/$1/}
echo ingest  $SRCFILE
curl $URL -H Content-type: text/xml --data-binary
@$SRCFILE
done

}


The respone I get is everytime: 

?xml version=1.0! encoding=UTF-8?
response
lst name=responseHeaderint name=status0int
name=QTime116/int/lst
/respone


Because of this I think that everything should be fine but the queries
doesn't work. 

For all other operations as the post operation I use the stuff from example
folder. 
Maybe I have to configure something in the schema.xml or solrconfig.xml ?


Hope you can help me!


Kind regards,

Max








--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-08 Thread Upayavira
That is the structure of your content? Is it formatted in the same XML
structure as the example data is? What URL are you posting to?

Upayavira

On Mon, Apr 8, 2013, at 02:08 PM, Max Bo wrote:
 Hello,
 
 I'm very new to Solr and I come to an unexplainable point by myself so I
 need your help.
 
 I have indexed a huge amount of xml-Files by a shell script.
 
 function solringest_rec {
 
 for SRCFILE in $(find $1 -type f); do
 #DESTFILE=$URL${SRCFILE/$1/}
 echo ingest  $SRCFILE
 curl $URL -H Content-type: text/xml --data-binary
 @$SRCFILE
 done
 
 }
 
 
 The respone I get is everytime: 
 
 ?xml version=1.0! encoding=UTF-8?
 response
 lst name=responseHeaderint name=status0int
 name=QTime116/int/lst
 /respone
 
 
 Because of this I think that everything should be fine but the queries
 doesn't work. 
 
 For all other operations as the post operation I use the stuff from
 example
 folder. 
 Maybe I have to configure something in the schema.xml or solrconfig.xml ?
 
 
 Hope you can help me!
 
 
 Kind regards,
 
 Max
 
 
 
 
 
 
 
 
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-08 Thread Gora Mohanty
On 8 April 2013 18:38, Max Bo maximilian.brod...@gmail.com wrote:
 Hello,

 I'm very new to Solr and I come to an unexplainable point by myself so I
 need your help.

 I have indexed a huge amount of xml-Files by a shell script.
[...]

For posting XML files to Solr directly with curl the XML files
need to be in a particular format, and you need to commit at
least at the end of the indexing. Please see
http://wiki.apache.org/solr/UpdateXmlMessages

If you are following the exact command there, and using
XML files from the example/ directory, things should just
work.

Regards,
Gora


Re: Indexed data not searchable

2013-04-08 Thread Max Bo
Thanks for your help:

The URL I'am positng to is: http://localhost:8983/solr/update?commit=true


The XML-Filess I've added contains fields like author so I thought they
have to serachable since it it declared as indexed in the example schema.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054481.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Indexed data not searchable

2013-04-08 Thread Jack Krupansky

Are you sure your XML is formatted according to the SolrXML rules?

See:
http://wiki.apache.org/solr/UpdateXmlMessages

I have to ask, because sometimes people send raw XML to Solr, not realizing 
that Solr accepts a particular format of XML.


-- Jack Krupansky

-Original Message- 
From: Max Bo

Sent: Monday, April 08, 2013 9:56 AM
To: solr-user@lucene.apache.org
Subject: Re: Indexed data not searchable

Thanks for your help:

The URL I'am positng to is: http://localhost:8983/solr/update?commit=true


The XML-Filess I've added contains fields like author so I thought they
have to serachable since it it declared as indexed in the example schema.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Indexed-data-not-searchable-tp4054473p4054481.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Indexed data not searchable

2013-04-08 Thread Gora Mohanty
On 8 April 2013 19:26, Max Bo maximilian.brod...@gmail.com wrote:
 Thanks for your help:

 The URL I'am positng to is: http://localhost:8983/solr/update?commit=true


 The XML-Filess I've added contains fields like author so I thought they
 have to serachable since it it declared as indexed in the example schema.

Please include an example of your .xml file and of Solr's
schema.xml. It is difficult to keep guessing in the dark.

Regards,
Gora


Re: Indexed data not searchable

2013-04-08 Thread It-forum

hi

I use dataimporter

the actual entity contain this :
field column=id_product name=id /
field column=quantity name=inStock /
field column=reference name=ref /
field column=supplier name=brand /
field column=manufacturer name=brand /
field column=name name=brand /
field column=comptabible_model regex=Piéce détachée pour ([\w 
0-9éèêîûô]+) Modèle sourceColName=description_short /
field column=version_model regex=Modèle:([0-9a-zA-Zéèêîûô-]+),? 
sourceColName=description_short /


data sample :
Piéce détachée pour Skimmer COFIES
Modèle:Premium-Design-Omega, Zipper5 Piéce détachée pour Régulateur de 
niveau modèle 3150

Modèle:3150 depuis 2003

Ideal result :
name = Couvercle SK siglé - HAYWARD
manufacturer = HAYWARD
compatibility =  [Skimmer COFIES] - [Premium-Design-Omega, 
Zipper5]
[Régulateur de niveau modèle 3150] 
- [3150 depuis 2003]




Then I wish to ba able to get all result for, all product with HAYWARD 
as Manufacturer. Then retreive the list of All Compatible product, in 
end the list of available model.


Schema.xml contains :
field name=ref type=string indexed=true stored=true 
omitNorms=true multiValued=false/

field name=name type=text_fr indexed=true stored=true /
field name=cat type=text_fr indexed=true stored=true 
multiValued=true /
field name=brand type=text_fr indexed=true stored=true 
multiValued=true /
field name=features type=text_fr indexed=true 
stored=true multiValued=true /

where

fieldType name=text_fr class=solr.TextField positionIncrementGap=100
analyzer
tokenizer class=solr.StandardTokenizerFactory /
!-- removes l', etc --
filter class=solr.ElisionFilterFactory 
ignoreCase=true articles=lang/contractions_fr.txt /

filter class=solr.LowerCaseFilterFactory /
filter class=solr.StopFilterFactory 
ignoreCase=true words=lang/stopwords_fr.txt format=snowball 
enablePositionIncrements=true /
filter class=solr.WordDelimiterFilterFactory  
generateWordParts=1 generateNumberParts=1 catenateWords=1 
catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/

filter class=solr.StandardFilterFactory /
filter class=solr.FrenchLightStemFilterFactory /
filter class=solr.FrenchMinimalStemFilterFactory /
!-- more aggressive: filter 
class=solr.SnowballPorterFilterFactory language=French/ --

/analyzer
/fieldType

fieldType name=text_html_fr class=solr.TextField 
positionIncrementGap=100

analyzer
charFilter class=solr.HTMLStripCharFilterFactory /
tokenizer class=solr.StandardTokenizerFactory /
!-- removes l', etc --
filter class=solr.ElisionFilterFactory 
ignoreCase=true articles=lang/contractions_fr.txt /

filter class=solr.LowerCaseFilterFactory /
filter class=solr.StopFilterFactory 
ignoreCase=true words=lang/stopwords_fr.txt format=snowball 
enablePositionIncrements=true /

filter class=solr.ASCIIFoldingFilterFactory /
filter class=solr.FrenchLightStemFilterFactory /
filter class=solr.FrenchMinimalStemFilterFactory /
filter class=solr.SnowballPorterFilterFactory 
language=French/

filter class=solr.RemoveDuplicatesTokenFilterFactory/
/analyzer

/fieldType


I do not see how to organize this specification correctly with solr.

regards

eric

Le 08/04/2013 16:36, Gora Mohanty a écrit :

On 8 April 2013 19:26, Max Bo maximilian.brod...@gmail.com wrote:

Thanks for your help:

The URL I'am positng to is: http://localhost:8983/solr/update?commit=true


The XML-Filess I've added contains fields like author so I thought they
have to serachable since it it declared as indexed in the example schema.

Please include an example of your .xml file and of Solr's
schema.xml. It is difficult to keep guessing in the dark.

Regards,
Gora




Re: Indexed data not searchable

2013-04-08 Thread Gora Mohanty
On 8 April 2013 21:35, It-forum it-fo...@meseo.fr wrote:
 hi

 I use dataimporter
[...]

Please do not hijack threads. Instead, start a new one for
your questions, or follow up in a thread that you had started.
Here is why this is bad practice:
http://people.apache.org/~hossman/#threadhijack

Regards,
Gora