Re: Displaying a particular field on result

2014-06-03 Thread Bayu Widyasanyata
Thank you Alexandre!
I will check my configurations again.


On Wed, Jun 4, 2014 at 9:14 AM, Alexandre Rafalovitch 
wrote:

> Ok, the question was if I understood it now:
>
> "I am importing data from Nutch into Solr. One of the fields is
> "author" and I have defined it in Solr's schema.xml. Unfortunately, it
> is always empty when I check the records in the Solr's AdminUI. How
> can I confirm that the field was actually indexed into Solr?"
>
> In which case, Bayu's answer should solve it. Nutch is either not
> extracting the field or trying to push it to Solr with a wrong name.
> Try switching the "*" field definition to catch it (stored=true).
> Alternatively, disable "*" definition all together and see what fields
> will fail (might be a lot of them though).
>
> Regards,
>Alex.
> Personal website: http://www.outerthoughts.com/
> Current project: http://www.solr-start.com/ - Accelerating your Solr
> proficiency
>
>
> On Wed, Jun 4, 2014 at 9:08 AM, Bayu Widyasanyata
>  wrote:
> > Hi Alexandre,
> >
> > I've already play with "fl" parameter in Admin UI but the result is not I
> > expected.
> >
> > From what I understand that Solr database structure is defined on Solr's
> > schema.xml.
> > On that file we defined in example "author" field to store author content
> > in Solr database.
> >
> > Even I put "author" as "fl" paramater in Admin UI, the query will never
> > show the contents, even I have (PDF/doc) document having author content.
> >
> > How to display that field?
> > Or take a previous step, how to ensure or check that field is already
> > stored on Solr?
> >
> >
> > On Wed, Jun 4, 2014 at 8:59 AM, Alexandre Rafalovitch <
> arafa...@gmail.com>
> > wrote:
> >
> >> Are you looking for the 'fl' parameter by any chance:
> >>
> >>
> https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefl(FieldList)Parameter
> >> ?
> >>
> >> It's in the Admin UI as well.
> >>
> >> If not, then you really do need to rephrase your question. Maybe by
> >> giving a very specific example.
> >>
> >> Regards,
> >>Alex.
> >> Personal website: http://www.outerthoughts.com/
> >> Current project: http://www.solr-start.com/ - Accelerating your Solr
> >> proficiency
> >>
> >>
> >> On Wed, Jun 4, 2014 at 8:51 AM, Bayu Widyasanyata
> >>  wrote:
> >> > Hi Ahmet,
> >> >
> >> > I just refering to Solr's schema.xml which described this field
> >> definition.
> >> > In this case for example "author" field.
> >> > Then also refer to Solr query's result which I queried through Solr
> Admin
> >> > page that didn't response author field.
> >> > CMIIW.
> >> >
> >> > Thanks.-
> >> >
> >> >
> >> > On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan
> 
> >> > wrote:
> >> >
> >> >> Hi Bayu,
> >> >>
> >> >> I think this is a nutch question, no?
> >> >>
> >> >> Ahmet
> >> >>
> >> >>
> >> >>
> >> >> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
> >> >> bwidyasany...@gmail.com> wrote:
> >> >> Hi,
> >> >>
> >> >> I'm sorry if this is a frequently asked question.
> >> >>
> >> >> In default Solr's schema.xml file we define an "author" field like
> >> >> following:
> >> >>  >> indexed="true"/>
> >> >>
> >> >> But this field seems not parsed (by nutch) and indexed (by Solr).
> >> >> My query is always return null result for "author" field even some
> >> >> documents (PDF) are have author contents.
> >> >>
> >> >> How to display them?
> >> >> What should I prepared during fetch & parsing which I missed out?
> >> >> Any documents/links for this issue?
> >> >>
> >> >> Thanks in advance.
> >> >>
> >> >> --
> >> >> wassalam,
> >> >> [bayu]
> >> >>
> >> >>
> >> >
> >> >
> >> > --
> >> > wassalam,
> >> > [bayu]
> >>
> >
> >
> >
> > --
> > wassalam,
> > [bayu]
>



-- 
wassalam,
[bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Alexandre Rafalovitch
I meant of course, "Ahmet's answer". Sorry, both.

Regards,
  Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency


On Wed, Jun 4, 2014 at 9:14 AM, Alexandre Rafalovitch
 wrote:
> Ok, the question was if I understood it now:
>
> "I am importing data from Nutch into Solr. One of the fields is
> "author" and I have defined it in Solr's schema.xml. Unfortunately, it
> is always empty when I check the records in the Solr's AdminUI. How
> can I confirm that the field was actually indexed into Solr?"
>
> In which case, Bayu's answer should solve it. Nutch is either not
> extracting the field or trying to push it to Solr with a wrong name.
> Try switching the "*" field definition to catch it (stored=true).
> Alternatively, disable "*" definition all together and see what fields
> will fail (might be a lot of them though).
>
> Regards,
>Alex.
> Personal website: http://www.outerthoughts.com/
> Current project: http://www.solr-start.com/ - Accelerating your Solr 
> proficiency
>
>
> On Wed, Jun 4, 2014 at 9:08 AM, Bayu Widyasanyata
>  wrote:
>> Hi Alexandre,
>>
>> I've already play with "fl" parameter in Admin UI but the result is not I
>> expected.
>>
>> From what I understand that Solr database structure is defined on Solr's
>> schema.xml.
>> On that file we defined in example "author" field to store author content
>> in Solr database.
>>
>> Even I put "author" as "fl" paramater in Admin UI, the query will never
>> show the contents, even I have (PDF/doc) document having author content.
>>
>> How to display that field?
>> Or take a previous step, how to ensure or check that field is already
>> stored on Solr?
>>
>>
>> On Wed, Jun 4, 2014 at 8:59 AM, Alexandre Rafalovitch 
>> wrote:
>>
>>> Are you looking for the 'fl' parameter by any chance:
>>>
>>> https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefl(FieldList)Parameter
>>> ?
>>>
>>> It's in the Admin UI as well.
>>>
>>> If not, then you really do need to rephrase your question. Maybe by
>>> giving a very specific example.
>>>
>>> Regards,
>>>Alex.
>>> Personal website: http://www.outerthoughts.com/
>>> Current project: http://www.solr-start.com/ - Accelerating your Solr
>>> proficiency
>>>
>>>
>>> On Wed, Jun 4, 2014 at 8:51 AM, Bayu Widyasanyata
>>>  wrote:
>>> > Hi Ahmet,
>>> >
>>> > I just refering to Solr's schema.xml which described this field
>>> definition.
>>> > In this case for example "author" field.
>>> > Then also refer to Solr query's result which I queried through Solr Admin
>>> > page that didn't response author field.
>>> > CMIIW.
>>> >
>>> > Thanks.-
>>> >
>>> >
>>> > On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan 
>>> > wrote:
>>> >
>>> >> Hi Bayu,
>>> >>
>>> >> I think this is a nutch question, no?
>>> >>
>>> >> Ahmet
>>> >>
>>> >>
>>> >>
>>> >> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
>>> >> bwidyasany...@gmail.com> wrote:
>>> >> Hi,
>>> >>
>>> >> I'm sorry if this is a frequently asked question.
>>> >>
>>> >> In default Solr's schema.xml file we define an "author" field like
>>> >> following:
>>> >> >> indexed="true"/>
>>> >>
>>> >> But this field seems not parsed (by nutch) and indexed (by Solr).
>>> >> My query is always return null result for "author" field even some
>>> >> documents (PDF) are have author contents.
>>> >>
>>> >> How to display them?
>>> >> What should I prepared during fetch & parsing which I missed out?
>>> >> Any documents/links for this issue?
>>> >>
>>> >> Thanks in advance.
>>> >>
>>> >> --
>>> >> wassalam,
>>> >> [bayu]
>>> >>
>>> >>
>>> >
>>> >
>>> > --
>>> > wassalam,
>>> > [bayu]
>>>
>>
>>
>>
>> --
>> wassalam,
>> [bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Alexandre Rafalovitch
Ok, the question was if I understood it now:

"I am importing data from Nutch into Solr. One of the fields is
"author" and I have defined it in Solr's schema.xml. Unfortunately, it
is always empty when I check the records in the Solr's AdminUI. How
can I confirm that the field was actually indexed into Solr?"

In which case, Bayu's answer should solve it. Nutch is either not
extracting the field or trying to push it to Solr with a wrong name.
Try switching the "*" field definition to catch it (stored=true).
Alternatively, disable "*" definition all together and see what fields
will fail (might be a lot of them though).

Regards,
   Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency


On Wed, Jun 4, 2014 at 9:08 AM, Bayu Widyasanyata
 wrote:
> Hi Alexandre,
>
> I've already play with "fl" parameter in Admin UI but the result is not I
> expected.
>
> From what I understand that Solr database structure is defined on Solr's
> schema.xml.
> On that file we defined in example "author" field to store author content
> in Solr database.
>
> Even I put "author" as "fl" paramater in Admin UI, the query will never
> show the contents, even I have (PDF/doc) document having author content.
>
> How to display that field?
> Or take a previous step, how to ensure or check that field is already
> stored on Solr?
>
>
> On Wed, Jun 4, 2014 at 8:59 AM, Alexandre Rafalovitch 
> wrote:
>
>> Are you looking for the 'fl' parameter by any chance:
>>
>> https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefl(FieldList)Parameter
>> ?
>>
>> It's in the Admin UI as well.
>>
>> If not, then you really do need to rephrase your question. Maybe by
>> giving a very specific example.
>>
>> Regards,
>>Alex.
>> Personal website: http://www.outerthoughts.com/
>> Current project: http://www.solr-start.com/ - Accelerating your Solr
>> proficiency
>>
>>
>> On Wed, Jun 4, 2014 at 8:51 AM, Bayu Widyasanyata
>>  wrote:
>> > Hi Ahmet,
>> >
>> > I just refering to Solr's schema.xml which described this field
>> definition.
>> > In this case for example "author" field.
>> > Then also refer to Solr query's result which I queried through Solr Admin
>> > page that didn't response author field.
>> > CMIIW.
>> >
>> > Thanks.-
>> >
>> >
>> > On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan 
>> > wrote:
>> >
>> >> Hi Bayu,
>> >>
>> >> I think this is a nutch question, no?
>> >>
>> >> Ahmet
>> >>
>> >>
>> >>
>> >> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
>> >> bwidyasany...@gmail.com> wrote:
>> >> Hi,
>> >>
>> >> I'm sorry if this is a frequently asked question.
>> >>
>> >> In default Solr's schema.xml file we define an "author" field like
>> >> following:
>> >> > indexed="true"/>
>> >>
>> >> But this field seems not parsed (by nutch) and indexed (by Solr).
>> >> My query is always return null result for "author" field even some
>> >> documents (PDF) are have author contents.
>> >>
>> >> How to display them?
>> >> What should I prepared during fetch & parsing which I missed out?
>> >> Any documents/links for this issue?
>> >>
>> >> Thanks in advance.
>> >>
>> >> --
>> >> wassalam,
>> >> [bayu]
>> >>
>> >>
>> >
>> >
>> > --
>> > wassalam,
>> > [bayu]
>>
>
>
>
> --
> wassalam,
> [bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Bayu Widyasanyata
Hi Alexandre,

I've already play with "fl" parameter in Admin UI but the result is not I
expected.

>From what I understand that Solr database structure is defined on Solr's
schema.xml.
On that file we defined in example "author" field to store author content
in Solr database.

Even I put "author" as "fl" paramater in Admin UI, the query will never
show the contents, even I have (PDF/doc) document having author content.

How to display that field?
Or take a previous step, how to ensure or check that field is already
stored on Solr?


On Wed, Jun 4, 2014 at 8:59 AM, Alexandre Rafalovitch 
wrote:

> Are you looking for the 'fl' parameter by any chance:
>
> https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefl(FieldList)Parameter
> ?
>
> It's in the Admin UI as well.
>
> If not, then you really do need to rephrase your question. Maybe by
> giving a very specific example.
>
> Regards,
>Alex.
> Personal website: http://www.outerthoughts.com/
> Current project: http://www.solr-start.com/ - Accelerating your Solr
> proficiency
>
>
> On Wed, Jun 4, 2014 at 8:51 AM, Bayu Widyasanyata
>  wrote:
> > Hi Ahmet,
> >
> > I just refering to Solr's schema.xml which described this field
> definition.
> > In this case for example "author" field.
> > Then also refer to Solr query's result which I queried through Solr Admin
> > page that didn't response author field.
> > CMIIW.
> >
> > Thanks.-
> >
> >
> > On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan 
> > wrote:
> >
> >> Hi Bayu,
> >>
> >> I think this is a nutch question, no?
> >>
> >> Ahmet
> >>
> >>
> >>
> >> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
> >> bwidyasany...@gmail.com> wrote:
> >> Hi,
> >>
> >> I'm sorry if this is a frequently asked question.
> >>
> >> In default Solr's schema.xml file we define an "author" field like
> >> following:
> >>  indexed="true"/>
> >>
> >> But this field seems not parsed (by nutch) and indexed (by Solr).
> >> My query is always return null result for "author" field even some
> >> documents (PDF) are have author contents.
> >>
> >> How to display them?
> >> What should I prepared during fetch & parsing which I missed out?
> >> Any documents/links for this issue?
> >>
> >> Thanks in advance.
> >>
> >> --
> >> wassalam,
> >> [bayu]
> >>
> >>
> >
> >
> > --
> > wassalam,
> > [bayu]
>



-- 
wassalam,
[bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Ahmet Arslan
Hi,

I don't know nutch, but I will answer as if you were using solr cell : 
http://wiki.apache.org/solr/ExtractingRequestHandler

When a pdf file is sent to extracting request handler, several meta data are 
extracted from pdf. These metadata are assigned to fields. I usually enable 
dynamic field * to capture all metadata and see their accociated field names 
and values. Afterwards I select useful ones (and define them in schema.xml like 
you did for author) and forward remaining ones to an ignored dynamic field. 
Wiki page has all info to manipulate metadata generated by extraction.

Hope this helps.



On Wednesday, June 4, 2014 4:51 AM, Bayu Widyasanyata  
wrote:



Hi Ahmet,

I just refering to Solr's schema.xml which described this field definition. In 
this case for example "author" field.
Then also refer to Solr query's result which I queried through Solr Admin page 
that didn't response author field.
CMIIW.

Thanks.-




On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan  wrote:

Hi Bayu,
>
>I think this is a nutch question, no?
>
>Ahmet
>
>
>
>
>On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata 
> wrote:
>Hi,
>
>I'm sorry if this is a frequently asked question.
>
>In default Solr's schema.xml file we define an "author" field like
>following:
>    
>
>But this field seems not parsed (by nutch) and indexed (by Solr).
>My query is always return null result for "author" field even some
>documents (PDF) are have author contents.
>
>How to display them?
>What should I prepared during fetch & parsing which I missed out?
>Any documents/links for this issue?
>
>Thanks in advance.
>
>--
>wassalam,
>[bayu]
>
>


-- 
wassalam,
[bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Alexandre Rafalovitch
Are you looking for the 'fl' parameter by any chance:
https://cwiki.apache.org/confluence/display/solr/Common+Query+Parameters#CommonQueryParameters-Thefl(FieldList)Parameter
?

It's in the Admin UI as well.

If not, then you really do need to rephrase your question. Maybe by
giving a very specific example.

Regards,
   Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency


On Wed, Jun 4, 2014 at 8:51 AM, Bayu Widyasanyata
 wrote:
> Hi Ahmet,
>
> I just refering to Solr's schema.xml which described this field definition.
> In this case for example "author" field.
> Then also refer to Solr query's result which I queried through Solr Admin
> page that didn't response author field.
> CMIIW.
>
> Thanks.-
>
>
> On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan 
> wrote:
>
>> Hi Bayu,
>>
>> I think this is a nutch question, no?
>>
>> Ahmet
>>
>>
>>
>> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
>> bwidyasany...@gmail.com> wrote:
>> Hi,
>>
>> I'm sorry if this is a frequently asked question.
>>
>> In default Solr's schema.xml file we define an "author" field like
>> following:
>> 
>>
>> But this field seems not parsed (by nutch) and indexed (by Solr).
>> My query is always return null result for "author" field even some
>> documents (PDF) are have author contents.
>>
>> How to display them?
>> What should I prepared during fetch & parsing which I missed out?
>> Any documents/links for this issue?
>>
>> Thanks in advance.
>>
>> --
>> wassalam,
>> [bayu]
>>
>>
>
>
> --
> wassalam,
> [bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Bayu Widyasanyata
Hi Ahmet,

I just refering to Solr's schema.xml which described this field definition.
In this case for example "author" field.
Then also refer to Solr query's result which I queried through Solr Admin
page that didn't response author field.
CMIIW.

Thanks.-


On Wed, Jun 4, 2014 at 5:19 AM, Ahmet Arslan 
wrote:

> Hi Bayu,
>
> I think this is a nutch question, no?
>
> Ahmet
>
>
>
> On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata <
> bwidyasany...@gmail.com> wrote:
> Hi,
>
> I'm sorry if this is a frequently asked question.
>
> In default Solr's schema.xml file we define an "author" field like
> following:
> 
>
> But this field seems not parsed (by nutch) and indexed (by Solr).
> My query is always return null result for "author" field even some
> documents (PDF) are have author contents.
>
> How to display them?
> What should I prepared during fetch & parsing which I missed out?
> Any documents/links for this issue?
>
> Thanks in advance.
>
> --
> wassalam,
> [bayu]
>
>


-- 
wassalam,
[bayu]


Re: Displaying a particular field on result

2014-06-03 Thread Ahmet Arslan
Hi Bayu,

I think this is a nutch question, no?

Ahmet



On Wednesday, June 4, 2014 1:13 AM, Bayu Widyasanyata  
wrote:
Hi,

I'm sorry if this is a frequently asked question.

In default Solr's schema.xml file we define an "author" field like
following:
    

But this field seems not parsed (by nutch) and indexed (by Solr).
My query is always return null result for "author" field even some
documents (PDF) are have author contents.

How to display them?
What should I prepared during fetch & parsing which I missed out?
Any documents/links for this issue?

Thanks in advance.

-- 
wassalam,
[bayu]