Fwding to user..
-- Forwarded message --
From: Andrew Musselman andrew.mussel...@gmail.com
Date: Wed, Jul 22, 2015 at 8:54 AM
Subject: Re: Parsing and indexing parts of the input file paths
To: d...@lucene.apache.org
Thanks, and tell it to index the id field, which eventually
, 2015 at 9:47 AM, Andrew Musselman
andrew.mussel...@gmail.com wrote:
Trying to figure out how to parse the file path, which when I run the
cloud instance becomes the id for each PDF document.
Is that id field the thing to parse with PatternReplaceFilterFactory in
the config
Erickson erickerick...@gmail.com
wrote:
Don't understand your question. If you're talking two different
fields, use copyField.
On Wed, Jul 22, 2015 at 8:55 AM, Andrew Musselman
andrew.mussel...@gmail.com wrote:
Fwding to user..
-- Forwarded message --
From: Andrew
Dear user and dev lists,
We are loading files from a directory and would like to index a portion of
each file path as a field as well as the text inside the file.
E.g., on HDFS we have this file path:
/user/andrew/1234/1234/file.pdf
And we would like the 1234 token parsed from the file path
I followed the instructions here
https://wiki.apache.org/lucene-java/HowtoConfigureIntelliJ, including `ant
idea`, but I'm still not getting the links in solr classes and methods; do
I need to add libraries, or am I missing something else?
Thanks!
, Jul 21, 2015, at 06:31 PM, Andrew Musselman wrote:
Dear user and dev lists,
We are loading files from a directory and would like to index a portion
of
each file path as a field as well as the text inside the file.
E.g., on HDFS we have this file path:
/user/andrew/1234/1234
Which can only happen if I post it to a web service, and won't happen if I
do it through config?
On Tue, Jul 21, 2015 at 2:19 PM, Upayavira u...@odoko.co.uk wrote:
yes, unless it has been added consciously as a separate field.
On Tue, Jul 21, 2015, at 09:40 PM, Andrew Musselman wrote
Upayavira
On Tue, Jul 21, 2015, at 07:37 PM, Andrew Musselman wrote:
I'm not sure, it's a remote team but will get more info. For now,
assuming
that a certain directory is specified, like /user/andrew/, and a regex
is
applied to capture anything two directories below matching
it
if switching between too diverged branches (e.g., 4.10 and 5_x).
вт, 21 июля 2015 г. в 21:53, Andrew Musselman andrew.mussel...@gmail.com
:
I followed the instructions here
https://wiki.apache.org/lucene-java/HowtoConfigureIntelliJ, including
`ant
idea`, but I'm still not getting