Fwd: Parsing and indexing parts of the input file paths

2015-07-22 Thread Andrew Musselman
Fwding to user.. -- Forwarded message -- From: Andrew Musselman andrew.mussel...@gmail.com Date: Wed, Jul 22, 2015 at 8:54 AM Subject: Re: Parsing and indexing parts of the input file paths To: d...@lucene.apache.org Thanks, and tell it to index the id field, which eventually

Re: Parsing and indexing parts of the input file paths

2015-07-22 Thread Andrew Musselman
, 2015 at 9:47 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Trying to figure out how to parse the file path, which when I run the cloud instance becomes the id for each PDF document. Is that id field the thing to parse with PatternReplaceFilterFactory in the config

Re: Parsing and indexing parts of the input file paths

2015-07-22 Thread Andrew Musselman
Erickson erickerick...@gmail.com wrote: Don't understand your question. If you're talking two different fields, use copyField. On Wed, Jul 22, 2015 at 8:55 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Fwding to user.. -- Forwarded message -- From: Andrew

Parsing and indexing parts of the input file paths

2015-07-21 Thread Andrew Musselman
Dear user and dev lists, We are loading files from a directory and would like to index a portion of each file path as a field as well as the text inside the file. E.g., on HDFS we have this file path: /user/andrew/1234/1234/file.pdf And we would like the 1234 token parsed from the file path

IntelliJ setup

2015-07-21 Thread Andrew Musselman
I followed the instructions here https://wiki.apache.org/lucene-java/HowtoConfigureIntelliJ, including `ant idea`, but I'm still not getting the links in solr classes and methods; do I need to add libraries, or am I missing something else? Thanks!

Re: Parsing and indexing parts of the input file paths

2015-07-21 Thread Andrew Musselman
, Jul 21, 2015, at 06:31 PM, Andrew Musselman wrote: Dear user and dev lists, We are loading files from a directory and would like to index a portion of each file path as a field as well as the text inside the file. E.g., on HDFS we have this file path: /user/andrew/1234/1234

Re: Parsing and indexing parts of the input file paths

2015-07-21 Thread Andrew Musselman
Which can only happen if I post it to a web service, and won't happen if I do it through config? On Tue, Jul 21, 2015 at 2:19 PM, Upayavira u...@odoko.co.uk wrote: yes, unless it has been added consciously as a separate field. On Tue, Jul 21, 2015, at 09:40 PM, Andrew Musselman wrote

Re: Parsing and indexing parts of the input file paths

2015-07-21 Thread Andrew Musselman
Upayavira On Tue, Jul 21, 2015, at 07:37 PM, Andrew Musselman wrote: I'm not sure, it's a remote team but will get more info. For now, assuming that a certain directory is specified, like /user/andrew/, and a regex is applied to capture anything two directories below matching

Re: IntelliJ setup

2015-07-21 Thread Andrew Musselman
it if switching between too diverged branches (e.g., 4.10 and 5_x). вт, 21 июля 2015 г. в 21:53, Andrew Musselman andrew.mussel...@gmail.com : I followed the instructions here https://wiki.apache.org/lucene-java/HowtoConfigureIntelliJ, including `ant idea`, but I'm still not getting