[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-26 Thread barrotsteindev
Github user barrotsteindev commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
Luckily for us the locale does not seem to cause a problem.
Unfortunately, the double space before single day digit test did not pass, 
forcing me to change the asctime configuration to "**EEE MMM [ ]d HH:mm:ss 
[z ]**",
which results in the test passing.


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-25 Thread dsmiley
Github user dsmiley commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
Another little detail I see that's different is the default locale.  
ParseDateFieldUpdateProcessorFactory uses ROOT but ExtractDateUtils uses 
ENGLISH.  AFAIK, such a difference would appear when parsing timezones like 
AKST.  I could be wrong.  Can we see if this is indeed a problem?  Ideally a 
test would demonstrate if it is or not.  If we do find an issue here... then we 
could simply always configure the locale in the configs (my preference) or we 
could change the internal default of `ParseDateFieldUpdateProcessorFactory`


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-25 Thread barrotsteindev
Github user barrotsteindev commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
Added test for RFC-2616 (ParsingFieldUpdateProcessorsTest#testRfc2616) :-).


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-24 Thread barrotsteindev
Github user barrotsteindev commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
> Alternatively, perhaps leave that part to me. Or take a 1st cut at it and 
I edit from there, which I'm likely to do any way. Either way.

I read the doc file and could not come up with the right words for the job.
Perhaps this part is better left for you, and I could learn from the final 
patch so I can do these kind of tasks next time. :)


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-24 Thread dsmiley
Github user dsmiley commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
Docs: 
`solr/solr-ref-guide/src/uploading-data-with-solr-cell-using-apache-tika.adoc`

Alternatively, perhaps leave that part to me.  Or take a 1st cut at it and 
I edit from there, which I'm likely to do any way.  Either way.

RE the default config:  I'm concerned we're not testing the default config 
well.  Here's what I recommend:  in 
solrconfig-parsing-update-processor-chains.xml.  I think there should be one 
date config that is intentionally identical to the `_default` configSet's 
configuration of this URP (and should have a name indicating this, like 
"parse-date-default-configSet").  All the other configurations of this URP in 
the file should have exactly one pattern and is only there to test a particular 
condition -- i.e. the effects of a locale or an interesting pattern or 
something like that.   I realize this recommendation would mean stomping over a 
bit of the changes in a previous issue, e.g. removing 
`parse-date-patterns-from-extract-contrib`, but the test code should stay the 
same, I think, notwithstanding referring to a new/renamed parse-date processor 
config per test.

To me, the merging of patterns in ExtractDateUtils into the default URP 
patterns seems distinct enough from removing ExtractDateUtils that it ought to 
occur in its own issue.

BTW I was looking at DateTimeFormatter.RFC_1123_DATE_TIME and noticed the 
leading day of week is optional.  We should update our pattern accordingly, and 
test it's optionality.  It'd be nice if we had a feature like 
`RFC_1123_DATE_TIME` where we could refer to a well known pattern by 
it's well-known name and in turn get the pre-compiled instance in 
DateTimeFormatter.  But that's not in scope here.


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-24 Thread barrotsteindev
Github user barrotsteindev commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
@dsmiley , I am having trouble figuring out which parts of the ref guide I 
should edit,
could you give me a hand as I am fairly new to this? 😅
Apologies, I am very new to all of this.


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[GitHub] lucene-solr issue #438: SOLR-12593: Remove date parsing functionality from e...

2018-08-22 Thread barrotsteindev
Github user barrotsteindev commented on the issue:

https://github.com/apache/lucene-solr/pull/438
  
Used the patterns from parse-date-patterns-from-extract-contrib,
which seem to work just fine :).


---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org