[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-05 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15998895#comment-15998895
 ] 

Mikhail Khludnev commented on SOLR-9867:


Thanks, [~werder] the finding is priceless, let's tackle it SOLR-10614 later. 
Now, urging by SOLR-10611 we need to bring the latch back again under 
SOLR-10615. 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: 6x failure 0 docs.txt, Lucene-Solr-master-MacOSX #3986 
> Console [Jenkins].htm, SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-05 Thread Andrey Kudryavtsev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15998732#comment-15998732
 ] 

Andrey Kudryavtsev commented on SOLR-9867:
--

{code}
   [junit4]   1> POSTing file money.xml to [base]

   [junit4]   1> POSTing file ipod_other.xml to [base]
   [junit4]   1> 14 files indexed.
   [junit4]   1> COMMITting Solr index changes to 
http://localhost:43383/solr/techproducts/update...
   [junit4]   1> Time spent: 0:00:00.002

{code}

 if everything works as expected there should be Solr's logs among client's 
logs. All that
{code}
17062 INFO  (qtp638502344-15) [x:techproducts] 
o.a.s.u.p.LogUpdateProcessorFactory [techproducts]  webapp=/solr path=/update 
params={}{add=[EN7800GTX/2DHTV/256M (1566581601311129600), 100-435805 
(1566581601326858240)]} 0 20

{code}

and etc.

I feel like it might be a test issue, because if one sets 
{{SimplePostTool.mockMode}} to {{true}}, issue will reproduce. 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: 6x failure 0 docs.txt, Lucene-Solr-master-MacOSX #3986 
> Console [Jenkins].htm, SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15997580#comment-15997580
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit c6ebee6a49e82f53090adf95836996ffecf0bb1c in lucene-solr's branch 
refs/heads/master from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=c6ebee6 ]

SOLR-9867: @Ignore TestSolrCLIRunExample.testTechproductsExample()


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: 6x failure 0 docs.txt, Lucene-Solr-master-MacOSX #3986 
> Console [Jenkins].htm, SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15997574#comment-15997574
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 56851d6e6b781516f06b7203d60f6ad117ea5091 in lucene-solr's branch 
refs/heads/branch_6x from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=56851d6 ]

SOLR-9867: @Ignore TestSolrCLIRunExample.testTechproductsExample()


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: 6x failure 0 docs.txt, Lucene-Solr-master-MacOSX #3986 
> Console [Jenkins].htm, SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15997544#comment-15997544
 ] 

Mikhail Khludnev commented on SOLR-9867:


the same is 
[here|https://builds.apache.org/job/Lucene-Solr-Tests-6.x/862/consoleFull]
posttool throw xmls to nowhere in 2 ms, and .. nothing.
{code}
  [junit4]   2> 2054068 INFO  (qtp132288369-20302) [x:techproducts] 
o.a.s.s.HttpSolrCall [admin] webapp=null path=/admin/cores 
params={name=techproducts=CREATE=techproducts=json} 
status=0 QTime=454
   [junit4]   1> SimplePostTool version 5.0.0
   [junit4]   1> Posting files to [base] url 
http://localhost:43383/solr/techproducts/update using content-type 
application/xml...
   [junit4]   1> POSTing file money.xml to [base]

   [junit4]   1> POSTing file ipod_other.xml to [base]
   [junit4]   1> 14 files indexed.
   [junit4]   1> COMMITting Solr index changes to 
http://localhost:43383/solr/techproducts/update...
   [junit4]   1> Time spent: 0:00:00.002
   [junit4]   2> 2054075 INFO  (qtp132288369-20305) [x:techproducts] 
o.a.s.c.S.Request [techproducts]  webapp=/solr path=/select 
params={q=*:*=javabin=2} hits=0 status=0 QTime=0
   [junit4]   2> 2054075 WARN  
(TEST-TestSolrCLIRunExample.testTechproductsExample-seed#[51307655FACE2661]) [  
  ] o.a.s.u.TestSolrCLIRunExample Going to wait for 1 second before re-trying 
query for techproduct example docs ...
   [junit4]   2> 2055078 INFO  (qtp132288369-20305) [x:techproducts] 
o.a.s.c.S.Request [techproducts]  webapp=/solr path=/select 
params={q=*:*=javabin=2} hits=0 status=0 QTime=1
   [junit4]   2> 2055079 INFO  
(TEST-TestSolrCLIRunExample.testTechproductsExample-seed#[51307655FACE2661]) [  
  ] o.a.s.SolrTestCaseJ4 ###Ending testTechproductsExample
{code}  
Is it super asynchronous? 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: 6x failure 0 docs.txt, Lucene-Solr-master-MacOSX #3986 
> Console [Jenkins].htm, SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996932#comment-15996932
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit b59f816e6204e59dbb9bfde464032cabb8ba0d50 in lucene-solr's branch 
refs/heads/branch_6x from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=b59f816 ]

SOLR-9867: fixing JvmMetricsTest broken earlier, bring back 
testTechproductsExample()
and single SDF.cores assignment.


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996883#comment-15996883
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 5eb4a8d71b491ca7f389b4dc7414dbc1a2620ae2 in lucene-solr's branch 
refs/heads/master from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=5eb4a8d ]

SOLR-9867: fixing JvmMetricsTest broken earlier, bring back 
testTechproductsExample()
and single SDF.cores assignment.


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996780#comment-15996780
 ] 

Mikhail Khludnev commented on SOLR-9867:


[~werder], where have you been an hour ago?  

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Andrey Kudryavtsev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996760#comment-15996760
 ] 

Andrey Kudryavtsev commented on SOLR-9867:
--

But what if we, you know

{code}
-  private void setupJvmMetrics()  {
+  private void setupJvmMetrics(CoreContainer cores)  {
 SolrMetricManager metricManager = cores.getMetricManager();
 final Set hiddenSysProps = cores.getConfig().getHiddenSysProps();
 try {
@@ -247,7 +247,7 @@ public class SolrDispatchFilter extends BaseSolrFilter {
*/
   protected CoreContainer createCoreContainer(Path solrHome, Properties 
extraProperties) {
 NodeConfig nodeConfig = loadNodeConfig(solrHome, extraProperties);
-cores = new CoreContainer(nodeConfig, extraProperties, true);
+CoreContainer cores = new CoreContainer(nodeConfig, extraProperties, true);
 cores.load();
 return cores;
   }
{code}

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996716#comment-15996716
 ] 

Mikhail Khludnev commented on SOLR-9867:


rolled back {{SDF.createCoreContainer()}}, disabled {{testTechproductsExample}}
Since it seems like it breaks {{JvmMetricsTest.testSetupJvmMetrics}} at least. 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996712#comment-15996712
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 72e76138ce7d9ec21d3178eb6ee65908792d0614 in lucene-solr's branch 
refs/heads/branch_6x from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=72e7613 ]

SOLR-9867: rollback SDF.createCoreContainer(). disable testTechproductsExample


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996711#comment-15996711
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 1534b6219fa99f7a64f372778e727382e256b423 in lucene-solr's branch 
refs/heads/master from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=1534b62 ]

SOLR-9867: rollback SDF.createCoreContainer(). disable testTechproductsExample


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996703#comment-15996703
 ] 

Mikhail Khludnev commented on SOLR-9867:


it seems like it wasn't a great idea. 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996664#comment-15996664
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit b4f936c7367dfaff6edc75fdce20ea383d0c1dd1 in lucene-solr's branch 
refs/heads/branch_6x from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=b4f936c ]

SOLR-9867: make sure cores are assigned in the end of 
SolrDispatchFilter.createCoreContainer() only


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996661#comment-15996661
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 30f1422ba955a38c37b1ec97244ebb7ede9ee7be in lucene-solr's branch 
refs/heads/master from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=30f1422 ]

SOLR-9867: make sure cores are assigned in the end of 
SolrDispatchFilter.createCoreContainer() only


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, 
> SOLR-9867-createCoreContainer-fix.patch, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Andrey Kudryavtsev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996505#comment-15996505
 ] 

Andrey Kudryavtsev commented on SOLR-9867:
--

{quote}
I've removed SDF.init() latch, and just assign volatile SDF.cores at the end of 
init() since SDF.doFilter() already has a precondition check SDF.cores!=null.
{quote}

Not sure whether it helped - there is an assignment for {{cores}} in 
{{createCoreContainer()}}

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: Lucene-Solr-master-MacOSX #3986 Console [Jenkins].htm, 
> SDF init and doFilter in parallel.png, SOLR-9867-ignore-whitespace.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch, stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996302#comment-15996302
 ] 

Mikhail Khludnev commented on SOLR-9867:


[~erickerickson] I spin off SOLR-10605 since SOLR-9867 is about 
{{TestSolrCLIRunExample.testTechproductsExample()}}.
Please comment on it.  

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996251#comment-15996251
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 62b0458166dda2a83aee3e348ae76ea75542f6b4 in lucene-solr's branch 
refs/heads/branch_6x from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=62b0458 ]

SOLR-9867: fixing TestSolrCLIRunExample.testTechproductsExample

- SolrDispatchFilter.doFilter rejects invocation until init() is completed.
- introducing isLoaded=false, isLoading=true core status
- blocking shutdown until core loading stops
- looping run example tool while core is loading 1 min max.


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-04 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15996238#comment-15996238
 ] 

ASF subversion and git services commented on SOLR-9867:
---

Commit 32b7791fa8019aff2fca65c06deda48a6360da41 in lucene-solr's branch 
refs/heads/master from [~mkhludnev]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=32b7791 ]

SOLR-9867: fixing TestSolrCLIRunExample.testTechproductsExample

- SolrDispatchFilter.doFilter rejects invocation until init() is completed.
- introducing isLoaded=false, isLoading=true core status
- blocking shutdown until core loading stops
- looping run example tool while core is loading 1 min max.


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995801#comment-15995801
 ] 

Erick Erickson commented on SOLR-9867:
--

It's nothing new I don't think. Possibly related to Shalin's comment on 
SOLR-10562?

So let's go ahead and commit and perhaps beast it again when something happens 
on 10562?

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch, 
> stdout_90
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995734#comment-15995734
 ] 

Mikhail Khludnev commented on SOLR-9867:


{code}
   [junit4]   2> 35073 INFO  (qtp1031286021-158) [n:localhost:32820_solr] 
o.a.s.s.HttpSolrCall [admin] webapp=null path=/admin/collections 
params={replicationFactor=2=4=testCloudExamplePrompt=testCloudExamplePrompt=CREATE=2=json}
 status=0 QTime=5117
   [junit4]   2> 35108 INFO  (qtp1031286021-207) [n:localhost:32820_solr 
c:testCloudExamplePrompt s:shard2 r:core_node1 
x:testCloudExamplePrompt_shard2_replica1] o.a.s.h.SolrConfigHandler Executed 
config commands successfully and persisted to ZK 
[{"set-property":{"updateHandler.autoSoftCommit.maxTime":"3000"}}]
   [junit4]   2> 35111 INFO  (qtp1031286021-207) [n:localhost:32820_solr 
c:testCloudExamplePrompt s:shard2 r:core_node1 
x:testCloudExamplePrompt_shard2_replica1] o.a.s.h.SolrConfigHandler Waiting up 
to 30 secs for 4 replicas to set the property overlay to be of version 0 for 
collection testCloudExamplePrompt
   [junit4]   2> 35112 INFO  (Thread-81) [n:localhost:32820_solr] 
o.a.s.c.SolrCore config update listener called for core 
testCloudExamplePrompt_shard2_replica2
   [junit4]   2> 35115 INFO  
(solrHandlerExecutor-81-thread-1-processing-n:localhost:32820_solr 
x:testCloudExamplePrompt_shard2_replica1 s:shard2 c:testCloudExamplePrompt 
r:core_node1) [n:localhost:32820_solr c:testCloudExamplePrompt s:shard2 
r:core_node1 x:testCloudExamplePrompt_shard2_replica1] 
o.a.s.h.SolrConfigHandler Time elapsed : 0 secs, maxWait 30
   [junit4]   2> 35115 INFO  (Thread-81) [n:localhost:32820_solr] 
o.a.s.c.SolrCore core reload testCloudExamplePrompt_shard2_replica2
{code}
Collection has been created, param update is sent, Zk listener {{(Thread-81)}} 
starts core reload 
{code}
   [junit4]   2> 39099 INFO  (qtp1031286021-155) [n:localhost:32820_solr] 
o.a.s.m.SolrMetricManager Closing metric reporters for 
registry=solr.core.testCloudExamplePrompt.shard2.replica2, tag=149464127
   [junit4]   2> 39099 INFO  (qtp1031286021-155) [n:localhost:32820_solr] 
o.a.s.m.SolrMetricManager Closing metric reporters for 
registry=solr.collection.testCloudExamplePrompt.shard2.leader, tag=149464127
   [junit4]   2> 39106 INFO  
(zkCallback-16-thread-1-processing-n:localhost:32820_solr) 
[n:localhost:32820_solr] o.a.s.c.c.ZkStateReader A cluster state change: 
[WatchedEvent state:SyncConnected type:NodeDataChanged 
path:/collections/testCloudExamplePrompt/state.json] for collection 
[testCloudExamplePrompt] has occurred - updating... (live nodes size: [1])
   [junit4]   2> 39106 INFO  (qtp1031286021-221) [n:localhost:32820_solr] 
o.a.s.s.HttpSolrCall [admin] webapp=null path=/admin/cores 
params={deleteInstanceDir=true=testCloudExamplePrompt_shard1_replica2=/admin/cores=true=UNLOAD=javabin=2}
 status=0 QTime=107
   [junit4]   2> 39107 INFO  (Thread-81) [n:localhost:32820_solr 
c:testCloudExamplePrompt s:shard2 r:core_node3 
x:testCloudExamplePrompt_shard2_replica2] o.a.s.m.r.SolrJmxReporter JMX 
monitoring for 'solr.core.testCloudExamplePrompt.shard1.replica2' (registry 
'solr.core.testCloudExamplePrompt.shard1.replica2') enabled at server: 
com.sun.jmx.mbeanserver.JmxMBeanServer@167c2a09
   [junit4]   2> 39108 INFO  (qtp1031286021-187) [n:localhost:32820_solr] 
o.a.s.s.HttpSolrCall [admin] webapp=null path=/admin/cores 
params={deleteInstanceDir=true=testCloudExamplePrompt_shard2_replica1=/admin/cores=true=UNLOAD=javabin=2}
 status=0 QTime=77
   [junit4]   2> 39109 WARN  
(zkCallback-16-thread-1-processing-n:localhost:32820_solr) 
[n:localhost:32820_solr] o.a.s.c.LeaderElector Our node is no longer in 
line to be leader
   [junit4]   2> 39109 WARN  
(zkCallback-16-thread-2-processing-n:localhost:32820_solr) 
[n:localhost:32820_solr] o.a.s.c.LeaderElector Our node is no longer in 
line to be leader
   [junit4]   2> 39113 WARN  (Thread-81) [n:localhost:32820_solr 
c:testCloudExamplePrompt s:shard2 r:core_node3 
x:testCloudExamplePrompt_shard2_replica2] o.a.s.c.ZkController listener throws 
error
   [junit4]   2> org.apache.solr.common.SolrException: Unable to reload core 
[testCloudExamplePrompt_shard1_replica2]
   [junit4]   2>at 
org.apache.solr.core.CoreContainer.reload(CoreContainer.java:1197)
   [junit4]   2>at 
org.apache.solr.core.SolrCore.lambda$getConfListener$18(SolrCore.java:2953)
   [junit4]   2>at 
org.apache.solr.cloud.ZkController.lambda$fireEventListeners$4(ZkController.java:2350)
   [junit4]   2>at java.lang.Thread.run(Thread.java:748)
   [junit4]   2> Caused by: java.lang.NullPointerException
   [junit4]   2>at 
org.apache.solr.metrics.SolrMetricManager.loadShardReporters(SolrMetricManager.java:1032)
   [junit4]   2>at 
org.apache.solr.metrics.SolrCoreMetricManager.loadReporters(SolrCoreMetricManager.java:89)
   [junit4]   

[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995503#comment-15995503
 ] 

Mikhail Khludnev commented on SOLR-9867:


It sounds like it can go off tomorrow.
[~erickerickson], can you share an output from _our old friend_ by any chance? 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Assignee: Mikhail Khludnev
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995485#comment-15995485
 ] 

Erick Erickson commented on SOLR-9867:
--

Out of 1,000 runs I see only one problem, our old friend ObjectTracker:

Throwable #1: java.lang.AssertionError: ObjectTracker found 5 object(s) that 
were not released!!! [NRTCachingDirectory, NRTCachingDirectory, 
NRTCachingDirectory, MDCAwareThreadPoolExecutor, TransactionLog]

All other 999 showed "BUILD SUCCESSFUL"


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995083#comment-15995083
 ] 

Erick Erickson commented on SOLR-9867:
--

Got it, running TestSolrCLIRunExample now.

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995077#comment-15995077
 ] 

Mikhail Khludnev commented on SOLR-9867:


[~erickerickson],
Here we need to test/beast {{TestSolrCLIRunExample}} for sure. We've done with 
{{SolrCloudExampleTest}} at SOLR-10588

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15995053#comment-15995053
 ] 

Erick Erickson commented on SOLR-9867:
--

got the patch, I'm starting to give it a beast run.

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-05-03 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15994993#comment-15994993
 ] 

Mikhail Khludnev commented on SOLR-9867:


[~mdrob],
bq. perhaps, instead of latch we can assign cores strictly at the end of init. 
What do you think?
Indeed! Thanks for the hint!
bq. Depends on if we want early requests to fail or to wait, no?
I suppose it's fine since we already have this check.
bq. couldn't this take a while? How did you decide 10 seconds?
I hardly remember how exactly I get to it. Now I set it to 30 sec (almost no 
one experience it, since most of the users wait till Solr load cores before 
stop it), internally it makes 0.5 sec polls, so it doesn't really matter. But 
I'm happy to put any number there which makes tests pass. 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, 
> SOLR-9867-ignore-whitespace.patch, SOLR-9867.patch, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-04-30 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15990284#comment-15990284
 ] 

Mikhail Khludnev commented on SOLR-9867:


Right. Heads up SOLR-10588

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-04-30 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15990278#comment-15990278
 ] 

Erick Erickson commented on SOLR-9867:
--

Well, it was a nice idea but apparently this JIRA doesn't fix the recent 
SolrCloudExampleTest failures. Pity.

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-04-30 Thread Mike Drob (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15990142#comment-15990142
 ] 

Mike Drob commented on SOLR-9867:
-

Great idea to add isLoading info.

Is latch.await() on every request going to be a performance impact? I don't 
have a jdk source on this computer to check.

bq. perhaps, instead of latch we can assign cores strictly at the end of init. 
What do you think?
Depends on if we want early requests to fail or to wait, no?

bq. patch also blocks container closing, until core loading is stopped with 
failure.
couldn't this take a while? How did you decide 10 seconds?

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867.patch, SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-04-28 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15989536#comment-15989536
 ] 

Mikhail Khludnev commented on SOLR-9867:


{code}
diff --git a/solr/core/src/java/org/apache/solr/servlet/SolrDispatchFilter.java 
b/solr/core/src/java/org/apache/solr/servlet/SolrDispatchFilter.java
index 39ccadc..ab46f19 100644
--- a/solr/core/src/java/org/apache/solr/servlet/SolrDispatchFilter.java
+++ b/solr/core/src/java/org/apache/solr/servlet/SolrDispatchFilter.java
@@ -96,6 +96,8 @@
   private Boolean testMode = null;
   private boolean isV2Enabled = 
!"true".equals(System.getProperty("disable.v2.api", "false"));
 
+  private volatile boolean initIsDone;
+
   /**
* Enum to define action that needs to be processed.
* PASSTHROUGH: Pass through to Restlet via webapp.
@@ -182,6 +184,7 @@
 }
 
 log.trace("SolrDispatchFilter.init() done");
+initIsDone = true;
   }
 
   private void setupJvmMetrics()  {
@@ -307,6 +310,7 @@
   }
   
   public void doFilter(ServletRequest request, ServletResponse response, 
FilterChain chain, boolean retry) throws IOException, ServletException {
+assert initIsDone:"I swear";
 if (!(request instanceof HttpServletRequest)) return;
 try {
{code}

{code}
305  INFO  (Thread-1) [] o.a.s.s.SolrDispatchFilter  ___  _   
Welcome to Apache Solr™ version 7.0.0
305  INFO  (Thread-1) [] o.a.s.s.SolrDispatchFilter / __| ___| |_ _   
Starting in standalone mode on port 52215
305  INFO  (Thread-1) [] o.a.s.s.SolrDispatchFilter \__ \/ _ \ | '_|  
Install dir: null
321  INFO  (Thread-1) [] o.a.s.s.SolrDispatchFilter |___/\___/_|_|Start 
time: 2017-04-28T22:25:03.738Z
335  INFO  (Thread-1) [] o.a.s.c.SolrResourceLoader solr home defaulted to 
'solr/' (could not find system property or JNDI)
341  INFO  (Thread-1) [] o.a.s.c.SolrXmlConfig Loading container 
configuration from 
/private/var/folders/rg/fr1t3mx9391f1_g0xs8wtq2d1xv078/T/solr.util.TestSolrCLIRunExample_E157FC17061E2B1D-001/tempDir-001/schemaless/solr/solr.xml
465  WARN  (qtp15120-21) [] o.e.j.s.ServletHandler Error for 
/solr/admin/info/system
java.lang.AssertionError: I swear
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:313)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:309)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1699)
at 
org.apache.solr.client.solrj.embedded.JettySolrRunner$DebugFilter.doFilter(JettySolrRunner.java:139)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1699)
at 
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)
at 
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:224)
at 
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1180)
at 
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)
at 
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
at 
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1112)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
at 
org.eclipse.jetty.server.handler.gzip.GzipHandler.handle(GzipHandler.java:462)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)
at org.eclipse.jetty.server.Server.handle(Server.java:534)
{code}
So, what we gonna do, then... 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
>Priority: Critical
> Fix For: 6.6, master (7.0)
>
> Attachments: SDF init and doFilter in parallel.png, SOLR-9867.patch, 
> SOLR-9867.patch, SOLR-9867-test.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my 

[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-04-27 Thread Mikhail Khludnev (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15987650#comment-15987650
 ] 

Mikhail Khludnev commented on SOLR-9867:


I've added printing response on core status requests. Here is how race looks 
like
{code}
Solr is running on 8983 in standalone mode with status:
{
  "solr_home":"/Use..olr/example/techproducts/solr",
  "version":"7.0.0-SNAPSHOT eb1671c8f9a472a6ba6ff2516c55110574a64a96 - 
khludnevm - 2017-04-27 18:02:58",
  "startTime":"2017-04-27T15:09:24.404Z",
  "uptime":"0 days, 0 hours, 0 minutes, 9 seconds",
  "memory":"35.9 MB (%7.3) of 490.7 MB",
  "baseUrl":"http://localhost:8983/solr"}
{code}
here are status responses
{code}
http://localhost:8983/solr/admin/cores?action=STATUS=techproducts
{responseHeader={status=0, QTime=2}, initFailures={}, status={techproducts={}}}
http://localhost:8983/solr/admin/cores?action=STATUS=techproducts
{responseHeader={status=0, QTime=0}, initFailures={}, status={techproducts={}}}

Creating new core 'techproducts' using command:
http://localhost:8983/solr/admin/cores?action=CREATE=techproducts=techproducts

ERROR: Error CREATEing SolrCore 'techproducts': Unable to create core 
[techproducts] Caused by: Lock held by this virtual machine: 
/Users/khludnevm/lucene-solr/solr/example/techproducts/solr/techproducts/data/index/write.lock
{code}
Here is how it looks like in logs, one core is loading, it responds on STATUS 
HttpSolrCall, and recieves core create request from SolrCLI. 
{code}
INFO  - 2017-04-27 15:09:33.021; [   x:techproducts] 
org.apache.solr.schema.IndexSchema; [techproducts] Schema name=example
INFO  - 2017-04-27 15:09:34.059; [   ] org.apache.solr.servlet.HttpSolrCall; 
[admin] webapp=null path=/admin/info/system params={wt=json} status=0 QTime=103
INFO  - 2017-04-27 15:09:34.132; [   ] 
org.apache.solr.core.TransientSolrCoreCacheDefault; Allocating transient cache 
for 2147483647 transient cores
INFO  - 2017-04-27 15:09:34.133; [   ] org.apache.solr.servlet.HttpSolrCall; 
[admin] webapp=null path=/admin/cores 
params={core=techproducts=STATUS=json} status=0 QTime=2
INFO  - 2017-04-27 15:09:34.202; [   x:techproducts] 
org.apache.solr.rest.RestManager$Registry; Registered ManagedResource impl 
org.apache.solr.rest.schema.analysis.ManagedWordSetResource
INFO  - 2017-04-27 15:09:34.202; [   ] org.apache.solr.servlet.HttpSolrCall; 
[admin] webapp=null path=/admin/info/system params={wt=json} status=0 QTime=61
INFO  - 2017-04-27 15:09:34.203; [   x:techproducts] 
org.apache.solr.rest.RestManager$Registry; Registered ManagedResource impl 
org.apache.solr.rest.schema.analysis.ManagedSynonymFilterFa
INFO  - 2017-04-27 15:09:34.215; [   x:techproducts] 
org.apache.solr.schema.IndexSchema; Loaded schema example/1.6 with uniqueid 
field id
INFO  - 2017-04-27 15:09:34.216; [   x:techproducts] 
org.apache.solr.rest.RestManager$Registry; Added observer of type 
org.apache.solr.rest.schema.analysis.ManagedStopFilterFactory to exi
INFO  - 2017-04-27 15:09:34.216; [   x:techproducts] 
org.apache.solr.rest.RestManager$Registry; Added observer of type 
org.apache.solr.rest.schema.analysis.ManagedSynonymFilterFactory to 
INFO  - 2017-04-27 15:09:34.270; [   ] org.apache.solr.servlet.HttpSolrCall; 
[admin] webapp=null path=/admin/info/system params={wt=json} status=0 QTime=57
INFO  - 2017-04-27 15:09:34.282; [   ] org.apache.solr.servlet.HttpSolrCall; 
[admin] webapp=null path=/admin/cores 
params={core=techproducts=STATUS=json} status=0 QTime=0
INFO  - 2017-04-27 15:09:34.287; [   ] 
org.apache.solr.handler.admin.CoreAdminOperation; core create command 
name=techproducts=CREATE=techproducts=json
INFO  - 2017-04-27 15:09:34.294; [   x:techproducts] 
org.apache.solr.core.CoreContainer; Creating SolrCore 'techproducts' using 
configuration from instancedir /Users/khl
{code} 

> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
> Fix For: 6.6, master (7.0)
>
> Attachments: SOLR-9867.patch, SOLR-9867.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't 

[jira] [Commented] (SOLR-9867) The Solr examples can not always be started after being stopped due to race with loading core.

2017-03-13 Thread Ishan Chattopadhyaya (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15907815#comment-15907815
 ] 

Ishan Chattopadhyaya commented on SOLR-9867:


Moving to 6.5, since 6.4 has already been released.


> The Solr examples can not always be started after being stopped due to race 
> with loading core.
> --
>
> Key: SOLR-9867
> URL: https://issues.apache.org/jira/browse/SOLR-9867
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mark Miller
> Fix For: 6.4, master (7.0)
>
> Attachments: SOLR-9867.patch, SOLR-9867.patch
>
>
> I'm having trouble when I start up the schemaless example after shutting down.
> I first tracked this down to the fact that the run example tool is getting an 
> error when it tries to create the SolrCore (again, it already exists) and so 
> it deletes the cores instance dir which leads to tlog and index lock errors 
> in Solr.
> The reason it seems to be trying to create the core when it already exists is 
> that the run example tool uses a core status call to check existence and 
> because the core is loading, we don't consider it as existing. I added a 
> check to look for core.properties.
> That seemed to let me start up, but my first requests failed because the core 
> was still loading. It appears CoreContainer#getCore  is supposed to be 
> blocking so you don't have this problem, but there must be an issue, because 
> it is not blocking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org