> On March 29, 2014, 12:26 a.m., kturner wrote:
> > server/src/main/java/org/apache/accumulo/server/master/Master.java, line 313
> > <https://reviews.apache.org/r/19804/diff/1/?file=539926#file539926line313>
> >
> >     I think this check can cause problems. Master.run()  starts 
> > StatusThread, StatusThread.run() will indirectly call upgradeMetadata().  
> > After Master.run() starts StatusThread, it seems like it will start Fate 
> > and the client service.  So its possible that a 1.5 client could submit a 
> > fate op before the upgradeMetadata() is called. 
> >     
> >     Also, this check is probably not needed.  upgradeZookeeper() should be 
> > called before upgradeMetadata().  Could add a sanity check for this.
> >

I *think* that it's fine, because I think the client service doesn't start 
until state goes to NORMAL. So while Fate has been started, nothing yet has 
access to use it. I could clarify things by not starting Fate until the 
upgradeMetadata happens (or we determine it isn't needed).

It'd probably be simpler to add a sanity check for making sure upgradeZooKeeper 
happened first though.


- Sean


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/19804/#review38972
-----------------------------------------------------------


On March 28, 2014, 9:22 p.m., Sean Busbey wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/19804/
> -----------------------------------------------------------
> 
> (Updated March 28, 2014, 9:22 p.m.)
> 
> 
> Review request for accumulo and kturner.
> 
> 
> Bugs: ACCUMULO-2519
>     https://issues.apache.org/jira/browse/ACCUMULO-2519
> 
> 
> Repository: accumulo
> 
> 
> Description
> -------
> 
> Adds "make sure Fate has no outstanding items" to the upgrade instructions. 
> Makes sure the master and tabletservers don't take upgrade steps if they see 
> fate ops waiting.
> 
> 
> Diffs
> -----
> 
>   README 115a9b7 
>   server/src/main/java/org/apache/accumulo/server/Accumulo.java 99ec7e4 
>   server/src/main/java/org/apache/accumulo/server/master/Master.java 8c4c864 
>   
> server/src/main/java/org/apache/accumulo/server/tabletserver/TabletServer.java
>  d76946d 
> 
> Diff: https://reviews.apache.org/r/19804/diff/
> 
> 
> Testing
> -------
> 
> Took a 1.4.5-SNAP cluster
> 
> * triggered compactions
> * shutdown cluster
> * verified waiting transactions
> * verified waiting local WALs
> * verified /accumulo/version showed 4
> * Start upgrade to 1.5.2-SNAP
> * verified errors showing no upgrade and to go back to docs in: monitor, 
> master logs, tabletserver logs
> * verified waiting transactions
> * verified waiting local WALs
> * verified /accumulo/version showed 4
> * Cleared Fate operations
> * Start upgrade to 1.5.2-SNAP
> * verify no errors shown for upgrade
> * verified WALs copied to HDFS
> * verified /accumulo/version showed 5
> * verified monitor showed normal start up
> 
> Running verify job on existing data now. should take ~6 hours. 
> 
> 
> Thanks,
> 
> Sean Busbey
> 
>

Reply via email to