Re: [DISCUSS] Proposed binary packaging changes

2016-07-01 Thread William Slacum
Yeah, I wasn't really suggesting it as a course of action. It was more of a mental exercise so I could grasp the issue better. On Fri, Jul 1, 2016 at 12:13 PM, Christopher <ctubb...@apache.org> wrote: > On Fri, Jul 1, 2016 at 3:07 PM William Slacum <wsla...@gmail.com> wrote: &

Re: [DISCUSS] Proposed binary packaging changes

2016-07-01 Thread William Slacum
Is another action we could take be adding profiles for each version of dependencies to include appropriate dependencies (and dependencies' metadata)? I guess right now the problem is we throw in a generic "one size fits all" distribution, and we're seeing the cracks in it? On Fri, Jul 1, 2016 at

Re: Apache Accumulo integrated with Presto

2016-06-13 Thread William Slacum
I think the generic hash-join strategy is, for some small set A, we can send the whole set to partitions of a larger set B and do the join in parallel. In this case, whichever is the smaller set would be consumed on some worker, and then distributed out to each worker participating in the hash

Re: measuring perf

2016-06-10 Thread William Slacum
I think it's reasonable to measure from the start of a for/while loop over the Scanner. Such as: ``` // .. my initialization code scanner.setRange(someRange) Stopwatch timer = Stopwatch.createStarted(); for(Entry e: scanner) { // my logic } timer.stop(); ``` I've personally done

Re: [DISCUSS] Java 8 support (was Fwd: [jira] [Commented] (ACCUMULO-4177) TinyLFU-based BlockCache)

2016-05-03 Thread William Slacum
; >>>> > >>> "public" > >>> > >>>> API. > >>>> > >>>> > >>>> > >>>> On Mon, May 2, 2016 at 6:42 PM, Christopher<ctubb...@apache.org> >

Re: [DISCUSS] Java 8 support (was Fwd: [jira] [Commented] (ACCUMULO-4177) TinyLFU-based BlockCache)

2016-05-02 Thread William Slacum
So my point about versioning WRT to the Java runtime is more about how there are incompatibilities within the granularity of Java versions we talk about (I'm specifically referencing a Kerberos incompatibility within versions of Java 7), so I think that just blanket saying "We support Java X or Y"

Re: Accumulo on s3

2016-04-25 Thread William Slacum
Ephemeral storage & EBS are more friendly. Ephemeral storage is generally the fastest and most HDFS-friendly. On Mon, Apr 25, 2016 at 1:13 PM, Dylan Hutchison wrote: > Hey Josh, > > Are there other platforms on AWS (or another cloud provider) that > Accumulo/HDFS

Re: Checking what a BatchWriter is stuck on; failure during split

2016-04-19 Thread William Slacum
Good digs, Dylan. I don't think it's too rare to matter. I notice often during MR jobs, and there's usually a point where I give up and just start writing RFiles. It could possibly be related to what I saw back in the dayoday with:

Re: Pros and Cons of moving SKVI to public API

2016-03-24 Thread William Slacum
It should be public API. It's one of the core reasons for choosing Accumulo over a similar project like HBase or Cassandra. Allegedly, Jeff "Mean Gene" Dean said we got the concept correct as well :) Personally I hate the current API from a usability standpoint (ie, the generic types are useless

Re: delete + insert case

2016-03-19 Thread William Slacum
Be aware of the OS's underlying granularity for time as well: http://docs.oracle.com/javase/6/docs/api/java/lang/System.html#currentTimeMillis%28%29 I almost wonder if it's better to use the RowDeletingIterator in this case. If the check it does is "if TS < delete marker TS", in theory you could

Re: git-based site and jekyll

2016-03-10 Thread William Slacum
I would like to request at least one frame and one scrolling marquee. Can we blingee the Accumulo logo? On Thursday, March 10, 2016, Josh Elser wrote: > * Some companies on http://ctubbsii.github.io/accumulo/people.html are > goofed as are the timezones. > * Some broken

Re: Trouble connecting to Kerberized Accumulo/Zookeeper

2016-03-08 Thread William Slacum
, 2016 at 7:31 PM, William Slacum <wsla...@gmail.com> wrote: > I think one thing is that we can at least guarantee you can connect to the > KDC. > > It kind of seems like there's an issue with communication between the > client and Accumulo.Can you try `new KerberosToken(prin

Re: Trouble connecting to Kerberized Accumulo/Zookeeper

2016-03-08 Thread William Slacum
channels.SocketChannel[connected local=/10.129.0.116:37532 > remote=dev/10.129.0.110:9997] > at > > org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:164) > at org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:161) > at org.apache.hadoop.net.SocketInputStream.rea

Re: Trouble connecting to Kerberized Accumulo/Zookeeper

2016-03-08 Thread William Slacum
Any logs on the Accumulo and/or KDC side? On Tue, Mar 8, 2016 at 5:05 PM, Tristen Georgiou wrote: > Here is a simple Java program to attempt to get a connection to Accumulo > and list the local users: > > package com.phemi.testing; > > import

Re: [ATTN] Cleaning up extra refs in git

2016-03-04 Thread William Slacum
Any stats on what the repo size is after removing the refs and doing something like `git gc`? On Fri, Mar 4, 2016 at 4:25 PM, Christopher wrote: > I was able to deleted 135 duplicate refs of the kind I described. Only one > resulted in a new branch being created

Re: Monitor Tablet Id mapping

2016-02-23 Thread William Slacum
It's md5sum'd then base64'd. I think if you'd have to build a mapping of tablet id <-> md5sum to translate them. On Tue, Feb 23, 2016 at 3:59 PM, wrote: > Anyone know the utility to map the tablet id on the monitor to its actual > value? It looks like its Base64 encoded on

Re: [DRAFT] [ANNOUNCE] Apache Accumulo 1.6.5

2016-02-17 Thread William Slacum
Thanks, Christopher! On Wed, Feb 17, 2016 at 4:06 PM, Christopher wrote: > Staging build looks stuck for extpaths. Might need to wait for it to time > out, but all the important stuff is published. > > On Wed, Feb 17, 2016, 18:55 Josh Elser wrote: > >

Re: On 1.7.1 rc1 (was Re: [VOTE] Accumulo 1.6.5-rc2)

2016-02-17 Thread William Slacum
"William" is my grandfather. Please refer to me as "Sir William". On Wed, Feb 17, 2016 at 8:44 AM, Josh Elser wrote: > > > Christopher wrote: > >> I'll wrap up this releasetomorrow and get started on 1.7.1 soon. >> > > FYI, I want to ping William about ACCUMULO-4140. His

Re: [DISCUSS] Trivial changes and git

2016-01-06 Thread William Slacum
I've worked on teams who had perpetually open tickets like "Solve warnings" or "Fix typos". Those issues could be referenced in commits just say they were involved with some changes. >From your list, I think #1 is fine, and that #3 is preferable to #2. I don't feel strongly to be honest, as the

Re: delete rows test result

2015-11-16 Thread William Slacum
"Reading" all of the rows first implies you're bringing back the entire result to a client, which provides you serial access to the data. I think you should re-run test #3 that measures the time it takes to call deleteRows only. I'm emphasizing this because I've worked on projects that could

Re: delete rows test result

2015-11-16 Thread William Slacum
What happens when you subtract the time to read all of your rows? deleteRows is designed so you don't have to read any data-- you can compute a range to delete. For instance, in time series table, it's trivial to give a start and end date as your rows and call deleteRows. On Mon, Nov 16, 2015 at

Re: total table rows

2015-11-12 Thread William Slacum
There is a performance difference. You have an upper bound of returning all data to the client be scanned, even with a FirstEntryInRowIterator. Imagine a table layout where each Key/Value pair represents a single row or document. Using a counting iterator will return a count (most likely a 64-bit

Re: total table rows

2015-11-09 Thread William Slacum
Pranked... you can't use a CountingIterator, because it can't be init'd. Can we get rid of that limitation? On Mon, Nov 9, 2015 at 10:43 AM, William Slacum <wsla...@gmail.com> wrote: > An interator stack of FirstEntryInRowIterator + CountingIterator will > return the count of rows in

Re: total table rows

2015-11-09 Thread William Slacum
An interator stack of FirstEntryInRowIterator + CountingIterator will return the count of rows in each tablet, which can then be combined on the client side. On Mon, Nov 9, 2015 at 10:25 AM, Josh Elser wrote: > Yeah, there's no explicit tracking of all rows in Accumulo,

Re: [DISCUSS] What to do about encryption at rest?

2015-11-05 Thread William Slacum
that some of the > > > > common > > > > > > > data access patterns that we have in Accumulo (over HBase) is > > that > > > > the > > > > > > > per-colfam encryption isn't quick as common a design pattern as > > it > >

[DISCUSS] What to do about encryption at rest?

2015-11-04 Thread William Slacum
'ke...@deenlo.com');>> wrote: > > > > > > > On Mon, Nov 2, 2015 at 12:27 PM, William Slacum <wsla...@gmail.com > <javascript:_e(%7B%7D,'cvml','wsla...@gmail.com');>> wrote: > > > >> Is "the code being 'at rest'" you making a funny about a

[DISCUSS] What to do about encryption at rest?

2015-10-30 Thread William Slacum
So I've been looking into options for providing encryption at rest, and it seems like what Accumulo has is abandonware from a project perspective. There is no official documentation on how to perform encryption at rest, and the best information from its status comes from year (or greater) old

Re: HBase and Accumulo

2015-08-19 Thread William Slacum
If you drew a Venn diagram of HBase features compared to Accumulo features, it's pretty much going to be a single circle. If you want performance anecdotes, the most succinct summary I've seen is that Accumulo can handle heavier write loads whereas HBase will handle heavier read loads. From these

Re: [PROPOSAL] 1.7/2.0 branches and git workflow change

2014-10-07 Thread William Slacum
a maintenance branch. -- Christopher L Tubbs II http://gravatar.com/ctubbsii On Mon, Oct 6, 2014 at 10:59 PM, William Slacum wilhelm.von.cl...@accumulo.net wrote: It seems to me you can get everything you want by merely getting rid of master or making master just be the 1.7 branch. I'm

Re: [PROPOSAL] 1.7/2.0 branches and git workflow change

2014-10-07 Thread William Slacum
to remove/delete commits? Could release managers for 2.0 and 1.7 decide differently on whether or not they want to include a fix from 1.6? On Tue, Oct 7, 2014 at 10:17 AM, Keith Turner ke...@deenlo.com wrote: On Tue, Oct 7, 2014 at 6:24 AM, William Slacum wilhelm.von.cl...@accumulo.net wrote

Re: [VOTE] Apache Accumulo 1.6.1 RC1

2014-09-26 Thread William Slacum
: No, not after the vote closes. I was trying to say that the concerns you expressed might have had greatest impact if they were expressed with a -1 while the vote was open. -- Christopher L Tubbs II http://gravatar.com/ctubbsii On Fri, Sep 26, 2014 at 12:40 AM, William Slacum

Re: [VOTE] Apache Accumulo 1.6.1 RC1

2014-09-25 Thread William Slacum
I'm a little concerned we had two +1's that mention failures. The one time when we're supposed to have a clean run through, we have 50% of the participators noticing failure. It doesn't instill much confidence in me. On Thu, Sep 25, 2014 at 2:18 PM, Josh Elser josh.el...@gmail.com wrote: Please

Re: [VOTE] Apache Accumulo 1.5.2 RC1

2014-09-18 Thread William Slacum
+1 - verified source dist hash - built from tag - ran koverse integration tests against 1.5.2 On Thu, Sep 18, 2014 at 5:36 PM, Josh Elser josh.el...@gmail.com wrote: Reminder that this closes in a few hours. We're currently at about 25% of PMC participating, would be much nicer to more

Re: AccumuloInputFormat getters

2014-07-16 Thread William Slacum
It's dubious to say it's internals, when it gets hamjammed into a map of strings to other strings that's going to be passed around to many processes. Maybe we can make our own serializable pojo that implements some interface for consumers to use. That would at least let us hide internals and have

Re: Is the Column Family especially useful for iterators?

2014-07-10 Thread William Slacum
Like most things in big table, it really depends on your use case. The column family can potentially control the location of a given key/value pair on disk. I wouldn't say it's necessarily more useful than any other part of the key tuple. We have some built-ins that make searching for or

Re: Accumulo Scala

2014-06-30 Thread William Slacum
That's a really cool DSL, Kevin. Any plans for adding in some iterator support? I see in the FAQ an iterator is mentioned, but it'd be cool to be able to push the foreach declarations out to the tservers, if possible. On Sun, Jun 29, 2014 at 4:26 PM, Kevin Faro ke...@tetraconcepts.com wrote:

Re: Running Accumulo on the IBM JVM

2014-06-23 Thread William Slacum
Work on the oldest branch possible and merge forward, please. On Mon, Jun 23, 2014 at 6:00 AM, Hayden Marchant hay...@il.ibm.com wrote: Josh (and all who commented), Thanks for the comments. I'll take them into account, and will create the JIRAs. I was not intending on removing the CMS

Re: [DISCUSS] Should we support upgrading 1.4 - 1.6 w/o going through 1.5?

2014-06-16 Thread William Slacum
How much of this is a standalone utility? I think a magic button approach would be good for this case. On Mon, Jun 16, 2014 at 5:24 PM, Sean Busbey bus...@cloudera.com wrote: In an effort to get more users off of our now unsupported 1.4 release, should we support upgrading directly to 1.6

Re: Accumulo shell remote debugger settings.

2014-06-15 Thread William Slacum
Putting the flag in the process/module OPTs is fine. It's what I normally do when I want to debug. Are you suggesting we have remote debugging enabled by default? On Sun, Jun 15, 2014 at 9:11 AM, Vicky Kak vicky@gmail.com wrote: While trying to get the remote debugger running with accumulo

Re: Email list search links

2014-06-13 Thread William Slacum
we outta make our own search capability using Accumulo :) On Fri, Jun 13, 2014 at 1:44 PM, Billie Rinaldi billie.rina...@gmail.com wrote: It might be okay, as long as you note that isn't the official mail archive. I think some projects use Nabble. I've had decent luck just doing a google

Re: Using ZooCache in unit tests

2014-06-11 Thread William Slacum
What about mocking that call? On Wed, Jun 11, 2014 at 8:09 PM, Mike Drob mad...@cloudera.com wrote: When writing unit tests, I indirectly call code that invokes {{Tables.getZooCache(Instance)}} which sets up a connection to a zookeeper. However, there is not a server running, so this end up

Re: Proposal for splitting ACCUMULO-1242 into subtasks.

2014-05-13 Thread William Slacum
Sounds good, Ed. Just out of curiosity, are you planning on doing this with the goal of being able to swap out log4j for logback? In personal projects, I like slf4j solely for the message formatting feature. On Mon, May 12, 2014 at 10:45 PM, Sean Busbey bus...@cloudera.com wrote: +1 LGTM

Re: SQL layer over Accumulo?

2014-05-10 Thread William Slacum
So there may be a bit of confusion with storing index and data in the same row. By row I just mean the logical Accumulo unit, not a row as in thing in my relational table. Synonyms for row in this scheme are shard and document partition. You can store multiple documents and indices for those

Re: [VOTE] end of life plan for 1.4 branch

2014-05-06 Thread William Slacum
+1 for EOL'ing 1.4. -0 for any follow on actions. I don't see any particular value in doing anything beyond just not contributing to the 1.4 branch any more. On Tue, May 6, 2014 at 2:45 PM, Sean Busbey bus...@cloudera.com wrote: On Tue, May 6, 2014 at 12:26 PM, John Vines vi...@apache.org

Re: verifying name suitability

2014-05-05 Thread William Slacum
Jerry O'Connell and his merry band from 1995 would like to have a word with you. On Mon, May 5, 2014 at 7:23 PM, Billie Rinaldi billie.rina...@gmail.comwrote: Oops, sorry Accumulo devs, I'm having trouble with my mailing list autocomplete. I'll try to be more careful. On Mon, May 5, 2014

Re: Remove Row Data

2014-05-02 Thread William Slacum
I interpreted this as I want to delete an entire row based on specific column family and qualifier value. On Fri, May 2, 2014 at 12:31 PM, Christopher ctubb...@apache.org wrote: I think there's a terminology mismatch in your question. It sounds like you're trying to remove single entries

Re: SQL layer over Accumulo?

2014-05-01 Thread William Slacum
The wikisearch example provides something similar to a local index. Rather than stuff things into two tablets, a single row in accumulo contains both the index and data stored in separate column families. Iterator trees are used to execute queries and retrieve data with that row. On Thu, May 1,

Re: [VOTE] Accumulo 1.6.0-RC4

2014-04-28 Thread William Slacum
Do you think doing this on a Friday was a good idea? I know that point came up earlier, and it was possibly due to already discovered issues that would fail the release, but I think the lack of traffic on here is significant. On Fri, Apr 25, 2014 at 8:37 PM, Christopher ctubb...@apache.org

Re: [VOTE] Accumulo 1.6.0-RC4

2014-04-28 Thread William Slacum
? -- Christopher L Tubbs II http://gravatar.com/ctubbsii On Mon, Apr 28, 2014 at 8:24 AM, William Slacum wilhelm.von.cl...@accumulo.net wrote: Do you think doing this on a Friday was a good idea? I know that point came up earlier, and it was possibly due to already discovered issues

Re: increasing balancing problems to WARN

2014-04-18 Thread William Slacum
We could consider the use of markers to throw in more metadata about the relevance of a particular log message. On Fri, Apr 18, 2014 at 10:46 PM, Sean Busbey bus...@cloudera.com wrote: I also try to limit what goes at higher warning levels. One of my goals over hte next few months is to

Re: 1.6.0 RCs release manager?

2014-04-07 Thread William Slacum
I was under the impression that John Heard It Through The Grape Vines was the release manager. On Mon, Apr 7, 2014 at 7:15 PM, Christopher ctubb...@apache.org wrote: Who is the volunteer for creating 1.6.0 RCs? I'm willing to build them and start the vote, but I had thought that somebody

Re: [DISCUSS] MiniAccumuloCluster goals and approach

2014-03-28 Thread William Slacum
I think this is better reserved for a version later than 1.6.0. It's an 11th hour change in addition to being a large overhaul of the interfaces to support functionality we never intended for 1.6.0. On Fri, Mar 28, 2014 at 4:04 PM, Josh Elser josh.el...@gmail.com wrote: Forgot to also add,

Re: [VOTE] Accumulo 1.4.5 RC-1

2014-03-27 Thread William Slacum
I was under the impression that a functioning Wikisearch was a requirement for 1.4.5, as it would be consistent with all previous 1.4.x releases. On Wed, Mar 26, 2014 at 11:35 PM, Josh Elser josh.el...@gmail.com wrote: Thanks, Sean. On 3/26/14, 8:24 PM, Sean Busbey wrote: Filed

Re: [DISCUSS] MiniAccumuloCluster goals and approach

2014-03-26 Thread William Slacum
[NOTE: I started this email when this thread was new, and it kinda of blew up on me while writing it and being distracted. Apologies in advance if things were already covered or it's not relevant any more.] Is this a design quality discussion or a a functionality discussion? The changes from

Re: [DISCUSS] MiniAccumuloCluster goals and approach

2014-03-26 Thread William Slacum
Correction from my previous email: At this point, the MiniAccumuloCluster's interface of the MiniAccumuloClusterImpl's interface. should read At this point, the MiniAccumuloCluster's interface is a subset of the MiniAccumuloClusterImpl's interface. On Wed, Mar 26, 2014 at 1:10 PM, William

Re: [DISCUSS] clarification of release guide

2014-03-21 Thread William Slacum
I agree with Chief Keith. Clarity in the docs would be good. On Fri, Mar 21, 2014 at 1:03 PM, Keith Turner ke...@deenlo.com wrote: I think the intention is 1 24h w/ agitation AND 1 24h w/o agitation On Fri, Mar 21, 2014 at 12:54 PM, Sean Busbey busbey+li...@cloudera.com wrote: Hi!

Re: Accumulo site Bootstrapped

2014-03-05 Thread William Slacum
I'm a fan of bootstrap and those pages are looking sexy. Not a big fan how the 1.4 / 1.5 links show up in the navigation bar on the left though. On Wed, Mar 5, 2014 at 5:40 PM, Bill Havanki bhava...@clouderagovt.comwrote: Some folks in the IRC room were discussing how nice the Spark [1] and

Re: [DISCUSS] Accumulo Bylaws

2014-02-18 Thread William Slacum
Mike, add the --all parameter to the log statement to go across the entire repo: git log --all --pretty=format:%an --since=6 months ago | sort | uniq -c This is slightly more portable for those of us on OSX w/ BSD date. On Tue, Feb 18, 2014 at 4:56 PM, Mike Drob mad...@cloudera.com wrote: I

Re: New committers!

2014-01-10 Thread William Slacum
Congrats! On Fri, Jan 10, 2014 at 3:23 PM, Bill Havanki bhava...@clouderagovt.comwrote: Eric: :P ;) On Fri, Jan 10, 2014 at 2:37 PM, Eric Newton eric.new...@gmail.com wrote: Yay! No more patching their many contributions! :-) On Fri, Jan 10, 2014 at 2:27 PM, Arshak Navruzyan

Re: [DISCUSS] API changes to provide resource cleanup

2014-01-02 Thread William Slacum
Voting for the hammer/hacksawjimdugging. I like the concept of being to track resources and clean them up, but the back end code isn't designed to deal with an instance in the way we're trying to model it. On Thu, Jan 2, 2014 at 2:46 PM, Josh Elser josh.el...@gmail.com wrote: Bill Slacum and I

Re: Resource leak warnings

2013-12-30 Thread William Slacum
, William Slacum wilhelm.von.cl...@accumulo.net wrote: We need to actually define the usage pattern and lifetime of a ZooKeeperInstance. Looking at the code, it's really masking a singleton usage pattern. The resources backing a given set of zookeepers+timeout pair all share a ZooCache, and we

Re: Resource leak warnings

2013-12-27 Thread William Slacum
We need to actually define the usage pattern and lifetime of a ZooKeeperInstance. Looking at the code, it's really masking a singleton usage pattern. The resources backing a given set of zookeepers+timeout pair all share a ZooCache, and we hand-rolled reference counting for ZooKeeperInstances

Re: Resource leak warnings

2013-12-23 Thread William Slacum
We're pretty clear on commit-then-review and lazy consensus, so I don't really have an issue with regards to the commits. That said, I still think ignoring the warnings is the best course of action. I compiled with warnings on from the command line and don't see a resource leak warning with Java

Re: Resource leak warnings

2013-12-13 Thread William Slacum
Voting for #1. On Fri, Dec 13, 2013 at 3:44 PM, Christopher ctubb...@apache.org wrote: What should we do about all these additional resource leak warnings added as a result of ACCUMULO-1984? (ACCUMULO-2010) As I see it, there's a few options: 0. Revert the previous patch for ACCUMULO-1984

Re: [accumulo-wikisearch] git workflow for accumulo wikisearch contrib

2013-12-06 Thread William Slacum
Regarding git workflows, http://cdn.memegenerator.net/instances/500x/43613593.jpg On Fri, Dec 6, 2013 at 6:28 PM, Josh Elser josh.el...@gmail.com wrote: I think Bill (ujustgotbi...@apache.org) is the component lead. He'd probably be a good start. But in all honesty, I don't know if anyone

Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-11-12 Thread William Slacum
A user of 1.4.a should be able to move to 1.4.b without any major infrastructure changes, such as swapping out HDFS or installing extra add-ons. I don't find much merit in debating local WAL vs HDFS WAL cost/benefit since the only quantifiable evidence we have supported the move. I should note,

Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-11-12 Thread William Slacum
? As a side note, 0.20.203.0 is 1.4, On Tue, Nov 12, 2013 at 3:28 PM, Sean Busbey busbey...@clouderagovt.comwrote: On Tue, Nov 12, 2013 at 1:28 PM, William Slacum wilhelm.von.cl...@accumulo.net wrote: A user of 1.4.a should be able to move to 1.4.b without any major infrastructure changes

Re: Accumulo Community Meeting Notes from Strata NYC

2013-11-09 Thread William Slacum
Thanks, Drew! On Thu, Nov 7, 2013 at 10:22 PM, Drew Farris d...@apache.org wrote: On October 29, a number of people got together prior to the Accumulo Meetup to present the work they've done with Accumulo and discuss a number of other topics. In the interests of tracking off-list

Re: [VOTE] add mvn dependency:analyze to release process

2013-11-08 Thread William Slacum
+1 On Fri, Nov 8, 2013 at 1:45 PM, Josh Elser josh.el...@gmail.com wrote: +1 On 11/8/13, 1:35 PM, Billie Rinaldi wrote: I would like to add a dependency clean up step (which can be assisted by running mvn dependency:analyze) to our release process for major and minor releases, to make

Re: [DISCUSS] Hadoop 2 and Accumulo 1.6.0

2013-10-23 Thread William Slacum
There wasn't any discussions in those tickets as to what Hadoop 2 provides Accumulo. If we're going to still support 1, then any new features only possible with 2 have to become optional until we ditch support for 1. Is there anything people have in mind, feature wise, that Hadoop 2 would help

Re: [VOTE] 1.6.0 Feature freeze.

2013-09-28 Thread William Slacum
Plus One On Fri, Sep 27, 2013 at 5:02 PM, Mike Drob md...@mdrob.com wrote: +1 On Fri, Sep 27, 2013 at 4:02 PM, Brian Loss bfl...@praxiseng.com wrote: +1 On Sep 27, 2013, at 1:39 PM, John Vines vi...@apache.org wrote: Please vote on a feature freeze date of Nov 1 23:59 PDT for

Re: How do I use scan

2013-08-19 Thread William Slacum
You could use an indexing strategy such as a term index or a sharded index. I know there's an example for the sharded index packaged with Accumulo. On Mon, Aug 19, 2013 at 4:28 PM, Richard DeVita rdev...@us.ibm.com wrote: I have Accumulo version 1.4.3 I wrote a java program to create an

Re: github mirror

2013-08-01 Thread William Slacum
IIRC I don't believe a process is actually in place to accept pull requests off the chub. I'm open to being corrected by someone with better info, however. On Thu, Aug 1, 2013 at 5:17 PM, Michael Berman mber...@sqrrl.com wrote: Oh, actually I was looking at the wrong branch. It's only a month

Re: Java 6 EOLed

2013-06-20 Thread William Slacum
I think in the discussion previoiusly, someone (John Vines?) mentioned RedHat was picking up the slack. On Thu, Jun 20, 2013 at 4:43 PM, Michael Allen mich...@sqrrl.com wrote: Here's another data point in the move to Java 7 debate: Oracle apparently just EOLed Java 6. Read the Slashdot

Re: Is C++ code still part of 1.5 release?

2013-05-17 Thread William Slacum
I think of the native maps as an add on and they should probably be treated as such. I think we should consider building a different package and installing them separately. Personally, for development and testing, I don't use them. Since we're building RPMs and debian packages, the steps to

Re: Is C++ code still part of 1.5 release?

2013-05-17 Thread William Slacum
arguments. Adam On Fri, May 17, 2013 at 2:49 PM, William Slacum wilhelm.von.cl...@accumulo.net** wrote: I think of the native maps as an add on and they should probably be treated as such. I think we should consider building

Re: peformance

2013-05-03 Thread William Slacum
Does sqrrl provide an example framework to play around with? On Fri, May 3, 2013 at 2:20 PM, Adam Fuchs afu...@apache.org wrote: Hey Drew, This could be a very broad question, so I'll give a partial answer and encourage you to come back for more details. Impala is a mechanism that sits on

Re: JIRA Patch Conventions

2013-04-24 Thread William Slacum
Leave the tickets on there. I'm not trying to romance you Mike, I want more history and less mystery. On Wed, Apr 24, 2013 at 11:22 AM, Corey Nolet cno...@texeltek.com wrote: #2 as well. On Wed, Apr 24, 2013 at 11:08 AM, John Vines vi...@apache.org wrote: I too am in favor of the patch

Re: [VOTE] release 1.4.3?

2013-03-10 Thread William Slacum
+1 for a 1.4.3 On Sun, Mar 10, 2013 at 6:21 PM, Brian Loss bfl...@praxiseng.com wrote: +1 On Mar 9, 2013, at 8:14 PM, Josh Elser josh.el...@gmail.com wrote: Ditto. In favor. I can help with the release process, as well. On 03/08/2013 02:50 PM, John Vines wrote: Looking over the

Re: LICENSE and NOTICE

2013-02-13 Thread William Slacum
We were so close to Good news, everyone! On Wed, Feb 13, 2013 at 6:02 PM, Keith Turner ke...@deenlo.com wrote: Thats awesome. I remember when were initially constructing these files we were trying to figure this out. We looked at what other Apache projects did and could not find a clear

Re: Add Damon Brown to contributors list

2013-02-12 Thread William Slacum
Thanks, Damon! On Tue, Feb 12, 2013 at 1:18 PM, Keith Turner ke...@deenlo.com wrote: Damon, Thanks for your recent patches. I am going to add you to the contributors list on the web page. If you would like an org and timezone listed also, just shoot me an email Keith

Re: ACCUMULO-958 - Pluggable encryption in walogs

2013-01-30 Thread William Slacum
Bottom line, the patch has no value added to general users. The idea behind pushing back a release date to stuff in unoperational code is very bad practice. It sets a precedent for not considering alternative approaches while simultaneously having no justification for choosing the approach we did.

Re: Accumulo 1.6 and beyond feature summit

2013-01-29 Thread William Slacum
at 7:12 PM, William Slacum wilhelm.von.cl...@accumulo.net wrote: I'd like to see: - Data triggers on insertion - REST interface for looking up ranges of keys - A DSL or some other interpreted language for crafting iterators - there's the clojure iterator, but something like python (via

Re: Accumulo 1.6 and beyond feature summit

2013-01-28 Thread William Slacum
I'd like to see: - Data triggers on insertion - REST interface for looking up ranges of keys - A DSL or some other interpreted language for crafting iterators - there's the clojure iterator, but something like python (via jython) or javascript (via rhino) would be more adoptable - Adding a

Re: Accumulo 1.6 and beyond feature summit

2013-01-28 Thread William Slacum
know of an iterator that does this). -Original Message- From: William Slacum [mailto:wilhelm.von.cl...@accumulo.net] Sent: Monday, January 28, 2013 7:13 PM To: dev@accumulo.apache.org Subject: Re: Accumulo 1.6 and beyond feature summit I'd like to see: - Data triggers on insertion

Re: Contributing Organizations

2013-01-03 Thread William Slacum
I support it and a PMC vote. On Wed, Jan 2, 2013 at 6:42 PM, Dave Marion dlmar...@comcast.net wrote: I see 3 proponents and 0 opponents of this idea. Can we put it to a vote? Dave -Original Message- From: Dave Marion [mailto:dlmar...@comcast.net] Sent: Wednesday, December 19, 2012

Re: ingest performance oscillations and Xceivers

2013-01-03 Thread William Slacum
Have you also been tracking compactions? Did you have a query load? On Wed, Jan 2, 2013 at 7:25 PM, Kepner, Jeremy - 0553 - MITLL kep...@ll.mit.edu wrote: Hmmm, that's interesting, because in the past I didn't see this behavior. It might be worth having someone look into because it seems to

Re: ingest performance oscillations and Xceivers

2013-01-02 Thread William Slacum
How many disks do you have? That can be bottle-necking throughput as the number of Xceivers is related to the number of resources (threads, sockets: http://blog.cloudera.com/blog/2012/03/hbase-hadoop-xceivers/) used at once to perform operations. On Tue, Jan 1, 2013 at 6:45 PM, Eric Newton

Re: problems running accumuo

2012-12-28 Thread William Slacum
Did you run `accumulo init`? Do you have a `/accumulo` directory in HDFS? On Fri, Dec 28, 2012 at 9:54 AM, Tim Piety timpi...@gmail.com wrote: I have installed CDH3 and ZooKeeper on a CENTOS 6.3. VM (4G memory). Hadoop and ZooKeeper appear to run fine. I installed accumulo-1.4.2 and believe I

Re: SplitLarge Utility

2012-11-13 Thread William Slacum
If it's used by RFile during a system invoked task, then I'd say leave it. If you want to make a shell friendly interface for invoking it, I'm all for it. On Tue, Nov 13, 2012 at 5:59 AM, David Medinets david.medin...@gmail.comwrote: It is out of place, to me, because the Accumulo Shell should

Re: Key.getColumnFamilyAsBytes - comments about suggested new method?

2012-11-13 Thread William Slacum
For efficiency reasons, I'd leave the methods that take a Text object as-is. This avoids a third copy of the data when a user actually wants it in Text form. On Tue, Nov 13, 2012 at 12:25 PM, David Medinets david.medin...@gmail.comwrote: In Key.java, I see this: public Text

Re: IteratorSetting and priorities

2012-10-31 Thread William Slacum
It's because you're building a stack of iterators and the order you set on the scanner is the order of sources created and passed to init() for each iterator you create in the stack when the scan is executing on a TServer. Albeit deprecated, the filtering API in 1.3 does allow you to set multiple

Re: IteratorSetting and priorities

2012-10-30 Thread William Slacum
The issue with giving multiple iterators the same priority is that the API specifies that during the call to init(), one source is given the iterator. Now, that iterator can make multiple copies of that source via deepCopy() to make a tree of iterators, but by default its given one source. In the

Re: Setting Charset in getBytes() call.

2012-10-29 Thread William Slacum
Isn't it easier to just set the JVM property `file.encoding`? On Sun, Oct 28, 2012 at 3:18 PM, Ed Kohlwey ekohl...@gmail.com wrote: If you use a private static field in each class for the charset, it will basically be a singleton because charsets are cached in char set.forname. IMHO this is a

Re: Unapproved License Message From assemble/build.sh)

2012-10-22 Thread William Slacum
Billie-- any way around the issue with different versions of rat considering the odp files binary? I'm noticing they're getting marked for me on OSX 10.7.5, and seem to be the difference in the file counts. On Mon, Oct 22, 2012 at 4:55 PM, Michael Flester fles...@gmail.com wrote: I've checked

Re: Running Examples Within Eclipse (Missing Class)

2012-10-19 Thread William Slacum
You need to add the zookeeper jar to the run/debug profile for the class you're executing. On Thu, Oct 18, 2012 at 10:01 PM, David Medinets david.medin...@gmail.comwrote: I imported the Accumulo project into the Spring Tool Suite (which is Eclipse-based) as a maven project. Even seemed fine

Re: JIRA Etiquette / Hackathon Projects

2012-10-08 Thread William Slacum
At some point we had default assignees, so I believe they should be fair game. If there aren't any patches and it's been open for a while, I think that's an even stronger case to work on it (speaking of which, I think I have a ticket or two I need to finish up!). On Mon, Oct 8, 2012 at 11:13 AM,

Re: new committers!

2012-08-06 Thread William Slacum
Thanks guys! I hope to contribute to as many areas as possible, but I'm really interested helping make Accumulo an easy tool to set up, throw some data at, and pull out data in some meaningful way. To start, I may be giving the Wikipedia example some TLC :) On Mon, Aug 6, 2012 at 1:13 PM, David

[jira] [Commented] (ACCUMULO-702) build on ubuntu hangs without required dependencies

2012-07-25 Thread William Slacum (JIRA)
[ https://issues.apache.org/jira/browse/ACCUMULO-702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13422844#comment-13422844 ] William Slacum commented on ACCUMULO-702: - I'd prefer it if things that make

[jira] [Commented] (ACCUMULO-703) Add PrintInfo shortcut to bin/accumulo

2012-07-25 Thread William Slacum (JIRA)
[ https://issues.apache.org/jira/browse/ACCUMULO-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13422851#comment-13422851 ] William Slacum commented on ACCUMULO-703: - Keith, how/where would I go about

  1   2   >