Let's just move to git. It's almost 2016. I suspect many contributors are probably primarily working off the github mirror anyway. Is there any great argument for delaying? On Dec 15, 2015 11:51 AM, "Mark Miller" <markrmil...@gmail.com> wrote:
> I don't think you will get a volunteer until someone sums up the > discussion with a proposal that someone is not going to veto or something. > We can't expect everyone to read the same tea leaves and come to the same > conclusion. > > Perhaps a stripped down mirror is the consensus. I'd rather we had some > agreement on what we were going to do though, rather than an agreement to > investigate. If we think stripping down is a technically feasible, and no > one is going to violently disagree still, then let's decide to do that. > > - Mark > > > > On Tue, Dec 15, 2015 at 11:39 AM Doug Turnbull < > dturnb...@opensourceconnections.com> wrote: > >> I thought the general consensus at minimum was to investigate a git >> mirror that stripped some artifacts out (jars etc) to lighten up the work >> of the process. If at some point the project switched to git, such a mirror >> might be a suitable git repo for the project with archived older versions >> in SVN. >> >> I think probably what is lacking is a volunteer to figure it all out. >> >> >> -Doug >> >> On Tue, Dec 15, 2015 at 11:32 AM, Mark Miller <markrmil...@gmail.com> >> wrote: >> >>> Anyone willing to lead this discussion to some kind of better >>> resolution? Did that whole back and forth help with any ideas on the best >>> path forward? I know it's a complicated issue, git / svn, the light side, >>> the dark side, but doesn't GitHub also depend on this mirroring? It's going >>> to be super annoying when I can no longer pull from a relatively up to date >>> git remote. >>> >>> Who has boiled down the correct path? >>> >>> - Mark >>> >>> On Wed, Dec 9, 2015 at 6:07 AM Dawid Weiss <dawid.we...@gmail.com> >>> wrote: >>> >>>> FYI. >>>> >>>> - All of Lucene's SVN, incremental deltas, uncompressed: 5.0G >>>> - the above, tar.bz2: 1.2G >>>> >>>> Sadly, I didn't succeed at recreating a local SVN repo from those >>>> incremental dumps. svnadmin load fails with a cryptic error related to >>>> the fact that revision number of node-copy operations refer to >>>> original SVN numbers and they're apparently renumbered on import. >>>> svnadmin isn't smart enough to somehow keep a reference of those >>>> original numbers and svndumpfilter can't work with incremental dump >>>> files... A seemingly trivial task of splitting a repo on a clean >>>> boundary seems incredibly hard with SVN... >>>> >>>> If anybody wishes to play with the dump files, here they are: >>>> http://goo.gl/m6q3J8 >>>> >>>> Dawid >>>> >>>> On Tue, Dec 8, 2015 at 10:49 PM, Upayavira <u...@odoko.co.uk> wrote: >>>> > You can't avoid having the history in SVN. The ASF has one large >>>> repo, and >>>> > won't be deleting that repo, so the history will survive in >>>> perpetuity, >>>> > regardless of what we do now. >>>> > >>>> > Upayavira >>>> > >>>> > On Tue, Dec 8, 2015, at 09:24 PM, Doug Turnbull wrote: >>>> > >>>> > It seems you'd want to preserve that history in a frozen/archiced >>>> Apache Svn >>>> > repo for Lucene. Then make the new git repo slimmer before switching. >>>> Folks >>>> > that want very old versions or doing research can at least go through >>>> the >>>> > original SVN repo. >>>> > >>>> > On Tuesday, December 8, 2015, Dawid Weiss <dawid.we...@gmail.com> >>>> wrote: >>>> > >>>> > One more thing, perhaps of importance, the raw Lucene repo contains >>>> > all the history of projects that then turned top-level (Nutch, >>>> > Mahout). These could also be dropped (or ignored) when converting to >>>> > git. If we agree JARs are not relevant, why should projects not >>>> > directly related to Lucene/ Solr be? >>>> > >>>> > Dawid >>>> > >>>> > On Tue, Dec 8, 2015 at 10:05 PM, Dawid Weiss <dawid.we...@gmail.com> >>>> wrote: >>>> >>> Don’t know how much we have of historic jars in our history. >>>> >> >>>> >> I actually do know. Or will know. In about ~10 hours. I wrote a >>>> script >>>> >> that does the following: >>>> >> >>>> >> 1) git log all revisions touching >>>> https://svn.apache.org/repos/asf/lucene >>>> >> 2) grep revision numbers >>>> >> 3) use svnrdump to get every single commit (revision) above, in >>>> >> incremental mode. >>>> >> >>>> >> This will allow me to: >>>> >> >>>> >> 1) recreate only Lucene/ Solr SVN, locally. >>>> >> 2) measure the size of SVN repo. >>>> >> 3) measure the size of any conversion to git (even if it's one-by-one >>>> >> checkout, then-sync with git). >>>> >> >>>> >> From what I see up until now size should not be an issue at all. Even >>>> >> with all binary blobs so far the SVN incremental dumps measure ~3.7G >>>> >> (and I'm about 75% done). There is one interesting super-large >>>> commit, >>>> >> this one: >>>> >> >>>> >> svn log -r1240618 https://svn.apache.org/repos/asf/lucene >>>> >> >>>> ------------------------------------------------------------------------ >>>> >> r1240618 | gsingers | 2012-02-04 22:45:17 +0100 (Sat, 04 Feb 2012) | >>>> 1 >>>> >> line >>>> >> >>>> >> LUCENE-2748: bring in old Lucene docs >>>> >> >>>> >> This commit diff weights... wait for it... 1.3G! I didn't check what >>>> >> it actually was. >>>> >> >>>> >> Will keep you posted. >>>> >> >>>> >> D. >>>> > >>>> > --------------------------------------------------------------------- >>>> > To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org >>>> > For additional commands, e-mail: dev-h...@lucene.apache.org >>>> > >>>> > >>>> > >>>> > >>>> > -- >>>> > Doug Turnbull | Search Relevance Consultant | OpenSource Connections, >>>> LLC | >>>> > 240.476.9983 >>>> > Author:Relevant Search >>>> > This e-mail and all contents, including attachments, is considered to >>>> be >>>> > Company Confidential unless explicitly stated otherwise, regardless of >>>> > whether attachments are marked as such. >>>> > >>>> > >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org >>>> For additional commands, e-mail: dev-h...@lucene.apache.org >>>> >>>> -- >>> - Mark >>> about.me/markrmiller >>> >> >> >> >> -- >> *Doug Turnbull **| *Search Relevance Consultant | OpenSource Connections >> <http://opensourceconnections.com>, LLC | 240.476.9983 >> Author: Relevant Search <http://manning.com/turnbull> >> This e-mail and all contents, including attachments, is considered to be >> Company Confidential unless explicitly stated otherwise, regardless >> of whether attachments are marked as such. >> > -- > - Mark > about.me/markrmiller >