On 02/01/2013, at 10:45 AM, Hervé BOUTEMY <herve.bout...@free.fr> wrote:

> Le lundi 24 décembre 2012 14:17:02 Brett Porter a écrit :
>> I had the same feeling pushing up Continuum's Maven site recently...
>> 
>> On 23/12/2012, at 9:36 AM, Olivier Lamy <ol...@apache.org> wrote:
>>> 2012/12/22 Kristian Rosenvold <kristian.rosenv...@zenior.no>:
>>>> Svnsubpub is just ridiculously inefficient and we need to do something
>>>> about it...
>>> 
>>> That remember me old time when pushing maven sites to sourceforge tru
>>> wagon ftp ... :-)
>> 
>> Is the main problem the file-by-file nature, or something else (such as the
>> size)?
> I tried to measure and find facts about this: my conclusion is that it is the 
> file-by-file nature
> exactly like webdav
> notice that we have really a bunch of generated files: did you realize that 
> maven-scm site has more files than maven core? that it represents 131 MB, 
> tens 
> of thousand of files?

Yes, similar size in Continuum which I've been working on migrating to 
svnpubsub recently.

> 
> IMHO, if we stored zip files of documentations (or tar, the archive format 
> can 
> be discussed) and httpd could serve their content on the fly like if they 
> were 
> unzipped, this would be awesome
> Apparently, coding an lua script could do the job: just need to find somebody 
> able to do it :)

I've raised this topic and that thread on #asfinfra and will try and follow up 
- I know this was flagged by them some time ago.

> 
> any other idea?

For now I'm just focusing on reducing the number of changes each site 
deployment to make it a smaller checkin.

See: http://continuum.apache.org/development/publishing-site.html

What I'm doing there is deploying to /docs/latest each time, then copying that 
to the versioned location on the server-side. That way, the next version is 
just the diff against the last, not a whole new one.

There are a few things that cause changes across generation runs:
- the javadocs generate a timestamp, and this is the biggest culprit as it is 
the largest number of files. I'm going to add -notimestamp tomorrow and see if 
I can get sequential deploys to remain unmodified
- most skins incorporated a timestamp that changes each build (I've removed 
that in Continuum's skin). The publish date is still there, but it doesn't seem 
to be a big problem.
- the dependencies report seems to change between runs

For other reports the change more frequently, I've stopped publishing them to 
the main site. There is Sonar for most of it at http://analysis.apache.org/, or 
you can publish them to a filesystem staging in CI and view them through the 
workspace.

With these sorts of improvements, incremental deployments will actually be 
better than shipping up a large ZIP that gets unpacked, even though it is 
mostly unchanged.

- Brett

--
Brett Porter
br...@apache.org
http://brettporter.wordpress.com/
http://au.linkedin.com/in/brettporter
http://twitter.com/brettporter






---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@maven.apache.org
For additional commands, e-mail: dev-h...@maven.apache.org

Reply via email to