Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jon Ciesla

Matthew Booth wrote:
The separate updates directory has been a pain for as long as I've 
been using RHL/Fedora Core/Fedora. It means you have two places to 
look when searching for packages manually, and twice as much to 
configure when you're configuring yum. It has never benefitted me, or 
anybody I know, but it has caught me out on any number of occasions. 
What's more, nobody really seems to know why it's like that: it seems 
it's always been that way, and nobody ever bother to fix it.


So lets fix it. The package set at release time is only interesting to 
historians. If any of them are really that bothered, I'm sure somebody 
can come up with a yum module which finds the oldest available version 
of a package in a repo.


Matt
Would not this also provide the minor added benefit that there could now 
be a drpm for the first update for a package?


-J

--
in your fear, seek only peace
in your fear, seek only love

-d. bowie

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 03:39 PM, Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've been
using RHL/Fedora Core/Fedora. It means you have two places to look when
searching for packages manually, and twice as much to configure when
you're configuring yum. It has never benefitted me, or anybody I know,
but it has caught me out on any number of occasions. What's more, nobody
really seems to know why it's like that: it seems it's always been that
way, and nobody ever bother to fix it.

So lets fix it. The package set at release time is only interesting to
historians. If any of them are really that bothered, I'm sure somebody
can come up with a yum module which finds the oldest available version
of a package in a repo.


So your proposal is to drop updates, but to add updates to "Everything"?

In this case, I am all for it.

Ralf


--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Robert Marcano

On 12/02/2009 10:30 AM, Jon Ciesla wrote:

Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've
been using RHL/Fedora Core/Fedora. It means you have two places to
look when searching for packages manually, and twice as much to
configure when you're configuring yum. It has never benefitted me, or
anybody I know, but it has caught me out on any number of occasions.
What's more, nobody really seems to know why it's like that: it seems
it's always been that way, and nobody ever bother to fix it.

So lets fix it. The package set at release time is only interesting to
historians. If any of them are really that bothered, I'm sure somebody
can come up with a yum module which finds the oldest available version
of a package in a repo.

Matt

Would not this also provide the minor added benefit that there could now
be a drpm for the first update for a package?


I like the proposal, the only drawback I found is that when a new Fedora 
release is out we mirror (local) the updates repository with priority, 
and the big Everything repository is mirrored using a small bandwidth 
setting because it is not needed normally for the first updates (updates 
that add new dependecies are not common the first weeks)




-J



--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Josh Boyer
On Wed, Dec 02, 2009 at 09:00:53AM -0600, Jon Ciesla wrote:
> Matthew Booth wrote:
>> The separate updates directory has been a pain for as long as I've  
>> been using RHL/Fedora Core/Fedora. It means you have two places to  
>> look when searching for packages manually, and twice as much to  
>> configure when you're configuring yum. It has never benefitted me, or  
>> anybody I know, but it has caught me out on any number of occasions.  
>> What's more, nobody really seems to know why it's like that: it seems  
>> it's always been that way, and nobody ever bother to fix it.
>>
>> So lets fix it. The package set at release time is only interesting to  
>> historians. If any of them are really that bothered, I'm sure somebody  
>> can come up with a yum module which finds the oldest available version  
>> of a package in a repo.
>>
>> Matt
> Would not this also provide the minor added benefit that there could now  
> be a drpm for the first update for a package?

We already have that if the update is done after GA.

josh

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Josh Boyer
On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:
> The separate updates directory has been a pain for as long as I've been  
> using RHL/Fedora Core/Fedora. It means you have two places to look when  
> searching for packages manually, and twice as much to configure when  
> you're configuring yum. It has never benefitted me, or anybody I know,  
> but it has caught me out on any number of occasions. What's more, nobody  
> really seems to know why it's like that: it seems it's always been that  
> way, and nobody ever bother to fix it.
>
> So lets fix it. 

And how do you propose to do that?

josh

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matthew Booth

On 02/12/09 15:26, Josh Boyer wrote:

On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've been
using RHL/Fedora Core/Fedora. It means you have two places to look when
searching for packages manually, and twice as much to configure when
you're configuring yum. It has never benefitted me, or anybody I know,
but it has caught me out on any number of occasions. What's more, nobody
really seems to know why it's like that: it seems it's always been that
way, and nobody ever bother to fix it.

So lets fix it.


And how do you propose to do that?


Unfortunately I'm not intimately familiar with Fedora infrastructure 
under-the-hood. However, I would hope that, technically at least, it 
wouldn't be much more than a systematic removal of all updates 
repositories and redirecting files which would have gone into them into 
the main repositories instead. This stems from the observation that if 
you copied everything from the main repository into updates you would 
have created the repository which people unfamiliar with the history 
would expect. The infrastructure side of this, on the face of it, sounds 
very simple.


More involved would be:

1. Updating all packages which expect 2 repos to only expect 1.
2. Communicating the change effectively.

Matt
--
Matthew Booth, RHCA, RHCSS
Red Hat Engineering, Virtualisation Team

M:   +44 (0)7977 267231
GPG ID:  D33C3490
GPG FPR: 3733 612D 2D05 5458 8A8A 1600 3441 EA19 D33C 3490

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jon Ciesla

Josh Boyer wrote:

On Wed, Dec 02, 2009 at 09:00:53AM -0600, Jon Ciesla wrote:
  

Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've  
been using RHL/Fedora Core/Fedora. It means you have two places to  
look when searching for packages manually, and twice as much to  
configure when you're configuring yum. It has never benefitted me, or  
anybody I know, but it has caught me out on any number of occasions.  
What's more, nobody really seems to know why it's like that: it seems  
it's always been that way, and nobody ever bother to fix it.


So lets fix it. The package set at release time is only interesting to  
historians. If any of them are really that bothered, I'm sure somebody  
can come up with a yum module which finds the oldest available version  
of a package in a repo.


Matt
  
Would not this also provide the minor added benefit that there could now  
be a drpm for the first update for a package?



We already have that if the update is done after GA.

josh

  
If that's the case, then good. In that case, I see no huge benefit to 
leaving it or changing it.


--
in your fear, seek only peace
in your fear, seek only love

-d. bowie

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Justin M. Forbes
On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:
> The separate updates directory has been a pain for as long as I've been 
> using RHL/Fedora Core/Fedora. It means you have two places to look when 
> searching for packages manually, and twice as much to configure when you're 
> configuring yum. It has never benefitted me, or anybody I know, but it has 
> caught me out on any number of occasions. What's more, nobody really seems 
> to know why it's like that: it seems it's always been that way, and nobody 
> ever bother to fix it.
>
The only downside to merging updates into the main repository is that
network installs from the repository will no longer install the "release"
they will install the updated release.  QA that goes into that first
impression is no longer there, and your installs are not repeatable because
installing system A on one day and system B on another could end up with
different versions of packages.  Of course you can always install from the
ISOs to avoid these problems. As things are, you have the choice of the
"release" as it was, or the updated release at install time.  I am not
saying that this is a bad idea, in fact I rather like it, but there are
things to consider.

Justin

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Josh Boyer
On Wed, Dec 02, 2009 at 03:44:08PM +, Matthew Booth wrote:
> On 02/12/09 15:26, Josh Boyer wrote:
>> On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:
>>> The separate updates directory has been a pain for as long as I've been
>>> using RHL/Fedora Core/Fedora. It means you have two places to look when
>>> searching for packages manually, and twice as much to configure when
>>> you're configuring yum. It has never benefitted me, or anybody I know,
>>> but it has caught me out on any number of occasions. What's more, nobody
>>> really seems to know why it's like that: it seems it's always been that
>>> way, and nobody ever bother to fix it.
>>>
>>> So lets fix it.
>>
>> And how do you propose to do that?
>
> Unfortunately I'm not intimately familiar with Fedora infrastructure  
> under-the-hood. However, I would hope that, technically at least, it  
> wouldn't be much more than a systematic removal of all updates  
> repositories and redirecting files which would have gone into them into  
> the main repositories instead. This stems from the observation that if  
> you copied everything from the main repository into updates you would  
> have created the repository which people unfamiliar with the history  
> would expect. The infrastructure side of this, on the face of it, sounds  
> very simple.

What you describe is effectively how the development repository is built,
so it's certainly a technical possibility.  It does have implications
though.  Off the top of my head, I can think of:

1) Composing a new everything tree for updates would lead to larger
compose times.  That could possibly mean that getting updates out would
take > 1 day per 'push'.  We've been trying to improve updates push
times so it would be a bit detrimental to that goal.

2) There might be GPL compliance issues

3) You would still need an 'updates-testing' repository given that this
is a supposedly stable release.  So there is still going to be at least
one additional repo regardless.

However, other than 'browsing manually for packages', I'm not really
sure what problem you are trying to solve by getting rid of the
updates repository.  It would seem like this has quite a bit of cost
for relatively little to no real gain?

josh

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Bill Nottingham
Matthew Booth (mbo...@redhat.com) said: 
> The separate updates directory has been a pain for as long as I've
> been using RHL/Fedora Core/Fedora. It means you have two places to
> look when searching for packages manually, and twice as much to
> configure when you're configuring yum. It has never benefitted me,
> or anybody I know, but it has caught me out on any number of
> occasions. What's more, nobody really seems to know why it's like
> that: it seems it's always been that way, and nobody ever bother to
> fix it.
> 
> So lets fix it. The package set at release time is only interesting
> to historians. If any of them are really that bothered, I'm sure
> somebody can come up with a yum module which finds the oldest
> available version of a package in a repo.

The separate Everything tree that does not get obsoleted is required
in some form for GPL compliance, with respect to the ISO images that
we ship. Any new solution would have to preserve this.

Bill

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Naheem Zaffar
2009/12/2 Justin M. Forbes 

> The only downside to merging updates into the main repository...


I would also assume that the repo data will need to be regenerated and often
be much larger than the one that is for the updates only repository, so
there will be acost to end users with this proposed change?
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Daniel P. Berrange
On Wed, Dec 02, 2009 at 10:01:51AM -0600, Justin M. Forbes wrote:
> On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:
> > The separate updates directory has been a pain for as long as I've been 
> > using RHL/Fedora Core/Fedora. It means you have two places to look when 
> > searching for packages manually, and twice as much to configure when you're 
> > configuring yum. It has never benefitted me, or anybody I know, but it has 
> > caught me out on any number of occasions. What's more, nobody really seems 
> > to know why it's like that: it seems it's always been that way, and nobody 
> > ever bother to fix it.
> >
> The only downside to merging updates into the main repository is that
> network installs from the repository will no longer install the "release"
> they will install the updated release.  QA that goes into that first
> impression is no longer there, and your installs are not repeatable because
> installing system A on one day and system B on another could end up with
> different versions of packages.  Of course you can always install from the
> ISOs to avoid these problems. As things are, you have the choice of the
> "release" as it was, or the updated release at install time.  I am not
> saying that this is a bad idea, in fact I rather like it, but there are
> things to consider.

I think this is quite a compelling reason not to touch it. Much as we'd
like the updates repository to always be perfectly stable, there are 
still plenty of occassions when we hit problems with updates. Being able
to reliably install the release at all times is pretty critical & saying
we should install off ISO if we want a reliable install is not really an
acceptable alternative since many people do all their installs straight
off the network.

Daniel
-- 
|: Red Hat, Engineering, London   -o-   http://people.redhat.com/berrange/ :|
|: http://libvirt.org  -o-  http://virt-manager.org  -o-  http://ovirt.org :|
|: http://autobuild.org   -o- http://search.cpan.org/~danberr/ :|
|: GnuPG: 7D3B9505  -o-  F3C9 553F A1DA 4AC2 5648 23C1 B3DF F742 7D3B 9505 :|

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 05:09 PM, Bill Nottingham wrote:

Matthew Booth (mbo...@redhat.com) said:

The separate updates directory has been a pain for as long as I've
been using RHL/Fedora Core/Fedora. It means you have two places to
look when searching for packages manually, and twice as much to
configure when you're configuring yum. It has never benefitted me,
or anybody I know, but it has caught me out on any number of
occasions. What's more, nobody really seems to know why it's like
that: it seems it's always been that way, and nobody ever bother to
fix it.

So lets fix it. The package set at release time is only interesting
to historians. If any of them are really that bothered, I'm sure
somebody can come up with a yum module which finds the oldest
available version of a package in a repo.


The separate Everything tree that does not get obsoleted is required
in some form for GPL compliance, with respect to the ISO images that
we ship.

Isn't this the "Fedora" repo?

To my knowledge the "Fedora" repos corresponds 1:1 to the isos.

Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Paul W. Frields
On Wed, Dec 02, 2009 at 11:09:41AM -0500, Bill Nottingham wrote:
> Matthew Booth (mbo...@redhat.com) said: 
> > The separate updates directory has been a pain for as long as I've
> > been using RHL/Fedora Core/Fedora. It means you have two places to
> > look when searching for packages manually, and twice as much to
> > configure when you're configuring yum. It has never benefitted me,
> > or anybody I know, but it has caught me out on any number of
> > occasions. What's more, nobody really seems to know why it's like
> > that: it seems it's always been that way, and nobody ever bother to
> > fix it.
> > 
> > So lets fix it. The package set at release time is only interesting
> > to historians. If any of them are really that bothered, I'm sure
> > somebody can come up with a yum module which finds the oldest
> > available version of a package in a repo.
> 
> The separate Everything tree that does not get obsoleted is required
> in some form for GPL compliance, with respect to the ISO images that
> we ship. Any new solution would have to preserve this.

Might there also be export compliance implications too?

-- 
Paul W. Frieldshttp://paul.frields.org/
  gpg fingerprint: 3DA6 A0AC 6D58 FEC4 0233  5906 ACDB C937 BD11 3717
  http://redhat.com/   -  -  -  -   http://pfrields.fedorapeople.org/
  irc.freenode.net: stickster @ #fedora-docs, #fedora-devel, #fredlug

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Kevin Fenzi
On Wed, 02 Dec 2009 17:27:17 +0100
Ralf Corsepius  wrote:

> On 12/02/2009 05:09 PM, Bill Nottingham wrote:
> > Matthew Booth (mbo...@redhat.com) said:
> >> The separate updates directory has been a pain for as long as I've
> >> been using RHL/Fedora Core/Fedora. It means you have two places to
> >> look when searching for packages manually, and twice as much to
> >> configure when you're configuring yum. It has never benefitted me,
> >> or anybody I know, but it has caught me out on any number of
> >> occasions. What's more, nobody really seems to know why it's like
> >> that: it seems it's always been that way, and nobody ever bother to
> >> fix it.
> >>
> >> So lets fix it. The package set at release time is only interesting
> >> to historians. If any of them are really that bothered, I'm sure
> >> somebody can come up with a yum module which finds the oldest
> >> available version of a package in a repo.
> >
> > The separate Everything tree that does not get obsoleted is required
> > in some form for GPL compliance, with respect to the ISO images that
> > we ship.
> Isn't this the "Fedora" repo?
> 
> To my knowledge the "Fedora" repos corresponds 1:1 to the isos.

To the DVD iso, yes. 

To any of the spins/desktop/live... nope. They use packages from
Everything/

kevin


signature.asc
Description: PGP signature
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 05:32 PM, Kevin Fenzi wrote:

On Wed, 02 Dec 2009 17:27:17 +0100
Ralf Corsepius  wrote:


On 12/02/2009 05:09 PM, Bill Nottingham wrote:

Matthew Booth (mbo...@redhat.com) said:

The separate updates directory has been a pain for as long as I've
been using RHL/Fedora Core/Fedora. It means you have two places to
look when searching for packages manually, and twice as much to
configure when you're configuring yum. It has never benefitted me,
or anybody I know, but it has caught me out on any number of
occasions. What's more, nobody really seems to know why it's like
that: it seems it's always been that way, and nobody ever bother to
fix it.

So lets fix it. The package set at release time is only interesting
to historians. If any of them are really that bothered, I'm sure
somebody can come up with a yum module which finds the oldest
available version of a package in a repo.


The separate Everything tree that does not get obsoleted is required
in some form for GPL compliance, with respect to the ISO images that
we ship.

Isn't this the "Fedora" repo?

To my knowledge the "Fedora" repos corresponds 1:1 to the isos.


To the DVD iso, yes.

To any of the spins/desktop/live... nope. They use packages from
Everything/


A question however remains:
Does the FSF/GPL demand to keep a repo around for ISOs?
A "rolling Everything" would not touch the ISOs. They would still be around.

Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Bill Nottingham
Ralf Corsepius (rc040...@freenet.de) said: 
> Does the FSF/GPL demand to keep a repo around for ISOs?
> A "rolling Everything" would not touch the ISOs. They would still be around.

The LiveCD/spins satisfy their source requirements via the source
repositories; they do not compose separate live source images.

Bill

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Casey Dahlin
On Wed, Dec 02, 2009 at 11:28:24AM -0500, Paul W. Frields wrote:
> > 
> > The separate Everything tree that does not get obsoleted is required
> > in some form for GPL compliance, with respect to the ISO images that
> > we ship. Any new solution would have to preserve this.
> 
> Might there also be export compliance implications too?

What manner of export issues? This change doesn't, afaik, affect where
export-restricted software appears in any way.

--CJD

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Casey Dahlin
On Wed, Dec 02, 2009 at 11:06:22AM -0500, Josh Boyer wrote:
> 1) Composing a new everything tree for updates would lead to larger
> compose times.  That could possibly mean that getting updates out would
> take > 1 day per 'push'.  We've been trying to improve updates push
> times so it would be a bit detrimental to that goal.
> 

This might be a good opportunity to look at some way of pushing
incremental sets of packages rather than re-building the entire yum repo
each time. Mashing is not a cheap operation.

> 2) There might be GPL compliance issues
> 
> 3) You would still need an 'updates-testing' repository given that this
> is a supposedly stable release.  So there is still going to be at least
> one additional repo regardless.
> 

We have tags for updates that mark them as bugfix/feature/security.
Maybe we could have one for "testing" which would keep yum from
installing the package unless explicitly asked or specially configured.

> However, other than 'browsing manually for packages', I'm not really
> sure what problem you are trying to solve by getting rid of the
> updates repository.  It would seem like this has quite a bit of cost
> for relatively little to no real gain?
> 

This is true. I don't know that manual package browsing is a use case
we've ever been interested in, and if we're going to be interested in it
there's probably a better way (search engine in the Fedora community
portal comes to mind).

> josh

You owe me $5.

--CJD

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Paul W. Frields wrote:


On Wed, Dec 02, 2009 at 11:09:41AM -0500, Bill Nottingham wrote:

we ship. Any new solution would have to preserve this.


Might there also be export compliance implications too?



A larger isssue is constantly having the repodata for the everything repo 
be:


1. growing
2. changing

So now instead of having a 15K pkg repo that never changes you have an 
ever-growing repo that never shrinks in size that changes what? 3 times a 
week?


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 06:01 PM, Casey Dahlin wrote:

On Wed, Dec 02, 2009 at 11:06:22AM -0500, Josh Boyer wrote:



However, other than 'browsing manually for packages', I'm not really
sure what problem you are trying to solve by getting rid of the
updates repository.  It would seem like this has quite a bit of cost
for relatively little to no real gain?



This is true.

It is not true.

* It shifts "costs" from "users" to "vendor"
and from "mirrors" to "master".
* It helps users who are using networked installs to spare bandwidth 
(avoids downloading obsolete packages from "Everything"/"Fedora").


Admitted, for most users, it would change almost nothing.

Ralf


--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matthew Booth

On 02/12/09 16:01, Justin M. Forbes wrote:

On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've been
using RHL/Fedora Core/Fedora. It means you have two places to look when
searching for packages manually, and twice as much to configure when you're
configuring yum. It has never benefitted me, or anybody I know, but it has
caught me out on any number of occasions. What's more, nobody really seems
to know why it's like that: it seems it's always been that way, and nobody
ever bother to fix it.


The only downside to merging updates into the main repository is that
network installs from the repository will no longer install the "release"
they will install the updated release.  QA that goes into that first
impression is no longer there, and your installs are not repeatable because
installing system A on one day and system B on another could end up with
different versions of packages.  Of course you can always install from the
ISOs to avoid these problems. As things are, you have the choice of the
"release" as it was, or the updated release at install time.  I am not
saying that this is a bad idea, in fact I rather like it, but there are
things to consider.


That's not a bug, it's a feature! Seriously, wasn't it a specific F10 or 
F11 feature that anaconda allowed the user to specify that updates be 
installed at installation time? Certainly I used to have Debianites crow 
at me that my installation was vulnerable until I updated it. Not only 
that, but I had to download updated packages twice. These days I always 
install with updates. I find the installation argument dubious at best.


Still, we could rename the main repo the 'installation' repo, and 
nothing but anaconda would ever look at it. Everything else would look 
at the main repo, which would be the same as the current updates repo 
except with everything in it from the start.


Matt
--
Matthew Booth, RHCA, RHCSS
Red Hat Engineering, Virtualisation Team

M:   +44 (0)7977 267231
GPG ID:  D33C3490
GPG FPR: 3733 612D 2D05 5458 8A8A 1600 3441 EA19 D33C 3490

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 14:39 +, Matthew Booth wrote:
> The separate updates directory has been a pain for as long as I've been 
> using RHL/Fedora Core/Fedora. It means you have two places to look when 
> searching for packages manually, and twice as much to configure when 
> you're configuring yum. It has never benefitted me, or anybody I know, 
> but it has caught me out on any number of occasions. What's more, nobody 
> really seems to know why it's like that: it seems it's always been that 
> way, and nobody ever bother to fix it.
> 
> So lets fix it. The package set at release time is only interesting to 
> historians. If any of them are really that bothered, I'm sure somebody 
> can come up with a yum module which finds the oldest available version 
> of a package in a repo.
> 

This runs us into a few different problems.

1) current compose tools do a fresh compose of a tag every time, into a
fresh directory.  It is not easy to just 'add' newer things to a
directory and only keep X number around.

2) the metadata generation step for the Everything repo is long, and
very resource intensive.  Doing it every time we push updates would
cause unreasonable stress on the infrastructure and the people doing the
pushes.

3) the repodata for the Everything repo is huge.  Forcing users to
download the entire thing every day or so as updates are pushed would
cause unreasonable stress on the users's bandwidth, local machine
resources to parse it, and the servers offering it for download.

I think there are more, but those above are enough for me to not persue
this avenue currently.  The pros of this change, which seem small to me,
don't outweigh the above cons, and we have far more other issues we
could be spending time and resources on.


-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matthew Booth

On 02/12/09 16:09, Bill Nottingham wrote:

Matthew Booth (mbo...@redhat.com) said:

The separate updates directory has been a pain for as long as I've
been using RHL/Fedora Core/Fedora. It means you have two places to
look when searching for packages manually, and twice as much to
configure when you're configuring yum. It has never benefitted me,
or anybody I know, but it has caught me out on any number of
occasions. What's more, nobody really seems to know why it's like
that: it seems it's always been that way, and nobody ever bother to
fix it.

So lets fix it. The package set at release time is only interesting
to historians. If any of them are really that bothered, I'm sure
somebody can come up with a yum module which finds the oldest
available version of a package in a repo.


The separate Everything tree that does not get obsoleted is required
in some form for GPL compliance, with respect to the ISO images that
we ship. Any new solution would have to preserve this.


This argument sounds weird to me. However, there no reason you can't 
keep around as many repositories as you like as long as the One True 
Repository exists and people can use it exclusively. Currently it 
doesn't exist.


Matt
--
Matthew Booth, RHCA, RHCSS
Red Hat Engineering, Virtualisation Team

M:   +44 (0)7977 267231
GPG ID:  D33C3490
GPG FPR: 3733 612D 2D05 5458 8A8A 1600 3441 EA19 D33C 3490

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 18:09 +0100, Ralf Corsepius wrote:
> 
> * It shifts "costs" from "users" to "vendor"
> and from "mirrors" to "master".
> * It helps users who are using networked installs to spare bandwidth 
> (avoids downloading obsolete packages from "Everything"/"Fedora").
> 
> Admitted, for most users, it would change almost nothing.
> 
> 

People doing network installs can either add the updates repo to their
kickstart, or check the box in the anaconda UI, so that the updates
repos are considered at install time.  No download of duplicate data.

In fact, having separate repos would likely cost less bandwidth.  If we
only had one combined repo, there would be many duplicate packages,
especially if we went the route of having updates-testing mixed in and
only marked by some update tag.  We'd have to keep sets of what's in
updates-testing, updates, and the GA release set, and all of that would
be in one repodata set.  Everybody doing a network install, whether they
wanted updates, updates-testing, or not would have to download and
consume that larger repodata, introducing a higher cost for them.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Colin Walters
On Wed, Dec 2, 2009 at 5:37 PM, Matthew Booth  wrote:
>
> That's not a bug, it's a feature! Seriously, wasn't it a specific F10 or F11
> feature that anaconda allowed the user to specify that updates be installed
> at installation time? Certainly I used to have Debianites crow at me that my
> installation was vulnerable until I updated it. Not only that, but I had to
> download updated packages twice. These days I always install with updates. I
> find the installation argument dubious at best.

This could be equivalently fixed by have the desktop check on initial
login if there are any updates available before letting you launch say
the web browser.  However - we have bigger issues related to updates
to fix.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matthew Booth

On 02/12/09 16:06, Josh Boyer wrote:

On Wed, Dec 02, 2009 at 03:44:08PM +, Matthew Booth wrote:

On 02/12/09 15:26, Josh Boyer wrote:

On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've been
using RHL/Fedora Core/Fedora. It means you have two places to look when
searching for packages manually, and twice as much to configure when
you're configuring yum. It has never benefitted me, or anybody I know,
but it has caught me out on any number of occasions. What's more, nobody
really seems to know why it's like that: it seems it's always been that
way, and nobody ever bother to fix it.

So lets fix it.


And how do you propose to do that?


Unfortunately I'm not intimately familiar with Fedora infrastructure
under-the-hood. However, I would hope that, technically at least, it
wouldn't be much more than a systematic removal of all updates
repositories and redirecting files which would have gone into them into
the main repositories instead. This stems from the observation that if
you copied everything from the main repository into updates you would
have created the repository which people unfamiliar with the history
would expect. The infrastructure side of this, on the face of it, sounds
very simple.


What you describe is effectively how the development repository is built,
so it's certainly a technical possibility.  It does have implications
though.  Off the top of my head, I can think of:

1) Composing a new everything tree for updates would lead to larger
compose times.  That could possibly mean that getting updates out would
take>  1 day per 'push'.  We've been trying to improve updates push
times so it would be a bit detrimental to that goal.


I can't comment on this.


2) There might be GPL compliance issues


This line of reasoning sounds bizarre to me. You can publish things in 
multiple places.



3) You would still need an 'updates-testing' repository given that this
is a supposedly stable release.  So there is still going to be at least
one additional repo regardless.


Indeed, however that would only be testers. Most people can ignore it.


However, other than 'browsing manually for packages', I'm not really
sure what problem you are trying to solve by getting rid of the
updates repository.  It would seem like this has quite a bit of cost
for relatively little to no real gain?


Any tool which deals with repositories requires the repo to be 
configured twice. Off the top of my head I can think of:


1. yum
separate repo and updates-repo in yum.conf.
2. livecd tools
two repos in kickstart
3. revisor
two repos in kickstart
4. febootstrap
two repos on command line

Given that you almost always want updates, it would an improvement if 
all of the above could be replaced with a single configuration.


Matt
--
Matthew Booth, RHCA, RHCSS
Red Hat Engineering, Virtualisation Team

M:   +44 (0)7977 267231
GPG ID:  D33C3490
GPG FPR: 3733 612D 2D05 5458 8A8A 1600 3441 EA19 D33C 3490

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 14:39 +, Matthew Booth wrote:
> The separate updates directory has been a pain for as long as I've been 
> using RHL/Fedora Core/Fedora. It means you have two places to look when 
> searching for packages manually, and twice as much to configure when 
> you're configuring yum. It has never benefitted me, or anybody I know, 
> but it has caught me out on any number of occasions. What's more, nobody 
> really seems to know why it's like that: it seems it's always been that 
> way, and nobody ever bother to fix it. 

If the real motivation is "I want to manually browse a single directory
for all the packages" I have an alternate proposal.

createrepo has the ability to take a list of packages (as paths) for
input.  I hypothesize that we could place all rpms for a given release
in a single directory (seth will hate this as he wants to split them up
based on first letter of their name for better filesystem performance),
yet still maintain different repo paths for the different logical
separation of rpms.  One path would be the "Everything" repo, which
would have repodata for the GA versions of the packages.  Another path
would be the "updates" repo, which has repodata for the current set of
stable updates, and a third path would be the "updates-testing" repo,
which has repodata for the current set of testing updates.  All the
repodata would reference files in a central directory (or directory
tree).

You could achieve a single place to manually look for packages, whilst
users would still have logically separated repository metadata.

Of course, I'm papering over the amount of work it would take to modify
our compose tools to perform in this way, and the added work mirrors
would have (they don't normally have to scan the Everything tree for
changes, but now they'd have to scan a giant tree of rpms every rsync to
see if anything changed), and the added complexity of trying to keep
track of which packages are active in the repos and which aren't, and
keeping the central directory pruned of obsolete packages.

I'm certainly not signing up to work on this, but I am offering this as
an alternative approach.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread James Cassell
On Wed, 02 Dec 2009 12:47:17 -0500, Matthew Booth   
wrote:




Any tool which deals with repositories requires the repo to be  
configured twice. Off the top of my head I can think of:

 1. yum
separate repo and updates-repo in yum.conf.
2. livecd tools
two repos in kickstart
3. revisor
two repos in kickstart
4. febootstrap
two repos on command line
 Given that you almost always want updates, it would an improvement if  
all of the above could be replaced with a single configuration.

 Matt


So, create a meta-repo that just says "I'm the combination of these  
several repos over here"?  Of course, it'd require modification of all the  
programs that know how to talk to repos...


Personally, I think the current repo setup is just fine as it is.

--
James Cassell

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matthew Booth

On 02/12/09 17:40, Jesse Keating wrote:

On Wed, 2009-12-02 at 18:09 +0100, Ralf Corsepius wrote:


* It shifts "costs" from "users" to "vendor"
and from "mirrors" to "master".
* It helps users who are using networked installs to spare bandwidth
(avoids downloading obsolete packages from "Everything"/"Fedora").

Admitted, for most users, it would change almost nothing.




People doing network installs can either add the updates repo to their
kickstart, or check the box in the anaconda UI, so that the updates
repos are considered at install time.  No download of duplicate data.

In fact, having separate repos would likely cost less bandwidth.  If we
only had one combined repo, there would be many duplicate packages,
especially if we went the route of having updates-testing mixed in and
only marked by some update tag.  We'd have to keep sets of what's in
updates-testing, updates, and the GA release set, and all of that would
be in one repodata set.  Everybody doing a network install, whether they
wanted updates, updates-testing, or not would have to download and
consume that larger repodata, introducing a higher cost for them.


There's an argument for separate updates-testing.

However, my original point was that nobody is interested in the original 
GA release set except historians. People should, and we rightly help 
them, be installing updates frequently.


So, people doing network installs fall into:

People who want to install GA only: Hopefully nobody. The only good 
reason to do this is if the distro is broken.


People who want to install GA+updates: Everybody except testers.

People who want GA+updates+testing: testers.

The GA package could be kept around as a separate, static repo nobody 
uses under normal circumstances. Combining GA+updates into a single repo 
would not consume additional bandwidth for anybody at all, and only 
testers would have to do any additional configuration.


Matt
--
Matthew Booth, RHCA, RHCSS
Red Hat Engineering, Virtualisation Team

M:   +44 (0)7977 267231
GPG ID:  D33C3490
GPG FPR: 3733 612D 2D05 5458 8A8A 1600 3441 EA19 D33C 3490

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Daniel P. Berrange
On Wed, Dec 02, 2009 at 05:47:17PM +, Matthew Booth wrote:
> On 02/12/09 16:06, Josh Boyer wrote:
> >On Wed, Dec 02, 2009 at 03:44:08PM +, Matthew Booth wrote:
> >>On 02/12/09 15:26, Josh Boyer wrote:
> >>>On Wed, Dec 02, 2009 at 02:39:30PM +, Matthew Booth wrote:
> The separate updates directory has been a pain for as long as I've been
> using RHL/Fedora Core/Fedora. It means you have two places to look when
> searching for packages manually, and twice as much to configure when
> you're configuring yum. It has never benefitted me, or anybody I know,
> but it has caught me out on any number of occasions. What's more, nobody
> really seems to know why it's like that: it seems it's always been that
> way, and nobody ever bother to fix it.
> 
> So lets fix it.
> >>>
> >>>And how do you propose to do that?
> >>
> >>Unfortunately I'm not intimately familiar with Fedora infrastructure
> >>under-the-hood. However, I would hope that, technically at least, it
> >>wouldn't be much more than a systematic removal of all updates
> >>repositories and redirecting files which would have gone into them into
> >>the main repositories instead. This stems from the observation that if
> >>you copied everything from the main repository into updates you would
> >>have created the repository which people unfamiliar with the history
> >>would expect. The infrastructure side of this, on the face of it, sounds
> >>very simple.
> >
> >What you describe is effectively how the development repository is built,
> >so it's certainly a technical possibility.  It does have implications
> >though.  Off the top of my head, I can think of:
> >
> >1) Composing a new everything tree for updates would lead to larger
> >compose times.  That could possibly mean that getting updates out would
> >take>  1 day per 'push'.  We've been trying to improve updates push
> >times so it would be a bit detrimental to that goal.
> 
> I can't comment on this.
> 
> >2) There might be GPL compliance issues
> 
> This line of reasoning sounds bizarre to me. You can publish things in 
> multiple places.
> 
> >3) You would still need an 'updates-testing' repository given that this
> >is a supposedly stable release.  So there is still going to be at least
> >one additional repo regardless.
> 
> Indeed, however that would only be testers. Most people can ignore it.
> 
> >However, other than 'browsing manually for packages', I'm not really
> >sure what problem you are trying to solve by getting rid of the
> >updates repository.  It would seem like this has quite a bit of cost
> >for relatively little to no real gain?
> 
> Any tool which deals with repositories requires the repo to be 
> configured twice. Off the top of my head I can think of:
> 
> 1. yum
> separate repo and updates-repo in yum.conf.
> 2. livecd tools
> two repos in kickstart
> 3. revisor
> two repos in kickstart
> 4. febootstrap
> two repos on command line
> 
> Given that you almost always want updates, it would an improvement if 
> all of the above could be replaced with a single configuration.

That sounds like something for a yum plugin to solve. eg

Have a URL you can query to get a list of valid repos for a particular
release. Have a YUM plugin implements a virtual repo, and this repo
simply fetches that URL, and then exposes a package set which is the 
union of all repos listed. That way we still have separate YUM repos &
separate metadata hosted on all the servers, but the clients can all
be configured with this single virtual repo


Daniel
-- 
|: Red Hat, Engineering, London   -o-   http://people.redhat.com/berrange/ :|
|: http://libvirt.org  -o-  http://virt-manager.org  -o-  http://ovirt.org :|
|: http://autobuild.org   -o- http://search.cpan.org/~danberr/ :|
|: GnuPG: 7D3B9505  -o-  F3C9 553F A1DA 4AC2 5648 23C1 B3DF F742 7D3B 9505 :|

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, James Cassell wrote:


On Wed, 02 Dec 2009 12:47:17 -0500, Matthew Booth  wrote:




So, create a meta-repo that just says "I'm the combination of these several 
repos over here"?  Of course, it'd require modification of all the programs 
that know how to talk to repos...




it wouldn't require any code changes at all. createrepo/mergerepos can do 
this now and yum interacts with them fine - EXCEPT it gets into mirroring 
issues. :-/


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Adam Williamson
On Wed, 2009-12-02 at 11:06 -0500, Josh Boyer wrote:

> so it's certainly a technical possibility.  It does have implications
> though.  Off the top of my head, I can think of:
> 
> 1) Composing a new everything tree for updates would lead to larger
> compose times.  That could possibly mean that getting updates out would
> take > 1 day per 'push'.  We've been trying to improve updates push
> times so it would be a bit detrimental to that goal.
> 
> 2) There might be GPL compliance issues
> 
> 3) You would still need an 'updates-testing' repository given that this
> is a supposedly stable release.  So there is still going to be at least
> one additional repo regardless.

4) makes it much harder to recover from broken updates by simply pulling
in the original release version of the package. Yes, the correct fix to
this is not to ship broken updates, but we _do_ ship broken updates, and
at least people have this option available currently. (And we have 'yum
downgrade' which mostly automates the procedure if you're lucky.)

-- 
Adam Williamson
Fedora QA Community Monkey
IRC: adamw | Fedora Talk: adamwill AT fedoraproject DOT org
http://www.happyassassin.net

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 17:58 +, Matthew Booth wrote:
> The GA package could be kept around as a separate, static repo nobody 
> uses under normal circumstances. Combining GA+updates into a single repo 
> would not consume additional bandwidth for anybody at all, and only 
> testers would have to do any additional configuration. 

After initial install, a GA+updates repo would cause consumption of more
bandwidth, as the repodata would change on a nearly daily basis causing
rather large repodata files to be downloaded over and over and over
again.  Instead what we have now is a static GA repo that is downloaded
usually once, and a much much much smaller updates repo that is
downloaded more frequently, but since it is much smaller the impact is
much lower.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Jesse Keating wrote:


On Wed, 2009-12-02 at 14:39 +, Matthew Booth wrote:

The separate updates directory has been a pain for as long as I've been
using RHL/Fedora Core/Fedora. It means you have two places to look when
searching for packages manually, and twice as much to configure when
you're configuring yum. It has never benefitted me, or anybody I know,
but it has caught me out on any number of occasions. What's more, nobody
really seems to know why it's like that: it seems it's always been that
way, and nobody ever bother to fix it.


If the real motivation is "I want to manually browse a single directory
for all the packages" I have an alternate proposal.

createrepo has the ability to take a list of packages (as paths) for
input.  I hypothesize that we could place all rpms for a given release
in a single directory (seth will hate this as he wants to split them up
based on first letter of their name for better filesystem performance),


And better webbrowser and webserver performance since having A GIANT list 
eats my webbrowser often.



yet still maintain different repo paths for the different logical
separation of rpms.  One path would be the "Everything" repo, which
would have repodata for the GA versions of the packages.  Another path
would be the "updates" repo, which has repodata for the current set of
stable updates, and a third path would be the "updates-testing" repo,
which has repodata for the current set of testing updates.  All the
repodata would reference files in a central directory (or directory
tree).


the paths don't matter - you just need to generate the lists and feed that 
into SOME sort of metadata.





Of course, I'm papering over the amount of work it would take to modify
our compose tools to perform in this way, and the added work mirrors
would have (they don't normally have to scan the Everything tree for
changes, but now they'd have to scan a giant tree of rpms every rsync to
see if anything changed), and the added complexity of trying to keep
track of which packages are active in the repos and which aren't, and
keeping the central directory pruned of obsolete packages.


The compose tools would just have to make a list of pkgs in what dirs - 
a list they already have.



I guess what I don't see is what the net benefit is here.

the merger of repos is already happening at the yum layer.

-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 13:09 -0500, Seth Vidal wrote:
> I guess what I don't see is what the net benefit is here.

From what I can gather...

A) a single path to look at when manually looking for rpms.

B) only one config entry to mess with when modifying repos for a release
(my alternate proposal does not solve this one)

> 
> the merger of repos is already happening at the yum layer. 

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Nicolas Mailhot
Since people are posting wishes, here is mine:

1. stop shuffling packages from directory to directory as they get
promoted/demoted from release to release

2. have a single authoritative URL for each package

3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
(make sure it works with web infrastructure instead of fighting it)

4. generate indexes of the kojipkgs.fedoraproject.org tree that
represent Fedora X, Fedora X + updates, Fedora X + testing, etc.

-- 
Nicolas Mailhot


signature.asc
Description: Ceci est une partie de message	numériquement signée
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Chris Adams
Once upon a time, Matthew Booth  said:
> The separate updates directory has been a pain for as long as I've been 
> using RHL/Fedora Core/Fedora. It means you have two places to look when 
> searching for packages manually, and twice as much to configure when 
> you're configuring yum.

Are these really significant issues for a significant number of users?

Not many people go looking manually for packages, since there are many
tools to do it easier (yum, PK, repoquery, etc.).

The repos are automatically configured with fedora-release; how often
are you configuring yum?

The only time I have to care about this is if I'm writing a kickstart
file, but it is one extra line (and then I copy the same kickstart base
over and over).
-- 
Chris Adams 
Systems and Network Administrator - HiWAAY Internet Services
I don't speak for anybody but myself - that's enough trouble.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Jesse Keating wrote:


On Wed, 2009-12-02 at 13:09 -0500, Seth Vidal wrote:

I guess what I don't see is what the net benefit is here.


From what I can gather...

A) a single path to look at when manually looking for rpms.


Isn't this, eventually, what the packagedb is supposed to be able to do?


B) only one config entry to mess with when modifying repos for a release
(my alternate proposal does not solve this one)


This one seems pretty minor - you only need to do this once. and most 
people will do it never.


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Nicolas Mailhot wrote:


Since people are posting wishes, here is mine:

1. stop shuffling packages from directory to directory as they get
promoted/demoted from release to release


we sort of do this now with hardlinks - the problem is when  we have to 
resign the pkgs.




2. have a single authoritative URL for each package


packagedb...




3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
(make sure it works with web infrastructure instead of fighting it)


I don't think that would work fine with a lot of our mirrors.



4. generate indexes of the kojipkgs.fedoraproject.org tree that
represent Fedora X, Fedora X + updates, Fedora X + testing, etc.


this is intriguing but expensive on kojipkgs.

-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 15:52 -0500, Seth Vidal wrote:
> Isn't this, eventually, what the packagedb is supposed to be able to
> do?

I gather it's a "ls in a directory" kind of thing, not an interface to
one tool or another kind of thing.  But I could be wrong.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matěj Cepl

Dne 2.12.2009 17:06, Josh Boyer napsal(a):

However, other than 'browsing manually for packages', I'm not really
sure what problem you are trying to solve by getting rid of the
updates repository.  It would seem like this has quite a bit of cost
for relatively little to no real gain?


I am usually too lazy, so I use yumdownloader anyway. So, I really don't 
see any real use case covered.


Matěj

--
http://www.ceplovi.cz/matej/, Jabber: mceplceplovi.cz
GPG Finger: 89EF 4BC6 288A BF43 1BAB  25C3 E09F EF25 D964 84AC

If Patrick Henry thought that taxation without representation was
bad, he should see how bad it is with representation.

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
(on my on tangent...)

On 12/02/2009 12:48 PM, Jesse Keating wrote:
> I hypothesize that we could place all rpms for a given release
> in a single directory (seth will hate this as he wants to split them up
> based on first letter of their name for better filesystem performance),

Ugh, first letter isn't really a great plan anyway. First (few) letters
of a hash of the filename is much better, but obviously hurts browsability. 
Next best is probably something like how a dead-tree dictionary index works;
list everything, split the list up by starting letters evenly, so the
directories (given a really unlikely hypothetical package set) are 

0/  # contains packages named 0 through 3*
4/  # 4 through 9*
a/  # a through ay*
az/ # az through bw*
bx/ # bx through cz*
da/ # da through whatever's next
...

so that every directory has about the same number of things.

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
On 12/02/2009 05:03 PM, Jesse Keating wrote:
> On Wed, 2009-12-02 at 15:52 -0500, Seth Vidal wrote:
>> Isn't this, eventually, what the packagedb is supposed to be able to
>> do?
> 
> I gather it's a "ls in a directory" kind of thing, not an interface to
> one tool or another kind of thing.  But I could be wrong.

Sure, but the better optimization for this is "don't do it".

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
On 12/02/2009 03:53 PM, Seth Vidal wrote:
> On Wed, 2 Dec 2009, Nicolas Mailhot wrote:
>> 3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
>> (make sure it works with web infrastructure instead of fighting it)
> 
> I don't think that would work fine with a lot of our mirrors.

I think it probably /could/ if we put some (relatively major) work in
on the server side to make it look like the directories it isn't. Not
that I'm endorsing such a plan.

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matt Domsch
On Wed, Dec 02, 2009 at 02:03:51PM -0800, Jesse Keating wrote:
> On Wed, 2009-12-02 at 15:52 -0500, Seth Vidal wrote:
> > Isn't this, eventually, what the packagedb is supposed to be able to
> > do?
> 
> I gather it's a "ls in a directory" kind of thing, not an interface to
> one tool or another kind of thing.  But I could be wrong.

We're at a point where 'ls in a directory' is becoming difficult even;
you can't glob 15k package names in a shell.

find .   is your friend.

-- 
Matt Domsch
Technology Strategist, Dell Office of the CTO
linux.dell.com & www.dell.com/linux

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matt Domsch
On Wed, Dec 02, 2009 at 07:35:03PM +0100, Nicolas Mailhot wrote:
> 3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
> (make sure it works with web infrastructure instead of fighting it)

Sorry, I don't understand this.  Can you elaborate?  That would help
me scope the impact to MirrorManager.

Thanks,
Matt

-- 
Matt Domsch
Technology Strategist, Dell Office of the CTO
linux.dell.com & www.dell.com/linux

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Peter Jones wrote:


(on my on tangent...)

On 12/02/2009 12:48 PM, Jesse Keating wrote:

I hypothesize that we could place all rpms for a given release
in a single directory (seth will hate this as he wants to split them up
based on first letter of their name for better filesystem performance),


Ugh, first letter isn't really a great plan anyway. First (few) letters
of a hash of the filename is much better, but obviously hurts browsability.
Next best is probably something like how a dead-tree dictionary index works;
list everything, split the list up by starting letters evenly, so the
directories (given a really unlikely hypothetical package set) are

0/  # contains packages named 0 through 3*
4/  # 4 through 9*
a/  # a through ay*
az/ # az through bw*
bx/ # bx through cz*
da/ # da through whatever's next
...

so that every directory has about the same number of things.


If you're looking for perfect division, sure - but the reality is this:

19K items in a single dir and ext3 and nfs and many many other things crap 
themselves returning that list.


If you make 36 subdirs (26+10) performance gets DRAMATICALLY better for 
producing the same list of files.


I tested it on our backend to be sure. getting the complete pkglist goes 
from taking 5 minutes to take 30s.


yes, I said 5 minutes.

-sv





--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matt Domsch
On Wed, Dec 02, 2009 at 11:09:41AM -0500, Bill Nottingham wrote:
> The separate Everything tree that does not get obsoleted is required
> in some form for GPL compliance, with respect to the ISO images that
> we ship. Any new solution would have to preserve this.

?  We provide Fedora-*-source-disc*.iso and Fedora-*-source-DVD.iso
images on the mirrors already.

-- 
Matt Domsch
Technology Strategist, Dell Office of the CTO
linux.dell.com & www.dell.com/linux

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Peter Jones wrote:


On 12/02/2009 03:53 PM, Seth Vidal wrote:

On Wed, 2 Dec 2009, Nicolas Mailhot wrote:

3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
(make sure it works with web infrastructure instead of fighting it)


I don't think that would work fine with a lot of our mirrors.


I think it probably /could/ if we put some (relatively major) work in
on the server side to make it look like the directories it isn't. Not
that I'm endorsing such a plan.


we'd have to make all our mirrors use that.

not gonna fly.

-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
On 12/02/2009 05:58 PM, Seth Vidal wrote:
> 
> 
> On Wed, 2 Dec 2009, Peter Jones wrote:
> 
>> On 12/02/2009 03:53 PM, Seth Vidal wrote:
>>> On Wed, 2 Dec 2009, Nicolas Mailhot wrote:
 3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
 (make sure it works with web infrastructure instead of fighting it)
>>>
>>> I don't think that would work fine with a lot of our mirrors.
>>
>> I think it probably /could/ if we put some (relatively major) work in
>> on the server side to make it look like the directories it isn't. Not
>> that I'm endorsing such a plan.
> 
> we'd have to make all our mirrors use that.
> 
> not gonna fly.

Well, you just wouldn't get the benefits of this if you're using a mirror.

Which, yes, is pretty lame.

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
On 12/02/2009 05:58 PM, Seth Vidal wrote:
> 
> 
> On Wed, 2 Dec 2009, Peter Jones wrote:
> 
>> (on my on tangent...)
>>
>> On 12/02/2009 12:48 PM, Jesse Keating wrote:
>>> I hypothesize that we could place all rpms for a given release
>>> in a single directory (seth will hate this as he wants to split them up
>>> based on first letter of their name for better filesystem performance),
>>
>> Ugh, first letter isn't really a great plan anyway. First (few) letters
>> of a hash of the filename is much better, but obviously hurts
>> browsability.
>> Next best is probably something like how a dead-tree dictionary index
>> works;
>> list everything, split the list up by starting letters evenly, so the
>> directories (given a really unlikely hypothetical package set) are
>>
>> 0/# contains packages named 0 through 3*
>> 4/# 4 through 9*
>> a/# a through ay*
>> az/# az through bw*
>> bx/# bx through cz*
>> da/# da through whatever's next
>> ...
>>
>> so that every directory has about the same number of things.
> 
> If you're looking for perfect division, sure - but the reality is this:
> 
> 19K items in a single dir and ext3 and nfs and many many other things
> crap themselves returning that list.
> 
> If you make 36 subdirs (26+10) performance gets DRAMATICALLY better for
> producing the same list of files.
> 
> I tested it on our backend to be sure. getting the complete pkglist goes
> from taking 5 minutes to take 30s.
> 
> yes, I said 5 minutes.

Yeah, I buy that.

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Peter Jones wrote:


On 12/02/2009 05:58 PM, Seth Vidal wrote:



On Wed, 2 Dec 2009, Peter Jones wrote:


On 12/02/2009 03:53 PM, Seth Vidal wrote:

On Wed, 2 Dec 2009, Nicolas Mailhot wrote:

3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
(make sure it works with web infrastructure instead of fighting it)


I don't think that would work fine with a lot of our mirrors.


I think it probably /could/ if we put some (relatively major) work in
on the server side to make it look like the directories it isn't. Not
that I'm endorsing such a plan.


we'd have to make all our mirrors use that.

not gonna fly.


Well, you just wouldn't get the benefits of this if you're using a mirror.

Which, yes, is pretty lame.


we won't get the benefits in fedora then. Our mirror infrastructure is a 
HUGE reason why we can do what we do.


If we can't farm things out to our mirrors then we might as well go home 
b/c our users will NEVER be able to get our bits.


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread James Antill
On Wed, 2009-12-02 at 17:46 -0500, Peter Jones wrote:
> (on my on tangent...)
> 
> On 12/02/2009 12:48 PM, Jesse Keating wrote:
> > I hypothesize that we could place all rpms for a given release
> > in a single directory (seth will hate this as he wants to split them up
> > based on first letter of their name for better filesystem performance),
> 
> Ugh, first letter isn't really a great plan anyway.

 It's not "great" but it's better than the current "all in one dir." and
easy to do, and I haven't heard of any downsides (that aren't applicable
to "all in one dir.").

>  First (few) letters
> of a hash of the filename is much better, but obviously hurts browsability.

 Right, which is important enough to not do that IMO. I'm sure Jesse
wouldn't like finding packages to mean using find.

> Next best is probably something like how a dead-tree dictionary index works;
> list everything, split the list up by starting letters evenly, so the
> directories (given a really unlikely hypothetical package set) are 
> 
> 0/# contains packages named 0 through 3*
> 4/# 4 through 9*
> a/# a through ay*
> az/   # az through bw*
> bx/   # bx through cz*
> da/   # da through whatever's next
> ...
> 
> so that every directory has about the same number of things.

 This should be fairly easy to code, but has a big downside:

 Packages will move directories.

1. This will upset yum (old repo. MD == no packages download).
2. This might upset mirrors.
3. This will probably upset users:
i. URLs will only be safe until the next mash, they
won't even be able to bookmark /y if they just
want to see yum each time.
ii. Users have to look/parse the directory layout each
time they visit to see what "blah" is.

-- 
James Antill - ja...@fedoraproject.org
http://yum.baseurl.org/wiki/releases
http://yum.baseurl.org/wiki/whatsnew/3.2.25
http://yum.baseurl.org/wiki/YumMultipleMachineCaching

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Peter Jones
On 12/02/2009 06:05 PM, James Antill wrote:
> On Wed, 2009-12-02 at 17:46 -0500, Peter Jones wrote:
>> so that every directory has about the same number of things.
> 
>  This should be fairly easy to code, but has a big downside:
> 
>  Packages will move directories.
> 
> 1. This will upset yum (old repo. MD == no packages download).
> 2. This might upset mirrors.
> 3. This will probably upset users:
> i. URLs will only be safe until the next mash, they
> won't even be able to bookmark /y if they just
> want to see yum each time.
> ii. Users have to look/parse the directory layout each
> time they visit to see what "blah" is.

Yeah. Seth makes a good point - first letter is such vast improvement that
we probably don't need to worry about doing better - at least for now.

-- 
Peter

I'd like to start a religion. That's where the money is.
-- L. Ron Hubbard to Lloyd Eshbach, in 1949;
quoted by Eshbach in _Over My Shoulder_.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Jesse Keating
On Wed, 2009-12-02 at 16:58 -0600, Matt Domsch wrote:
> On Wed, Dec 02, 2009 at 11:09:41AM -0500, Bill Nottingham wrote:
> > The separate Everything tree that does not get obsoleted is required
> > in some form for GPL compliance, with respect to the ISO images that
> > we ship. Any new solution would have to preserve this.
> 
> ?  We provide Fedora-*-source-disc*.iso and Fedora-*-source-DVD.iso
> images on the mirrors already.
> 

As stated elsewhere, that doesn't carry the packages found on all the
live images we produce.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Matt Domsch
On Wed, Dec 02, 2009 at 03:08:06PM -0800, Jesse Keating wrote:
> On Wed, 2009-12-02 at 16:58 -0600, Matt Domsch wrote:
> > On Wed, Dec 02, 2009 at 11:09:41AM -0500, Bill Nottingham wrote:
> > > The separate Everything tree that does not get obsoleted is required
> > > in some form for GPL compliance, with respect to the ISO images that
> > > we ship. Any new solution would have to preserve this.
> > 
> > ?  We provide Fedora-*-source-disc*.iso and Fedora-*-source-DVD.iso
> > images on the mirrors already.
> > 
> 
> As stated elsewhere, that doesn't carry the packages found on all the
> live images we produce.

All the more reason to put more effort into 
  http://git.fedorahosted.org/git/correspondingsource.git


-- 
Matt Domsch
Technology Strategist, Dell Office of the CTO
linux.dell.com & www.dell.com/linux

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Bruno Wolff III
On Wed, Dec 02, 2009 at 17:58:24 -0500,
  Seth Vidal  wrote:
> 
> I tested it on our backend to be sure. getting the complete pkglist
> goes from taking 5 minutes to take 30s.
> 
> yes, I said 5 minutes.

Have you tried any of the tunning knobs to have the directory cache be
alotted more space or given higher priority? For example:

vfs_cache_pressure
--

Controls the tendency of the kernel to reclaim the memory which is used for
caching of directory and inode objects.

At the default value of vfs_cache_pressure=100 the kernel will attempt to
reclaim dentries and inodes at a "fair" rate with respect to pagecache and
swapcache reclaim.  Decreasing vfs_cache_pressure causes the kernel to prefer
to retain dentry and inode caches.  Increasing vfs_cache_pressure beyond 100
causes the kernel to prefer to reclaim dentries and inodes.

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Wed, 2 Dec 2009, Bruno Wolff III wrote:


On Wed, Dec 02, 2009 at 17:58:24 -0500,
 Seth Vidal  wrote:


I tested it on our backend to be sure. getting the complete pkglist
goes from taking 5 minutes to take 30s.

yes, I said 5 minutes.


Have you tried any of the tunning knobs to have the directory cache be
alotted more space or given higher priority? For example:

vfs_cache_pressure
--

Controls the tendency of the kernel to reclaim the memory which is used for
caching of directory and inode objects.

At the default value of vfs_cache_pressure=100 the kernel will attempt to
reclaim dentries and inodes at a "fair" rate with respect to pagecache and
swapcache reclaim.  Decreasing vfs_cache_pressure causes the kernel to prefer
to retain dentry and inode caches.  Increasing vfs_cache_pressure beyond 100
causes the kernel to prefer to reclaim dentries and inodes.


Not tried that - worth giving it a shot - but it still won't help the 
'holy crap there are 15K items on this webpage and it won't render' 
problem.


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Kevin Kofler
Seth Vidal wrote:
> If you're looking for perfect division, sure - but the reality is this:
> 
> 19K items in a single dir and ext3 and nfs and many many other things crap
> themselves returning that list.
> 
> If you make 36 subdirs (26+10) performance gets DRAMATICALLY better for
> producing the same list of files.

The problem is, a few of those starting letters still correspond to A LOT of 
packages, e.g. p* matches perl-*, python-* and php-* and that's a lot of 
stuff (especially Perl and Python). And adding python3-* (and perl6-*, or 
are we going to use the rakudo-* namespace there?) won't make it any less.

Kevin Kofler

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Thu, 3 Dec 2009, Kevin Kofler wrote:


Seth Vidal wrote:

If you're looking for perfect division, sure - but the reality is this:

19K items in a single dir and ext3 and nfs and many many other things crap
themselves returning that list.

If you make 36 subdirs (26+10) performance gets DRAMATICALLY better for
producing the same list of files.


The problem is, a few of those starting letters still correspond to A LOT of
packages, e.g. p* matches perl-*, python-* and php-* and that's a lot of
stuff (especially Perl and Python). And adding python3-* (and perl6-*, or
are we going to use the rakudo-* namespace there?) won't make it any less.




And yet, when tested, just making those 36 subdirs made a HUGE difference.

What I've found is getting any one dir down below 3K entries makes things 
faster, by a lot.


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Josh Boyer
On Wed, Dec 02, 2009 at 12:01:49PM -0500, Casey Dahlin wrote:
>
>You owe me $5.

It hasn't been a week and you haven't sent me your address.
I did notice though, so keep up the good work ;)

josh

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Mike McGrath
On Wed, 2 Dec 2009, Bruno Wolff III wrote:

> On Wed, Dec 02, 2009 at 17:58:24 -0500,
>   Seth Vidal  wrote:
> >
> > I tested it on our backend to be sure. getting the complete pkglist
> > goes from taking 5 minutes to take 30s.
> >
> > yes, I said 5 minutes.
>
> Have you tried any of the tunning knobs to have the directory cache be
> alotted more space or given higher priority? For example:
>
> vfs_cache_pressure
> --
>
> Controls the tendency of the kernel to reclaim the memory which is used for
> caching of directory and inode objects.
>
> At the default value of vfs_cache_pressure=100 the kernel will attempt to
> reclaim dentries and inodes at a "fair" rate with respect to pagecache and
> swapcache reclaim.  Decreasing vfs_cache_pressure causes the kernel to prefer
> to retain dentry and inode caches.  Increasing vfs_cache_pressure beyond 100
> causes the kernel to prefer to reclaim dentries and inodes.
>

We have experimented with this for the purposes of rsync on our mirrors
but haven't experimented for this specific issue.

-Mike

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 06:40 PM, Jesse Keating wrote:

On Wed, 2009-12-02 at 18:09 +0100, Ralf Corsepius wrote:


* It shifts "costs" from "users" to "vendor"
and from "mirrors" to "master".
* It helps users who are using networked installs to spare bandwidth
(avoids downloading obsolete packages from "Everything"/"Fedora").

Admitted, for most users, it would change almost nothing.




People doing network installs can either add the updates repo to their
kickstart, or check the box in the anaconda UI, so that the updates
repos are considered at install time.  No download of duplicate data.
Yes, for people who are doing "full featured networked installs" w/ 
custom kickstart files. I've never met such a person.



In fact, having separate repos would likely cost less bandwidth.  If we
only had one combined repo, there would be many duplicate packages,
Where? Unlike now, where you have each package twice (in Everything and 
"updates"), you would have each package only once: In Everything.


It would help people like me, who are locally reusing downloaded 
packages in a networked environment, instead of letting each machine 
accesses the original repos directly.



especially if we went the route of having updates-testing mixed in and
only marked by some update tag.

Updates-testing is an add-on repo to "Everything+updates".

A merged new Everything would not change anything wrt. 
"updates-testing". The only difference would be packages being pushed to 
"Everything" instead of "updates".



We'd have to keep sets of what's in
updates-testing, updates, and the GA release set, and all of that would
be in one repodata set.  Everybody doing a network install, whether they
wanted updates, updates-testing, or not would have to download and
consume that larger repodata, introducing a higher cost for them.

Your concern is the bigger repodata?

Reality check:
# ls -h -s releases/11/Everything/x86_64/os/repodata/*.sqlite.bz2 
updates/11/x86_64/repodata/*.sqlite.bz2


 16M 
releases/11/Everything/x86_64/os/repodata/0301ed1cbf01c00cdf9c42b71df6c74947d9c76a3c9767e8f85a99ef6fdccb86-filelists.sqlite.bz2
 11M 
releases/11/Everything/x86_64/os/repodata/6a8bfab8ebcbc79f9827f5b16bc1bd1573c068f141bf47c6f216e72dd8b60ff0-primary.sqlite.bz2
5.8M 
releases/11/Everything/x86_64/os/repodata/d3405c970a0c27e9dc3fd3ed179af33fee9a72bf704f45f1ed96a9063b40d7c7-other.sqlite.bz2


9.0M 
updates/11/x86_64/repodata/3a725e07adff9d7e56e7fd8bd3091f38107723987b3b614220df72494dc6814d-filelists.sqlite.bz2
6.0M 
updates/11/x86_64/repodata/54ca116c2a00e87eaca6491adb9e077f4d3f0d5b20e7cbab984774aefb042d0f-primary.sqlite.bz2
3.2M 
updates/11/x86_64/repodata/646eb24cfe113de569853a4e05f55773b3ad9531dee646e7d843b8dc4a849780-other.sqlite.bz2


=> An estimate for the increase in downloaded files's sizes you are 
talking about is ca. factor 2, from 18.2M (current "updates")

to 32.8M+ (current "Everything"+"newly introduced packages)


Whether this increase in download-size is "significant" is up to the 
beholder. For me, it gets lost in the noise of accessing a "good" or a 
"bad" connection to a mirror and in the time required to download 
packages from mirrors.



On the other hand, the content (the packages referenced inside) of 
"updates" overlaps with packages in Everything
=> The number of packages to be considered in dep-computation should 
become much (?) smaller. However, I am not knowledgable enough with 
yum's internals to estimate the impact this would have on 
turnaround-times, memory and diskspace requirements this would have on yum.



Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/02/2009 07:09 PM, Seth Vidal wrote:


the merger of repos is already happening at the yum layer.


On the client's side - With a combined Everything+updates, this would 
happen on the server side.


It's one of the aspects which made me said "a combined 
Everything+updates shifts costs from client to server".


Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Seth Vidal



On Thu, 3 Dec 2009, Ralf Corsepius wrote:


On 12/02/2009 07:09 PM, Seth Vidal wrote:


the merger of repos is already happening at the yum layer.


On the client's side - With a combined Everything+updates, this would happen 
on the server side.


It's one of the aspects which made me said "a combined Everything+updates 
shifts costs from client to server".


except they wouldn't be merged on the server side -it would just be 
additive.



We wouldn't be talking about removing the original GA set - just adding 
updated pkgs into the path. So you'd still have the number of pkgs -just 
all in one repo, that you have to download all of the metadata for all of 
the more often, despite that 15K of them don't CHANGE.


this is why it is less good for our users.

-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread Ralf Corsepius

On 12/03/2009 06:32 AM, Seth Vidal wrote:



On Thu, 3 Dec 2009, Ralf Corsepius wrote:


On 12/02/2009 07:09 PM, Seth Vidal wrote:


the merger of repos is already happening at the yum layer.


On the client's side - With a combined Everything+updates, this would
happen on the server side.

It's one of the aspects which made me said "a combined
Everything+updates shifts costs from client to server".


except they wouldn't be merged on the server side -it would just be
additive.


We wouldn't be talking about removing the original GA set - just adding
updated pkgs into the path.


Woa!!! With all due respect, but this would seem an stupid and silly 
plan to me.



So you'd still have the number of pkgs -just
all in one repo, that you have to download all of the metadata for all
of the more often, despite that 15K of them don't CHANGE.

this is why it is less good for our users.


Yes - cf. above. I have nothing to add.

Ralf


--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-02 Thread shmuel siegel

Ralf Corsepius wrote:

Your concern is the bigger repodata?

My download of repodata towards the end of a release, or from rawhide, 
is usually bigger than my download of packages. So yes, this would make 
a difference. On the otherhand, there probably could be a repodiff that 
would alleviate a lot of this problem.


--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Adam Williamson
On Thu, 2009-12-03 at 00:32 -0500, Seth Vidal wrote:

> We wouldn't be talking about removing the original GA set - just adding 
> updated pkgs into the path. So you'd still have the number of pkgs -just 
> all in one repo, that you have to download all of the metadata for all of 
> the more often, despite that 15K of them don't CHANGE.

I don't think that was actually made clear in the initial proposal. I'd
been assuming that the proposal was _exactly_ to remove the GA set.
Usually, when a newer package shows up in any given repository, we don't
keep the previous version of the package, do we? So I assumed the
proposal was expecting that behaviour for the combined repository.

-- 
Adam Williamson
Fedora QA Community Monkey
IRC: adamw | Fedora Talk: adamwill AT fedoraproject DOT org
http://www.happyassassin.net

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Emmanuel Seyman
* Adam Williamson [03/12/2009 10:10] :
>
> I don't think that was actually made clear in the initial proposal. I'd
> been assuming that the proposal was _exactly_ to remove the GA set.

No can do.
People who install from the netinst CD or do PXE installs without adding
the updates repo during the installation would be unable to finish it.

Emmanuel

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Nicolas Mailhot


Le Mer 2 décembre 2009 23:56, Matt Domsch a écrit :
>
> On Wed, Dec 02, 2009 at 07:35:03PM +0100, Nicolas Mailhot wrote:
>> 3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
>> (make sure it works with web infrastructure instead of fighting it)
>
> Sorry, I don't understand this.  Can you elaborate?  That would help
> me scope the impact to MirrorManager.

Right now the same package moves from master URL to master URL as it is pushed
from testing to updates to GA to whatever. That means the same package gets
downloaded many times over because it changed URL (and browsers, proxies, etc
understand new url = new file)

My proposal is to never move a package, put it in a single unchanging place
after a koji build, and just index it or not in the relevant repodata when it
gets promoted or demoted.

-- 
Nicolas Mailhot


-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Andreas Schwab
Matt Domsch  writes:

> We're at a point where 'ls in a directory' is becoming difficult even;
> you can't glob 15k package names in a shell.

Recent kernels have no argv limit any more.

Andreas.

-- 
Andreas Schwab, sch...@redhat.com
GPG Key fingerprint = D4E8 DBE3 3813 BB5D FA84  5EC7 45C6 250E 6F00 984E
"And now for something completely different."

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Seth Vidal



On Thu, 3 Dec 2009, Adam Williamson wrote:


On Thu, 2009-12-03 at 00:32 -0500, Seth Vidal wrote:


We wouldn't be talking about removing the original GA set - just adding
updated pkgs into the path. So you'd still have the number of pkgs -just
all in one repo, that you have to download all of the metadata for all of
the more often, despite that 15K of them don't CHANGE.


I don't think that was actually made clear in the initial proposal. I'd
been assuming that the proposal was _exactly_ to remove the GA set.
Usually, when a newer package shows up in any given repository, we don't
keep the previous version of the package, do we? So I assumed the
proposal was expecting that behaviour for the combined repository.


From a QA standpoint I'd think you'd want at least one known-installable 
set of pkgs. Since everything after the original GA set is a giant 
questionmark.


Not to mention that removing all the old pkgs would more or less make 
deltarpms very difficult.


-sv

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread James Antill
On Thu, 2009-12-03 at 10:29 +0100, Nicolas Mailhot wrote:
> 
> Le Mer 2 décembre 2009 23:56, Matt Domsch a écrit :
> >
> > On Wed, Dec 02, 2009 at 07:35:03PM +0100, Nicolas Mailhot wrote:
> >> 3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
> >> (make sure it works with web infrastructure instead of fighting it)
> >
> > Sorry, I don't understand this.  Can you elaborate?  That would help
> > me scope the impact to MirrorManager.
> 
> Right now the same package moves from master URL to master URL as it is pushed
> from testing to updates to GA to whatever. That means the same package gets
> downloaded many times over because it changed URL (and browsers, proxies, etc
> understand new url = new file)

 It only moves once, at least in the vast majority of cases.

> My proposal is to never move a package, put it in a single unchanging place
> after a koji build, and just index it or not in the relevant repodata when it
> gets promoted or demoted.

 One problem is that atm. when it moves from testing to updates, or
rawhide to GA, it also gets resigned with a different key ... so the
bits are not the same. Having the URL be the same will not be helpful
until that changes.

-- 
James Antill - ja...@fedoraproject.org
http://yum.baseurl.org/wiki/releases
http://yum.baseurl.org/wiki/whatsnew/3.2.25
http://yum.baseurl.org/wiki/YumMultipleMachineCaching

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Liang Suilong
I think that idea maybe isn't benefit with repository.

If updates repository is merged into Everything repository, Will metadata
files become too large? I know that the size of metadatas on updates and
everything are more than 30 megabytes. If these two repositories compose, We
will need download more than 30MB files before updating. I believe that it
will decrease the advantage of delrarpm update. It will increase more time
to download the files.

Is there any way to make createrepo generate delta metadata files? Just like
delta rpm, we just need to get what meta files are changed, then yum
generates the entire metadata files. We exactly not need and want to
download the big files.

Additionally, if an user is far away from repository server, though ISP
provides quite a good 10Mbps connection, it is still quite slow to install
some big applications, such OpenOffice.org, Eclipse, Netbeans, KOffice and
some 3D games. Does yum allow to download more than one rpm files with only
one thread per files after checking dependencies? Or using a yum plugin to
replace yum default downloader by axel to download one rpm files with multi
threads. I ever found out this kind of plugin, but I can assure it can run
in Fedora 12.

-- 
My Page: http://www.liangsuilong.info
Fedora Project Contributor -- Packager && Ambassador
https://fedoraproject.org/wiki/User:Liangsuilong
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Jesse Keating



On Dec 3, 2009, at 5:28, James Antill  wrote:


On Thu, 2009-12-03 at 10:29 +0100, Nicolas Mailhot wrote:


Le Mer 2 décembre 2009 23:56, Matt Domsch a écrit :


On Wed, Dec 02, 2009 at 07:35:03PM +0100, Nicolas Mailhot wrote:

3. replace static mirrors with proxy-ing of kojipkgs.fedoraproject.org
(make sure it works with web infrastructure instead of fighting it)


Sorry, I don't understand this.  Can you elaborate?  That would help
me scope the impact to MirrorManager.


Right now the same package moves from master URL to master URL as  
it is pushed
from testing to updates to GA to whatever. That means the same  
package gets
downloaded many times over because it changed URL (and browsers,  
proxies, etc

understand new url = new file)


It only moves once, at least in the vast majority of cases.

My proposal is to never move a package, put it in a single  
unchanging place
after a koji build, and just index it or not in the relevant  
repodata when it

gets promoted or demoted.


One problem is that atm. when it moves from testing to updates, or
rawhide to GA, it also gets resigned with a different key ... so the
bits are not the same. Having the URL be the same will not be helpful
until that changes.

--



That is no longer true. We used a single key for all of f11 and a  
single key for all of f12.


--
Jes

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Adam Williamson
On Thu, 2009-12-03 at 08:20 -0500, Seth Vidal wrote:

> >> We wouldn't be talking about removing the original GA set - just adding
> >> updated pkgs into the path. So you'd still have the number of pkgs -just
> >> all in one repo, that you have to download all of the metadata for all of
> >> the more often, despite that 15K of them don't CHANGE.
> >
> > I don't think that was actually made clear in the initial proposal. I'd
> > been assuming that the proposal was _exactly_ to remove the GA set.
> > Usually, when a newer package shows up in any given repository, we don't
> > keep the previous version of the package, do we? So I assumed the
> > proposal was expecting that behaviour for the combined repository.
> 
> >From a QA standpoint I'd think you'd want at least one known-installable 
> set of pkgs. Since everything after the original GA set is a giant 
> questionmark.
> 
> Not to mention that removing all the old pkgs would more or less make 
> deltarpms very difficult.

I'm not saying I support the proposal, I don't, I think it's a waste of
effort for no benefit. I was just clarifying the initial
characterization. Actually I think the initial proposer _was_ expecting
to remove initial packages when updates become available, because they
keep saying stuff like 'only historians are interested in the GA
packages'.

-- 
Adam Williamson
Fedora QA Community Monkey
IRC: adamw | Fedora Talk: adamwill AT fedoraproject DOT org
http://www.happyassassin.net

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Peter Jones
On 12/03/2009 12:24 AM, Ralf Corsepius wrote:
> On 12/02/2009 06:40 PM, Jesse Keating wrote:
>> People doing network installs can either add the updates repo to their
>> kickstart, or check the box in the anaconda UI, so that the updates
>> repos are considered at install time.  No download of duplicate data.
> Yes, for people who are doing "full featured networked installs" w/
> custom kickstart files. I've never met such a person.

Really? This very much seems to me like it's a vast majority of
kickstarts that happen.

-- 
Peter

Power corrupts.  Absolute power is kind of neat.
-- John Lehman, Secretary of the Navy, 1981-1987

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Peter Jones
On 12/03/2009 08:20 AM, Seth Vidal wrote:
> 
> 
> On Thu, 3 Dec 2009, Adam Williamson wrote:
> 
>> On Thu, 2009-12-03 at 00:32 -0500, Seth Vidal wrote:
>>
>>> We wouldn't be talking about removing the original GA set - just adding
>>> updated pkgs into the path. So you'd still have the number of pkgs -just
>>> all in one repo, that you have to download all of the metadata for
>>> all of
>>> the more often, despite that 15K of them don't CHANGE.
>>
>> I don't think that was actually made clear in the initial proposal. I'd
>> been assuming that the proposal was _exactly_ to remove the GA set.
>> Usually, when a newer package shows up in any given repository, we don't
>> keep the previous version of the package, do we? So I assumed the
>> proposal was expecting that behaviour for the combined repository.
> 
>> From a QA standpoint I'd think you'd want at least one known-installable 
> set of pkgs. Since everything after the original GA set is a giant
> questionmark.
> 
> Not to mention that removing all the old pkgs would more or less make
> deltarpms very difficult.

Well, if we're talking about removing them from Fedora/ but leaving
them in Everything/ (am I understanding the current form of the proposal
correctly?), then it's not really significantly more difficult,
but it is one more process that needs modification in order to enact
such a plan.

-- 
Peter

If the facts don't fit the theory, change the facts.
-- Einstein

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-03 Thread Peter Jones
On 12/02/2009 09:12 PM, Kevin Kofler wrote:
> Seth Vidal wrote:
>> If you're looking for perfect division, sure - but the reality is this:
>>
>> 19K items in a single dir and ext3 and nfs and many many other things crap
>> themselves returning that list.
>>
>> If you make 36 subdirs (26+10) performance gets DRAMATICALLY better for
>> producing the same list of files.
> 
> The problem is, a few of those starting letters still correspond to A LOT of 
> packages, e.g. p* matches perl-*, python-* and php-* and that's a lot of 
> stuff (especially Perl and Python). And adding python3-* (and perl6-*, or 
> are we going to use the rakudo-* namespace there?) won't make it any less.

Even still, separating "p" out to a subdirectory means that subdirectory
has an order of magnitude fewer files than the previous way. That's a
really big difference.

-- 
Peter

If the facts don't fit the theory, change the facts.
-- Einstein

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-04 Thread Jesse Keating
On Thu, 2009-12-03 at 06:51 +0100, Ralf Corsepius wrote:
> > We wouldn't be talking about removing the original GA set - just adding
> > updated pkgs into the path.
> 
> Woa!!! With all due respect, but this would seem an stupid and silly 
> plan to me. 

The only way not to do that would be to maintain yet another directory
of packages that matches the GA set, for GPL compliance.  Now we're just
getting silly.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-04 Thread Jesse Keating
On Thu, 2009-12-03 at 06:24 +0100, Ralf Corsepius wrote:
> > People doing network installs can either add the updates repo to their
> > kickstart, or check the box in the anaconda UI, so that the updates
> > repos are considered at install time.  No download of duplicate data.
> Yes, for people who are doing "full featured networked installs" w/ 
> custom kickstart files. I've never met such a person.

Really?  I meet people who use kickstart all the time.  Any sizable
deployment of Fedora/RHEL uses or should use kickstart.  And those that
don't aren't afraid to check that little 'updates' box at the software
selection screen.  You seemed to have ignored that part of my point.

> 
> > In fact, having separate repos would likely cost less bandwidth.  If we
> > only had one combined repo, there would be many duplicate packages,
> Where? Unlike now, where you have each package twice (in Everything and 
> "updates"), you would have each package only once: In Everything.

That assumes we purge anything but the latest version of a package,
which as noted in other parts of this thread gets complicated with GPL
compliance.

> 
> It would help people like me, who are locally reusing downloaded 
> packages in a networked environment, instead of letting each machine 
> accesses the original repos directly.

Surely a caching proxy server doesn't care what folder a file comes
from, it'll cache it.  Whether the new file shows up in Everything/ or
the new file shows up in updates/ shouldn't matter.

> 
> > especially if we went the route of having updates-testing mixed in and
> > only marked by some update tag.
> Updates-testing is an add-on repo to "Everything+updates".
> 
> A merged new Everything would not change anything wrt. 
> "updates-testing". The only difference would be packages being pushed to 
> "Everything" instead of "updates".
> 
> > We'd have to keep sets of what's in
> > updates-testing, updates, and the GA release set, and all of that would
> > be in one repodata set.  Everybody doing a network install, whether they
> > wanted updates, updates-testing, or not would have to download and
> > consume that larger repodata, introducing a higher cost for them.
> Your concern is the bigger repodata?
> 
> Reality check:
> # ls -h -s releases/11/Everything/x86_64/os/repodata/*.sqlite.bz2 
> updates/11/x86_64/repodata/*.sqlite.bz2
> 
>   16M 
> releases/11/Everything/x86_64/os/repodata/0301ed1cbf01c00cdf9c42b71df6c74947d9c76a3c9767e8f85a99ef6fdccb86-filelists.sqlite.bz2
>   11M 
> releases/11/Everything/x86_64/os/repodata/6a8bfab8ebcbc79f9827f5b16bc1bd1573c068f141bf47c6f216e72dd8b60ff0-primary.sqlite.bz2
> 5.8M 
> releases/11/Everything/x86_64/os/repodata/d3405c970a0c27e9dc3fd3ed179af33fee9a72bf704f45f1ed96a9063b40d7c7-other.sqlite.bz2
> 
> 9.0M 
> updates/11/x86_64/repodata/3a725e07adff9d7e56e7fd8bd3091f38107723987b3b614220df72494dc6814d-filelists.sqlite.bz2
> 6.0M 
> updates/11/x86_64/repodata/54ca116c2a00e87eaca6491adb9e077f4d3f0d5b20e7cbab984774aefb042d0f-primary.sqlite.bz2
> 3.2M 
> updates/11/x86_64/repodata/646eb24cfe113de569853a4e05f55773b3ad9531dee646e7d843b8dc4a849780-other.sqlite.bz2
> 
> => An estimate for the increase in downloaded files's sizes you are 
> talking about is ca. factor 2, from 18.2M (current "updates")
> to 32.8M+ (current "Everything"+"newly introduced packages)
> 
> 
> Whether this increase in download-size is "significant" is up to the 
> beholder. For me, it gets lost in the noise of accessing a "good" or a 
> "bad" connection to a mirror and in the time required to download 
> packages from mirrors.

33~ megs downloaded every single time an update is pushed is a
significant hit for a fair number of people.  That was why it was
important to make yum not re-fetch that repodata every time, and use a
cached version of it.

-- 
Jesse Keating
Fedora -- Freedom² is a feature!
identi.ca: http://identi.ca/jkeating


signature.asc
Description: This is a digitally signed message part
-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list

Re: Proposed F13 feature: drop separate updates repository

2009-12-04 Thread Ralf Corsepius

On 12/03/2009 07:22 AM, Jesse Keating wrote:

On Thu, 2009-12-03 at 06:24 +0100, Ralf Corsepius wrote:

People doing network installs can either add the updates repo to their
kickstart, or check the box in the anaconda UI, so that the updates
repos are considered at install time.  No download of duplicate data.

Yes, for people who are doing "full featured networked installs" w/
custom kickstart files. I've never met such a person.


Really?  I meet people who use kickstart all the time.

May-be internal at RH?


 Any sizable
deployment of Fedora/RHEL uses or should use kickstart.  And those that
don't aren't afraid to check that little 'updates' box at the software
selection screen.  You seemed to have ignored that part of my point.
No, I didn't. It's just that unless this "little check button" is the 
default, many users will ignore it or as in my case ... I am normally 
"yum-upgrading" between distros and rarely use anaconda.



In fact, having separate repos would likely cost less bandwidth.  If we
only had one combined repo, there would be many duplicate packages,

Where? Unlike now, where you have each package twice (in Everything and
"updates"), you would have each package only once: In Everything.


That assumes we purge anything but the latest version of a package,

Correct.


which as noted in other parts of this thread gets complicated with GPL
compliance.
Not necessarily - E.g. it would be GPL compliant to store "purged 
packages" outside of the "current" repos.


And whether "spins" and the way they currently are implemented is 
"good"/"feasible"/"reasonable" is a different question.



=>  An estimate for the increase in downloaded files's sizes you are
talking about is ca. factor 2, from 18.2M (current "updates")
to 32.8M+ (current "Everything"+"newly introduced packages)


Whether this increase in download-size is "significant" is up to the
beholder. For me, it gets lost in the noise of accessing a "good" or a
"bad" connection to a mirror and in the time required to download
packages from mirrors.


33~ megs downloaded every single time an update is pushed is a
significant hit for a fair number of people.

Yes, but ... some more figures:

The same figures as above for FC10:
=> Everything: 25.8M
=> updates: 18.5M

=> A rough estimate for sizes of repodata for a
"near EOL'ed" Fedora: 70% of the size of "Everything's repodata".


I.e. should this estimate apply to later Fedoras, Fedora 11 users are 
likely to see 70% of 33MB = 23MB near EOL of Fedora 11.



That was why it was
important to make yum not re-fetch that repodata every time, and use a
cached version of it.

Yes, the keys to minimize bandwidth demands would be
* to shink the size of the repodata/-files
* to shink the size of "changes" to them.

Besides obvious solutions, such as using a different compression
(e.g. xz instead of bz2) and minimizing their contents, one could ship 
repodata/-files in "chunks".


What I mean: In theory, one could
* update repodata/-files incrementally by shipping some kind of "deltas".

* split repodata/-files into several, e.g. sorted by "some other 
criteria", i.e. to provide several sets of *-[primary,filelist,other] 
files. One "package push", then would only affect a subset of the files, 
but not all. - This is very similar to what (IIRC) Seth had proposed 
(Split the repo into several repos, alphabetically), except that the 
"split" would happen inside of repodata and thus be transparent to users.

I am not sure how difficult to implement this would be.


Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-04 Thread Orion Poplawski

On Fri, December 4, 2009 9:20 pm, Ralf Corsepius wrote:
> On 12/03/2009 07:22 AM, Jesse Keating wrote:
>> On Thu, 2009-12-03 at 06:24 +0100, Ralf Corsepius wrote:
>>> Yes, for people who are doing "full featured networked installs" w/
>>> custom kickstart files. I've never met such a person.
>>
>> Really?  I meet people who use kickstart all the time.
> May-be internal at RH?

I do every install via kickstart - small company with 30-50 machines. 
Been doing fedora+everything+updates installs for many releases now.

In fact - every "upgrade" is a fresh kickstart install + restore critical
files from backup.

-- 
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA/CoRA DivisionFAX: 303-415-9702
3380 Mitchell Lane  or...@cora.nwra.com
Boulder, CO 80301  http://www.cora.nwra.com

-- 
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list


Re: Proposed F13 feature: drop separate updates repository

2009-12-04 Thread Ralf Corsepius

On 12/05/2009 06:22 AM, Orion Poplawski wrote:


On Fri, December 4, 2009 9:20 pm, Ralf Corsepius wrote:

On 12/03/2009 07:22 AM, Jesse Keating wrote:

On Thu, 2009-12-03 at 06:24 +0100, Ralf Corsepius wrote:

Yes, for people who are doing "full featured networked installs" w/
custom kickstart files. I've never met such a person.


Really?  I meet people who use kickstart all the time.

May-be internal at RH?


I do every install via kickstart - small company with 30-50 machines.
Been doing fedora+everything+updates installs for many releases now.


OK, then it's likely a "full time" professional admins vs. "part 
time"/"side-job" admins and "home-users" thing.


Ralf

--
fedora-devel-list mailing list
fedora-devel-list@redhat.com
https://www.redhat.com/mailman/listinfo/fedora-devel-list