Re: [PATCH 2.6.25] netns: struct net content re-work

2007-12-10 Thread Kirill Korotaev
Daniel Lezcano wrote: > Denis V. Lunev wrote: > >>Recently David Miller and Herbert Xu pointed out that struct net becomes >>overbloated and un-maintainable. There are two solutions: >>- provide a pointer to a network subsystem definition from struct net. >> This costs an additional dereferrence

Re: [PATCH][net-2.6.25][NETNS] make netns cleanup to run in a separate workqueue

2007-11-19 Thread Kirill Korotaev
Ah, sorry. Didn't notice it's called only on boot. Acked-By: Kirill Korotaev <[EMAIL PROTECTED]> Kirill Korotaev wrote: > imho panic() is too much. > create_singlethread_workqueue() can fail e.g. due to out of memory... > > Thanks, > Kirill > > > Daniel

Re: [PATCH][net-2.6.25][NETNS] make netns cleanup to run in a separate workqueue

2007-11-19 Thread Kirill Korotaev
imho panic() is too much. create_singlethread_workqueue() can fail e.g. due to out of memory... Thanks, Kirill Daniel Lezcano wrote: > Subject: make netns cleanup to run in a separate queue > From: Benjamin Thery <[EMAIL PROTECTED]> > > This patch adds a separate workqueue for cleaning up a net

Re: [RFD] L2 Network namespace infrastructure

2007-06-28 Thread Kirill Korotaev
Eric W. Biederman wrote: > Patrick McHardy <[EMAIL PROTECTED]> writes: > > >>Eric W. Biederman wrote: >> >>>-- The basic design >>> >>>There will be a network namespace structure that holds the global >>>variables for a network namespace, making those global variables >>>per network namespace. >>

Re: [Devel] Re: [RFD] L2 Network namespace infrastructure

2007-06-28 Thread Kirill Korotaev
Ben Greear wrote: > Kirill Korotaev wrote: > >>Patrick McHardy wrote: >> >> >>>I believe OpenVZ stores the current namespace somewhere global, >>>which avoids passing the namespace around. Couldn't you do this >>>as well? >>>

Re: [Devel] Re: [RFD] L2 Network namespace infrastructure

2007-06-27 Thread Kirill Korotaev
Jeff Garzik wrote: > Eric W. Biederman wrote: > >>Jeff Garzik <[EMAIL PROTECTED]> writes: >> >> >>>David Miller wrote: >>> I don't accept that we have to add another function argument to a bunch of core routines just to support this crap, especially since you give no way to turn it off

Re: [Devel] Re: [RFD] L2 Network namespace infrastructure

2007-06-27 Thread Kirill Korotaev
Patrick McHardy wrote: > Eric W. Biederman wrote: > >>-- The basic design >> >>There will be a network namespace structure that holds the global >>variables for a network namespace, making those global variables >>per network namespace. >> >>One of those per network namespace global variables will

Re: [Devel] Re: [RFD] L2 Network namespace infrastructure

2007-06-27 Thread Kirill Korotaev
Ben Greear wrote: > Patrick McHardy wrote: > >>Eric W. Biederman wrote: >> >> >>>-- The basic design >>> >>>There will be a network namespace structure that holds the global >>>variables for a network namespace, making those global variables >>>per network namespace. >>> >>>One of those per netw

Re: [PATCH] Virtual ethernet tunnel

2007-06-07 Thread Kirill Korotaev
Deniel, Daniel Lezcano wrote: > Pavel Emelianov wrote: > I did this at the very first version, but Alexey showed me that this would be wrong. Look. When we create the second device it must be in the other namespace as it is useless to have them in one namespace. But if we have the

Re: [Devel] Re: [PATCH] Virtual ethernet tunnel

2007-06-07 Thread Kirill Korotaev
David Miller wrote: > From: Pavel Emelianov <[EMAIL PROTECTED]> > Date: Wed, 06 Jun 2007 19:11:38 +0400 > > >>Veth stands for Virtual ETHernet. It is a simple tunnel driver >>that works at the link layer and looks like a pair of ethernet >>devices interconnected with each other. > > > I would s

Re: L2 network namespace benchmarking

2007-03-27 Thread Kirill Korotaev
>>The loss of performances is very noticeable inside the container and >>seems to be directly related to the usage of the pair device and the >>specific network configuration needed for the container. When the >>packets are sent by the container, the mac address is for the pair >>device but the IP

Re: [Devel] Re: [PATCH] Copy mac_len in skb_clone() as well

2007-03-15 Thread Kirill Korotaev
David Miller wrote: > From: Alexey Dobriyan <[EMAIL PROTECTED]> > Date: Wed, 14 Mar 2007 16:07:11 +0300 > > >>ANK says: "It is rarely used, that's wy it was not noticed. >>But in the places, where it is used, it should be disaster." >> >>Signed-off-by: Alexey Dobriyan <[EMAIL PROTECTED]> > > >

Re: [PATCH 0/59] Cleanup sysctl

2007-01-17 Thread Kirill Korotaev
Eric, really good job! Patches: 1-13, 15-24, 26-32, 34-44, 46-49, 52-55, 57 (all except below) Acked-By: Kirill Korotaev <[EMAIL PROTECTED]> 14/59 - minor (extra space) 25/59 - minor note 33/59 - not sorted sysctl IDs 45/59 - typo 50/59 - copyright/file note 51/59 - copyright/fil

Re: [PATCH 51/59] sysctl: Move SYSV IPC sysctls to their own file

2007-01-17 Thread Kirill Korotaev
1. I ask for not setting your authorship/copyright on the code which you just copied from other places. Just doesn't look polite IMHO. 2. please don't name files like ipc/ipc_sysctl.c ipc/sysctl.c sounds better IMHO. 3. any reason to introduce CONFIG_SYSVIPC_SYSCTL? why not simply do >

Re: [PATCH 50/59] sysctl: Move utsname sysctls to their own file

2007-01-17 Thread Kirill Korotaev
Eric, though I personally don't care much: 1. I ask for not setting your authorship/copyright on the code which you just copied from other places. Just doesn't look polite IMHO. 2. I would propose to not introduce utsname_sysctl.c. both files are too small and minor that I can't see much reaso

Re: [PATCH 33/59] sysctl: s390 move sysctl definitions to sysctl.h

2007-01-17 Thread Kirill Korotaev
IDs not sorted in enum. see below. > From: Eric W. Biederman <[EMAIL PROTECTED]> - unquoted > > We need to have the the definition of all top level sysctl > directories registers in sysctl.h so we don't conflict by > accident and cause abi problems. > > Signed-off-by: Eric W. Biederman <[EMAIL P

Re: [PATCH 25/59] sysctl: C99 convert arch/frv/kernel/pm.c

2007-01-17 Thread Kirill Korotaev
another small minor note. > From: Eric W. Biederman <[EMAIL PROTECTED]> - unquoted > > Signed-off-by: Eric W. Biederman <[EMAIL PROTECTED]> > --- > arch/frv/kernel/pm.c | 50 > +++--- > 1 files changed, 43 insertions(+), 7 deletions(-) > > diff --g

Re: [PATCH 14/59] sysctl: C99 convert xfs ctl_tables

2007-01-17 Thread Kirill Korotaev
minor extra space in table below... Kirill > From: Eric W. Biederman <[EMAIL PROTECTED]> - unquoted > > Signed-off-by: Eric W. Biederman <[EMAIL PROTECTED]> > --- > fs/xfs/linux-2.6/xfs_sysctl.c | 258 > 1 files changed, 180 insertions(+), 78 deletions(

Re: [Devel] Re: Network virtualization/isolation

2006-12-06 Thread Kirill Korotaev
>>>If there is a better and less intrusive while still being obvious >>>method I am all for it. I do not like the OpenVZ thing of doing the >>>lookup once and then stashing the value in current and the special >>>casing the exceptions. >> >>Why? > > > I like it when things are obvious and not im

Re: [Devel] Re: [RFC] network namespaces

2006-09-07 Thread Kirill Korotaev
Herbert Poetzl wrote: my point (until we have an implementation which clearly shows that performance is equal/better to isolation) is simply this: of course, you can 'simulate' or 'construct' all the isolation scenarios with kernel bridging and routing and tricky injection/marking of packets, b

Re: [RFC] network namespaces

2006-09-06 Thread Kirill Korotaev
On Tue, Sep 05, 2006 at 08:45:39AM -0600, Eric W. Biederman wrote: Daniel Lezcano <[EMAIL PROTECTED]> writes: For HPC if you are interested in migration you need a separate IP per container. If you can take you IP address with you migration of networking state is simple. If you can't take your

Re: [RFC] network namespaces

2006-09-05 Thread Kirill Korotaev
Yes, performance is probably one issue. My concerns was for layer 2 / layer 3 virtualization. I agree a layer 2 isolation/virtualization is the best for the "system container". But there is another family of container called "application container", it is not a system which is run inside a cont

[PATCH] neigh_table_clear() doesn't free stats

2006-09-01 Thread Kirill Korotaev
Signed-Off-By: Kirill Korotaev <[EMAIL PROTECTED]> diff --git a/net/core/neighbour.c b/net/core/neighbour.c index 89b7904..a45bd21 100644 --- a/net/core/neighbour.c +++ b/net/core/neighbour.c @@ -1429,6 +1429,9 @@ int neigh_table_clear(struct neigh_table kfree(tbl->phash_buck

Re: [RFC] network namespaces

2006-08-17 Thread Kirill Korotaev
Basically there are currently 3 approaches that have been proposed. The trivial bsdjail style as implemented by Serge and in a slightly more sophisticated version in vserver. This approach as it does not touch the packets has little to no packet level overhead. Basically this is what I have cal

Re: [PATCH 3/9] network namespaces: playing and debugging

2006-08-17 Thread Kirill Korotaev
Temporary code to play with network namespaces in the simplest way. Do exec 7< /proc/net/net_ns in your bash shell and you'll get a brand new network namespace. There you can, for example, do ip link set lo up ip addr list ip addr add 1.2.3.4 dev lo ping -n 1.2.3

[PATCH] add_timer -> mod_timer() in dst_run_gc()

2006-08-09 Thread Kirill Korotaev
before dst_lock is tried. Meanwhile, someone on CPU1 adds an entry to gc list and starts the timer. If CPU2 was preempted long enough, this timer can expire simultaneously with resuming timer handler on CPU1, arriving exactly to the situation described. Signed-Off-By: Dmitry Mishin <[EMAIL PROTEC

Re: [PATCH] limit rt cache size

2006-08-09 Thread Kirill Korotaev
2) for cases where we haven't implemented dynamic table growth, specifying a proper limit argument to the hash table allocation is a sufficient solution for the time being Agreed, just we don't know what the proper limits are. I guess it would need someone running quite a lot of benchm

Re: [PATCH] limit rt cache size

2006-08-09 Thread Kirill Korotaev
1) dynamic table growth is the only reasonable way to handle this and not waste memory in all cases Definitely that's the ideal way to go. But there's alot of state to update (more or less atomically, too) in the TCP hashes. Looks tricky to do that without hurting performance, especiall

Re: [PATCH] limit rt cache size

2006-08-08 Thread Kirill Korotaev
David Miller wrote: From: Andi Kleen <[EMAIL PROTECTED]> Date: Tue, 8 Aug 2006 08:53:03 +0200 That's still too big. Consider a 2TB machine, with all memory in LOWMEM. Andi I agree with you, route.c should pass in a suitable limit. I'm just suggesting a fix for a seperate problem. So summa

Re: [PATCH] limit rt cache size

2006-08-08 Thread Kirill Korotaev
David Miller wrote: we quickly discover this GIT commit: 424c4b70cc4ff3930ee36a2ef7b204e4d704fd26 [IPV4]: Use the fancy alloc_large_system_hash() function for route hash table - rt hash table allocated using alloc_large_system_hash() function. Signed-off-by: Eric Dumazet <[EMAIL PROTECTED]> S

Re: [patch 3/4] Network namespaces: IPv4 FIB/routing in namespaces

2006-06-28 Thread Kirill Korotaev
Structures related to IPv4 rounting (FIB and routing cache) are made per-namespace. Hi Andrey, if the ressources are private to the namespace, how do you will handle NFS mounted before creating the network namespace ? Do you take care of that or simply assume you can't access NFS anymore ?

Re: [RFC][patch 1/4] Network namespaces: cleanup of dev_base list use

2006-06-27 Thread Kirill Korotaev
Cleanup of dev_base list use, with the aim to make device list per-namespace. In almost every occasion, use of dev_base variable and dev->next pointer could be easily replaced by for_each_netdev loop. A few most complicated places were converted to using first_netdev()/next_netdev(). As a proof

Re: [patch 2/6] [Network namespace] Network device sharing by view

2006-06-27 Thread Kirill Korotaev
My point is that if you make namespace tagging at routing time, and your packets are being routed only once, you lose the ability to have separate routing tables in each namespace. Right. What is the advantage of having separate the routing tables ? it is impossible to have bridged networking,

Re: [RFC][patch 1/4] Network namespaces: cleanup of dev_base list use

2006-06-27 Thread Kirill Korotaev
Cleanup of dev_base list use, with the aim to make device list per-namespace. In almost every occasion, use of dev_base variable and dev->next pointer could be easily replaced by for_each_netdev loop. A few most complicated places were converted to using first_netdev()/next_netdev(). As a proof