Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On 01/29/2015 05:06 PM, Scot Doyle wrote: > On Thu, 29 Jan 2015, Michael Kerrisk (man-pages) wrote: >> On 29 January 2015 at 02:27, Scot Doyle wrote: >>> On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: > On Wed, 28 Jan 2015, Vivek Goyal wrote: >> On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: >>> When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same >>> true for kexec_load? Would it make sense to note this in the man pages >>> along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? >> >> Hmm.., I can't see an explicity dependency between RELOCATABLE and >> KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel >> even if it had RELOCATABLE=n. >> >> Just that kernel will run from the address it has been built for. >> >> Thanks >> Vivek > > Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says > "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to > arch/x86/boot/header.S line 396: > > #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) >/* kernel/boot_param/ramdisk could be loaded above 4g */ > # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G > #else > # define XLF1 0 > #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek >>> >>> Thanks, I agree, and think it would make sense to list them as part of the >>> page's ENOEXEC error. >> >> Scott, could you then phras a couple of sentences that capture thge >> details, so I can add it to the ENOEXEC error? >> >> Thanks, >> >> Michael > > Yes, maybe something like "kernel_fd does not refer to an open file, or > the file type is not supported. Currently, the file must be a bzImage > and contain an x86 kernel loadable above 4G in memory (see > Documentation/x86/boot.txt)."? > > boot.txt explains that loading above 4G implies 64-bit and is specified > via a bit in xloadflags added in Linux 3.8. Added and pushed. Thanks, Scott. Cheers, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On 01/29/2015 05:06 PM, Scot Doyle wrote: On Thu, 29 Jan 2015, Michael Kerrisk (man-pages) wrote: On 29 January 2015 at 02:27, Scot Doyle lkm...@scotdoyle.com wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek Thanks, I agree, and think it would make sense to list them as part of the page's ENOEXEC error. Scott, could you then phras a couple of sentences that capture thge details, so I can add it to the ENOEXEC error? Thanks, Michael Yes, maybe something like kernel_fd does not refer to an open file, or the file type is not supported. Currently, the file must be a bzImage and contain an x86 kernel loadable above 4G in memory (see Documentation/x86/boot.txt).? boot.txt explains that loading above 4G implies 64-bit and is specified via a bit in xloadflags added in Linux 3.8. Added and pushed. Thanks, Scott. Cheers, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Thu, 29 Jan 2015, Michael Kerrisk (man-pages) wrote: > On 29 January 2015 at 02:27, Scot Doyle wrote: > > On Wed, 28 Jan 2015, Vivek Goyal wrote: > >> On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: > >> > On Wed, 28 Jan 2015, Vivek Goyal wrote: > >> > > On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: > >> > > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the > >> > > > same > >> > > > true for kexec_load? Would it make sense to note this in the man > >> > > > pages > >> > > > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error > >> > > > message? > >> > > > >> > > Hmm.., I can't see an explicity dependency between RELOCATABLE and > >> > > KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel > >> > > even if it had RELOCATABLE=n. > >> > > > >> > > Just that kernel will run from the address it has been built for. > >> > > > >> > > Thanks > >> > > Vivek > >> > > >> > Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says > >> > "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to > >> > arch/x86/boot/header.S line 396: > >> > > >> > #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) > >> >/* kernel/boot_param/ramdisk could be loaded above 4g */ > >> > # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G > >> > #else > >> > # define XLF1 0 > >> > #endif > >> > >> Ah, this one. Actually generic kexec file loading implementation does not > >> impose this restriction. It is the image specific loader part which > >> decides what kind of bzImage it can load. > >> > >> Current implementation (kexec-bzimage64.c), is only supporting loading > >> bzImages which are 64bit and can be loaded above 4G. This simplifies > >> the implementation of loader. > >> > >> But there is nothing which prevents one from implementing other image > >> loaders. > >> > >> So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, > >> it might be better to say in man page that currently this system call > >> supports only loading a bzImage which is 64bit and which can be loaded > >> above 4G too. > >> > >> Thanks > >> Vivek > > > > Thanks, I agree, and think it would make sense to list them as part of the > > page's ENOEXEC error. > > Scott, could you then phras a couple of sentences that capture thge > details, so I can add it to the ENOEXEC error? > > Thanks, > > Michael Yes, maybe something like "kernel_fd does not refer to an open file, or the file type is not supported. Currently, the file must be a bzImage and contain an x86 kernel loadable above 4G in memory (see Documentation/x86/boot.txt)."? boot.txt explains that loading above 4G implies 64-bit and is specified via a bit in xloadflags added in Linux 3.8. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Thu, 29 Jan 2015, Michael Kerrisk (man-pages) wrote: On 29 January 2015 at 02:27, Scot Doyle lkm...@scotdoyle.com wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek Thanks, I agree, and think it would make sense to list them as part of the page's ENOEXEC error. Scott, could you then phras a couple of sentences that capture thge details, so I can add it to the ENOEXEC error? Thanks, Michael Yes, maybe something like kernel_fd does not refer to an open file, or the file type is not supported. Currently, the file must be a bzImage and contain an x86 kernel loadable above 4G in memory (see Documentation/x86/boot.txt).? boot.txt explains that loading above 4G implies 64-bit and is specified via a bit in xloadflags added in Linux 3.8. -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On 29 January 2015 at 02:27, Scot Doyle wrote: > On Wed, 28 Jan 2015, Vivek Goyal wrote: >> On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: >> > On Wed, 28 Jan 2015, Vivek Goyal wrote: >> > > On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: >> > > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same >> > > > true for kexec_load? Would it make sense to note this in the man pages >> > > > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? >> > > >> > > Hmm.., I can't see an explicity dependency between RELOCATABLE and >> > > KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel >> > > even if it had RELOCATABLE=n. >> > > >> > > Just that kernel will run from the address it has been built for. >> > > >> > > Thanks >> > > Vivek >> > >> > Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says >> > "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to >> > arch/x86/boot/header.S line 396: >> > >> > #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) >> >/* kernel/boot_param/ramdisk could be loaded above 4g */ >> > # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G >> > #else >> > # define XLF1 0 >> > #endif >> >> Ah, this one. Actually generic kexec file loading implementation does not >> impose this restriction. It is the image specific loader part which >> decides what kind of bzImage it can load. >> >> Current implementation (kexec-bzimage64.c), is only supporting loading >> bzImages which are 64bit and can be loaded above 4G. This simplifies >> the implementation of loader. >> >> But there is nothing which prevents one from implementing other image >> loaders. >> >> So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, >> it might be better to say in man page that currently this system call >> supports only loading a bzImage which is 64bit and which can be loaded >> above 4G too. >> >> Thanks >> Vivek > > Thanks, I agree, and think it would make sense to list them as part of the > page's ENOEXEC error. Scott, could you then phras a couple of sentences that capture thge details, so I can add it to the ENOEXEC error? Thanks, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: > On Wed, 28 Jan 2015, Vivek Goyal wrote: > > On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: > > > On Wed, 28 Jan 2015, Vivek Goyal wrote: > > > > On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) > > > > wrote: > > > > > Hello Vivek, > > > > > > > > > > >> I've made various adjustments to the page in the light of your > > > > > >> comments > > > > > >> above. Thanks! > > > > > > > > > > > > Thank you for following it up and improving kexec man page. > > > > > > > > > > You're welcome. So, by now, I've made quite a lot of changes > > > > > (including adding a number of cases under ERRORS). I think the revised > > > > > kexec_load/kexec_file_load page is pretty much ready to go, but would > > > > > you be willing to give the text below a check over first? > > > > > > > > > > > > > Hi Michael, > > > > > > > > I had a quick look and it looks good to me. > > > > > > > > Thanks > > > > Vivek > > > > > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same > > > true for kexec_load? Would it make sense to note this in the man pages > > > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? > > > > Hmm.., I can't see an explicity dependency between RELOCATABLE and > > KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel > > even if it had RELOCATABLE=n. > > > > Just that kernel will run from the address it has been built for. > > > > Thanks > > Vivek > > Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says > "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to > arch/x86/boot/header.S line 396: > > #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) >/* kernel/boot_param/ramdisk could be loaded above 4g */ > # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G > #else > # define XLF1 0 > #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
[Dropping Andi into CC, which I should have done to start with, since he wrote the original page, and might also have some comments] Hello Vivek, >> I've made various adjustments to the page in the light of your comments >> above. Thanks! > > Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Thanks Michael .\" Copyright (C) 2010 Intel Corporation, Author: Andi Kleen .\" and Copyright 2014, Vivek Goyal .\" and Copyright (c) 2015, Michael Kerrisk .\" .\" %%%LICENSE_START(VERBATIM) .\" Permission is granted to make and distribute verbatim copies of this .\" manual provided the copyright notice and this permission notice are .\" preserved on all copies. .\" .\" Permission is granted to copy and distribute modified versions of this .\" manual under the conditions for verbatim copying, provided that the .\" entire resulting derived work is distributed under the terms of a .\" permission notice identical to this one. .\" .\" Since the Linux kernel and libraries are constantly changing, this .\" manual page may be incorrect or out-of-date. The author(s) assume no .\" responsibility for errors or omissions, or for damages resulting from .\" the use of the information contained herein. The author(s) may not .\" have taken the same level of care in the production of this manual, .\" which is licensed free of charge, as they might when working .\" professionally. .\" .\" Formatted or processed versions of this manual, if unaccompanied by .\" the source, must acknowledge the copyright and authors of this work. .\" %%%LICENSE_END .\" .TH KEXEC_LOAD 2 2014-08-19 "Linux" "Linux Programmer's Manual" .SH NAME kexec_load, kexec_file_load \- load a new kernel for later execution .SH SYNOPSIS .nf .B #include .BI "long kexec_load(unsigned long " entry ", unsigned long " nr_segments "," .BI "struct kexec_segment *" segments \ ", unsigned long " flags ");" .BI "long kexec_file_load(int " kernel_fd ", int " initrd_fd "," .br .BI "unsigned long " cmdline_len \ ", const char *" cmdline "," .BI "unsigned long " flags ");" .fi .IR Note : There are no glibc wrappers for these system calls; see NOTES. .SH DESCRIPTION The .BR kexec_load () system call loads a new kernel that can be executed later by .BR reboot (2). .PP The .I flags argument is a bit mask that controls the operation of the call. The following values can be specified in .IR flags : .TP .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" Execute the new kernel automatically on a system crash. This "crash kernel" is loaded into an area of reserved memory that is determined at boot time using the .I craskkernel kernel command-line parameter. The location of this reserved memory is exported to user space via the .I /proc/iomem file, in an entry labeled "Crash kernel". A user-space application can parse this file and prepare a list of segments (see below) that specify this reserved memory as destination. If this flag is specified, the kernel checks that the target segments specified in .I segments fall within the reserved region. .TP .BR KEXEC_PRESERVE_CONTEXT " (since Linux 2.6.27)" Preserve the system hardware and software states before executing the new kernel. This could be used for system suspend. This flag is available only if the kernel was configured with .BR CONFIG_KEXEC_JUMP , and is effective only if .I nr_segments is greater than 0. .PP The high-order bits (corresponding to the mask 0x) of .I flags contain the architecture of the to-be-executed kernel. Specify (OR) the constant .B KEXEC_ARCH_DEFAULT to use the current architecture, or one of the following architecture constants .BR KEXEC_ARCH_386 , .BR KEXEC_ARCH_68K , .BR KEXEC_ARCH_X86_64 , .BR KEXEC_ARCH_PPC , .BR KEXEC_ARCH_PPC64 , .BR KEXEC_ARCH_IA_64 , .BR KEXEC_ARCH_ARM , .BR KEXEC_ARCH_S390 , .BR KEXEC_ARCH_SH , .BR KEXEC_ARCH_MIPS , and .BR KEXEC_ARCH_MIPS_LE . The architecture must be executable on the CPU of the system. The .I entry argument is the physical entry address in the kernel image. The .I nr_segments argument is the number of segments pointed to by the .I segments pointer; the kernel imposes an (arbitrary) limit of 16 on the number of segments. The .I segments argument is an array of .I kexec_segment structures which define the kernel layout: .in +4n .nf struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP The kernel image defined by .I segments is copied from the calling process into the kernel either in regular memory or in reserved memory (if .BR
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: > On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: > > Hello Vivek, > > > > >> I've made various adjustments to the page in the light of your comments > > >> above. Thanks! > > > > > > Thank you for following it up and improving kexec man page. > > > > You're welcome. So, by now, I've made quite a lot of changes > > (including adding a number of cases under ERRORS). I think the revised > > kexec_load/kexec_file_load page is pretty much ready to go, but would > > you be willing to give the text below a check over first? > > > > Hi Michael, > > I had a quick look and it looks good to me. > > Thanks > Vivek When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Thanks, Scot -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: > On Wed, 28 Jan 2015, Vivek Goyal wrote: > > On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: > > > Hello Vivek, > > > > > > >> I've made various adjustments to the page in the light of your comments > > > >> above. Thanks! > > > > > > > > Thank you for following it up and improving kexec man page. > > > > > > You're welcome. So, by now, I've made quite a lot of changes > > > (including adding a number of cases under ERRORS). I think the revised > > > kexec_load/kexec_file_load page is pretty much ready to go, but would > > > you be willing to give the text below a check over first? > > > > > > > Hi Michael, > > > > I had a quick look and it looks good to me. > > > > Thanks > > Vivek > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same > true for kexec_load? Would it make sense to note this in the man pages > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: > On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: > > On Wed, 28 Jan 2015, Vivek Goyal wrote: > > > On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: > > > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same > > > > true for kexec_load? Would it make sense to note this in the man pages > > > > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? > > > > > > Hmm.., I can't see an explicity dependency between RELOCATABLE and > > > KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel > > > even if it had RELOCATABLE=n. > > > > > > Just that kernel will run from the address it has been built for. > > > > > > Thanks > > > Vivek > > > > Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says > > "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to > > arch/x86/boot/header.S line 396: > > > > #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) > >/* kernel/boot_param/ramdisk could be loaded above 4g */ > > # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G > > #else > > # define XLF1 0 > > #endif > > Ah, this one. Actually generic kexec file loading implementation does not > impose this restriction. It is the image specific loader part which > decides what kind of bzImage it can load. > > Current implementation (kexec-bzimage64.c), is only supporting loading > bzImages which are 64bit and can be loaded above 4G. This simplifies > the implementation of loader. > > But there is nothing which prevents one from implementing other image > loaders. > > So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, > it might be better to say in man page that currently this system call > supports only loading a bzImage which is 64bit and which can be loaded > above 4G too. > > Thanks > Vivek Thanks, I agree, and think it would make sense to list them as part of the page's ENOEXEC error. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: > On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: > > On Wed, 28 Jan 2015, Vivek Goyal wrote: > > > On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) > > > wrote: > > > > Hello Vivek, > > > > > > > > >> I've made various adjustments to the page in the light of your > > > > >> comments > > > > >> above. Thanks! > > > > > > > > > > Thank you for following it up and improving kexec man page. > > > > > > > > You're welcome. So, by now, I've made quite a lot of changes > > > > (including adding a number of cases under ERRORS). I think the revised > > > > kexec_load/kexec_file_load page is pretty much ready to go, but would > > > > you be willing to give the text below a check over first? > > > > > > > > > > Hi Michael, > > > > > > I had a quick look and it looks good to me. > > > > > > Thanks > > > Vivek > > > > When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same > > true for kexec_load? Would it make sense to note this in the man pages > > along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? > > Hmm.., I can't see an explicity dependency between RELOCATABLE and > KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel > even if it had RELOCATABLE=n. > > Just that kernel will run from the address it has been built for. > > Thanks > Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says "kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set." which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) && defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 09:04:38AM +0100, Michael Kerrisk (man-pages) wrote: Hi Michael, [..] > >> * the number of bytes copied from userspace is min(bufsz, memsz) > > > > Yes. bufsz can not be more than memsz. There is a check to validate > > this in kernel. > > > > result = -EINVAL; > > for (i = 0; i < nr_segments; i++) { > > if (image->segment[i].bufsz > image->segment[i].memsz) > > return result; > > } > > Okay. So it's more precise to leave discussion of min(bufz, memsz) > out of the man page just to say: bufsz bytes are transferred; > if bufsz < memsz, then the excess bytes in the target region are > filled with zeros. Right? Sounds good. [..] > > Both mem and memsz need to be page aligned. > > And the error if not is EADDRNOTAVAIL, right? Yes. > > >> And one further question. Other than the fact that they are used with > >> different system calls, what is the difference between KEXEC_ON_CRASH > >> and KEXEC_FILE_ON_CRASH? > > > > Right now I can't think of any other difference. They both tell respective > > system call that this kernel needs to be loaded in reserved memory region > > for crash kernel. > > Okay. > > I've made various adjustments to the page in the light of your comments > above. Thanks! Thank you for following it up and improving kexec man page. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hi Vivek, On 01/27/2015 03:24 PM, Vivek Goyal wrote: > On Fri, Jan 16, 2015 at 02:30:25PM +0100, Michael Kerrisk (man-pages) wrote: > [..] >> > > Hi Michael, > > Please find my responses below. Sorry, I got stuck in other work and > forgot about this thread. > >> So, returning to the kexeec_segment structure: >> >>struct kexec_segment { >>void *buf;/* Buffer in user space */ >>size_t bufsz; /* Buffer length in user space */ >>void *mem;/* Physical address of kernel */ >>size_t memsz; /* Physical address length */ >>}; >> >> Are the following statements correct: >> * buf + bufsz identify a memory region in the caller's virtual >> address space that is the source of the copy > > Yes. Okay. >> * mem + memsz specify the target memory region of the copy > > Yes. Okay. >> * mem is physical memory address, as seen from kernel space > > Yes. Okay. >> * the number of bytes copied from userspace is min(bufsz, memsz) > > Yes. bufsz can not be more than memsz. There is a check to validate > this in kernel. > > result = -EINVAL; > for (i = 0; i < nr_segments; i++) { > if (image->segment[i].bufsz > image->segment[i].memsz) > return result; > } Okay. So it's more precise to leave discussion of min(bufz, memsz) out of the man page just to say: bufsz bytes are transferred; if bufsz < memsz, then the excess bytes in the target region are filled with zeros. Right? >> * if bufsz > memsz, then excess bytes in the user-space buffer >> are ignored. > > You will get -EINVAL. Okay. >> * if memsz > bufsz, then excess bytes in the target kernel buffer >> are filled with zeros. > > Yes. Okay. >> Also, it seems to me that 'mem' need not be page aligned. >> Is that correct? Should the man page say something about that? >> (E.g., is it generally desirable that 'mem' should be page aligned?) > > mem and memsz need to be page aligned. There is a check for that too. > > mstart = image->segment[i].mem; > mend = mstart + image->segment[i].memsz; > if ((mstart & ~PAGE_MASK) || (mend & ~PAGE_MASK)) > return result; > >> >> Likewise, 'memsz' doesn't need to beta page multiple, IIUC. >> Should the man page say anything about this? For example, should >> it note that the initialized kernel segment will be of size: >> >> (mem % PAGE_SIZE + memsz) rounded up to the next multiple of PAGE_SIZE >> >> And should it note that if 'mem' is not a multiple of the page size, then >> the initial bytes (mem % PAGE_SIZE)) in the first page of the kernel segment >> will be zeros? >> >> (Hopefully I have read kimage_load_normal_segment() correctly.) > > Both mem and memsz need to be page aligned. And the error if not is EADDRNOTAVAIL, right? >> And one further question. Other than the fact that they are used with >> different system calls, what is the difference between KEXEC_ON_CRASH >> and KEXEC_FILE_ON_CRASH? > > Right now I can't think of any other difference. They both tell respective > system call that this kernel needs to be loaded in reserved memory region > for crash kernel. Okay. I've made various adjustments to the page in the light of your comments above. Thanks! Cheers, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: > [Dropping Andi into CC, which I should have done to start with, since > he wrote the original page, and might also have some comments] > > Hello Vivek, > > >> I've made various adjustments to the page in the light of your comments > >> above. Thanks! > > > > Thank you for following it up and improving kexec man page. > > You're welcome. So, by now, I've made quite a lot of changes > (including adding a number of cases under ERRORS). I think the revised > kexec_load/kexec_file_load page is pretty much ready to go, but would > you be willing to give the text below a check over first? > Hi Michael, I had a quick look and it looks good to me. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 09:04:38AM +0100, Michael Kerrisk (man-pages) wrote: Hi Michael, [..] * the number of bytes copied from userspace is min(bufsz, memsz) Yes. bufsz can not be more than memsz. There is a check to validate this in kernel. result = -EINVAL; for (i = 0; i nr_segments; i++) { if (image-segment[i].bufsz image-segment[i].memsz) return result; } Okay. So it's more precise to leave discussion of min(bufz, memsz) out of the man page just to say: bufsz bytes are transferred; if bufsz memsz, then the excess bytes in the target region are filled with zeros. Right? Sounds good. [..] Both mem and memsz need to be page aligned. And the error if not is EADDRNOTAVAIL, right? Yes. And one further question. Other than the fact that they are used with different system calls, what is the difference between KEXEC_ON_CRASH and KEXEC_FILE_ON_CRASH? Right now I can't think of any other difference. They both tell respective system call that this kernel needs to be loaded in reserved memory region for crash kernel. Okay. I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hi Vivek, On 01/27/2015 03:24 PM, Vivek Goyal wrote: On Fri, Jan 16, 2015 at 02:30:25PM +0100, Michael Kerrisk (man-pages) wrote: [..] Hi Michael, Please find my responses below. Sorry, I got stuck in other work and forgot about this thread. So, returning to the kexeec_segment structure: struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; Are the following statements correct: * buf + bufsz identify a memory region in the caller's virtual address space that is the source of the copy Yes. Okay. * mem + memsz specify the target memory region of the copy Yes. Okay. * mem is physical memory address, as seen from kernel space Yes. Okay. * the number of bytes copied from userspace is min(bufsz, memsz) Yes. bufsz can not be more than memsz. There is a check to validate this in kernel. result = -EINVAL; for (i = 0; i nr_segments; i++) { if (image-segment[i].bufsz image-segment[i].memsz) return result; } Okay. So it's more precise to leave discussion of min(bufz, memsz) out of the man page just to say: bufsz bytes are transferred; if bufsz memsz, then the excess bytes in the target region are filled with zeros. Right? * if bufsz memsz, then excess bytes in the user-space buffer are ignored. You will get -EINVAL. Okay. * if memsz bufsz, then excess bytes in the target kernel buffer are filled with zeros. Yes. Okay. Also, it seems to me that 'mem' need not be page aligned. Is that correct? Should the man page say something about that? (E.g., is it generally desirable that 'mem' should be page aligned?) mem and memsz need to be page aligned. There is a check for that too. mstart = image-segment[i].mem; mend = mstart + image-segment[i].memsz; if ((mstart ~PAGE_MASK) || (mend ~PAGE_MASK)) return result; Likewise, 'memsz' doesn't need to beta page multiple, IIUC. Should the man page say anything about this? For example, should it note that the initialized kernel segment will be of size: (mem % PAGE_SIZE + memsz) rounded up to the next multiple of PAGE_SIZE And should it note that if 'mem' is not a multiple of the page size, then the initial bytes (mem % PAGE_SIZE)) in the first page of the kernel segment will be zeros? (Hopefully I have read kimage_load_normal_segment() correctly.) Both mem and memsz need to be page aligned. And the error if not is EADDRNOTAVAIL, right? And one further question. Other than the fact that they are used with different system calls, what is the difference between KEXEC_ON_CRASH and KEXEC_FILE_ON_CRASH? Right now I can't think of any other difference. They both tell respective system call that this kernel needs to be loaded in reserved memory region for crash kernel. Okay. I've made various adjustments to the page in the light of your comments above. Thanks! Cheers, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: [Dropping Andi into CC, which I should have done to start with, since he wrote the original page, and might also have some comments] Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Hi Michael, I had a quick look and it looks good to me. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Hi Michael, I had a quick look and it looks good to me. Thanks Vivek When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Hi Michael, I had a quick look and it looks good to me. Thanks Vivek When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
[Dropping Andi into CC, which I should have done to start with, since he wrote the original page, and might also have some comments] Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Thanks Michael .\ Copyright (C) 2010 Intel Corporation, Author: Andi Kleen .\ and Copyright 2014, Vivek Goyal vgo...@redhat.com .\ and Copyright (c) 2015, Michael Kerrisk mtk.manpa...@gmail.com .\ .\ %%%LICENSE_START(VERBATIM) .\ Permission is granted to make and distribute verbatim copies of this .\ manual provided the copyright notice and this permission notice are .\ preserved on all copies. .\ .\ Permission is granted to copy and distribute modified versions of this .\ manual under the conditions for verbatim copying, provided that the .\ entire resulting derived work is distributed under the terms of a .\ permission notice identical to this one. .\ .\ Since the Linux kernel and libraries are constantly changing, this .\ manual page may be incorrect or out-of-date. The author(s) assume no .\ responsibility for errors or omissions, or for damages resulting from .\ the use of the information contained herein. The author(s) may not .\ have taken the same level of care in the production of this manual, .\ which is licensed free of charge, as they might when working .\ professionally. .\ .\ Formatted or processed versions of this manual, if unaccompanied by .\ the source, must acknowledge the copyright and authors of this work. .\ %%%LICENSE_END .\ .TH KEXEC_LOAD 2 2014-08-19 Linux Linux Programmer's Manual .SH NAME kexec_load, kexec_file_load \- load a new kernel for later execution .SH SYNOPSIS .nf .B #include linux/kexec.h .BI long kexec_load(unsigned long entry , unsigned long nr_segments , .BI struct kexec_segment * segments \ , unsigned long flags ); .BI long kexec_file_load(int kernel_fd , int initrd_fd , .br .BI unsigned long cmdline_len \ , const char * cmdline , .BI unsigned long flags ); .fi .IR Note : There are no glibc wrappers for these system calls; see NOTES. .SH DESCRIPTION The .BR kexec_load () system call loads a new kernel that can be executed later by .BR reboot (2). .PP The .I flags argument is a bit mask that controls the operation of the call. The following values can be specified in .IR flags : .TP .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. This crash kernel is loaded into an area of reserved memory that is determined at boot time using the .I craskkernel kernel command-line parameter. The location of this reserved memory is exported to user space via the .I /proc/iomem file, in an entry labeled Crash kernel. A user-space application can parse this file and prepare a list of segments (see below) that specify this reserved memory as destination. If this flag is specified, the kernel checks that the target segments specified in .I segments fall within the reserved region. .TP .BR KEXEC_PRESERVE_CONTEXT (since Linux 2.6.27) Preserve the system hardware and software states before executing the new kernel. This could be used for system suspend. This flag is available only if the kernel was configured with .BR CONFIG_KEXEC_JUMP , and is effective only if .I nr_segments is greater than 0. .PP The high-order bits (corresponding to the mask 0x) of .I flags contain the architecture of the to-be-executed kernel. Specify (OR) the constant .B KEXEC_ARCH_DEFAULT to use the current architecture, or one of the following architecture constants .BR KEXEC_ARCH_386 , .BR KEXEC_ARCH_68K , .BR KEXEC_ARCH_X86_64 , .BR KEXEC_ARCH_PPC , .BR KEXEC_ARCH_PPC64 , .BR KEXEC_ARCH_IA_64 , .BR KEXEC_ARCH_ARM , .BR KEXEC_ARCH_S390 , .BR KEXEC_ARCH_SH , .BR KEXEC_ARCH_MIPS , and .BR KEXEC_ARCH_MIPS_LE . The architecture must be executable on the CPU of the system. The .I entry argument is the physical entry address in the kernel image. The .I nr_segments argument is the number of segments pointed to by the .I segments pointer; the kernel imposes an (arbitrary) limit of 16 on the number of segments. The .I segments argument is an array of .I kexec_segment structures which define the kernel layout: .in +4n .nf struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP The kernel image defined by .I segments is copied from the calling process into the kernel either in regular memory or in reserved memory (if .BR KEXEC_ON_CRASH is
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek Thanks, I agree, and think it would make sense to list them as part of the page's ENOEXEC error. -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Hi Michael, I had a quick look and it looks good to me. Thanks Vivek When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Thanks, Scot -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 04:49:34PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek, I've made various adjustments to the page in the light of your comments above. Thanks! Thank you for following it up and improving kexec man page. You're welcome. So, by now, I've made quite a lot of changes (including adding a number of cases under ERRORS). I think the revised kexec_load/kexec_file_load page is pretty much ready to go, but would you be willing to give the text below a check over first? Hi Michael, I had a quick look and it looks good to me. Thanks Vivek When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On 29 January 2015 at 02:27, Scot Doyle lkm...@scotdoyle.com wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 10:10:59PM +, Scot Doyle wrote: On Wed, 28 Jan 2015, Vivek Goyal wrote: On Wed, Jan 28, 2015 at 09:14:03PM +, Scot Doyle wrote: When I tested, kexec_file_load required CONFIG_RELOCATABLE. Is the same true for kexec_load? Would it make sense to note this in the man pages along with the need for CONFIG_KEXEC_FILE, etc? Or as an error message? Hmm.., I can't see an explicity dependency between RELOCATABLE and KEXEC. Both KEXEC and KEXEC_FILE should be able to load a kernel even if it had RELOCATABLE=n. Just that kernel will run from the address it has been built for. Thanks Vivek Confusing, right? kexec_file_load returns -ENOEXEC and dmesg says kexec-bzImage64: XLF_CAN_BE_LOADED_ABOVE_4G is not set. which leads to arch/x86/boot/header.S line 396: #if defined(CONFIG_RELOCATABLE) defined(CONFIG_X86_64) /* kernel/boot_param/ramdisk could be loaded above 4g */ # define XLF1 XLF_CAN_BE_LOADED_ABOVE_4G #else # define XLF1 0 #endif Ah, this one. Actually generic kexec file loading implementation does not impose this restriction. It is the image specific loader part which decides what kind of bzImage it can load. Current implementation (kexec-bzimage64.c), is only supporting loading bzImages which are 64bit and can be loaded above 4G. This simplifies the implementation of loader. But there is nothing which prevents one from implementing other image loaders. So instead of saying that kexec_file_load() depends on CONFIG_RELOCATABLE, it might be better to say in man page that currently this system call supports only loading a bzImage which is 64bit and which can be loaded above 4G too. Thanks Vivek Thanks, I agree, and think it would make sense to list them as part of the page's ENOEXEC error. Scott, could you then phras a couple of sentences that capture thge details, so I can add it to the ENOEXEC error? Thanks, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Fri, Jan 16, 2015 at 02:30:25PM +0100, Michael Kerrisk (man-pages) wrote: [..] > Hi Michael, Please find my responses below. Sorry, I got stuck in other work and forgot about this thread. > So, returning to the kexeec_segment structure: > >struct kexec_segment { >void *buf;/* Buffer in user space */ >size_t bufsz; /* Buffer length in user space */ >void *mem;/* Physical address of kernel */ >size_t memsz; /* Physical address length */ >}; > > Are the following statements correct: > * buf + bufsz identify a memory region in the caller's virtual > address space that is the source of the copy Yes. > * mem + memsz specify the target memory region of the copy Yes. > * mem is physical memory address, as seen from kernel space Yes. > * the number of bytes copied from userspace is min(bufsz, memsz) Yes. bufsz can not be more than memsz. There is a check to validate this in kernel. result = -EINVAL; for (i = 0; i < nr_segments; i++) { if (image->segment[i].bufsz > image->segment[i].memsz) return result; } > * if bufsz > memsz, then excess bytes in the user-space buffer > are ignored. You will get -EINVAL. > * if memsz > bufsz, then excess bytes in the target kernel buffer > are filled with zeros. Yes. > Also, it seems to me that 'mem' need not be page aligned. > Is that correct? Should the man page say something about that? > (E.g., is it generally desirable that 'mem' should be page aligned?) mem and memsz need to be page aligned. There is a check for that too. mstart = image->segment[i].mem; mend = mstart + image->segment[i].memsz; if ((mstart & ~PAGE_MASK) || (mend & ~PAGE_MASK)) return result; > > Likewise, 'memsz' doesn't need to beta page multiple, IIUC. > Should the man page say anything about this? For example, should > it note that the initialized kernel segment will be of size: > > (mem % PAGE_SIZE + memsz) rounded up to the next multiple of PAGE_SIZE > > And should it note that if 'mem' is not a multiple of the page size, then > the initial bytes (mem % PAGE_SIZE)) in the first page of the kernel segment > will be zeros? > > (Hopefully I have read kimage_load_normal_segment() correctly.) Both mem and memsz need to be page aligned. > > And one further question. Other than the fact that they are used with > different system calls, what is the difference between KEXEC_ON_CRASH > and KEXEC_FILE_ON_CRASH? Right now I can't think of any other difference. They both tell respective system call that this kernel needs to be loaded in reserved memory region for crash kernel. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hello Vivek, Ping! Cheers, Michael On 16 January 2015 at 14:30, Michael Kerrisk (man-pages) wrote: > Hello Vivek, > > Thanks for your comments! I've added some further text to > the page based on those comments. See some follow-up > questions below. > > On 01/12/2015 11:16 PM, Vivek Goyal wrote: >> On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: >> >> [..] > .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" > Execute the new kernel automatically on a system crash. > .\" FIXME Explain in more detail how KEXEC_ON_CRASH is actually used >>> >>> I wasn't expecting that you would respond to the FIXMEs that were >>> not labeled "kexec_file_load", but I was hoping you might ;-). Thanks! >>> I have a few additional questions to your nice notes. >>> Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel=<> command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). >>> >> >> Hi Michael, >> >>> Can I just confirm: is it in all cases only possible to use kexec_load() >>> and kexec_file_load() if the kernel was booted with the 'crashkernel' >>> parameter set? >> >> As of now, only kexec_load() and kexec_file_load() system calls can >> make use of memory reserved by crashkernel=<> kernel parameter. And >> this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH >> flag specified). > > Okay. > Location of this reserved memory is exported to user space through /proc/iomem file. >>> >>> Is that export via an entry labeled "Crash kernel" in the >>> /proc/iomem file? >> >> Yes. > > Okay -- thanks. > User space can parse it and prepare list of segments specifying this reserved memory as destination. >>> >>> I'm not quite clear on "specifying this reserved memory as destination". >>> Is that done by specifying the address in the kexec_segment.mem fields? >> >> You are absolutely right. User space can specify in kexec_segment.mem >> field the memory location where it expecting a particular segment to >> be loaded by kernel. >> >>> Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. >>> >>> Could you point me to where this checking is done? Also, what is the >>> error (errno) that occurs when the load operation fails? (I think the >>> answers to these questions are "at the start of kimage_alloc_init()" >>> and "EADDRNOTAVAIL", but I'd like to confirm.) >> >> This checking happens in sanity_check_segment_list() which is called >> by kimage_alloc_init(). >> >> And yes, error code returned is -EADDRNOTAVAIL. > > Thanks. I added EADDRNOTAVAIL to the ERRORS. > [..] > struct kexec_segment { > void *buf;/* Buffer in user space */ > size_t bufsz; /* Buffer length in user space */ > void *mem;/* Physical address of kernel */ > size_t memsz; /* Physical address length */ > }; > .fi > .in > .PP > .\" FIXME Explain the details of how the kernel image defined by segments > .\" is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory >>> >>> Could you clarify what you mean by "regular memory"? >> >> I meant memory which is not reserved memory. > > Okay. > or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. >>> >>> By "moved to final destination", do you mean "moved from user space to the >>> final kernel-space destination"? >> >> No. Segment data moves from user space to kernel space once kexec_load() >> call finishes successfully. But when user does reboot (kexec -e), at that >> time kernel moves that segment data to its final location. Kernel could >> not place the segment at its final location during kexec_load() time as >> that memory is already in use by running kernel. But once we are about >> to reboot to new kernel, we can overwrite the old kernel's memory. > > Got it. > In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps >>> >>> By "directly", I assume you mean "at the time of the kexec_laod() call", >>> right? >> >> Yes. > > Thanks. > > So, returning to the kexeec_segment structure: > >struct kexec_segment { >void *buf;/* Buffer in user space */ >size_t bufsz; /* Buffer length in user space */ >void
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hello Vivek, Ping! Cheers, Michael On 16 January 2015 at 14:30, Michael Kerrisk (man-pages) mtk.manpa...@gmail.com wrote: Hello Vivek, Thanks for your comments! I've added some further text to the page based on those comments. See some follow-up questions below. On 01/12/2015 11:16 PM, Vivek Goyal wrote: On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: [..] .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. .\ FIXME Explain in more detail how KEXEC_ON_CRASH is actually used I wasn't expecting that you would respond to the FIXMEs that were not labeled kexec_file_load, but I was hoping you might ;-). Thanks! I have a few additional questions to your nice notes. Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel= command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Hi Michael, Can I just confirm: is it in all cases only possible to use kexec_load() and kexec_file_load() if the kernel was booted with the 'crashkernel' parameter set? As of now, only kexec_load() and kexec_file_load() system calls can make use of memory reserved by crashkernel= kernel parameter. And this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH flag specified). Okay. Location of this reserved memory is exported to user space through /proc/iomem file. Is that export via an entry labeled Crash kernel in the /proc/iomem file? Yes. Okay -- thanks. User space can parse it and prepare list of segments specifying this reserved memory as destination. I'm not quite clear on specifying this reserved memory as destination. Is that done by specifying the address in the kexec_segment.mem fields? You are absolutely right. User space can specify in kexec_segment.mem field the memory location where it expecting a particular segment to be loaded by kernel. Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. Could you point me to where this checking is done? Also, what is the error (errno) that occurs when the load operation fails? (I think the answers to these questions are at the start of kimage_alloc_init() and EADDRNOTAVAIL, but I'd like to confirm.) This checking happens in sanity_check_segment_list() which is called by kimage_alloc_init(). And yes, error code returned is -EADDRNOTAVAIL. Thanks. I added EADDRNOTAVAIL to the ERRORS. [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\ FIXME Explain the details of how the kernel image defined by segments .\ is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory Could you clarify what you mean by regular memory? I meant memory which is not reserved memory. Okay. or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. By moved to final destination, do you mean moved from user space to the final kernel-space destination? No. Segment data moves from user space to kernel space once kexec_load() call finishes successfully. But when user does reboot (kexec -e), at that time kernel moves that segment data to its final location. Kernel could not place the segment at its final location during kexec_load() time as that memory is already in use by running kernel. But once we are about to reboot to new kernel, we can overwrite the old kernel's memory. Got it. In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps By directly, I assume you mean at the time of the kexec_laod() call, right? Yes. Thanks. So, returning to the kexeec_segment structure: struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; Are the following statements correct: * buf + bufsz identify a memory region in the caller's virtual address space that is the source of the copy * mem + memsz specify the target memory region of the copy * mem is
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Fri, Jan 16, 2015 at 02:30:25PM +0100, Michael Kerrisk (man-pages) wrote: [..] Hi Michael, Please find my responses below. Sorry, I got stuck in other work and forgot about this thread. So, returning to the kexeec_segment structure: struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; Are the following statements correct: * buf + bufsz identify a memory region in the caller's virtual address space that is the source of the copy Yes. * mem + memsz specify the target memory region of the copy Yes. * mem is physical memory address, as seen from kernel space Yes. * the number of bytes copied from userspace is min(bufsz, memsz) Yes. bufsz can not be more than memsz. There is a check to validate this in kernel. result = -EINVAL; for (i = 0; i nr_segments; i++) { if (image-segment[i].bufsz image-segment[i].memsz) return result; } * if bufsz memsz, then excess bytes in the user-space buffer are ignored. You will get -EINVAL. * if memsz bufsz, then excess bytes in the target kernel buffer are filled with zeros. Yes. Also, it seems to me that 'mem' need not be page aligned. Is that correct? Should the man page say something about that? (E.g., is it generally desirable that 'mem' should be page aligned?) mem and memsz need to be page aligned. There is a check for that too. mstart = image-segment[i].mem; mend = mstart + image-segment[i].memsz; if ((mstart ~PAGE_MASK) || (mend ~PAGE_MASK)) return result; Likewise, 'memsz' doesn't need to beta page multiple, IIUC. Should the man page say anything about this? For example, should it note that the initialized kernel segment will be of size: (mem % PAGE_SIZE + memsz) rounded up to the next multiple of PAGE_SIZE And should it note that if 'mem' is not a multiple of the page size, then the initial bytes (mem % PAGE_SIZE)) in the first page of the kernel segment will be zeros? (Hopefully I have read kimage_load_normal_segment() correctly.) Both mem and memsz need to be page aligned. And one further question. Other than the fact that they are used with different system calls, what is the difference between KEXEC_ON_CRASH and KEXEC_FILE_ON_CRASH? Right now I can't think of any other difference. They both tell respective system call that this kernel needs to be loaded in reserved memory region for crash kernel. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hello Vivek, Thanks for your comments! I've added some further text to the page based on those comments. See some follow-up questions below. On 01/12/2015 11:16 PM, Vivek Goyal wrote: > On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: > > [..] .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" Execute the new kernel automatically on a system crash. .\" FIXME Explain in more detail how KEXEC_ON_CRASH is actually used >> >> I wasn't expecting that you would respond to the FIXMEs that were >> not labeled "kexec_file_load", but I was hoping you might ;-). Thanks! >> I have a few additional questions to your nice notes. >> >>> Upon boot first kernel reserves a chunk of contiguous memory (if >>> crashkernel=<> command line paramter is passed). This memory is >>> is used to load the crash kernel (Kernel which will be booted into >>> if first kernel crashes). >> > > Hi Michael, > >> Can I just confirm: is it in all cases only possible to use kexec_load() >> and kexec_file_load() if the kernel was booted with the 'crashkernel' >> parameter set? > > As of now, only kexec_load() and kexec_file_load() system calls can > make use of memory reserved by crashkernel=<> kernel parameter. And > this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH > flag specified). Okay. >>> Location of this reserved memory is exported to user space through >>> /proc/iomem file. >> >> Is that export via an entry labeled "Crash kernel" in the >> /proc/iomem file? > > Yes. Okay -- thanks. >>> User space can parse it and prepare list of segments >>> specifying this reserved memory as destination. >> >> I'm not quite clear on "specifying this reserved memory as destination". >> Is that done by specifying the address in the kexec_segment.mem fields? > > You are absolutely right. User space can specify in kexec_segment.mem > field the memory location where it expecting a particular segment to > be loaded by kernel. > >> >>> Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the >>> segments are destined for reserved memory otherwise kernel load operation >>> fails. >> >> Could you point me to where this checking is done? Also, what is the >> error (errno) that occurs when the load operation fails? (I think the >> answers to these questions are "at the start of kimage_alloc_init()" >> and "EADDRNOTAVAIL", but I'd like to confirm.) > > This checking happens in sanity_check_segment_list() which is called > by kimage_alloc_init(). > > And yes, error code returned is -EADDRNOTAVAIL. Thanks. I added EADDRNOTAVAIL to the ERRORS. >>> [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\" FIXME Explain the details of how the kernel image defined by segments .\" is copied from the calling process into previously reserved memory. >>> >>> Kernel image defined by segments is copied into kernel either in regular >>> memory >> >> Could you clarify what you mean by "regular memory"? > > I meant memory which is not reserved memory. Okay. >>> or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first >>> copies list of segments in kernel memory and then goes does various >>> sanity checks on the segments. If everything looks line, kernel copies >>> segment data to kernel memory. >>> >>> In case of normal kexec, segment data is loaded in any available memory >>> and segment data is moved to final destination at the kexec reboot time. >> >> By "moved to final destination", do you mean "moved from user space to the >> final kernel-space destination"? > > No. Segment data moves from user space to kernel space once kexec_load() > call finishes successfully. But when user does reboot (kexec -e), at that > time kernel moves that segment data to its final location. Kernel could > not place the segment at its final location during kexec_load() time as > that memory is already in use by running kernel. But once we are about > to reboot to new kernel, we can overwrite the old kernel's memory. Got it. >>> In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is >>> directly loaded to reserved memory and after crash kexec simply jumps >> >> By "directly", I assume you mean "at the time of the kexec_laod() call", >> right? > > Yes. Thanks. So, returning to the kexeec_segment structure: struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; Are the following statements correct: * buf + bufsz identify a memory region in the caller's virtual
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hello Vivek, Thanks for your comments! I've added some further text to the page based on those comments. See some follow-up questions below. On 01/12/2015 11:16 PM, Vivek Goyal wrote: On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: [..] .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. .\ FIXME Explain in more detail how KEXEC_ON_CRASH is actually used I wasn't expecting that you would respond to the FIXMEs that were not labeled kexec_file_load, but I was hoping you might ;-). Thanks! I have a few additional questions to your nice notes. Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel= command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Hi Michael, Can I just confirm: is it in all cases only possible to use kexec_load() and kexec_file_load() if the kernel was booted with the 'crashkernel' parameter set? As of now, only kexec_load() and kexec_file_load() system calls can make use of memory reserved by crashkernel= kernel parameter. And this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH flag specified). Okay. Location of this reserved memory is exported to user space through /proc/iomem file. Is that export via an entry labeled Crash kernel in the /proc/iomem file? Yes. Okay -- thanks. User space can parse it and prepare list of segments specifying this reserved memory as destination. I'm not quite clear on specifying this reserved memory as destination. Is that done by specifying the address in the kexec_segment.mem fields? You are absolutely right. User space can specify in kexec_segment.mem field the memory location where it expecting a particular segment to be loaded by kernel. Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. Could you point me to where this checking is done? Also, what is the error (errno) that occurs when the load operation fails? (I think the answers to these questions are at the start of kimage_alloc_init() and EADDRNOTAVAIL, but I'd like to confirm.) This checking happens in sanity_check_segment_list() which is called by kimage_alloc_init(). And yes, error code returned is -EADDRNOTAVAIL. Thanks. I added EADDRNOTAVAIL to the ERRORS. [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\ FIXME Explain the details of how the kernel image defined by segments .\ is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory Could you clarify what you mean by regular memory? I meant memory which is not reserved memory. Okay. or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. By moved to final destination, do you mean moved from user space to the final kernel-space destination? No. Segment data moves from user space to kernel space once kexec_load() call finishes successfully. But when user does reboot (kexec -e), at that time kernel moves that segment data to its final location. Kernel could not place the segment at its final location during kexec_load() time as that memory is already in use by running kernel. But once we are about to reboot to new kernel, we can overwrite the old kernel's memory. Got it. In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps By directly, I assume you mean at the time of the kexec_laod() call, right? Yes. Thanks. So, returning to the kexeec_segment structure: struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; Are the following statements correct: * buf + bufsz identify a memory region in the caller's virtual address space that is the source of the copy * mem + memsz specify the target memory region of the copy * mem is physical memory address, as seen from kernel space * the number of bytes copied from userspace is min(bufsz, memsz) * if bufsz memsz,
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: [..] > >> .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" > >> Execute the new kernel automatically on a system crash. > >> .\" FIXME Explain in more detail how KEXEC_ON_CRASH is actually used > > I wasn't expecting that you would respond to the FIXMEs that were > not labeled "kexec_file_load", but I was hoping you might ;-). Thanks! > I have a few additional questions to your nice notes. > > > Upon boot first kernel reserves a chunk of contiguous memory (if > > crashkernel=<> command line paramter is passed). This memory is > > is used to load the crash kernel (Kernel which will be booted into > > if first kernel crashes). > Hi Michael, > Can I just confirm: is it in all cases only possible to use kexec_load() > and kexec_file_load() if the kernel was booted with the 'crashkernel' > parameter set? As of now, only kexec_load() and kexec_file_load() system calls can make use of memory reserved by crashkernel=<> kernel parameter. And this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH flag specified). > > > Location of this reserved memory is exported to user space through > > /proc/iomem file. > > Is that export via an entry labeled "Crash kernel" in the > /proc/iomem file? Yes. > > > User space can parse it and prepare list of segments > > specifying this reserved memory as destination. > > I'm not quite clear on "specifying this reserved memory as destination". > Is that done by specifying the address in the kexec_segment.mem fields? You are absolutely right. User space can specify in kexec_segment.mem field the memory location where it expecting a particular segment to be loaded by kernel. > > > Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the > > segments are destined for reserved memory otherwise kernel load operation > > fails. > > Could you point me to where this checking is done? Also, what is the > error (errno) that occurs when the load operation fails? (I think the > answers to these questions are "at the start of kimage_alloc_init()" > and "EADDRNOTAVAIL", but I'd like to confirm.) This checking happens in sanity_check_segment_list() which is called by kimage_alloc_init(). And yes, error code returned is -EADDRNOTAVAIL. > > > [..] > >> struct kexec_segment { > >> void *buf;/* Buffer in user space */ > >> size_t bufsz; /* Buffer length in user space */ > >> void *mem;/* Physical address of kernel */ > >> size_t memsz; /* Physical address length */ > >> }; > >> .fi > >> .in > >> .PP > >> .\" FIXME Explain the details of how the kernel image defined by segments > >> .\" is copied from the calling process into previously reserved memory. > > > > Kernel image defined by segments is copied into kernel either in regular > > memory > > Could you clarify what you mean by "regular memory"? I meant memory which is not reserved memory. > > > or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first > > copies list of segments in kernel memory and then goes does various > > sanity checks on the segments. If everything looks line, kernel copies > > segment data to kernel memory. > > > > In case of normal kexec, segment data is loaded in any available memory > > and segment data is moved to final destination at the kexec reboot time. > > By "moved to final destination", do you mean "moved from user space to the > final kernel-space destination"? No. Segment data moves from user space to kernel space once kexec_load() call finishes successfully. But when user does reboot (kexec -e), at that time kernel moves that segment data to its final location. Kernel could not place the segment at its final location during kexec_load() time as that memory is already in use by running kernel. But once we are about to reboot to new kernel, we can overwrite the old kernel's memory. > > > In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is > > directly loaded to reserved memory and after crash kexec simply jumps > > By "directly", I assume you mean "at the time of the kexec_laod() call", > right? Yes. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Wed, Jan 07, 2015 at 10:17:56PM +0100, Michael Kerrisk (man-pages) wrote: [..] .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. .\ FIXME Explain in more detail how KEXEC_ON_CRASH is actually used I wasn't expecting that you would respond to the FIXMEs that were not labeled kexec_file_load, but I was hoping you might ;-). Thanks! I have a few additional questions to your nice notes. Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel= command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Hi Michael, Can I just confirm: is it in all cases only possible to use kexec_load() and kexec_file_load() if the kernel was booted with the 'crashkernel' parameter set? As of now, only kexec_load() and kexec_file_load() system calls can make use of memory reserved by crashkernel= kernel parameter. And this is used only if we are trying to load a crash kernel (KEXEC_ON_CRASH flag specified). Location of this reserved memory is exported to user space through /proc/iomem file. Is that export via an entry labeled Crash kernel in the /proc/iomem file? Yes. User space can parse it and prepare list of segments specifying this reserved memory as destination. I'm not quite clear on specifying this reserved memory as destination. Is that done by specifying the address in the kexec_segment.mem fields? You are absolutely right. User space can specify in kexec_segment.mem field the memory location where it expecting a particular segment to be loaded by kernel. Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. Could you point me to where this checking is done? Also, what is the error (errno) that occurs when the load operation fails? (I think the answers to these questions are at the start of kimage_alloc_init() and EADDRNOTAVAIL, but I'd like to confirm.) This checking happens in sanity_check_segment_list() which is called by kimage_alloc_init(). And yes, error code returned is -EADDRNOTAVAIL. [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\ FIXME Explain the details of how the kernel image defined by segments .\ is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory Could you clarify what you mean by regular memory? I meant memory which is not reserved memory. or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. By moved to final destination, do you mean moved from user space to the final kernel-space destination? No. Segment data moves from user space to kernel space once kexec_load() call finishes successfully. But when user does reboot (kexec -e), at that time kernel moves that segment data to its final location. Kernel could not place the segment at its final location during kexec_load() time as that memory is already in use by running kernel. But once we are about to reboot to new kernel, we can overwrite the old kernel's memory. In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps By directly, I assume you mean at the time of the kexec_laod() call, right? Yes. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hi Vivek, Thanks for your comments, and my apologies for a the long delayed follow-up. On 11/11/2014 10:30 PM, Vivek Goyal wrote: > On Sun, Nov 09, 2014 at 08:17:49PM +0100, Michael Kerrisk (man-pages) wrote: >> Hello Vivek (and all), >> >> Thanks for the kexec_file_load() patch [for the kexec_load(2) man page] >> that you quite some time ago sent. I have merged it and done some >> substantial editing as well. Could you please take a look at the >> draft below, and check that the kexec_file_load() material is okay. >> Please could you especially pay attention to the pieces marked >> "FIXME(kexec_file_load)", since those are pieces about which i >> had questions or doubts. >> > > Hi Michael, > > Thanks for editing this man page. I have some thoughts inline. > > [..] >> .B #include >> >> .BI "long kexec_load(unsigned long " entry ", unsigned long " nr_segments "," >> .BI "struct kexec_segment *" segments \ >> ", unsigned long " flags ");" >> >> .\" FIXME(kexec_file_load): >> .\" Why are the return types of kexec_load() and kexec_file_load() >> .\" different? >> .BI "int kexec_file_load(int " kernel_fd ", int " initrd_fd "," > > I think this is ignorance on my part. It probably should be "long" as > SYSCALL_DEFINE() seems to expand to. > > asmlinkage long SyS##name(__MAP(x,__SC_LONG,__VA_ARGS__)); Okay -- I've changed to 'long' in the man page. >> .br >> .BI "unsigned long " cmdline_len \ >> ", const char *" cmdline "," >> .BI "unsigned long " flags ");" >> >> .fi >> .IR Note : >> There are no glibc wrappers for these system calls; see NOTES. >> .SH DESCRIPTION >> The >> .BR kexec_load () >> system call loads a new kernel that can be executed later by >> .BR reboot (2). >> .PP >> The >> .I flags >> argument is a bit mask that controls the operation of the call. >> The following values can be specified in >> .IR flags : >> .TP >> .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" >> Execute the new kernel automatically on a system crash. >> .\" FIXME Explain in more detail how KEXEC_ON_CRASH is actually used I wasn't expecting that you would respond to the FIXMEs that were not labeled "kexec_file_load", but I was hoping you might ;-). Thanks! I have a few additional questions to your nice notes. > Upon boot first kernel reserves a chunk of contiguous memory (if > crashkernel=<> command line paramter is passed). This memory is > is used to load the crash kernel (Kernel which will be booted into > if first kernel crashes). Can I just confirm: is it in all cases only possible to use kexec_load() and kexec_file_load() if the kernel was booted with the 'crashkernel' parameter set? > Location of this reserved memory is exported to user space through > /proc/iomem file. Is that export via an entry labeled "Crash kernel" in the /proc/iomem file? > User space can parse it and prepare list of segments > specifying this reserved memory as destination. I'm not quite clear on "specifying this reserved memory as destination". Is that done by specifying the address in the kexec_segment.mem fields? > Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the > segments are destined for reserved memory otherwise kernel load operation > fails. Could you point me to where this checking is done? Also, what is the error (errno) that occurs when the load operation fails? (I think the answers to these questions are "at the start of kimage_alloc_init()" and "EADDRNOTAVAIL", but I'd like to confirm.) > [..] >> struct kexec_segment { >> void *buf;/* Buffer in user space */ >> size_t bufsz; /* Buffer length in user space */ >> void *mem;/* Physical address of kernel */ >> size_t memsz; /* Physical address length */ >> }; >> .fi >> .in >> .PP >> .\" FIXME Explain the details of how the kernel image defined by segments >> .\" is copied from the calling process into previously reserved memory. > > Kernel image defined by segments is copied into kernel either in regular > memory Could you clarify what you mean by "regular memory"? > or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first > copies list of segments in kernel memory and then goes does various > sanity checks on the segments. If everything looks line, kernel copies > segment data to kernel memory. > > In case of normal kexec, segment data is loaded in any available memory > and segment data is moved to final destination at the kexec reboot time. By "moved to final destination", do you mean "moved from user space to the final kernel-space destination"? > In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is > directly loaded to reserved memory and after crash kexec simply jumps By "directly", I assume you mean "at the time of the kexec_laod() call", right? > to starting point. > > [..] >> .\" FIXME(kexec_file_load): >> .\" Is the following rationale accurate? Does it need expanding? >> The >> .BR
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
Hi Vivek, Thanks for your comments, and my apologies for a the long delayed follow-up. On 11/11/2014 10:30 PM, Vivek Goyal wrote: On Sun, Nov 09, 2014 at 08:17:49PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek (and all), Thanks for the kexec_file_load() patch [for the kexec_load(2) man page] that you quite some time ago sent. I have merged it and done some substantial editing as well. Could you please take a look at the draft below, and check that the kexec_file_load() material is okay. Please could you especially pay attention to the pieces marked FIXME(kexec_file_load), since those are pieces about which i had questions or doubts. Hi Michael, Thanks for editing this man page. I have some thoughts inline. [..] .B #include linux/kexec.h .BI long kexec_load(unsigned long entry , unsigned long nr_segments , .BI struct kexec_segment * segments \ , unsigned long flags ); .\ FIXME(kexec_file_load): .\ Why are the return types of kexec_load() and kexec_file_load() .\ different? .BI int kexec_file_load(int kernel_fd , int initrd_fd , I think this is ignorance on my part. It probably should be long as SYSCALL_DEFINE() seems to expand to. asmlinkage long SyS##name(__MAP(x,__SC_LONG,__VA_ARGS__)); Okay -- I've changed to 'long' in the man page. .br .BI unsigned long cmdline_len \ , const char * cmdline , .BI unsigned long flags ); .fi .IR Note : There are no glibc wrappers for these system calls; see NOTES. .SH DESCRIPTION The .BR kexec_load () system call loads a new kernel that can be executed later by .BR reboot (2). .PP The .I flags argument is a bit mask that controls the operation of the call. The following values can be specified in .IR flags : .TP .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. .\ FIXME Explain in more detail how KEXEC_ON_CRASH is actually used I wasn't expecting that you would respond to the FIXMEs that were not labeled kexec_file_load, but I was hoping you might ;-). Thanks! I have a few additional questions to your nice notes. Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel= command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Can I just confirm: is it in all cases only possible to use kexec_load() and kexec_file_load() if the kernel was booted with the 'crashkernel' parameter set? Location of this reserved memory is exported to user space through /proc/iomem file. Is that export via an entry labeled Crash kernel in the /proc/iomem file? User space can parse it and prepare list of segments specifying this reserved memory as destination. I'm not quite clear on specifying this reserved memory as destination. Is that done by specifying the address in the kexec_segment.mem fields? Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. Could you point me to where this checking is done? Also, what is the error (errno) that occurs when the load operation fails? (I think the answers to these questions are at the start of kimage_alloc_init() and EADDRNOTAVAIL, but I'd like to confirm.) [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\ FIXME Explain the details of how the kernel image defined by segments .\ is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory Could you clarify what you mean by regular memory? or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. By moved to final destination, do you mean moved from user space to the final kernel-space destination? In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps By directly, I assume you mean at the time of the kexec_laod() call, right? to starting point. [..] .\ FIXME(kexec_file_load): .\ Is the following rationale accurate? Does it need expanding? The .BR kexec_file_load () .\ See also http://lwn.net/Articles/603116/ system call was added to provide support for systems where kexec loading should be restricted to only kernels that are signed. Yes, this rationale
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Sun, Nov 09, 2014 at 08:17:49PM +0100, Michael Kerrisk (man-pages) wrote: > Hello Vivek (and all), > > Thanks for the kexec_file_load() patch [for the kexec_load(2) man page] > that you quite some time ago sent. I have merged it and done some > substantial editing as well. Could you please take a look at the > draft below, and check that the kexec_file_load() material is okay. > Please could you especially pay attention to the pieces marked > "FIXME(kexec_file_load)", since those are pieces about which i > had questions or doubts. > Hi Michael, Thanks for editing this man page. I have some thoughts inline. [..] > .B #include > > .BI "long kexec_load(unsigned long " entry ", unsigned long " nr_segments "," > .BI "struct kexec_segment *" segments \ > ", unsigned long " flags ");" > > .\" FIXME(kexec_file_load): > .\" Why are the return types of kexec_load() and kexec_file_load() > .\" different? > .BI "int kexec_file_load(int " kernel_fd ", int " initrd_fd "," I think this is ignorance on my part. It probably should be "long" as SYSCALL_DEFINE() seems to expand to. asmlinkage long SyS##name(__MAP(x,__SC_LONG,__VA_ARGS__)); > .br > .BI "unsigned long " cmdline_len \ > ", const char *" cmdline "," > .BI "unsigned long " flags ");" > > .fi > .IR Note : > There are no glibc wrappers for these system calls; see NOTES. > .SH DESCRIPTION > The > .BR kexec_load () > system call loads a new kernel that can be executed later by > .BR reboot (2). > .PP > The > .I flags > argument is a bit mask that controls the operation of the call. > The following values can be specified in > .IR flags : > .TP > .BR KEXEC_ON_CRASH " (since Linux 2.6.13)" > Execute the new kernel automatically on a system crash. > .\" FIXME Explain in more detail how KEXEC_ON_CRASH is actually used Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel=<> command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Location of this reserved memory is exported to user space through /proc/iomem file. User space can parse it and prepare list of segments specifying this reserved memory as destination. Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. [..] > struct kexec_segment { > void *buf;/* Buffer in user space */ > size_t bufsz; /* Buffer length in user space */ > void *mem;/* Physical address of kernel */ > size_t memsz; /* Physical address length */ > }; > .fi > .in > .PP > .\" FIXME Explain the details of how the kernel image defined by segments > .\" is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps to starting point. [..] > .\" FIXME(kexec_file_load): > .\" Is the following rationale accurate? Does it need expanding? > The > .BR kexec_file_load () > .\" See also http://lwn.net/Articles/603116/ > system call was added to provide support for systems > where "kexec" loading should be restricted to > only kernels that are signed. Yes, this rationale looks good. > > The > .BR kexec_load () > system call is available only if the kernel was configured with > .BR CONFIG_KEXEC . > The > .BR kexec_file_load () > system call is available only if the kernel was configured with > .BR CONFIG_KEXEC_FILE . > .\" FIXME(kexec_file_load): > .\" Does kexec_file_load() need any other CONFIG_* options to be defined? Yes, it requires some other config options too. depends on KEXEC depends on X86_64 depends on CRYPTO=y depends on CRYPTO_SHA256=y CONFIG_KEXEC_VERIFY_SIG=y CONFIG_KEXEC_BZIMAGE_VERIFY_SIG=y CONFIG_SIGNED_PE_FILE_VERIFICATION=y CONFIG_PKCS7_MESSAGE_PARSER=y CONFIG_X509_CERTIFICATE_PARSER=y CONFIG_ASYMMETRIC_PUBLIC_KEY_SUBTYPE=y So dependency list seems pretty long. Not sure how many of these should we specify in man page. Thanks Vivek -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: Edited kexec_load(2) [kexec_file_load()] man page for review
On Sun, Nov 09, 2014 at 08:17:49PM +0100, Michael Kerrisk (man-pages) wrote: Hello Vivek (and all), Thanks for the kexec_file_load() patch [for the kexec_load(2) man page] that you quite some time ago sent. I have merged it and done some substantial editing as well. Could you please take a look at the draft below, and check that the kexec_file_load() material is okay. Please could you especially pay attention to the pieces marked FIXME(kexec_file_load), since those are pieces about which i had questions or doubts. Hi Michael, Thanks for editing this man page. I have some thoughts inline. [..] .B #include linux/kexec.h .BI long kexec_load(unsigned long entry , unsigned long nr_segments , .BI struct kexec_segment * segments \ , unsigned long flags ); .\ FIXME(kexec_file_load): .\ Why are the return types of kexec_load() and kexec_file_load() .\ different? .BI int kexec_file_load(int kernel_fd , int initrd_fd , I think this is ignorance on my part. It probably should be long as SYSCALL_DEFINE() seems to expand to. asmlinkage long SyS##name(__MAP(x,__SC_LONG,__VA_ARGS__)); .br .BI unsigned long cmdline_len \ , const char * cmdline , .BI unsigned long flags ); .fi .IR Note : There are no glibc wrappers for these system calls; see NOTES. .SH DESCRIPTION The .BR kexec_load () system call loads a new kernel that can be executed later by .BR reboot (2). .PP The .I flags argument is a bit mask that controls the operation of the call. The following values can be specified in .IR flags : .TP .BR KEXEC_ON_CRASH (since Linux 2.6.13) Execute the new kernel automatically on a system crash. .\ FIXME Explain in more detail how KEXEC_ON_CRASH is actually used Upon boot first kernel reserves a chunk of contiguous memory (if crashkernel= command line paramter is passed). This memory is is used to load the crash kernel (Kernel which will be booted into if first kernel crashes). Location of this reserved memory is exported to user space through /proc/iomem file. User space can parse it and prepare list of segments specifying this reserved memory as destination. Once kernel sees the flag KEXEC_ON_CRASH, it makes sure that all the segments are destined for reserved memory otherwise kernel load operation fails. [..] struct kexec_segment { void *buf;/* Buffer in user space */ size_t bufsz; /* Buffer length in user space */ void *mem;/* Physical address of kernel */ size_t memsz; /* Physical address length */ }; .fi .in .PP .\ FIXME Explain the details of how the kernel image defined by segments .\ is copied from the calling process into previously reserved memory. Kernel image defined by segments is copied into kernel either in regular memory or in reserved memory (if KEXEC_ON_CRASH is set). Kernel first copies list of segments in kernel memory and then goes does various sanity checks on the segments. If everything looks line, kernel copies segment data to kernel memory. In case of normal kexec, segment data is loaded in any available memory and segment data is moved to final destination at the kexec reboot time. In case of kexec on panic (KEXEC_ON_CRASH flag set), segment data is directly loaded to reserved memory and after crash kexec simply jumps to starting point. [..] .\ FIXME(kexec_file_load): .\ Is the following rationale accurate? Does it need expanding? The .BR kexec_file_load () .\ See also http://lwn.net/Articles/603116/ system call was added to provide support for systems where kexec loading should be restricted to only kernels that are signed. Yes, this rationale looks good. The .BR kexec_load () system call is available only if the kernel was configured with .BR CONFIG_KEXEC . The .BR kexec_file_load () system call is available only if the kernel was configured with .BR CONFIG_KEXEC_FILE . .\ FIXME(kexec_file_load): .\ Does kexec_file_load() need any other CONFIG_* options to be defined? Yes, it requires some other config options too. depends on KEXEC depends on X86_64 depends on CRYPTO=y depends on CRYPTO_SHA256=y CONFIG_KEXEC_VERIFY_SIG=y CONFIG_KEXEC_BZIMAGE_VERIFY_SIG=y CONFIG_SIGNED_PE_FILE_VERIFICATION=y CONFIG_PKCS7_MESSAGE_PARSER=y CONFIG_X509_CERTIFICATE_PARSER=y CONFIG_ASYMMETRIC_PUBLIC_KEY_SUBTYPE=y So dependency list seems pretty long. Not sure how many of these should we specify in man page. Thanks Vivek -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/