Re: [PHP-DEV] Re: Improving PRNG implementation.
Hello Nikita. > In addition to the functions you already list, I'd add >function rng_bytes(RNGInterface $rng, int $length): string; >function rng_between(RNGInterface $rng, int $min, int $max): int; Certainly these are useful. I've added this to the RFC in TypeC. > Just to share another possibility, we could also modify existing functions to optionally accept RNGInterface, like >shuffle(array &$array, ?RNGInterface $rng = null): bool; This approach may be appropriate. This way, the user is implicitly aware that he is using a global state RNG. > What do you mean by "use unintended instances" here? For example, if you write the following code, the user will receive an unintended result. ```php $rng1 = new \RNG\XorShift128Plus(1234); $rng2 = new \RNG\MT19937(1234); $arr1 = range(1, 100); $arr2 = range(1, 100); rng_shuffle($rng1, $arr1); rng_shuffle($rng1, $arr2); // oops, this case uses $rng1 RNG instance. ``` I believe that if these features are provided by class methods, such mistakes can be avoided. However, it is a matter of likelihood and similar mistakes can happen anyway. > Regarding implementation complexity, yes, this version is a bit more involved if we want to allow userland implementations of RNGInterface. To be honest I don't particularly care about supporting userland (we could also only allow it to be implemented internally), but I think it shouldn't be too hard either. Yes, it's probably possible. I need to understand the PHP implementation more deeply :) If you know of any implementations that might be helpful, I would like to know about them. As for the implementation of userland, I don't put that much emphasis on it either. I just use classes for clarity, and I'm fine only provided by the core or extensions. > This is generally the part I'm least sure about. I agree that it would be good to leverage 64-bit numbers on systems that support them. On the other hand, I also think that having a consistent random number stream between 32-bit and 64-bit systems may be important. For example, if you use this functionality for predictable "randomness" in unit tests, then it would be a problem if your library could only be tested on 32-bit or only on 64-bit. Certainly, this seems to be a problem. However, the 32bit architecture environment is very limited, and a 64bit RNG can be useful for applications and libraries that are designed to run on 64bit architecture. Also, some of the new RNGs that we are trying to implement, such as XorShift128+, have an internal state of uint64_t. The random numbers they generate are always 64bit, and rounding them to 32bit may prevent their intended use. For these reasons, I think that we should provide a `next64()` method. However, care must be taken in serializing the instance. We are trying to implement 32bit environment-safe serialization in the current PR, but it is not smart to say the least. https://github.com/php/php-src/pull/6568/files#diff-0f1c13e606f11fb5da8dc091109852a6dec328e5aea2389091ff839510cf23faR275 Based on the above discussion, I have added TypeC2 to the RFC. How about this? https://wiki.php.net/rfc/object_scope_prng Regards, Go Kudo 2021年1月6日(水) 19:50 Nikita Popov : > On Tue, Jan 5, 2021 at 6:40 PM Go Kudo wrote: > >> Thanks Nikita. >> >> Thanks for the input. This looks like a very smart solution. >> I've edited the RFC and added a new proposal as TypeC. >> >> https://wiki.php.net/rfc/object_scope_prng >> > > In addition to the functions you already list, I'd add > > function rng_bytes(RNGInterface $rng, int $length): string; > function rng_between(RNGInterface $rng, int $min, int $max): int; > > Similar to your type B proposal. > > Just to share another possibility, we could also modify existing functions > to optionally accept RNGInterface, like > > shuffle(array &$array, ?RNGInterface $rng = null): bool; > > However, I'm not sure this is better than adding a separate family of > functions. > > >> However, I have a couple of concerns. The first is that users will have >> to be careful with RNG instances (for better or worse, TypeA will not allow >> users to use unintended instances), and the second is that it will >> complicate the implementation of PHP functions. >> > > What do you mean by "use unintended instances" here? > > Regarding implementation complexity, yes, this version is a bit more > involved if we want to allow userland implementations of RNGInterface. To > be honest I don't particularly care about supporting userland (we could > also only allow it to be implemented internally), but I think it shouldn't > be too hard either. > > >> Also, given the current environment in which PHP runs, I think that >> random number generation in the 64bit range should be supported. >> How about including the `next64()` method in the interface? >> > > This is generally the part I'm least sure about. I agree that it would be > good to leverage 64-bit numbers on systems that support them. On the other > ha
Re: [PHP-DEV] Re: Improving PRNG implementation.
On Tue, Jan 5, 2021 at 6:40 PM Go Kudo wrote: > Thanks Nikita. > > Thanks for the input. This looks like a very smart solution. > I've edited the RFC and added a new proposal as TypeC. > > https://wiki.php.net/rfc/object_scope_prng > In addition to the functions you already list, I'd add function rng_bytes(RNGInterface $rng, int $length): string; function rng_between(RNGInterface $rng, int $min, int $max): int; Similar to your type B proposal. Just to share another possibility, we could also modify existing functions to optionally accept RNGInterface, like shuffle(array &$array, ?RNGInterface $rng = null): bool; However, I'm not sure this is better than adding a separate family of functions. > However, I have a couple of concerns. The first is that users will have to > be careful with RNG instances (for better or worse, TypeA will not allow > users to use unintended instances), and the second is that it will > complicate the implementation of PHP functions. > What do you mean by "use unintended instances" here? Regarding implementation complexity, yes, this version is a bit more involved if we want to allow userland implementations of RNGInterface. To be honest I don't particularly care about supporting userland (we could also only allow it to be implemented internally), but I think it shouldn't be too hard either. > Also, given the current environment in which PHP runs, I think that random > number generation in the 64bit range should be supported. > How about including the `next64()` method in the interface? > This is generally the part I'm least sure about. I agree that it would be good to leverage 64-bit numbers on systems that support them. On the other hand, I also think that having a consistent random number stream between 32-bit and 64-bit systems may be important. For example, if you use this functionality for predictable "randomness" in unit tests, then it would be a problem if your library could only be tested on 32-bit or only on 64-bit. As long as an API like rng_between($rng, $min, $max) is used, we can always assemble two 32-bit values into a random number that is larger than 32-bit. But of course, fetching a single 64-bit number is more efficient... Nikita > 2021年1月5日(火) 23:53 Nikita Popov : > >> On Thu, Dec 24, 2020 at 5:12 PM zeriyoshi wrote: >> >>> I updated the RFC draft and changed it to a proposal to bifurcate the >>> interface. >>> >>> https://wiki.php.net/rfc/object_scope_prng >>> >>> At the same time, I was looking into the RNG problem in Swoole and found >>> out that the problem was actually occurring. >>> >>> https://www.easyswoole.com/En/Other/random.html >>> >>> The above problem with Swoole is only for child processes and can be >>> fixed >>> by the user, but as mentioned above, I think it will become more serious >>> as >>> PHP becomes more complex in the future. >>> I hadn't thought of this before, but we might want to consider >>> deprecating >>> existing state-dependent RNG functions. >>> >>> I am seeking feedback on this proposal in order to take the RFC to the >>> next >>> step. >>> Thank you in advance. >>> >>> Regards, >>> Go Kudo >>> >> >> I think the general direction of your second proposal (B) is the right >> one, but I'm concerned about some of the details. >> >> First and foremost, I don't think that RNG should extend Iterator. Just >> having a single "next()" method or similar is sufficient. The Iterator >> interface provides many additional things that don't really make sense in >> this context and only complicate the implementation. We should clearly >> specify the value range of next() though. I assume that for portability >> reasons (ensure same sequence on 32-bit and 64-bit) this would have to be a >> sign-extended 32-bit value. >> >> The type B proposal distinguishes between a RNG and a PRNG interface. Is >> this really useful? I don't think the value of the "getSeed()" method is >> high enough to justify the split interface. >> >> The RNGUtil class has methods like: >> >> public static function shuffleArray(int $randomNumber, array $arr): >> array; >> >> This doesn't make sense to me, as a single random number is not enough to >> shuffle a whole array :) Instead this should be accepting the RNG and >> perform an internal call to next() (of course, with a fast by-pass >> mechanism if the RNG is implemented internally): >> >> public static function shuffleArray(RNG $rng, array $arr): array; >> >> I'm also wondering why this is a static class rather than a set of >> free-standing functions. Why RNGUtil::shuffleArray() rather than >> rng_shuffle_array()? >> >> Regards, >> Nikita >> >> 2020年12月16日(水) 23:46 zeriyoshi : >>> >>> > Nice to meet you, internals. >>> > >>> > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to >>> be >>> > extended beyond the web. >>> > >>> > So I'd like to make a few suggestions. >>> > >>> > First , PHP has the historical Mersenne Twister PRNG. However, this >>> > implementati
Re: [PHP-DEV] Re: Improving PRNG implementation.
Thanks Nikita. Thanks for the input. This looks like a very smart solution. I've edited the RFC and added a new proposal as TypeC. https://wiki.php.net/rfc/object_scope_prng However, I have a couple of concerns. The first is that users will have to be careful with RNG instances (for better or worse, TypeA will not allow users to use unintended instances), and the second is that it will complicate the implementation of PHP functions. Also, given the current environment in which PHP runs, I think that random number generation in the 64bit range should be supported. How about including the `next64()` method in the interface? Regards, Go Kudo 2021年1月5日(火) 23:53 Nikita Popov : > On Thu, Dec 24, 2020 at 5:12 PM zeriyoshi wrote: > >> I updated the RFC draft and changed it to a proposal to bifurcate the >> interface. >> >> https://wiki.php.net/rfc/object_scope_prng >> >> At the same time, I was looking into the RNG problem in Swoole and found >> out that the problem was actually occurring. >> >> https://www.easyswoole.com/En/Other/random.html >> >> The above problem with Swoole is only for child processes and can be fixed >> by the user, but as mentioned above, I think it will become more serious >> as >> PHP becomes more complex in the future. >> I hadn't thought of this before, but we might want to consider deprecating >> existing state-dependent RNG functions. >> >> I am seeking feedback on this proposal in order to take the RFC to the >> next >> step. >> Thank you in advance. >> >> Regards, >> Go Kudo >> > > I think the general direction of your second proposal (B) is the right > one, but I'm concerned about some of the details. > > First and foremost, I don't think that RNG should extend Iterator. Just > having a single "next()" method or similar is sufficient. The Iterator > interface provides many additional things that don't really make sense in > this context and only complicate the implementation. We should clearly > specify the value range of next() though. I assume that for portability > reasons (ensure same sequence on 32-bit and 64-bit) this would have to be a > sign-extended 32-bit value. > > The type B proposal distinguishes between a RNG and a PRNG interface. Is > this really useful? I don't think the value of the "getSeed()" method is > high enough to justify the split interface. > > The RNGUtil class has methods like: > > public static function shuffleArray(int $randomNumber, array $arr): > array; > > This doesn't make sense to me, as a single random number is not enough to > shuffle a whole array :) Instead this should be accepting the RNG and > perform an internal call to next() (of course, with a fast by-pass > mechanism if the RNG is implemented internally): > > public static function shuffleArray(RNG $rng, array $arr): array; > > I'm also wondering why this is a static class rather than a set of > free-standing functions. Why RNGUtil::shuffleArray() rather than > rng_shuffle_array()? > > Regards, > Nikita > > 2020年12月16日(水) 23:46 zeriyoshi : >> >> > Nice to meet you, internals. >> > >> > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to be >> > extended beyond the web. >> > >> > So I'd like to make a few suggestions. >> > >> > First , PHP has the historical Mersenne Twister PRNG. However, this >> > implementation keeps its state in a global and cannot be handled as an >> > object like other languages (e.g. Java). >> > >> > So, I created a PHP Extension and proposed it to PECL. >> > >> > https://marc.info/?l=pecl-dev&m=160795415604102&w=2 >> > https://github.com/zeriyoshi/php-ext-orng >> > >> > But, Then I looked at the mailing list archives and noticed that a >> similar >> > proposal had been made before. >> > >> > https://externals.io/message/98021#98130 >> > >> > I feel that this suggestion is needed now to expand PHP beyond the web. >> > >> > Second suggestion is to stop using the Combined LCG as the default seed >> > value for each function. >> > >> > PHP's Combined LCG only uses PID (or ZTS Thread ID) and time as entropy. >> > https://github.com/php/php-src/blob/master/ext/standard/lcg.c#L72 >> > >> > With the development of container technology, this problem seems to be >> > getting more serious. So I think we should use the random numbers >> provided >> > by the OS (getrandom on Linux) if available. >> > >> > I would like to hear your opinions. >> > >> > Regards >> > Go Kudo >> > >> >
Re: [PHP-DEV] Re: Improving PRNG implementation.
On Thu, Dec 24, 2020 at 5:12 PM zeriyoshi wrote: > I updated the RFC draft and changed it to a proposal to bifurcate the > interface. > > https://wiki.php.net/rfc/object_scope_prng > > At the same time, I was looking into the RNG problem in Swoole and found > out that the problem was actually occurring. > > https://www.easyswoole.com/En/Other/random.html > > The above problem with Swoole is only for child processes and can be fixed > by the user, but as mentioned above, I think it will become more serious as > PHP becomes more complex in the future. > I hadn't thought of this before, but we might want to consider deprecating > existing state-dependent RNG functions. > > I am seeking feedback on this proposal in order to take the RFC to the next > step. > Thank you in advance. > > Regards, > Go Kudo > I think the general direction of your second proposal (B) is the right one, but I'm concerned about some of the details. First and foremost, I don't think that RNG should extend Iterator. Just having a single "next()" method or similar is sufficient. The Iterator interface provides many additional things that don't really make sense in this context and only complicate the implementation. We should clearly specify the value range of next() though. I assume that for portability reasons (ensure same sequence on 32-bit and 64-bit) this would have to be a sign-extended 32-bit value. The type B proposal distinguishes between a RNG and a PRNG interface. Is this really useful? I don't think the value of the "getSeed()" method is high enough to justify the split interface. The RNGUtil class has methods like: public static function shuffleArray(int $randomNumber, array $arr): array; This doesn't make sense to me, as a single random number is not enough to shuffle a whole array :) Instead this should be accepting the RNG and perform an internal call to next() (of course, with a fast by-pass mechanism if the RNG is implemented internally): public static function shuffleArray(RNG $rng, array $arr): array; I'm also wondering why this is a static class rather than a set of free-standing functions. Why RNGUtil::shuffleArray() rather than rng_shuffle_array()? Regards, Nikita 2020年12月16日(水) 23:46 zeriyoshi : > > > Nice to meet you, internals. > > > > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to be > > extended beyond the web. > > > > So I'd like to make a few suggestions. > > > > First , PHP has the historical Mersenne Twister PRNG. However, this > > implementation keeps its state in a global and cannot be handled as an > > object like other languages (e.g. Java). > > > > So, I created a PHP Extension and proposed it to PECL. > > > > https://marc.info/?l=pecl-dev&m=160795415604102&w=2 > > https://github.com/zeriyoshi/php-ext-orng > > > > But, Then I looked at the mailing list archives and noticed that a > similar > > proposal had been made before. > > > > https://externals.io/message/98021#98130 > > > > I feel that this suggestion is needed now to expand PHP beyond the web. > > > > Second suggestion is to stop using the Combined LCG as the default seed > > value for each function. > > > > PHP's Combined LCG only uses PID (or ZTS Thread ID) and time as entropy. > > https://github.com/php/php-src/blob/master/ext/standard/lcg.c#L72 > > > > With the development of container technology, this problem seems to be > > getting more serious. So I think we should use the random numbers > provided > > by the OS (getrandom on Linux) if available. > > > > I would like to hear your opinions. > > > > Regards > > Go Kudo > > >
Re: [PHP-DEV] Re: Improving PRNG implementation.
On 01.01.21 20:22, Go Kudo wrote: > Hi Marc, and sorry for the late reply. Email has been marked as spam... > >> I would expect the Random number generator to implement from Iterator > using an integer for current() value >> and providing utility functions (simple functions or methods of another > class) for shuffle, random array entry like this: > > Thank you very much. This suggestion makes sense and looks very structured. > > However, I feel that in the PHP world, this structuring is a bit too much. > Ease of use is very important in PHP. One important reason for a PRNGInterface is (at least how I understand it) to allow implementing PRNG algorithms in userland but there would be no performant way to use this for shuffle and friends that's why I think such functions should consume any PRNGInterface instead of building it directly into the PRNG class. -> If you really want these as PRNG methods - one possibility could be a Trait providing these methods but this is very exotic for core functionalities. Another thing with the proposed `next*` methods is * How to get the current value? retrieving the next value only seems to be out of sync with the current PHP way (Iterator, Generator) * `nextByte(int $length)` How much values does this consume? How much random bytes can be generated with one value (4 bytes, 8 bytes, platform dependent). Does this consume multiple values to generate 1 byte 4/8 times? * same for `nextDouble()` on 32 bit platforms * Btw. it should be called `nextFloat()` as there is no double type in PHP and the float type is equivalent to C double precision > > Nevertheless, I am satisfied with this proposal. I'd like to hear from > someone who is more familiar with the PHP context about the form of the > implementation. > > But, it may be faster to actually hold a vote to ask this question. I'm not > sure how much support this proposal has at this point, but do you think > it's worth a try? I'm don't have voting rights and I'm not very active in discussions so I would be better to get some reasonable thoughts from someone more involved. Cheers Marc > > > 2020年12月29日(火) 18:26 Marc : > >> Hi zeriyoshi, >> On 23.12.20 14:41, zeriyoshi wrote: >> >> Thanks tyson. >> >> >> This would also make it easier to use those generators in brand new >> >> algorithms that weren't in the ionitial RFC. >> >> (or in algorithms written by users in PHP) >> >> This suggestion seems to make sense. Maybe the RNG should only focus on >> generating random numbers and not anything else. >> However, the fact that array and string manipulation functions are no >> longer native to C may have a speed disadvantage. >> >> So I came up with the idea of minimizing the interface definition as RNG. >> >> ``` >> interface PRNGInterface >> { >> public function nextInt(?int $min = null, ?int $max = null): int; >> public function nextDouble(): double; // maybe, non-needed. >> public function nextByte(int $length): string; >> } >> ``` >> >> The methods for array and string operations are defined separately as >> interfaces that inherit from the interface. >> >> ``` >> interface RandomInterface extends PRNGInterface >> { >> public function shuffle(array &$array): bool; >> public function arrayRand(array $array, int $num = 1): int|string|array; >> public function strShuffle(string $string): string; >> } >> ``` >> >> This can be overly structured, but it will serve all purposes. >> >> Personally I feel the interfaces still looking a bit off to me. >> >> I would expect the Random number generator to implement from Iterator >> using an integer for `current()` value >> >> and providing utility functions (simple functions or methods of another >> class) for shuffle, random array entry like this: >> >> ```php >> >> interface RNG extends Iterator { >> public function rewind(); >> public function next(); >> public function current(): int; >> public function key(): int; >> public function valid(); >> } >> >> >> interface PRNG extends RNG { >> public function __current(int $seed); >> public function getSeed(): int; >> } >> >> class RNGUtil { >> public static function shuffleArray(int $randomNumber, array $arr): >> array; >> public static function randomArrayElement(int $randomNumber, array >> $arr): mixed; >> public static function between(int $randomNumber, int $min = >> PHP_INT_MIN, int $max = PHP_INT_MAX): int; >> public static function bytes(RNG $rng, int $length): string; >> } >> >> ``` >> >> >> Regards, >> Go Kudo >> >> >> 2020年12月23日(水) 0:40 tyson andre >> : >> >> >> Hi Go Kudo, >> >> **A possible alternative that is widely used in other programming >> languages is to limit the interface API to only generating bytes/integers,** >> and to provide global functions that would use generic random number >> generator objects (from internal or user-provided code) in their algorithms. >> >> This would also make it easier to use those generators in brand new
[PHP-DEV] Re: Improving PRNG implementation.
Unfortunately, there is not much feedback on this proposal. To remedy the situation, we have updated the RFC and implemented it. https://wiki.php.net/rfc/object_scope_prng https://github.com/php/php-src/pull/6568 Currently, it seems to work generally as expected, although it has failed to be tested on Windows. However, I am not sure if it is correctly with the new serialize / unserialize mechanism of PHP. (Of course we've tested the behavior) 2020年12月16日(水) 23:46 zeriyoshi : > Nice to meet you, internals. > > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to be > extended beyond the web. > > So I'd like to make a few suggestions. > > First , PHP has the historical Mersenne Twister PRNG. However, this > implementation keeps its state in a global and cannot be handled as an > object like other languages (e.g. Java). > > So, I created a PHP Extension and proposed it to PECL. > > https://marc.info/?l=pecl-dev&m=160795415604102&w=2 > https://github.com/zeriyoshi/php-ext-orng > > But, Then I looked at the mailing list archives and noticed that a similar > proposal had been made before. > > https://externals.io/message/98021#98130 > > I feel that this suggestion is needed now to expand PHP beyond the web. > > Second suggestion is to stop using the Combined LCG as the default seed > value for each function. > > PHP's Combined LCG only uses PID (or ZTS Thread ID) and time as entropy. > https://github.com/php/php-src/blob/master/ext/standard/lcg.c#L72 > > With the development of container technology, this problem seems to be > getting more serious. So I think we should use the random numbers provided > by the OS (getrandom on Linux) if available. > > I would like to hear your opinions. > > Regards > Go Kudo >
Re: [PHP-DEV] Re: Improving PRNG implementation.
Hi Marc, and sorry for the late reply. Email has been marked as spam... > I would expect the Random number generator to implement from Iterator using an integer for current() value > and providing utility functions (simple functions or methods of another class) for shuffle, random array entry like this: Thank you very much. This suggestion makes sense and looks very structured. However, I feel that in the PHP world, this structuring is a bit too much. Ease of use is very important in PHP. Nevertheless, I am satisfied with this proposal. I'd like to hear from someone who is more familiar with the PHP context about the form of the implementation. But, it may be faster to actually hold a vote to ask this question. I'm not sure how much support this proposal has at this point, but do you think it's worth a try? 2020年12月29日(火) 18:26 Marc : > Hi zeriyoshi, > On 23.12.20 14:41, zeriyoshi wrote: > > Thanks tyson. > > > This would also make it easier to use those generators in brand new > > algorithms that weren't in the initial RFC. > > (or in algorithms written by users in PHP) > > This suggestion seems to make sense. Maybe the RNG should only focus on > generating random numbers and not anything else. > However, the fact that array and string manipulation functions are no > longer native to C may have a speed disadvantage. > > So I came up with the idea of minimizing the interface definition as RNG. > > ``` > interface PRNGInterface > { > public function nextInt(?int $min = null, ?int $max = null): int; > public function nextDouble(): double; // maybe, non-needed. > public function nextByte(int $length): string; > } > ``` > > The methods for array and string operations are defined separately as > interfaces that inherit from the interface. > > ``` > interface RandomInterface extends PRNGInterface > { > public function shuffle(array &$array): bool; > public function arrayRand(array $array, int $num = 1): int|string|array; > public function strShuffle(string $string): string; > } > ``` > > This can be overly structured, but it will serve all purposes. > > Personally I feel the interfaces still looking a bit off to me. > > I would expect the Random number generator to implement from Iterator > using an integer for `current()` value > > and providing utility functions (simple functions or methods of another > class) for shuffle, random array entry like this: > > ```php > > interface RNG extends Iterator { > public function rewind(); > public function next(); > public function current(): int; > public function key(): int; > public function valid(); > } > > > interface PRNG extends RNG { > public function __current(int $seed); > public function getSeed(): int; > } > > class RNGUtil { > public static function shuffleArray(int $randomNumber, array $arr): array; > public static function randomArrayElement(int $randomNumber, array $arr): > mixed; > public static function between(int $randomNumber, int $min = PHP_INT_MIN, > int $max = PHP_INT_MAX): int; > public static function bytes(RNG $rng, int $length): string; > } > > ``` > > > Regards, > Go Kudo > > > 2020年12月23日(水) 0:40 tyson andre > : > > > Hi Go Kudo, > > **A possible alternative that is widely used in other programming > languages is to limit the interface API to only generating bytes/integers,** > and to provide global functions that would use generic random number > generator objects (from internal or user-provided code) in their algorithms. > > This would also make it easier to use those generators in brand new > algorithms that weren't in the initial RFC. > (or in algorithms written by users in PHP) > > This alternative is similar to > Javahttps://docs.oracle.com/javase/7/docs/api/java/util/Collections.html#shuffle(java.util.List,%20java.util.Random) > and https://docs.oracle.com/javase/7/docs/api/java/util/Random.html > and C++ https://www.cplusplus.com/reference/algorithm/shuffle/ > where the algorithms accept a random number generator conforming to some > interface > and Python https://docs.python.org/3/library/random.html#random.shuffle > > ``` > interface PRNGInterface { > public function random[Signed]Int(): int; // between PHP_INT_MIN and > PHP_INT_MAX (4 bytes or 8 bytes) > public function randomBytes(int $length): string // $length bytes of > raw data > // possibly randomByte(): int // between 0 and 255 > // public function randomIntInRange(int $min, int $max) > // __serialize(), __unserialize() may be provided, but may be > counterproductive for classes that wrap /dev/urandom or random_bytes()? > } > // possibly provide a trait that provides defaults based on abstract > function randomInt > > function whateverprefixornamespace_rand(PRNGInterface $rng, [int $min, int > $max]): int {} > // If this is intended to be secure, whateverprefix_array_rand may need to > avoid the optimizations used by array_rand for sparse arrays > function whateverprefixorna
Re: [PHP-DEV] Re: Improving PRNG implementation.
Hi zeriyoshi, On 23.12.20 14:41, zeriyoshi wrote: > Thanks tyson. > >> This would also make it easier to use those generators in brand new > algorithms that weren't in the initial RFC. >> (or in algorithms written by users in PHP) > This suggestion seems to make sense. Maybe the RNG should only focus on > generating random numbers and not anything else. > However, the fact that array and string manipulation functions are no > longer native to C may have a speed disadvantage. > > So I came up with the idea of minimizing the interface definition as RNG. > > ``` > interface PRNGInterface > { > public function nextInt(?int $min = null, ?int $max = null): int; > public function nextDouble(): double; // maybe, non-needed. > public function nextByte(int $length): string; > } > ``` > > The methods for array and string operations are defined separately as > interfaces that inherit from the interface. > > ``` > interface RandomInterface extends PRNGInterface > { > public function shuffle(array &$array): bool; > public function arrayRand(array $array, int $num = 1): int|string|array; > public function strShuffle(string $string): string; > } > ``` > > This can be overly structured, but it will serve all purposes. Personally I feel the interfaces still looking a bit off to me. I would expect the Random number generator to implement from Iterator using an integer for `current()` value and providing utility functions (simple functions or methods of another class) for shuffle, random array entry like this: ```php interface RNG extends Iterator { public function|rewind();||| public function next(); public function current(): int; public function key(): int; public function valid(); } interface PRNG extends RNG { public function __current(int $seed); public function getSeed(): int; } class RNGUtil { public static function shuffleArray(int $randomNumber, array $arr): array; public static function randomArrayElement(int $randomNumber, array $arr): mixed; public static function between(int $randomNumber, int $min = PHP_INT_MIN, int $max = PHP_INT_MAX): int; public static function bytes(RNG $rng, int $length): string; } ``` > > Regards, > Go Kudo > > > 2020年12月23日(水) 0:40 tyson andre : > >> Hi Go Kudo, >> >> **A possible alternative that is widely used in other programming >> languages is to limit the interface API to only generating bytes/integers,** >> and to provide global functions that would use generic random number >> generator objects (from internal or user-provided code) in their algorithms. >> >> This would also make it easier to use those generators in brand new >> algorithms that weren't in the initial RFC. >> (or in algorithms written by users in PHP) >> >> This alternative is similar to Java >> https://docs.oracle.com/javase/7/docs/api/java/util/Collections.html#shuffle(java.util.List,%20java.util.Random) >> and https://docs.oracle.com/javase/7/docs/api/java/util/Random.html >> and C++ https://www.cplusplus.com/reference/algorithm/shuffle/ >> where the algorithms accept a random number generator conforming to some >> interface >> and Python https://docs.python.org/3/library/random.html#random.shuffle >> >> ``` >> interface PRNGInterface { >> public function random[Signed]Int(): int; // between PHP_INT_MIN and >> PHP_INT_MAX (4 bytes or 8 bytes) >> public function randomBytes(int $length): string // $length bytes of >> raw data >> // possibly randomByte(): int // between 0 and 255 >> // public function randomIntInRange(int $min, int $max) >> // __serialize(), __unserialize() may be provided, but may be >> counterproductive for classes that wrap /dev/urandom or random_bytes()? >> } >> // possibly provide a trait that provides defaults based on abstract >> function randomInt >> >> function whateverprefixornamespace_rand(PRNGInterface $rng, [int $min, int >> $max]): int {} >> // If this is intended to be secure, whateverprefix_array_rand may need to >> avoid the optimizations used by array_rand for sparse arrays >> function whateverprefixornamespace_array_rand(PRNGInterface $rng, array >> $array, ...): int {} >> function whateverprefixornamespace_shuffle(PRNGInterface $rng, array >> &$array): bool; >> // alternately might be possible by extending existing global functions >> with an optional parameter >> ``` >> >> Thanks, >> - Tyson
[PHP-DEV] Re: Improving PRNG implementation.
I updated the RFC draft and changed it to a proposal to bifurcate the interface. https://wiki.php.net/rfc/object_scope_prng At the same time, I was looking into the RNG problem in Swoole and found out that the problem was actually occurring. https://www.easyswoole.com/En/Other/random.html The above problem with Swoole is only for child processes and can be fixed by the user, but as mentioned above, I think it will become more serious as PHP becomes more complex in the future. I hadn't thought of this before, but we might want to consider deprecating existing state-dependent RNG functions. I am seeking feedback on this proposal in order to take the RFC to the next step. Thank you in advance. Regards, Go Kudo 2020年12月16日(水) 23:46 zeriyoshi : > Nice to meet you, internals. > > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to be > extended beyond the web. > > So I'd like to make a few suggestions. > > First , PHP has the historical Mersenne Twister PRNG. However, this > implementation keeps its state in a global and cannot be handled as an > object like other languages (e.g. Java). > > So, I created a PHP Extension and proposed it to PECL. > > https://marc.info/?l=pecl-dev&m=160795415604102&w=2 > https://github.com/zeriyoshi/php-ext-orng > > But, Then I looked at the mailing list archives and noticed that a similar > proposal had been made before. > > https://externals.io/message/98021#98130 > > I feel that this suggestion is needed now to expand PHP beyond the web. > > Second suggestion is to stop using the Combined LCG as the default seed > value for each function. > > PHP's Combined LCG only uses PID (or ZTS Thread ID) and time as entropy. > https://github.com/php/php-src/blob/master/ext/standard/lcg.c#L72 > > With the development of container technology, this problem seems to be > getting more serious. So I think we should use the random numbers provided > by the OS (getrandom on Linux) if available. > > I would like to hear your opinions. > > Regards > Go Kudo >
Re: [PHP-DEV] Re: Improving PRNG implementation.
Thanks tyson. > This would also make it easier to use those generators in brand new algorithms that weren't in the initial RFC. > (or in algorithms written by users in PHP) This suggestion seems to make sense. Maybe the RNG should only focus on generating random numbers and not anything else. However, the fact that array and string manipulation functions are no longer native to C may have a speed disadvantage. So I came up with the idea of minimizing the interface definition as RNG. ``` interface PRNGInterface { public function nextInt(?int $min = null, ?int $max = null): int; public function nextDouble(): double; // maybe, non-needed. public function nextByte(int $length): string; } ``` The methods for array and string operations are defined separately as interfaces that inherit from the interface. ``` interface RandomInterface extends PRNGInterface { public function shuffle(array &$array): bool; public function arrayRand(array $array, int $num = 1): int|string|array; public function strShuffle(string $string): string; } ``` This can be overly structured, but it will serve all purposes. Regards, Go Kudo 2020年12月23日(水) 0:40 tyson andre : > Hi Go Kudo, > > **A possible alternative that is widely used in other programming > languages is to limit the interface API to only generating bytes/integers,** > and to provide global functions that would use generic random number > generator objects (from internal or user-provided code) in their algorithms. > > This would also make it easier to use those generators in brand new > algorithms that weren't in the initial RFC. > (or in algorithms written by users in PHP) > > This alternative is similar to Java > https://docs.oracle.com/javase/7/docs/api/java/util/Collections.html#shuffle(java.util.List,%20java.util.Random) > and https://docs.oracle.com/javase/7/docs/api/java/util/Random.html > and C++ https://www.cplusplus.com/reference/algorithm/shuffle/ > where the algorithms accept a random number generator conforming to some > interface > and Python https://docs.python.org/3/library/random.html#random.shuffle > > ``` > interface PRNGInterface { > public function random[Signed]Int(): int; // between PHP_INT_MIN and > PHP_INT_MAX (4 bytes or 8 bytes) > public function randomBytes(int $length): string // $length bytes of > raw data > // possibly randomByte(): int // between 0 and 255 > // public function randomIntInRange(int $min, int $max) > // __serialize(), __unserialize() may be provided, but may be > counterproductive for classes that wrap /dev/urandom or random_bytes()? > } > // possibly provide a trait that provides defaults based on abstract > function randomInt > > function whateverprefixornamespace_rand(PRNGInterface $rng, [int $min, int > $max]): int {} > // If this is intended to be secure, whateverprefix_array_rand may need to > avoid the optimizations used by array_rand for sparse arrays > function whateverprefixornamespace_array_rand(PRNGInterface $rng, array > $array, ...): int {} > function whateverprefixornamespace_shuffle(PRNGInterface $rng, array > &$array): bool; > // alternately might be possible by extending existing global functions > with an optional parameter > ``` > > Thanks, > - Tyson
Re: [PHP-DEV] Re: Improving PRNG implementation.
Hi Go Kudo, **A possible alternative that is widely used in other programming languages is to limit the interface API to only generating bytes/integers,** and to provide global functions that would use generic random number generator objects (from internal or user-provided code) in their algorithms. This would also make it easier to use those generators in brand new algorithms that weren't in the initial RFC. (or in algorithms written by users in PHP) This alternative is similar to Java https://docs.oracle.com/javase/7/docs/api/java/util/Collections.html#shuffle(java.util.List,%20java.util.Random) and https://docs.oracle.com/javase/7/docs/api/java/util/Random.html and C++ https://www.cplusplus.com/reference/algorithm/shuffle/ where the algorithms accept a random number generator conforming to some interface and Python https://docs.python.org/3/library/random.html#random.shuffle ``` interface PRNGInterface { public function random[Signed]Int(): int; // between PHP_INT_MIN and PHP_INT_MAX (4 bytes or 8 bytes) public function randomBytes(int $length): string // $length bytes of raw data // possibly randomByte(): int // between 0 and 255 // public function randomIntInRange(int $min, int $max) // __serialize(), __unserialize() may be provided, but may be counterproductive for classes that wrap /dev/urandom or random_bytes()? } // possibly provide a trait that provides defaults based on abstract function randomInt function whateverprefixornamespace_rand(PRNGInterface $rng, [int $min, int $max]): int {} // If this is intended to be secure, whateverprefix_array_rand may need to avoid the optimizations used by array_rand for sparse arrays function whateverprefixornamespace_array_rand(PRNGInterface $rng, array $array, ...): int {} function whateverprefixornamespace_shuffle(PRNGInterface $rng, array &$array): bool; // alternately might be possible by extending existing global functions with an optional parameter ``` Thanks, - Tyson -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: https://www.php.net/unsub.php
Re: [PHP-DEV] Re: Improving PRNG implementation.
Thanks. I have edited the RFC based on your question. > Method names: you don't need to emulate the conventions from non-OOP code, e.g. string_shuffle() --> strShuffle and array_rand() --> arrayRand(). Use a more naturally readable convention like shuffleString(). As mentioned in the RFC, it is intended for interoperability. However, you may be right that we should forget the past and aim for a better implementation. > Do you have evidence of widespread demand for this functionality? Usage statistics for the userland implementation you mentioned[3] don't look that hot. A difficult problem. At least for PHP, which is a web-oriented language, there may be little demand for it. In my opinion, an object-scoped implementation will be needed in the future. For example, there are currently proposed implementations of Fiber, which will certainly make global state unpredictable. Swoole extension is also a concern, although I am not familiar with this. Regards, Go Kudo 2020年12月20日(日) 23:09 Max Semenik : > On Sun, Dec 20, 2020 at 6:45 AM zeriyoshi wrote: > >> Thanks cmb. I have created a first draft of an RFC. I think I've covered >> all the necessary requirements, but I'd like to know if there are any >> problems. >> https://wiki.php.net/rfc/object_scope_prng > > > A few IMHO comments: > * Namespaces are controversial. There were multiple discussions and RFCs > about introducing them for new functionality but nothing came out of it. > Still probably worth adding an option for \PHP\PRNG to the vote, or > something like that. > * The "Which set of PRNGs should be provided as standard?" vote seems > vague to me. Are you proposing to add only one algorithm? Why? Also, the > adding just interfaces option makes no sense because there's Composer for > things that don't require a C implementation. > * "One possible solution is to implement the PRNG in pure PHP. There is > actually a userland library [1], but it is not fast enough for PHP at the > moment. (However, this may be improved by JIT)" - this definitely needs a > comparison because implementation in C is only needed for performance > reasons. > * Method names: you don't need to emulate the conventions from non-OOP > code, e.g. string_shuffle() --> strShuffle and array_rand() --> > arrayRand(). Use a more naturally readable convention like shuffleString(). > * The return value in "function shuffle(array &$array): bool" means that > if something goes wrong, callers will have no idea what it is. Just throw > an exception. > * If the classes are namespaced, don't prefix their names, > e.g. PRNGInterface --> RandomGenerator (yay, so many opportunities for > bikeshedding!) > * Perhaps it's worth mentioning other languages using this pattern, e.g. > Java[1] and C#[2]. > * Do you have evidence of widespread demand for this functionality? Usage > statistics for the userland implementation you mentioned[3] don't look that > hot. > > -- > [1] https://docs.oracle.com/javase/8/docs/api/java/util/Random.html > [2] https://docs.microsoft.com/en-us/dotnet/api/system.random?view=net-5.0 > [3] https://packagist.org/packages/savvot/random > >
Re: [PHP-DEV] Re: Improving PRNG implementation.
On Sun, Dec 20, 2020 at 6:45 AM zeriyoshi wrote: > Thanks cmb. I have created a first draft of an RFC. I think I've covered > all the necessary requirements, but I'd like to know if there are any > problems. > https://wiki.php.net/rfc/object_scope_prng A few IMHO comments: * Namespaces are controversial. There were multiple discussions and RFCs about introducing them for new functionality but nothing came out of it. Still probably worth adding an option for \PHP\PRNG to the vote, or something like that. * The "Which set of PRNGs should be provided as standard?" vote seems vague to me. Are you proposing to add only one algorithm? Why? Also, the adding just interfaces option makes no sense because there's Composer for things that don't require a C implementation. * "One possible solution is to implement the PRNG in pure PHP. There is actually a userland library [1], but it is not fast enough for PHP at the moment. (However, this may be improved by JIT)" - this definitely needs a comparison because implementation in C is only needed for performance reasons. * Method names: you don't need to emulate the conventions from non-OOP code, e.g. string_shuffle() --> strShuffle and array_rand() --> arrayRand(). Use a more naturally readable convention like shuffleString(). * The return value in "function shuffle(array &$array): bool" means that if something goes wrong, callers will have no idea what it is. Just throw an exception. * If the classes are namespaced, don't prefix their names, e.g. PRNGInterface --> RandomGenerator (yay, so many opportunities for bikeshedding!) * Perhaps it's worth mentioning other languages using this pattern, e.g. Java[1] and C#[2]. * Do you have evidence of widespread demand for this functionality? Usage statistics for the userland implementation you mentioned[3] don't look that hot. -- [1] https://docs.oracle.com/javase/8/docs/api/java/util/Random.html [2] https://docs.microsoft.com/en-us/dotnet/api/system.random?view=net-5.0 [3] https://packagist.org/packages/savvot/random
[PHP-DEV] Re: Improving PRNG implementation.
On 20.12.2020 at 04:45, zeriyoshi wrote: > On 19.12.2020 at 11:26, Christoph M. Becker wrote: > >> RFC karma granted. Best of luck with the RFC! :) > > Thanks cmb. I have created a first draft of an RFC. I think I've covered > all the necessary requirements, but I'd like to know if there are any > problems. > https://wiki.php.net/rfc/object_scope_prng > > As for the second suggestion, I saw that the RFC was tagged on GH. This is > not a new feature and does not seem to be a bc break. Do we still need an > RFC in this case? > https://github.com/php/php-src/pull/6520 Sorry, my bad. I thought that PR was an implementation of the RFC. I've removed the RFC label now. Christoph > Regards, > Go Kudo > > 2020年12月19日(土) 20:26 Christoph M. Becker : > >> On 19.12.2020 at 07:33, zeriyoshi wrote: >> >>> On Wed, Dec 16, 2020 at 8:46 AM zeriyoshi wrote: >>> >>> I have created an account on the PHP.net wiki to create an RFC about >> this. >>> Can you grant me editing privileges? >> >> RFC karma granted. Best of luck with the RFC! :) >> >> Christoph >> > -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: https://www.php.net/unsub.php
[PHP-DEV] Re: Improving PRNG implementation.
On 19.12.2020 at 11:26, Christoph M. Becker wrote: > RFC karma granted. Best of luck with the RFC! :) Thanks cmb. I have created a first draft of an RFC. I think I've covered all the necessary requirements, but I'd like to know if there are any problems. https://wiki.php.net/rfc/object_scope_prng As for the second suggestion, I saw that the RFC was tagged on GH. This is not a new feature and does not seem to be a bc break. Do we still need an RFC in this case? https://github.com/php/php-src/pull/6520 Regards, Go Kudo 2020年12月19日(土) 20:26 Christoph M. Becker : > On 19.12.2020 at 07:33, zeriyoshi wrote: > > > On Wed, Dec 16, 2020 at 8:46 AM zeriyoshi wrote: > > > > I have created an account on the PHP.net wiki to create an RFC about > this. > > Can you grant me editing privileges? > > RFC karma granted. Best of luck with the RFC! :) > > Christoph >
[PHP-DEV] Re: Improving PRNG implementation.
On 19.12.2020 at 07:33, zeriyoshi wrote: > On Wed, Dec 16, 2020 at 8:46 AM zeriyoshi wrote: > > I have created an account on the PHP.net wiki to create an RFC about this. > Can you grant me editing privileges? RFC karma granted. Best of luck with the RFC! :) Christoph -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: https://www.php.net/unsub.php
[PHP-DEV] Re: Improving PRNG implementation.
On Wed, Dec 16, 2020 at 8:46 AM zeriyoshi wrote: Hello. I have created an account on the PHP.net wiki to create an RFC about this. Can you grant me editing privileges? As for suggestion two, I sent a PR to php-src on GitHub. https://github.com/php/php-src/pull/6520 Fortunately, Nikita seems to be checking it out. Thank you very much. P.S: I had mentioned earlier that php_combined_lcg() was used in the session, but I checked and it seems not to be the case now. I will correct it. Sorry about that. Regards, Go Kudo 2020年12月16日(水) 23:46 zeriyoshi : > Nice to meet you, internals. > > PHP 8.0 has been released. With the inclusion of JIT, PHP is about to be > extended beyond the web. > > So I'd like to make a few suggestions. > > First , PHP has the historical Mersenne Twister PRNG. However, this > implementation keeps its state in a global and cannot be handled as an > object like other languages (e.g. Java). > > So, I created a PHP Extension and proposed it to PECL. > > https://marc.info/?l=pecl-dev&m=160795415604102&w=2 > https://github.com/zeriyoshi/php-ext-orng > > But, Then I looked at the mailing list archives and noticed that a similar > proposal had been made before. > > https://externals.io/message/98021#98130 > > I feel that this suggestion is needed now to expand PHP beyond the web. > > Second suggestion is to stop using the Combined LCG as the default seed > value for each function. > > PHP's Combined LCG only uses PID (or ZTS Thread ID) and time as entropy. > https://github.com/php/php-src/blob/master/ext/standard/lcg.c#L72 > > With the development of container technology, this problem seems to be > getting more serious. So I think we should use the random numbers provided > by the OS (getrandom on Linux) if available. > > I would like to hear your opinions. > > Regards > Go Kudo >