Skip to content

Implementing 64-bit Maximally Equidistributed F2-Linear Generators with Mersenne Prime Period

License

Notifications You must be signed in to change notification settings

sharase/melg-64

Repository files navigation

Implementing 64-bit Maximally Equidistributed F2-Linear Generators with Mersenne Prime Period

What is MELG-64?

The 64-bit Maximally Equidistributed F2-Linear Generators with Mersenne Prime Period (MELG-64) are 64-bit Mersenne-Twister-type pseudorandom number generators developed between 2014 and 2017, and the corresponding paper was published on ACM TOMS in 2018.

S. Harase and T. Kimoto, "Implementing 64-bit maximally equidistributed F2-linear generators with Mersenne prime period", ACM Transactions on Mathematical Software, Volume 44, Issue 3, April 2018, Article No. 30, 11 pp. Artcle

Background

CPUs and operating systems are moving from 32 to 64 bits, and hence it is important to have good 64-bit pseudorandom number generators (PRNGs) designed to fully exploit these word lengths.

The 32-bit Mersenne Twister (MT) MT19937 (Matsumoto and Nishimura, 1998) is one of the most widely used PRNGs, but it is not completely optimized in terms of high-dimensional uniformity, which is a theoretical criterion of PRNGs. The 32-bit WELL generators (Panneton et. al., 2006) was developed in order to overcome this weakness.

However, for 64-bit PRNGs, MT19937-64 (Nishimura, 2000) and SFMT19937 using SIMD (Saito and Matsumoto, 2008), etc., have been proposed, but there exists no 64-bit MT-type long-period linear PRNG completely optimized for high-dimensional uniformity, such as a variant of WELL generators.

In this page, we introduce 64-bit maximally equidistributed F2-linear PRNGs (MELG-64) that are optimal in this respect and have speeds equivalent to 64-bit Mersenne Twisters.

Feature

MELG19937-64 has the following properties:

  • Very long period 219937-1 106000;
  • High-dimensional uniformity completely optimized;
  • Fast generation competitive with MT19937-64;
  • Memory size requiring only 312 words (similarly to MT19937-64).

We provide the codes for MELG-64 with various period lengths from 2607-1 to 244497-1. The jump-ahead algorithm is also implemented in order to obtain disjoint streams in parallel computing. (The default skip size is 2256.)

Usage

Please click the "Code" button in the upper right of the content pane, and clone this page:

git clone https://github.com/sharase/melg-64.git

or download the zip file. If you want to use MELG19937-64, for example,

cd melg-64
cd melg19937-64
gcc melg19937-64.c -o melg19937-64 -O3 -Wall
./melg19937-64

Before using, please initialize the state by the function init_by_array64(init, length) in a similar way as Mersenne Twisters.

High-dimensional uniformity

The high-dimensional uniformity is a theoretical criterion for PRNGs, which is assessed via the dimension of equidistribution with v-bit accuracy as follows:

Let

x0, x1, ..., xP-1, xP = x0, ...

be a unsigned w-bit binary integer sequence with period P, where w is the word size of the intended machine. Let truncv(xi)
denote the number formed by the v most significant bits of xi. Consider the kv-bit vectors for the entire period:

(truncv(xi), truncv(xi+1), ..., truncv(xi+k-1)),   i = 0, ..., P-1 .

A pseudorandom sequence xi of w-bit integers of period P is said to be k-dimensionally equidistributed with v-bit accuracy if each of the 2kv possible combinations of bits occurs the same number of times over the whole period P, except for the all-zero combination that occurs once less often.

The largest value of k with this property is called the dimension of equidistribution with v-bit accuracy, denoted by k(v).

This definition is based on the assumption that the higher digits are large numbers. In particular, the dimension of equidistribution ensures that the output values with the v most significant bits are uniformly distributed up to dimension k(v). Thus, as a criterion of uniformity, larger values of k(v) for each 1 v w is desirable.

Now we have a trivial upper bound

k(v) log2(P+1) / v

for each v = 1, 2, ...,w. Define the sum of the gaps

Δ := log2(P+1) / v - k(v)),

where the sum is over all 1 v w.

If Δ = 0, the generator is said to be maximally equidistributed (ME).

The aim of our study is to design maximally equidistributed F2-linear PRNGs with similar speed as 64-bit Mersenne Twisters.

Performance

We compare the following MT-type PRNGs corresponding to 64-bit output sequences:

  • MELG19937-64: the 64-bit integer output of our proposed generator;
  • MT19937-64: the 64-bit integer output of the 64-bit Mersenne Twister (downloaded from here);
  • MT19937-64 (ID3): the 64-bit integer output of a 64-bit Mersenne Twister based on a five-term recursion (ID3) (Nishimura 2000);
  • SFMT19937-64 (without SIMD): the 64-bit integer output of the SIMD-oriented Fast Mersenne Twister SFMT19937 without SIMD (Saito and Matsumoto 2008);
  • SFMT19937-64 (with SIMD): the 64-bit integer output of the foregoing with SIMD (Saito and Matsumoto 2008).

We measure the CPU time (in seconds) taken to generate 109 64-bit unsigned integers. The following table summarizes the timing and the figures of merit Δ and N1. See the remark below for the definition of N1.

Generators CPU time (Intel) CPU time (AMD) Δ N1
MELG19937-64 4.2123 6.2920 0 9603
MT19937-64 5.1002 6.6490 7820 285
MT19937-64 (ID3) 4.8993 6.7930 7940 5795
SFMT19937-64 (without SIMD) 4.2654 5.6123 14095 6711
SFMT19937-64 (with SIMD) 1.8457 2.8806 14095 6711

Platforms (64-bit CPUs and OSs):
  • CPU time (Intel): Intel Core i7-3770 (3.40GHz) Linux gcc compiler with -O3
  • CPU time (AMD): AMD Phenom II X6 1045T (2.70 GHz) Linux gcc compiler with -O3

SFMT19937 is very fast but Δ for SFMT19937 is large. (In fact, the SFMT generators are optimized under the assumption that one will mainly be using 32-bit output sequences. For double-precision floating-point numbers, dSFMT is faster than SFMT and is also improved from the viewpoint of the dimensions of equidistribution with v-bit accuracy.)

Remark:

For MT-type PRNGs, each bit of the sequence obeys a linear feedback shift register generator with characteristic polynomial P(x) over the two-element field F2. Let N1 be the number of nonzero coefficients of P(x). In addition to the excellent equidistribution, as a secondary criterion, N1 should be large enough. This criterion implies that the PRNG avoids a long-lasting impact for poor initialization, such as 0-excess states (Panneton et al, 2006).

Further reading

For more information, please see the presentation slides here.

  • S. Harase and T. Kimoto, "Implementing 64-bit maximally equidistributed F2-linear generators with Mersenne prime period", ACM Transactions on Mathematical Software, Volume 44, Issue 3, April 2018, Article No. 30, 11 Pages. Artcle
  • S. Harase, "Conversion of Mersenne Twister to double-precision floating-point numbers", Mathematics and Computers in Simulation, Volume 161, July 2019, Pages 76-83. Article

日本語の発表スライドもこちらに公開しました。Youtubeの解説動画も作成しました。

Acknowledgments

This work was partially supported by JSPS KAKENHI Grant Numbers JP18K18016, JP26730015, JP26310211, JP15K13460, JP12J07985.

About

Implementing 64-bit Maximally Equidistributed F2-Linear Generators with Mersenne Prime Period

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages