[WIP] jit/core redesign #7

s-m-e · 2024-02-24T19:24:11Z

Objective

This PR serves as foundation for both orbit and state arrays. It focusses on functionalities provided by the core module and their invocation. All relevant core functions are designed to work equally on CPUs and GPUs as either universal functions or generalized universal functions. As a "side-effect", all relevant core functions allow parallel operation with full broadcasting semantics.

Summary

All dependencies of Orbit and state classes towards core are refactored as follows:

Functions decorated by either numba.vectorize and numba.guvectorize serve as the only interface between regular uncompiled Python code and core
Functions decorated by numba.vectorize and numba.guvectorize only call functions decorated by numba.jit/numba.cuda.jit
Functions decorated by numba.jit/numba.cuda.jit can only call each other.
Functions decorated by numba.vectorize, numba.guvectorize and numba.jit/numba.cuda.jit:
- are only allowed to depend on Python's standard library's math module, but not numpy - except for certain details like enforcing floating point precision
- are fully typed, loosely following numba semantics via shortcuts

The above mentioned "hierarchy" of decorators is imposed by CUDA-compatibility. While functions decorated by numba.jit (targets cpu and parallel) can be called from uncompiled Python code, functions decorated by numba.cuda.jit (target cuda) are considered "device functions" and can not be called by uncompiled Python code directly. They are supposed to be called by CUDA-kernels (or other device functions) only (slightly simplifying the actual situation as implemented by numba). If the target is set to cuda, functions decorated by numba.vectorize and numba.guvectorize become CUDA kernels.

Eliminating numpy as a dependency serves two purposes. While it also contributes to CUDA-compatiblity, it additionally makes the code significantly faster on CPUs.

New decorators are introduced, wrapping numba.jit, numba.cuda.jit, numba.vectorize and numba.guvectorize, centralizing compiler options and target switching (cpu, parallel or cuda) as well as simplifying typing:
- vjit: Wraps numba.vectorize. Functions decorated by it carry the suffix _vf.
- gjit: Wraps numba.guvectorize. Functions decorated by it carry the suffix _gf.
- hjit: Wraps numba.jit or numba.cuda.jit, depending on compiler target. Functions decorated by it carry the suffix _hf.
- djit: Variation of hjit with fixed function signature for user-provided functions used by Cowell

All mentioned wrappers are found in core.jit.

As a result of name suffixes, a number of core module functions have been renamed making the package intentionally backwards-incompatible. Functions not yet using the new infrastructure can be recognized based on lack of suffix. core functions dynamically generating (and compiling) other functions carry _hb, _vb and _gb suffixes.

Math

The former _math module has become a first-class citizen as core.math, fully compiled by the above mentioned infrastructure.

All compiled code now enforces a single floating point precision level, which can be configured by users. The default is FP64 / double precision. For simplicity, the type shortcut is f. Additional infrastructure can be found in core.math.ieee754.

core.math contains a number of replacements for numpy operations, mostly found in core.math.linalg. All of those functions do not allocate memory and are free of side-effects including a lack of changes to their parameters. 3D vectors are expressed as tuples (type shortcut V, replacing Tuple([f,f,f])). Matrices are expressed as tuples of tuples (type shortcut M, replacing Tuple([V,V,V])).

core.math also replaces (some) required scipy functions:
- scipy.interpolate.interp1d is replaced by core.math.interpolate.interp_hb. It custom-compiles 1D linear interpolators, embedding data statically into the compiled functions.
- scipy.integrate.solve_ivp, scipy.integrate.DOP853 and scipy.optimize.brentq are replaced by core.math.ivp.

Style

core modules now explicitly export APIs via __all__.

Settings

This PR introduces a new settings module. It currently serves to control compiler options, e.g. the compiler target (cpu, parallel and cuda). Settings can be switched by either setting environment variables or importing the settings module before any other (sub-) module is imported.

Logging

This PR introduces basic logging functionality. Logger name equals package name. The logger can also be imported from the new debug module. At the moment, it only logs compiler status messages and issues.

Blocking `numba` issues for CUDA-compatibility

math.nextafter: nextafter (via both math and numpy) missing for CUDA numba/numba#9435
Signatures for function references: CUDA: Providing a signature for a function reference when eagerly compiling does not work numba/numba#9398
Return type mismatch: On target cuda, return type of function may not match type given in signature string numba/numba#8400
Multiple outputs for guvectorize: Allow multiple outputs for guvectorize on CUDA target numba/numba#8303
Missing __name__ attribute for 'CUDAUFuncDispatcher': 'CUDAUFuncDispatcher' object has no attribute '__name__' numba/numba#8272

Non-blocking `numba` issues for CUDA-compatibility with workaround present

Tuple of functions: Tuple of functions (with identical signature) does not compile for CUDA numba/numba#9420
Keyword arguments: Add support of keyword argument to Dispatchers from numba.cuda.jit numba/numba#7870

Non-blocking `numba` issues unrelated to CUDA with workaround present

New size variable i.e. () -> (n) in output argument: @guvectorize not accepting new size variable (i.e. () -> (n)) in output argument numba/numba#2797
Building the cache in parallel: Cache causes Segmentation Faults when generated in parallel numba/numba#4807

TODO

Code-style, doc strings and comments in core.math.ivp
Compiled non-linear interpolators
Documentation

📚 Documentation preview 📚: https://hapsira--7.org.readthedocs.build/en/7/

…redesign_solveivp

s-m-e added 30 commits January 13, 2024 09:28

rm dead code

fab3b44

rm class

08cf5f0

rm class

9aec248

cleanup

bd4b8cb

exports

3546e03

rm arg

c585852

types, const, cleanup

0bbb453

revert solver

57a963c

rm vectorized support

f43d4c9

rm first step

ee77d07

rm oo k decl

81d5611

fixed array sizes, prep for tuples

203357e

nb compile cache clean

0c548af

explicit nopython

7cbfeae

assert shapes; unroll loop

7c5defc

rm A,B,C props and args, turn into const

adb7ca6

eliminate func wrapper

bd49a33

pass k to func, prep for inline compile and eliminate last wrapper

0322830

rm dead consts

d2225c8

prep for tuples without numpy in rk step

bceaa1a

prep for tuples without numpy in rk step (2)

10266a2

state parser

2fb74e5

cowell has its own jit

3de598b

Merge remote-tracking branch 'origin/jit_redesign_solveivp' into jit_…

78bfccb

…redesign_solveivp

jit planetocentric_to_AltAz

06dd537

deactivate trap

83ce58d

jit line_of_sight

86bb3ae

cleanup

af83602

jit J2_perturbation

fdf616d

jit J3_perturbation_hf

9591b67

s-m-e added 9 commits March 2, 2024 18:16

fix urls

bce3039

add changes

89a3b67

fix copyright

009b718

details

17b6493

draft core doc

b6dac8a

clarifications

0a49ca9

fix headline

284604f

cleanup

c131f30

fix doc build issues

3cf6724

s-m-e mentioned this pull request Mar 7, 2024

Experiments with DOP853 and numba hgrecco/numbakit-ode#30

Open

s-m-e added 8 commits March 7, 2024 17:43

rm type S entirely; inline array to vector

13a9994

more changes

cbc1acd

ieee754; errors; kwargs

72b7572

inlining broke CI

82794a8

fix imports

2dfc5f2

simplify exception for numba

f04baff

fix doc strings

2a57e7b

fix doc strings

61afbe3

s-m-e mentioned this pull request Apr 24, 2024

math.nextafter for cuda numba/numba#9541

Open

s-m-e added 11 commits April 25, 2024 19:28

cleanup

4a914d4

doc strings

0750ff4

doc strings; param name cleanup

c1790aa

doc strings

545c3f0

doc strings

2924237

doc strings

2b16b5d

scipy ref

d9eac1e

prep doc

e4cba2e

doc strings

2c5fbc4

doc string

eff4f8a

doc string

97b4c93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] jit/core redesign #7

[WIP] jit/core redesign #7

s-m-e commented Feb 24, 2024 •

edited

Loading

[WIP] jit/core redesign #7

Are you sure you want to change the base?

[WIP] jit/core redesign #7

Conversation

s-m-e commented Feb 24, 2024 • edited Loading

Objective

Summary

Math

Style

Settings

Logging

Blocking numba issues for CUDA-compatibility

Non-blocking numba issues for CUDA-compatibility with workaround present

Non-blocking numba issues unrelated to CUDA with workaround present

TODO

s-m-e commented Feb 24, 2024 •

edited

Loading

Blocking `numba` issues for CUDA-compatibility

Non-blocking `numba` issues for CUDA-compatibility with workaround present

Non-blocking `numba` issues unrelated to CUDA with workaround present