Support additional Execution Providers in ONNX `wasi-nn` backend #8547

kaivol · 2024-05-03T21:38:47Z

Feature

Currently the ONNX backend in wasmtime-wasi-nn only uses the default CPU execution provider and ignores the ExecutionTarget requested by the WASM caller.

wasmtime/crates/wasi-nn/src/backend/onnxruntime.rs

Lines 21 to 33 in 24c1388

    
           fn load(&mut self, builders: &[&[u8]], target: ExecutionTarget) -> Result<Graph, BackendError> { 
        
               if builders.len() != 1 { 
        
                   return Err(BackendError::InvalidNumberOfBuilders(1, builders.len()).into()); 
        
               } 
        
               let session = Session::builder()? 
        
                   .with_optimization_level(GraphOptimizationLevel::Level3)? 
        
                   .with_model_from_memory(builders[0])?; 
        
               let box_: Box<dyn BackendGraph> = 
        
                   Box::new(ONNXGraph(Arc::new(Mutex::new(session)), target)); 
        
               Ok(box_.into()) 
        
           }

I would like to suggest adding support for additional execution providers (CUDA, TensorRT, ROCm, ...) to wasmtime-wasi-nn.

Benefit

Improved performance for WASM modules using the wasi-nn API.

Implementation

ort already has support for many execution providers, so integrating these into wasmtime-wasi-nn should not be to much work.
I would be interested in looking into this, however, I only really have the means to test the DirectML and NVIDIA CUDA / TensorRT EPs.

Alternatives

Leave it to the users to add support for additional execution providers.

The text was updated successfully, but these errors were encountered:

abrown · 2024-06-11T17:19:08Z

I was looking at old issues and ran across this one (sorry for such a late reply!): I completely agree with this idea. I am tempted to say "go for it!" but maybe there is some coordination needed. E.g., I think @jianjunz has started enabling some DirectML bits in #8756. And @devigned may have some opinions on the best way to do this. But from my perspective, this seems like a worthwhile avenue to pursue.

devigned · 2024-06-20T13:02:47Z

I think this is a great idea! One interesting part will be testing. We may need to spin up some hardware to make sure the functionality stays evergreen.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

kaivol commented May 3, 2024

abrown commented Jun 11, 2024

devigned commented Jun 20, 2024

Support additional Execution Providers in ONNX wasi-nn backend #8547

Support additional Execution Providers in ONNX wasi-nn backend #8547

Comments

kaivol commented May 3, 2024

Feature

Benefit

Implementation

Alternatives

abrown commented Jun 11, 2024

devigned commented Jun 20, 2024

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

Support additional Execution Providers in ONNX `wasi-nn` backend #8547