Test Results #10
Replies: 3 comments 2 replies
-
Test Results: Llama-3.1 70b instruct (fireworks.ai)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
-
Test Results: Gemma 2 2b it Q8_0 (local: llama_cpp_python)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
-
Test Results: Phi-3.5 Mini 128K Instruct (openrouter.ai)=== Test Results Summary === Test Case: Arena Bench Hard Test Case: Big Code Bench Test Case: Maths Problem Test Case: GSM8K |
Beta Was this translation helpful? Give feedback.
-
Just a thread to post test results.
E.g. to use another backend like openrouter.ai:
I updated
test.py
to supportbase_url
Beta Was this translation helpful? Give feedback.
All reactions