Skip to content

Add Batch=32 / paged attention / and new RoPE module support to all Llama3 demo and tests #32885

Add Batch=32 / paged attention / and new RoPE module support to all Llama3 demo and tests

Add Batch=32 / paged attention / and new RoPE module support to all Llama3 demo and tests #32885

Annotations

2 warnings

Run Pre-commit Hooks

succeeded Nov 26, 2024 in 27s