This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 38
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
1 Restore first token configuration 2 Bug fixes for arch_config 3 Bug fixes for fmha 4 Synchronized part of the code with innersource 5 cmake compilation parameters are the same as ipex 6 FP16 UT bugfix dtype_mma_a and dtype_mma_b should be fp16 7 Updated policy for int4 and default FPU 8 FP16 gemm MatB col_major bugfix --------- Co-authored-by: Ding, Yi1 <yi1.ding@intel.com>
- Loading branch information
1 parent
b13e02f
commit e5510c6
Showing
54 changed files
with
2,147 additions
and
1,257 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,25 +1,6 @@ | ||
set(TARGET stream_k_gemm) | ||
|
||
set(XETLA_KERNEL_FLAGS ${XETLA_KERNEL_FLAGS} -fsycl) | ||
set(XETLA_KERNEL_FLAGS ${XETLA_KERNEL_FLAGS} -fsycl-targets=spir64_gen) | ||
|
||
# disable loop invariance optimization, this is for performance | ||
set(XETLA_OFFLINE_OPTIONS "${XETLA_OFFLINE_OPTIONS} -vc-disable-indvars-opt") | ||
# For large registers mode, enable 256 registers for kernels | ||
set(XETLA_OFFLINE_OPTIONS "${XETLA_OFFLINE_OPTIONS} -doubleGRF") | ||
# For registers usage verbose at AOT | ||
set(XETLA_OFFLINE_OPTIONS "${XETLA_OFFLINE_OPTIONS} -Xfinalizer -printregusage") | ||
# Enable bank conflict reduction. | ||
set(XETLA_OFFLINE_OPTIONS "${XETLA_OFFLINE_OPTIONS} -Xfinalizer -enableBCR") | ||
# Optimization to reduce the tokens used for DPAS instruction. | ||
set(XETLA_OFFLINE_OPTIONS "${XETLA_OFFLINE_OPTIONS} -Xfinalizer -DPASTokenReduction") | ||
|
||
set(XETLA_KERNEL_FLAGS ${XETLA_KERNEL_FLAGS} -Xs) | ||
set(XETLA_KERNEL_FLAGS ${XETLA_KERNEL_FLAGS} "-device pvc -options '${XETLA_OFFLINE_OPTIONS} ' ") | ||
|
||
#build test | ||
add_executable(${TARGET} stream_k_gemm.cpp) | ||
target_link_options(${TARGET} PRIVATE ${XETLA_KERNEL_FLAGS}) | ||
# Disable vector combine, to remove redundant loads and stores | ||
#target_compile_options(${TARGET} PRIVATE -mllvm -disable-vector-combine -fsycl -fsycl-targets=spir64_gen) | ||
|
||
# target_compile_options(${TARGET} PRIVATE -mllvm -disable-vector-combine -fsycl -fsycl-targets=spir64_gen) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.