Commit History

ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317)
fea8f94

taronaeo slaren commited on

ggml-cpu : split arch-specific implementations (llama/13892)
8c833e9

xctan ggerganov HF Staff commited on

ggml : refactor online repacking (llama/10446)
163128e

Djip007 ggerganov HF Staff commited on

ggml : move AMX to the CPU backend (llama/10570)
3732429

Diego Devesa commited on