Commit History

OpenCL: add initial FA support (llama/14987)
8ece1ee

mrfatso commited on

opencl: add initial mxfp4 support via mv (llama/15270)
1a0281c

lhez shawngu-quic commited on

opencl: allow mixed f16/f32 `add` (llama/15140)
345810b

mrfatso commited on

opencl: support sink in `soft_max` (attn sinks) (llama/15152)
d8664e4

lhez commited on

opencl: add `swiglu_oai` and `add_id` (llama/15121)
1c97db6

lhez commited on

opencl: add f16 for `add`, `sub`, `mul`, `div` (llama/14984)
4dc1834

lhez commited on

opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` (llama/14809)
05577c3

lhez commited on

opencl: add fused `rms_norm_mul` (llama/14841)
5629961

lhez commited on

opencl: fix `im2col` when `KW!=KH` (llama/14803)
2fdd2df

Sigbjørn Skjæret commited on

opencl: add conv2d kernel (llama/14403)
d579f20

mrfatso commited on

opencl: add tiled mul_mat_f16_f32 (llama/14535)
398dc49

mrfatso commited on

opencl: add `set_rows` for `f16` and `f32` (llama/14547)
5e203ec

lhez commited on

ggml : add ggml_scale_bias (llama/14417)
573d50a

ngxson HF Staff commited on

opencl: add GELU_ERF (llama/14476)
b19d736

Sigbjørn Skjæret commited on

ggml : implement GEGLU_ERF and GEGLU_QUICK ops (llama/14445)
f798922

Sigbjørn Skjæret commited on

opencl : broadcast for soft_max (llama/14510)
4434043

lhez commited on

opencl : update upscale to support align corners (llama/14488)
2b95b05

lhez commited on

opencl : add GEGLU, REGLU, SWIGLU (llama/14456)
d70ff9f

lhez commited on

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)
d0a458b

lhez commited on

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)
5ff8785

rmatif commited on

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787)
1ab0f23

lhez commited on

opencl : remove obsolete files (skip) (ggml/1200)
adc6542

ggerganov commited on

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886)
291a5b7

lhez Shangqing Gu commited on

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600)
3261fcd

lhez commited on

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)
94449e3

lhez commited on

opencl: fix for small models (llama/11950)
4532dc6

lhez Shawn Gu Skyler Szot commited on

opencl: Fix rope and softmax (llama/11833)
bf3b6f8

lhez commited on

ggml : add opencl backend (skip) (llama/10693)
226358f

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on