Eval bug: Vulkan/HIP is way too slow in comparison to llama.cpp

### Name and Version

0.3.2 / 0.3.1 / 0.3.0

### Operating systems

Linux

### GGML backends

Vulkan

### Hardware

7900XTX - 24GB - 64GB DDR5

### Models

Qwen 27B Q4KXL / Qwen 35B A3B Q4KM

### Problem description & steps to reproduce

Vulkan is way too slow, about 40ts with MTP even at very low context, while llama.cpp gives 85t/s as a context is starting.
HIP is even worse, at 25t/s with MTP.

### First Bad Commit

_No response_

### Relevant log output

.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: Vulkan/HIP is way too slow in comparison to llama.cpp #59

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Eval bug: Vulkan/HIP is way too slow in comparison to llama.cpp #59

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions