Tags · CodeLinaro/llama.cpp

b6029

embeddings: fix extraction of CLS pooling results (ggml-org#14927)

* embeddings: fix extraction of CLS pooling results

* merge RANK pooling into CLS case for inputs

Jul 30, 2025
a118d80
zip
tar.gz
Downloads

b5797

ci : disable fast-math for Metal GHA CI (ggml-org#14478)

* ci : disable fast-math for Metal GHA CI

ggml-ci

* cont : remove -g flag

ggml-ci

Jul 1, 2025
de56944
zip
tar.gz
Downloads

b5752

batch : fix check for empty sequences in memory (ggml-org#14364)

* batch : fix check for empty sequences in memory

ggml-ci

* cont : reuse the var

ggml-ci

Jun 24, 2025
62af464
zip
tar.gz
Downloads

b5689

cmake: remove shader-gen step-targets from ggml-vulkan (ggml-org#14226)

* Remove step-targets from vulkan-shaders-gen

* Unset DESTDIR when building vulkan-shaders-gen

Jun 17, 2025
c465030
zip
tar.gz
Downloads

b5686

common : suggest --jinja when autodetection fails (ggml-org#14222)

Jun 16, 2025
e434e69
zip
tar.gz
Downloads

b5627

llama : support GEGLU for jina-bert-v2 (ggml-org#14090)

Jun 10, 2025
3678b83
zip
tar.gz
Downloads

b5548

CUDA: fix typo in FlashAttention code (ggml-org#13926)

May 30, 2025
e562eec
zip
tar.gz
Downloads

b5460

release : fix windows hip release (ggml-org#13707)

* release : fix windows hip release

* make single hip release with multiple targets

May 22, 2025
3079e9a
zip
tar.gz
Downloads

b5255

ci: fix cross-compile sync issues (ggml-org#12804)

May 1, 2025
d24d592
zip
tar.gz
Downloads

b5098

convert : ability to lazy-load safetensors remotely without downloadi…

…ng to disk (ggml-org#12820)

* gguf util : add SafetensorRemote

* fix style

* convert: add --remote option

* convert : allow using lazy remote tensors

It's a bit slow for now since everything is blocking and single-threaded.

* correct metadata.name

* small style fix

* support HF_TOKEN

* convert : use writeable buffer for remote lazy tensors

* convert : fix flake8 lint regarding lamdba assigment

* multithreaded download

* multithread: print debug

* fix style

* Revert "multithreaded download"

This reverts commit 42fc895.

* bring back _get_request_headers

---------

Co-authored-by: Francis Couture-Harpin <[email protected]>

Apr 10, 2025
64eda5d
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b6029

b5797

b5752

b5689

b5686

b5627

b5548

b5460

b5255

b5098

Tags: CodeLinaro/llama.cpp