Skip to content

Actions: ggml-org/llama.cpp

Pull Request Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
12,806 workflow runs
12,806 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

SvelteKit-based WebUI
Pull Request Labeler #14151: Pull request #14839 synchronize by allozaur
July 30, 2025 15:49 Queued
July 30, 2025 15:49 Queued
SvelteKit-based WebUI
Pull Request Labeler #14150: Pull request #14839 synchronize by allozaur
July 30, 2025 15:46 Queued
July 30, 2025 15:46 Queued
llama-server : implement universal assisted decoding
Pull Request Labeler #14149: Pull request #12635 synchronize by g2mt
July 30, 2025 15:42 Queued
July 30, 2025 15:42 Queued
llama-server : implement universal assisted decoding
Pull Request Labeler #14148: Pull request #12635 synchronize by g2mt
July 30, 2025 15:42 Queued
July 30, 2025 15:42 Queued
llama-server : implement universal assisted decoding
Pull Request Labeler #14147: Pull request #12635 synchronize by g2mt
July 30, 2025 15:42 Queued
July 30, 2025 15:42 Queued
Optimize l2_norm_f32 op with SIMD
Pull Request Labeler #14146: Pull request #14970 opened by TIKki43
July 30, 2025 15:41 Queued
July 30, 2025 15:41 Queued
Implementation of GGML_NUMA_MIRROR for 64% inferencing performance gain on numa systems
Pull Request Labeler #14145: Pull request #14969 opened by dbsanfte
July 30, 2025 14:38 Queued
July 30, 2025 14:38 Queued
vulkan: optimizations for direct convolution
Pull Request Labeler #14144: Pull request #14933 synchronize by jeffbolznv
July 30, 2025 14:13 50m 25s
July 30, 2025 14:13 50m 25s
Refactor: Merge build_moe_ffn_from_probs function into build_moe_ffn
Pull Request Labeler #14143: Pull request #14968 opened by wdl339
July 30, 2025 14:08 43m 49s
July 30, 2025 14:08 43m 49s
Q2k interleaving implementation - x86/x64 SIMD
Pull Request Labeler #14142: Pull request #14373 synchronize by Srihari-mcw
July 30, 2025 13:34 22m 9s
July 30, 2025 13:34 22m 9s
sync : ggml
Pull Request Labeler #14141: Pull request #14967 opened by ggerganov
July 30, 2025 13:03 19m 23s
July 30, 2025 13:03 19m 23s
Improve Mistral models integration with llama.cpp
Pull Request Labeler #14140: Pull request #14737 synchronize by juliendenize
July 30, 2025 13:01 18m 39s
July 30, 2025 13:01 18m 39s
Improve Mistral models integration with llama.cpp
Pull Request Labeler #14139: Pull request #14737 synchronize by juliendenize
July 30, 2025 12:56 3m 50s
July 30, 2025 12:56 3m 50s
Improve Mistral models integration with llama.cpp
Pull Request Labeler #14138: Pull request #14737 synchronize by juliendenize
July 30, 2025 12:55 4m 31s
July 30, 2025 12:55 4m 31s
HIP: enable mfma mmq on gfx908 and gfx90a for select datatypes and shapes
Pull Request Labeler #14137: Pull request #14949 synchronize by IMbackK
July 30, 2025 12:54 11s
July 30, 2025 12:54 11s
ggml : repack block_iq4_nlx8 (AVX)
Pull Request Labeler #14136: Pull request #14904 synchronize by ggerganov
July 30, 2025 12:36 11s
July 30, 2025 12:36 11s
Added dynamic context size. This is perfect for servers running llama models as a service.
Pull Request Labeler #14135: Pull request #13295 synchronize by J4e6eR
July 30, 2025 12:32 11s
July 30, 2025 12:32 11s
Support intern-s1
Pull Request Labeler #14134: Pull request #14875 synchronize by RunningLeon
July 30, 2025 12:28 52s
July 30, 2025 12:28 52s
Added dynamic context size. This is perfect for servers running llama models as a service.
Pull Request Labeler #14133: Pull request #13295 synchronize by J4e6eR
July 30, 2025 12:17 4m 31s
July 30, 2025 12:17 4m 31s
Added dynamic context size. This is perfect for servers running llama models as a service.
Pull Request Labeler #14132: Pull request #13295 synchronize by J4e6eR
July 30, 2025 11:51 13m 5s
July 30, 2025 11:51 13m 5s
tests : update for LLAMA_SET_ROWS=1
Pull Request Labeler #14131: Pull request #14961 synchronize by ggerganov
July 30, 2025 11:46 24s
July 30, 2025 11:46 24s
SYCL: experimental gemv kernel for q4_K
Pull Request Labeler #14130: Pull request #14947 synchronize by Alcpz
July 30, 2025 11:13 12m 39s
July 30, 2025 11:13 12m 39s
Q2k interleaving implementation - x86/x64 SIMD
Pull Request Labeler #14129: Pull request #14373 synchronize by Srihari-mcw
July 30, 2025 11:07 7m 47s
July 30, 2025 11:07 7m 47s
tests : update for LLAMA_SET_ROWS=1
Pull Request Labeler #14128: Pull request #14961 synchronize by ggerganov
July 30, 2025 10:53 6m 8s
July 30, 2025 10:53 6m 8s
Added dynamic context size. This is perfect for servers running llama models as a service.
Pull Request Labeler #14127: Pull request #13295 synchronize by J4e6eR
July 30, 2025 10:30 11s
July 30, 2025 10:30 11s