为了安全，强烈建议开启2FA双因子认证：User Settings -> Account -> Enable two-factor authentication！！！

Tags

Tags give the ability to mark specific points in history as being important

b3644

42c76d13 · Threadpool: take 2 (#8672) · Aug 30, 2024
b3643

9f7d4bcf · server : fix crash when error handler dumps invalid utf-8 json (#9195) · Aug 30, 2024
b3639

20f1789d · vulkan : fix build (#0) · Aug 27, 2024
b3636

78eb487b · llama : fix qs.n_attention_wv for DeepSeek-V2 (#9156) · Aug 27, 2024
b3635

a77feb5d · server : add some missing env variables (#9116) · Aug 27, 2024
b3634

2e59d61c · llama : fix ChatGLM4 wrong shape (#9194) · Aug 27, 2024
b3633

75e1dbba · llama : fix llama3.1 rope_freqs not respecting custom head_dim (#9141) · Aug 27, 2024
b3632

ad76569f · common : Update stb_image.h to latest version (#9161) · Aug 27, 2024
b3631

7d787ed9 · ggml : do not crash when quantizing q4_x_x with an imatrix (#9192) · Aug 26, 2024
b3630

06658ad7 · metal : separate scale and mask from QKT in FA kernel (#9189) · Aug 26, 2024
b3629

fc18425b · ggml : add SSM Metal kernels (#8546) · Aug 26, 2024
b3628

879275ac · tests : fix compile warnings for unreachable code (#9185) · Aug 26, 2024
b3627

7a3df798 · ci : add VULKAN support to ggml-ci (#9055) · Aug 26, 2024
b3625

0c41e03c · metal : gemma2 flash attention support (#9159) · Aug 26, 2024
b3623

436787f1 · llama : fix time complexity of string replacement (#9163) · Aug 26, 2024
b3622

93bc3839 · common: fixed not working find argument --n-gpu-layers-draft (#9175) · Aug 26, 2024
b3621

f91fc563 · CUDA: fix Gemma 2 numerical issues for FA (#9166) · Aug 25, 2024
b3620

e11bd856 · CPU/CUDA: Gemma 2 FlashAttention support (#8542) · Aug 24, 2024
b3619

8f824ffe · quantize : fix typo in usage help of `quantize.cpp` (#9145) · Aug 24, 2024
b3618

3ba780e2 · lora : fix llama conversion script with ROPE_FREQS (#9117) · Aug 23, 2024

1
…
56
57
58
59
60
61
62
63
64
…
178