为了安全，强烈建议开启2FA双因子认证：User Settings -> Account -> Enable two-factor authentication！！！

Tags

Tags give the ability to mark specific points in history as being important

b3088

b90dc566 · Allow number of nodes in CUDA graph to change (#7738) · Jun 04, 2024
b3087

1442677f · common : refactor cli arg parsing (#7675) · Jun 04, 2024
b3086

554c247c · ggml : remove OpenCL (#7735) · Jun 04, 2024
b3085

0cd6bd34 · llama : remove beam search (#7736) · Jun 04, 2024
b3084

5ca0944a · readme : remove obsolete Zig instructions (#7471) · Jun 04, 2024
b3083

adc9ff38 · llama-bench : allow using a different printer for stderr with -oe (#7722) · Jun 04, 2024
b3082

987d743d · Improve hipBLAS support in CMake (#7696) · Jun 04, 2024
b3080

3b38d486 · Per token attributes (#7685) · Jun 04, 2024
b3079

6d161694 · ggml : prevent builds with -ffinite-math-only (#7726) · Jun 04, 2024
b3078

bde7cd3c · llama : offload to RPC in addition to other backends (#7640) · Jun 03, 2024
b3077

a5735e44 · ggml : use OpenMP as a thread pool (#7606) · Jun 03, 2024
b3076

0b832d53 · make: fix debug options not being applied to NVCC (#7714) · Jun 03, 2024
b3075

3d7ebf63 · Vulkan Mixture of Experts (MoE) support (#7628) · Jun 03, 2024
b3074

a10cda58 · cmake : add pkg-config spec file for llama.cpp (#7702) · Jun 03, 2024
b3073

6f28a333 · llama : MiniCPM support tied embeddings (#7664) · Jun 03, 2024
b3072

549279d8 · llama : avoid double token-to-piece cache (#7654) · Jun 03, 2024
b3071

9e405b6e · kompute : implement op_getrows_f32 (#6403) · Jun 03, 2024
b3070

3413ae21 · fix bug introduced in using calloc (#7701) · Jun 02, 2024
b3067

9422c5e3 · [SYCL] Update rpc-server.cpp to include SYCL backend (#7682) · Jun 02, 2024
b3066

e141ce62 · Fix FlashAttention debug test, FP32 assert (#7684) · Jun 01, 2024

1
…
74
75
76
77
78
79
80
81
82
…
178