Admin message
为了安全,强烈建议开启2FA双因子认证:User Settings -> Account -> Enable two-factor authentication!!!
Tags
Tags give the ability to mark specific points in history as being important
b3088
b90dc566
·
Allow number of nodes in CUDA graph to change (#7738)
·
Jun 04, 2024
b3087
1442677f
·
common : refactor cli arg parsing (#7675)
·
Jun 04, 2024
b3086
554c247c
·
ggml : remove OpenCL (#7735)
·
Jun 04, 2024
b3085
0cd6bd34
·
llama : remove beam search (#7736)
·
Jun 04, 2024
b3084
5ca0944a
·
readme : remove obsolete Zig instructions (#7471)
·
Jun 04, 2024
b3083
adc9ff38
·
llama-bench : allow using a different printer for stderr with -oe (#7722)
·
Jun 04, 2024
b3082
987d743d
·
Improve hipBLAS support in CMake (#7696)
·
Jun 04, 2024
b3080
3b38d486
·
Per token attributes (#7685)
·
Jun 04, 2024
b3079
6d161694
·
ggml : prevent builds with -ffinite-math-only (#7726)
·
Jun 04, 2024
b3078
bde7cd3c
·
llama : offload to RPC in addition to other backends (#7640)
·
Jun 03, 2024
b3077
a5735e44
·
ggml : use OpenMP as a thread pool (#7606)
·
Jun 03, 2024
b3076
0b832d53
·
make: fix debug options not being applied to NVCC (#7714)
·
Jun 03, 2024
b3075
3d7ebf63
·
Vulkan Mixture of Experts (MoE) support (#7628)
·
Jun 03, 2024
b3074
a10cda58
·
cmake : add pkg-config spec file for llama.cpp (#7702)
·
Jun 03, 2024
b3073
6f28a333
·
llama : MiniCPM support tied embeddings (#7664)
·
Jun 03, 2024
b3072
549279d8
·
llama : avoid double token-to-piece cache (#7654)
·
Jun 03, 2024
b3071
9e405b6e
·
kompute : implement op_getrows_f32 (#6403)
·
Jun 03, 2024
b3070
3413ae21
·
fix bug introduced in using calloc (#7701)
·
Jun 02, 2024
b3067
9422c5e3
·
[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)
·
Jun 02, 2024
b3066
e141ce62
·
Fix FlashAttention debug test, FP32 assert (#7684)
·
Jun 01, 2024
1
…
74
75
76
77
78
79
80
81
82
…
178