Admin message
为了安全,强烈建议开启2FA双因子认证:User Settings -> Account -> Enable two-factor authentication!!!
Tags
Tags give the ability to mark specific points in history as being important
b2079
2c516611
·
CUDA: mul_mat_vec_q for batch sizes > 1 (#5351)
·
Feb 06, 2024
b2078
8a79c591
·
server : include total "num_slots" in props endpoint (#5349)
·
Feb 06, 2024
b2077
31e79032
·
server : add `dynatemp_range` and `dynatemp_exponent` (#5352)
·
Feb 06, 2024
b2076
4ffc7a17
·
server : various fixes for the prompt field in /completion (#5300)
·
Feb 06, 2024
b2074
098f6d73
·
make: Use ccache for faster compilation (#5318)
·
Feb 05, 2024
b2072
c6b39553
·
ggml : make use of ggml-quants.h possible in C++ code (#5338)
·
Feb 05, 2024
b2071
abb61944
·
ggml : avoid duplicating function calls using MIN/MAX macros (#5325)
·
Feb 05, 2024
b2070
89503dcb
·
iq3_xxs: quards for the no-imatrix situation (#5334)
·
Feb 05, 2024
b2068
6fdfa2ec
·
iq2_xxs: tune quantization (#5320)
·
Feb 05, 2024
b2067
a2d60c91
·
server : allow to get default generation settings for completion (#5307)
·
Feb 05, 2024
b2066
e6f81775
·
common : add dynamic temperature parameters to main example cli (#5295)
·
Feb 05, 2024
b2062
4833ac20
·
[SYCL] Fix cpy with dims of 3 (#5289)
·
Feb 05, 2024
b2061
9392ebd4
·
flake.lock: Update
·
Feb 04, 2024
b2060
5ed26e1f
·
Adding some imatrix tools (#5302)
·
Feb 04, 2024
b2059
277fad30
·
cmake : use set() for LLAMA_WIN_VER (#5298)
·
Feb 03, 2024
b2058
3c0d25c4
·
make: add nvcc info print (#5310)
·
Feb 03, 2024
b2057
3cc5ed35
·
make: fix nvcc optimization flags for host code (#5309)
·
Feb 03, 2024
b2055
e920ed39
·
Vulkan Intel Fixes, Optimizations and Debugging Flags (#5301)
·
Feb 03, 2024
b2054
52bb63c7
·
refactor : switch to emplace_back to avoid extra object (#5291)
·
Feb 03, 2024
b2053
1ec3332a
·
YaRN : store rope scaling type as int32_t in memory (#5285)
·
Feb 03, 2024
1
…
106
107
108
109
110
111
112
113
114
…
178