Admin message
为了安全,强烈建议开启2FA双因子认证:User Settings -> Account -> Enable two-factor authentication!!!
Tags
Tags give the ability to mark specific points in history as being important
b2293
08c5ee87
·
llama : remove deprecated API (#5770)
·
Feb 28, 2024
b2291
8c0e8f4e
·
sync : ggml
·
Feb 28, 2024
b2288
a693bea1
·
server : hit Ctrl+C twice to exit (#5734)
·
Feb 28, 2024
b2287
adcb12a9
·
llama : fix non-quantization of expert gating tensors (#5754)
·
Feb 28, 2024
b2286
177628bf
·
llama : improve BERT tokenization (#5740)
·
Feb 28, 2024
b2284
efc72253
·
server : add "/chat/completions" alias for "/v1/...` (#5722)
·
Feb 28, 2024
b2283
7c4263d4
·
ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (#5760)
·
Feb 28, 2024
b2282
cb49e0f8
·
Attempt to fix android build (#5752)
·
Feb 27, 2024
b2281
0becb22a
·
IQ4_XS: a 4.25 bpw quantization (#5747)
·
Feb 27, 2024
b2280
c24a2a6e
·
cuda : replace remaining shfl_xor with calls to warp_reduce functions (#5744)
·
Feb 27, 2024
b2279
1f30b7a9
·
ggml-quants : fix avx2 iq1_s vec_dot when compiled with gcc (#5742)
·
Feb 27, 2024
b2278
9d533a77
·
llama : fix defrag bugs + add parameter (#5735)
·
Feb 27, 2024
b2277
cbbd1efa
·
Makefile: use variables for cublas (#5689)
·
Feb 27, 2024
b2276
b11a93df
·
fix server hangs on empty prompt (#5733)
·
Feb 26, 2024
b2275
a33e6a0d
·
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (#5721)
·
Feb 26, 2024
b2274
47bb7b48
·
CUDA: fix DEBUG_CUDA_MALLOC (#5729)
·
Feb 26, 2024
b2272
e849078c
·
[SYCL] Add support for soft_max ALiBi (#5639)
·
Feb 26, 2024
b2271
67fd3313
·
unicode : reuse iterator (#5726)
·
Feb 26, 2024
b2270
4804215c
·
server: CI fix trailing space (#5728)
·
Feb 26, 2024
b2269
8a533f0d
·
server: CI tests reduce build matrix (#5725)
·
Feb 26, 2024
1
…
99
100
101
102
103
104
105
106
107
…
178