为了安全，强烈建议开启2FA双因子认证：User Settings -> Account -> Enable two-factor authentication！！！

Tags

Tags give the ability to mark specific points in history as being important

b2293

08c5ee87 · llama : remove deprecated API (#5770) · Feb 28, 2024
b2291

8c0e8f4e · sync : ggml · Feb 28, 2024
b2288

a693bea1 · server : hit Ctrl+C twice to exit (#5734) · Feb 28, 2024
b2287

adcb12a9 · llama : fix non-quantization of expert gating tensors (#5754) · Feb 28, 2024
b2286

177628bf · llama : improve BERT tokenization (#5740) · Feb 28, 2024
b2284

efc72253 · server : add "/chat/completions" alias for "/v1/...` (#5722) · Feb 28, 2024
b2283

7c4263d4 · ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (#5760) · Feb 28, 2024
b2282

cb49e0f8 · Attempt to fix android build (#5752) · Feb 27, 2024
b2281

0becb22a · IQ4_XS: a 4.25 bpw quantization (#5747) · Feb 27, 2024
b2280

c24a2a6e · cuda : replace remaining shfl_xor with calls to warp_reduce functions (#5744) · Feb 27, 2024
b2279

1f30b7a9 · ggml-quants : fix avx2 iq1_s vec_dot when compiled with gcc (#5742) · Feb 27, 2024
b2278

9d533a77 · llama : fix defrag bugs + add parameter (#5735) · Feb 27, 2024
b2277

cbbd1efa · Makefile: use variables for cublas (#5689) · Feb 27, 2024
b2276

b11a93df · fix server hangs on empty prompt (#5733) · Feb 26, 2024
b2275

a33e6a0d · Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (#5721) · Feb 26, 2024
b2274

47bb7b48 · CUDA: fix DEBUG_CUDA_MALLOC (#5729) · Feb 26, 2024
b2272

e849078c · [SYCL] Add support for soft_max ALiBi (#5639) · Feb 26, 2024
b2271

67fd3313 · unicode : reuse iterator (#5726) · Feb 26, 2024
b2270

4804215c · server: CI fix trailing space (#5728) · Feb 26, 2024
b2269

8a533f0d · server: CI tests reduce build matrix (#5725) · Feb 26, 2024

1
…
99
100
101
102
103
104
105
106
107
…
178