Admin message
为了安全,强烈建议开启2FA双因子认证:User Settings -> Account -> Enable two-factor authentication!!!
Tags
Tags give the ability to mark specific points in history as being important
b1093
463173a6
·
llama : speedup tokenization (#2831)
·
Aug 27, 2023
b1092
eaa13a48
·
falcon : fix CUDA inference by making K and Q contiguous (#2830)
·
Aug 27, 2023
b1089
a6d1189f
·
k_quants tuning for Falcon-7b (#2816)
·
Aug 27, 2023
b1087
d0cee0d3
·
gguf : add 64-bit support (GGUF v2) (#2821)
·
Aug 27, 2023
b1086
edd4c148
·
llama : more tokenizer fixes (#2810)
·
Aug 27, 2023
b1085
1591e2e5
·
ggml : detect SSSE3 (#2825)
·
Aug 27, 2023
b1083
c1ac54b7
·
server : add `/detokenize` endpoint (#2802)
·
Aug 27, 2023
b1081
c7d92e6d
·
llama : use Unicode Escape Sequence to replace encoded characters (#2814)
·
Aug 26, 2023
b1079
741ca7dd
·
llama : move #includes out of _GNU_SOURCE conditional (#2817)
·
Aug 26, 2023
b1078
72f895c9
·
main : fix bug (penalize_nl=false doesn't work) + suppress warning on mingw (#1528)
·
Aug 26, 2023
b1077
50526f37
·
llama : use std::abs in llama_sample_tail_free (#2800)
·
Aug 26, 2023
b1076
04f4b1eb
·
k-quants : remove unnecessary tensor shape restrictions (#2811)
·
Aug 26, 2023
b1075
75923754
·
Better perplexity for 2- and 3-bit quantization for LLaMA-v2-70B (#2807)
·
Aug 26, 2023
b1074
771551a7
·
Fix HellaSwag (#2805)
·
Aug 26, 2023
b1071
2ba83c86
·
Fix spm whitespaces (#2806)
·
Aug 26, 2023
ci_cublas_linux-b1071-5562e3e
5562e3e6
·
temporarily disable broken 512 build
·
Aug 26, 2023
b1069
232caf3c
·
llama : fix struct decl (#2790)
·
Aug 25, 2023
b1068
d046dcee
·
Faster perplexity computation (#2786)
·
Aug 25, 2023
b1067
c82742ac
·
llama : add llama_beam_search() (#2267)
·
Aug 25, 2023
b1065
154725c5
·
llama-bench : add model sizes (#2771)
·
Aug 25, 2023
1
…
140
141
142
143
144
145
146
147
148
…
178