Admin message
为了安全,强烈建议开启2FA双因子认证:User Settings -> Account -> Enable two-factor authentication!!!
Tags
Tags give the ability to mark specific points in history as being important
b3592
2a24c8ca
·
Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)
·
Aug 16, 2024
b3591
e3f6fd56
·
ggml : dynamic ggml_sched_max_splits based on graph_size (#9047)
·
Aug 16, 2024
b3590
4b9afbbe
·
retrieval : fix memory leak in retrieval query handling (#8955)
·
Aug 15, 2024
b3589
37501d9c
·
server : fix duplicated n_predict key in the generation_settings (#8994)
·
Aug 15, 2024
b3588
4af8420a
·
common : remove duplicate function llama_should_add_bos_token (#8778)
·
Aug 15, 2024
b3587
6bda7ce6
·
llama : add pre-tokenizer regexes for BLOOM and gpt3-finnish (#8850)
·
Aug 15, 2024
b3585
234b3067
·
server : init stop and error fields of the result struct (#9026)
·
Aug 15, 2024
b3584
5fd89a70
·
Vulkan Optimizations and Fixes (#8959)
·
Aug 14, 2024
b3583
98a532d4
·
server : fix segfault on long system prompt (#8987)
·
Aug 14, 2024
b3582
43bdd3ce
·
cmake : remove unused option GGML_CURL (#9011)
·
Aug 14, 2024
b3581
06943a69
·
ggml : move rope type enum to ggml.h (#8949)
·
Aug 13, 2024
b3580
828d6ff7
·
export-lora : throw error if lora is quantized (#9002)
·
Aug 13, 2024
b3579
fc4ca27b
·
ci : fix github workflow vulnerable to script injection (#9008)
·
Aug 12, 2024
b3578
1f67436c
·
ci : enable RPC in all of the released builds (#9006)
·
Aug 12, 2024
b3577
0fd93cde
·
llama : model-based max number of graph nodes calculation (#8970)
·
Aug 12, 2024
b3576
84eb2f4f
·
docs: introduce gpustack and gguf-parser (#8873)
·
Aug 12, 2024
b3575
1262e7ed
·
grammar-parser : fix possible null-deref (#9004)
·
Aug 12, 2024
b3574
df5478fb
·
ggml: fix div-by-zero (#9003)
·
Aug 12, 2024
b3573
2589292c
·
Fix a spelling mistake (#9001)
·
Aug 12, 2024
b3571
5ef07e25
·
server : handle models with missing EOS token (#8997)
·
Aug 12, 2024
1
…
58
59
60
61
62
63
64
65
66
…
178