为了安全，强烈建议开启2FA双因子认证：User Settings -> Account -> Enable two-factor authentication！！！

Tags

Tags give the ability to mark specific points in history as being important

b3592

2a24c8ca · Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922) · Aug 16, 2024
b3591

e3f6fd56 · ggml : dynamic ggml_sched_max_splits based on graph_size (#9047) · Aug 16, 2024
b3590

4b9afbbe · retrieval : fix memory leak in retrieval query handling (#8955) · Aug 15, 2024
b3589

37501d9c · server : fix duplicated n_predict key in the generation_settings (#8994) · Aug 15, 2024
b3588

4af8420a · common : remove duplicate function llama_should_add_bos_token (#8778) · Aug 15, 2024
b3587

6bda7ce6 · llama : add pre-tokenizer regexes for BLOOM and gpt3-finnish (#8850) · Aug 15, 2024
b3585

234b3067 · server : init stop and error fields of the result struct (#9026) · Aug 15, 2024
b3584

5fd89a70 · Vulkan Optimizations and Fixes (#8959) · Aug 14, 2024
b3583

98a532d4 · server : fix segfault on long system prompt (#8987) · Aug 14, 2024
b3582

43bdd3ce · cmake : remove unused option GGML_CURL (#9011) · Aug 14, 2024
b3581

06943a69 · ggml : move rope type enum to ggml.h (#8949) · Aug 13, 2024
b3580

828d6ff7 · export-lora : throw error if lora is quantized (#9002) · Aug 13, 2024
b3579

fc4ca27b · ci : fix github workflow vulnerable to script injection (#9008) · Aug 12, 2024
b3578

1f67436c · ci : enable RPC in all of the released builds (#9006) · Aug 12, 2024
b3577

0fd93cde · llama : model-based max number of graph nodes calculation (#8970) · Aug 12, 2024
b3576

84eb2f4f · docs: introduce gpustack and gguf-parser (#8873) · Aug 12, 2024
b3575

1262e7ed · grammar-parser : fix possible null-deref (#9004) · Aug 12, 2024
b3574

df5478fb · ggml: fix div-by-zero (#9003) · Aug 12, 2024
b3573

2589292c · Fix a spelling mistake (#9001) · Aug 12, 2024
b3571

5ef07e25 · server : handle models with missing EOS token (#8997) · Aug 12, 2024

1
…
58
59
60
61
62
63
64
65
66
…
178