bartowski/Replete-Coder-V2-Llama-3.1-8b-GGUF-torrent
Filename | Quant Type | File Size | Split | Description |
---|---|---|---|---|
Replete-Coder-V2-Llama-3.1-8b-Q6_K_L.gguf | Q6_K_L | 6.85GB | False | Very high quality, near perfect, recommended. Uses Q8_0 for embed and output weights. |
Replete-Coder-V2-Llama-3.1-8b-Q6_K.gguf | Q6_K | 6.60GB | False | Very high quality, near perfect, recommended. |
Replete-Coder-V2-Llama-3.1-8b-Q5_K_L.gguf | Q5_K_L | 6.06GB | False | High quality, recommended. Uses Q8_0 for embed and output weights. |
Replete-Coder-V2-Llama-3.1-8b-Q5_K_M.gguf | Q5_K_M | 5.73GB | False | High quality, recommended. |
Replete-Coder-V2-Llama-3.1-8b-Q4_K_L.gguf | Q4_K_L | 5.31GB | False | Good quality, recommended. Uses Q8_0 for embed and output weights. |
Replete-Coder-V2-Llama-3.1-8b-Q4_K_M.gguf | Q4_K_M | 4.92GB | False | Good quality, default size for most use cases, recommended. |
Replete-Coder-V2-Llama-3.1-8b-IQ4_XS.gguf | IQ4_XS | 4.45GB | False | Decent quality, smaller than Q4_K_S with similar performance, recommended. |
Replete-Coder-V2-Llama-3.1-8b-Q3_K_L.gguf | Q3_K_L | 4.32GB | False | Lower quality but usable, good for low RAM availability. Uses Q8_0 for embed and output weights. |
Replete-Coder-V2-Llama-3.1-8b-Q2_K_L.gguf | Q2_K_L | 3.69GB | False | Very low quality but surprisingly usable. Uses Q8_0 for embed and output weights. |