bartowski/Yi-Coder-1.5B-GGUF-torrent

Last updated on Sep 5, 2024

Filename	Quant type	File Size	Split	Description
Yi-Coder-1.5B-f16.gguf	f16	2.95GB	false	Full F16 weights.
Yi-Coder-1.5B-Q8_0.gguf	Q8_0	1.57GB	false	Extremely high quality, generally unneeded but max available quant.
Yi-Coder-1.5B-Q6_K_L.gguf	Q6_K_L	1.34GB	false	Uses Q8_0 for embed and output weights. Very high quality, near perfect, recommended.
Yi-Coder-1.5B-Q6_K.gguf	Q6_K	1.28GB	false	Very high quality, near perfect, recommended.
Yi-Coder-1.5B-Q5_K_L.gguf	Q5_K_L	1.18GB	false	Uses Q8_0 for embed and output weights. High quality, recommended.
Yi-Coder-1.5B-Q5_K_M.gguf	Q5_K_M	1.10GB	false	High quality, recommended.
Yi-Coder-1.5B-Q4_K_L.gguf	Q4_K_L	1.06GB	false	Uses Q8_0 for embed and output weights. Good quality, recommended.
Yi-Coder-1.5B-Q5_K_S.gguf	Q5_K_S	1.05GB	false	High quality, recommended.
Yi-Coder-1.5B-Q4_K_M.gguf	Q4_K_M	0.96GB	false	Good quality, default size for must use cases, recommended.
Yi-Coder-1.5B-Q3_K_XL.gguf	Q3_K_XL	0.94GB	false	Uses Q8_0 for embed and output weights. Lower quality but usable, good for low RAM availability.
Yi-Coder-1.5B-Q4_K_S.gguf	Q4_K_S	0.90GB	false	Slightly lower quality with more space savings, recommended.
Yi-Coder-1.5B-Q4_0_8_8.gguf	Q4_0_8_8	0.87GB	false	Optimized for ARM inference. Requires 'sve' support (see link below).
Yi-Coder-1.5B-Q4_0_4_8.gguf	Q4_0_4_8	0.87GB	false	Optimized for ARM inference. Requires 'i8mm' support (see link below).
Yi-Coder-1.5B-Q4_0_4_4.gguf	Q4_0_4_4	0.87GB	false	Optimized for ARM inference. Should work well on all ARM chips, pick this if you're unsure.
Yi-Coder-1.5B-Q4_0.gguf	Q4_0	0.87GB	false	Legacy format, generally not worth using over similarly sized formats
Yi-Coder-1.5B-IQ4_XS.gguf	IQ4_XS	0.83GB	false	Decent quality, smaller than Q4_K_S with similar performance, recommended.
Yi-Coder-1.5B-Q3_K_L.gguf	Q3_K_L	0.83GB	false	Lower quality but usable, good for low RAM availability.
Yi-Coder-1.5B-IQ3_M.gguf	IQ3_M	0.75GB	false	Medium-low quality, new method with decent performance comparable to Q3_K_M.