Lucy-128k-GGUF

Lucy is a compact but capable 1.7B model focused on agentic web search and lightweight browsing. Built on Qwen3-1.7B, Lucy inherits deep research capabilities from larger models while being optimized to run efficiently on mobile devices, even with CPU-only configurations.

Model files

File Size Format
Lucy-128k.BF16.gguf 3.45 GB BF16
Lucy-128k.F16.gguf 3.45 GB F16
Lucy-128k.F32.gguf 6.89 GB F32
Lucy-128k.Q2_K.gguf 778 MB Q2_K
Lucy-128k.Q3_K_L.gguf 1 GB Q3_K_L
Lucy-128k.Q3_K_M.gguf 940 MB Q3_K_M
Lucy-128k.Q3_K_S.gguf 867 MB Q3_K_S
Lucy-128k.Q4_K_M.gguf 1.11 GB Q4_K_M
Lucy-128k.Q4_K_S.gguf 1.06 GB Q4_K_S
Lucy-128k.Q5_K_M.gguf 1.26 GB Q5_K_M
Lucy-128k.Q5_K_S.gguf 1.23 GB Q5_K_S
Lucy-128k.Q6_K.gguf 1.42 GB Q6_K
Lucy-128k.Q8_0.gguf 1.83 GB Q8_0
.gitattributes 2.4 kB -
README.md 57 Bytes -
config.json 29 Bytes -

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
96
GGUF
Model size
2B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for prithivMLmods/Lucy-128k-GGUF

Finetuned
Qwen/Qwen3-1.7B
Finetuned
Menlo/Lucy-128k
Quantized
(18)
this model