Lucy-128k-GGUF

Lucy is a compact but capable 1.7B model focused on agentic web search and lightweight browsing. Built on Qwen3-1.7B, Lucy inherits deep research capabilities from larger models while being optimized to run efficiently on mobile devices, even with CPU-only configurations.

Model files

File	Size	Format
Lucy-128k.BF16.gguf	3.45 GB	BF16
Lucy-128k.F16.gguf	3.45 GB	F16
Lucy-128k.F32.gguf	6.89 GB	F32
Lucy-128k.Q2_K.gguf	778 MB	Q2_K
Lucy-128k.Q3_K_L.gguf	1 GB	Q3_K_L
Lucy-128k.Q3_K_M.gguf	940 MB	Q3_K_M
Lucy-128k.Q3_K_S.gguf	867 MB	Q3_K_S
Lucy-128k.Q4_K_M.gguf	1.11 GB	Q4_K_M
Lucy-128k.Q4_K_S.gguf	1.06 GB	Q4_K_S
Lucy-128k.Q5_K_M.gguf	1.26 GB	Q5_K_M
Lucy-128k.Q5_K_S.gguf	1.23 GB	Q5_K_S
Lucy-128k.Q6_K.gguf	1.42 GB	Q6_K
Lucy-128k.Q8_0.gguf	1.83 GB	Q8_0
.gitattributes	2.4 kB	-
README.md	57 Bytes	-
config.json	29 Bytes	-

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

Downloads last month: 96

GGUF

Model size

2B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Model tree for prithivMLmods/Lucy-128k-GGUF

Base model

Qwen/Qwen3-1.7B-Base

Finetuned

Qwen/Qwen3-1.7B

Finetuned

Menlo/Lucy-128k

Quantized

(18)

this model