update tech report
Browse files- Nex-N1-TechReport.pdf +3 -0
- README.md +41 -1
- figures/coding-eval.png +3 -0
- figures/html-eval.png +3 -0
Nex-N1-TechReport.pdf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b32a420ccd3f74452e9932d04dc334b791c710f29c0d9d274745ea95572bedae
|
| 3 |
+
size 6464309
|
README.md
CHANGED
|
@@ -6,6 +6,14 @@ license: apache-2.0
|
|
| 6 |
<img src="./figures/NEX_logo.svg" width="20%"/>
|
| 7 |
</div>
|
| 8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
# Nex-N1
|
| 11 |
|
|
@@ -14,6 +22,7 @@ DeepSeek-V3.1-Nex-N1 is the flagship release of the Nex-N1 series — a post-tra
|
|
| 14 |
We are committed to making it easier than ever to build and deploy AI agents by offering researchers and entrepreneurs a high-performance, reliable, and cost-effective "out-of-the-box" agent system.
|
| 15 |
|
| 16 |
## Highlights
|
|
|
|
| 17 |
- **Full spectrum model matrix:** From 8B to 671B parameters, the Nex series covers everything from edge-friendly setups to frontier-scale deployments.
|
| 18 |
- **Agent-focused performance:** Demonstrates industry-leading results on programming, tool-use, web-search, and other multi-hop reasoning tasks.
|
| 19 |
- **Production-ready utility:** Excels at mini-app development, website authoring, slide creation, and immersive role-play—delivering immediate productivity
|
|
@@ -23,6 +32,7 @@ gains.
|
|
| 23 |
training services are all openly available.
|
| 24 |
|
| 25 |
## Performance
|
|
|
|
| 26 |
Nex-N1 is evaluated on six representative agentic benchmarks (general + professional). The model consistently ranks at or near the top across tool-using, web-search, and coding-heavy evaluations, showing strong readiness for real-world agent workflows.
|
| 27 |
|
| 28 |

|
|
@@ -43,16 +53,46 @@ Nex-N1 provides various size models from 8B to 671B for different usage scenario
|
|
| 43 |
| [Qwen3-30B-A3B-Nex-N1](https://huggingface.co/nex-agi/Qwen3-30B-A3B-Nex-N1) | 11.3 | 65.3 | 29.7 | 8.3 | 13.6 | 51.9 |
|
| 44 |
| [internlm3-8B-Nex-N1](https://huggingface.co/nex-agi/internlm3-8B-Nex-N1) | 8.6 | 63.0 | 20.3 | - | - | 44.5 |
|
| 45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
## Usage
|
| 47 |
|
| 48 |
### Local Deployment
|
|
|
|
| 49 |
We recommend `sglang` for serving Nex-series models locally:
|
|
|
|
| 50 |
```bash
|
| 51 |
python -m sglang.launch_server --model-path /path/to/your/model
|
| 52 |
```
|
| 53 |
|
| 54 |
### Function Calling
|
| 55 |
-
|
|
|
|
|
|
|
| 56 |
```bash
|
| 57 |
python -m sglang.launch_server --model-path /path/to/your/model --tool-call-parser qwen3_coder
|
| 58 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
<img src="./figures/NEX_logo.svg" width="20%"/>
|
| 7 |
</div>
|
| 8 |
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
<div align="center">
|
| 12 |
+
🏠 <a href="https://nex.sii.edu.cn"><b>Home Page</b></a>   |   
|
| 13 |
+
🤗 <a href="https://hf.co/collections/nex-agi/nex-n1"><b>Model</b></a>   |   
|
| 14 |
+
🤗 <a href="https://huggingface.co/datasets/nex-agi/agent-sft"><b>Data</b></a>   |   
|
| 15 |
+
📑 <a href="https://github.com/nex-agi/Nex-N1/blob/main/Nex-N1-TechReport.pdf"><b>Tech Report</b></a>  
|
| 16 |
+
</div>
|
| 17 |
|
| 18 |
# Nex-N1
|
| 19 |
|
|
|
|
| 22 |
We are committed to making it easier than ever to build and deploy AI agents by offering researchers and entrepreneurs a high-performance, reliable, and cost-effective "out-of-the-box" agent system.
|
| 23 |
|
| 24 |
## Highlights
|
| 25 |
+
|
| 26 |
- **Full spectrum model matrix:** From 8B to 671B parameters, the Nex series covers everything from edge-friendly setups to frontier-scale deployments.
|
| 27 |
- **Agent-focused performance:** Demonstrates industry-leading results on programming, tool-use, web-search, and other multi-hop reasoning tasks.
|
| 28 |
- **Production-ready utility:** Excels at mini-app development, website authoring, slide creation, and immersive role-play—delivering immediate productivity
|
|
|
|
| 32 |
training services are all openly available.
|
| 33 |
|
| 34 |
## Performance
|
| 35 |
+
|
| 36 |
Nex-N1 is evaluated on six representative agentic benchmarks (general + professional). The model consistently ranks at or near the top across tool-using, web-search, and coding-heavy evaluations, showing strong readiness for real-world agent workflows.
|
| 37 |
|
| 38 |

|
|
|
|
| 53 |
| [Qwen3-30B-A3B-Nex-N1](https://huggingface.co/nex-agi/Qwen3-30B-A3B-Nex-N1) | 11.3 | 65.3 | 29.7 | 8.3 | 13.6 | 51.9 |
|
| 54 |
| [internlm3-8B-Nex-N1](https://huggingface.co/nex-agi/internlm3-8B-Nex-N1) | 8.6 | 63.0 | 20.3 | - | - | 44.5 |
|
| 55 |
|
| 56 |
+
Nex-N1 demonstrates competitive performance across all evaluation scenarios, showing particularly strong results in practical coding and HTML generation tasks.
|
| 57 |
+
|
| 58 |
+
<div align="center">
|
| 59 |
+
<img src="./figures/coding-eval.png" width="80%"/>
|
| 60 |
+
<div>Practical Coding Evaluation</div>
|
| 61 |
+
</div>
|
| 62 |
+
|
| 63 |
+
<div align="center">
|
| 64 |
+
<img src="./figures/html-eval.png" width="80%"/>
|
| 65 |
+
<div>HTML Generation Evaluation</div>
|
| 66 |
+
</div>
|
| 67 |
+
|
| 68 |
+
Refer to <https://huggingface.co/datasets/nex-agi/coding-eval> and <https://huggingface.co/datasets/nex-agi/html-eval> for more details.
|
| 69 |
+
|
| 70 |
## Usage
|
| 71 |
|
| 72 |
### Local Deployment
|
| 73 |
+
|
| 74 |
We recommend `sglang` for serving Nex-series models locally:
|
| 75 |
+
|
| 76 |
```bash
|
| 77 |
python -m sglang.launch_server --model-path /path/to/your/model
|
| 78 |
```
|
| 79 |
|
| 80 |
### Function Calling
|
| 81 |
+
|
| 82 |
+
Nex-series models support robust function-calling capabilities. To maximize the function-calling capabilities of the Nex-series models, we modified the tool parser of `qwen3_coder`, see: <https://github.com/sgl-project/sglang/pull/13411>. To enable this feature, simply add the `--tool-call-parser qwen3_coder` flag when launching the server:
|
| 83 |
+
|
| 84 |
```bash
|
| 85 |
python -m sglang.launch_server --model-path /path/to/your/model --tool-call-parser qwen3_coder
|
| 86 |
```
|
| 87 |
+
|
| 88 |
+
### Mini Program Development
|
| 89 |
+
|
| 90 |
+
Nex-N1 is optimized for mini program development. For optimal performance, we recommend using Claude Code configured with both `context7` and a search MCP.
|
| 91 |
+
|
| 92 |
+
```shell
|
| 93 |
+
claude mcp add --transport http context7 https://mcp.context7.com/mcp --header "CONTEXT7_API_KEY: [CONTEXT7_API_KEY]"
|
| 94 |
+
|
| 95 |
+
claude mcp add --transport stdio serper-search --env SERPER_API_KEY=[SERPER_API_KEY] -- npx -y serper-search-scrape-mcp-server
|
| 96 |
+
```
|
| 97 |
+
|
| 98 |
+
Refer to <https://github.com/upstash/context7> for more details on setting up `context7`.
|
figures/coding-eval.png
ADDED
|
Git LFS Details
|
figures/html-eval.png
ADDED
|
Git LFS Details
|