AI & ML interests

None defined yet.

Recent Activity

Fix pipeline_tag 🤗

🚀 🔥 3
#2 opened about 1 month ago by
merve
yuhangzang 
in internlm/Spatial-SSRL-81k about 1 month ago

Upload task4.png

#3 opened about 1 month ago by
baliyebang

Upload 2 files

#2 opened about 1 month ago by
baliyebang

Upload 3 files

#1 opened about 1 month ago by
baliyebang
FeYuan 
posted an update about 2 months ago
view post
Post
235
Meet LLaMAX2! Lightweight Pipeline - SFT on Qwen3-Instruct Models without Catastrophic Forgetting !!!
✨Highlights:
🔹 SOTA Translation: State-of-the-art translation performance across both high- and low-resource trained languages.
🔹 Lightweight Pipeline: Engineered for efficiency, our pipeline uses minimal parallel data and applies layer-selective tuning to a powerful instruct model.
🔹 Strong Reasoning Capabilities: Exhibits reasoning abilities that are competitive with top-tier models like Qwen3-Instruct.

Welcome to use our models. More Details:
🎉 Paper: LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning (2510.09189)
🎉 Code: https://github.com/CONE-MT/LLaMAX2.0
🎉 Model: LLaMAX/llamax20-68ad1c154fcf2623b75a068c