71 8 325

Adam PRO

adamo1139

AI & ML interests

Local training and inference.

Recent Activity

upvoted a paper about 4 hours ago

Scalable-Softmax Is Superior for Attention

new activity about 5 hours ago

mistralai/Devstral-Small-2-24B-Instruct-2512:Typo in Benchmark Results

updated a model 3 days ago

adamo1139/Poziomka-Instruct-Alpha-2-GGUF

View all activity

Organizations

None yet

New activity in mistralai/Devstral-Small-2-24B-Instruct-2512 about 5 hours ago

Typo in Benchmark Results

#5 opened about 5 hours ago by

adamo1139

New activity in adamo1139/Poziomka-SFT-v1-mix 2 months ago

🚩 Report: Spam

#1 opened 2 months ago by

adgw

New activity in inclusionAI/Ring-1T-preview 2 months ago

Is WSM strategy used here for RLVR?

#3 opened 2 months ago by

adamo1139

what is the active parameters of the model??

#2 opened 2 months ago by

ct-2

New activity in ByteDance-Seed/Seed-OSS-36B-Instruct 4 months ago

vram Requirements for full size

#14 opened 4 months ago by

tazomatalax

New activity in adamo1139/DeepSeek-R1-0528-AWQ 6 months ago

running in vllm gives error

#1 opened 6 months ago by

GrigoriiA

New activity in deepseek-ai/DeepSeek-R1-0528 6 months ago

Do you have deepseek-r1-0528-awq plan?

#68 opened 6 months ago by

oliver0102

New activity in unsloth/Qwen3-32B 7 months ago

Base Model?

#2 opened 8 months ago by

Downtown-Case

New activity in adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF 11 months ago

Failed to regenerate message

#1 opened about 1 year ago by

PeterCastler

New activity in rhymes-ai/Aria 12 months ago

Base model not released

👍 3

#2 opened about 1 year ago by

adamo1139

New activity in adamo1139/Yi-1.5-34B-32K-rebased-1406 about 1 year ago

Still active?

#1 opened about 1 year ago by

DazzlingXeno

New activity in adamo1139/magpie-ultra-v0.1-shareGPT-Conversations about 1 year ago

Librarian Bot: Add language metadata for dataset

#1 opened about 1 year ago by

librarian-bot

New activity in RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic about 1 year ago

Can you please add Nemotron 70B static?

#1 opened about 1 year ago by

nickandbro

New activity in allenai/Molmo-7B-D-0924 about 1 year ago

batch inference supported?

👍 1

#7 opened about 1 year ago by

chenkq

commented a paper about 1 year ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 56 •

New activity in adamo1139/Yi-34B-200K-AEZAKMI-v2 over 1 year ago

Adding Evaluation Results

#4 opened over 1 year ago by

leaderboard-pr-bot

New activity in teknium/OpenHermes-2.5-Mistral-7B over 1 year ago

How to do batch inference?

#34 opened over 1 year ago by

abhijeet-ta

New activity in LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF over 1 year ago

How good is the gguf?

#3 opened over 1 year ago by

Tom-Neverwinter

New activity in deepseek-ai/DeepSeek-V2-Lite over 1 year ago

mixtral format?

#1 opened over 1 year ago by

KnutJaegersberg

New activity in LLM360/K2 over 1 year ago

huggyllama/llama-65b

👀 1

#1 opened over 1 year ago by

KnutJaegersberg

Adam PRO

AI & ML interests

Recent Activity

Organizations

adamo1139's activity

Typo in Benchmark Results

🚩 Report: Spam

Is WSM strategy used here for RLVR?

what is the active parameters of the model??

vram Requirements for full size

running in vllm gives error

Do you have deepseek-r1-0528-awq plan?

Base Model?

Failed to regenerate message

Base model not released

Still active?

Librarian Bot: Add language metadata for dataset

Can you please add Nemotron 70B static?

batch inference supported?

Adding Evaluation Results

How to do batch inference?

How good is the gguf?

mixtral format?

huggyllama/llama-65b