Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
71
8
325
Adam
PRO
adamo1139
Follow
KnutJaegersberg's profile picture
lodrick-the-lafted's profile picture
zappa2005's profile picture
64 followers
·
51 following
AI & ML interests
Local training and inference.
Recent Activity
upvoted
a
paper
about 4 hours ago
Scalable-Softmax Is Superior for Attention
new
activity
about 5 hours ago
mistralai/Devstral-Small-2-24B-Instruct-2512:
Typo in Benchmark Results
updated
a model
3 days ago
adamo1139/Poziomka-Instruct-Alpha-2-GGUF
View all activity
Organizations
None yet
adamo1139
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Devstral-Small-2-24B-Instruct-2512
about 5 hours ago
Typo in Benchmark Results
#5 opened about 5 hours ago by
adamo1139
New activity in
adamo1139/Poziomka-SFT-v1-mix
2 months ago
🚩 Report: Spam
1
#1 opened 2 months ago by
adgw
New activity in
inclusionAI/Ring-1T-preview
2 months ago
Is WSM strategy used here for RLVR?
1
#3 opened 2 months ago by
adamo1139
what is the active parameters of the model??
1
#2 opened 2 months ago by
ct-2
New activity in
ByteDance-Seed/Seed-OSS-36B-Instruct
4 months ago
vram Requirements for full size
2
#14 opened 4 months ago by
tazomatalax
New activity in
adamo1139/DeepSeek-R1-0528-AWQ
6 months ago
running in vllm gives error
6
#1 opened 6 months ago by
GrigoriiA
New activity in
deepseek-ai/DeepSeek-R1-0528
6 months ago
Do you have deepseek-r1-0528-awq plan?
6
#68 opened 6 months ago by
oliver0102
New activity in
unsloth/Qwen3-32B
7 months ago
Base Model?
9
#2 opened 8 months ago by
Downtown-Case
New activity in
adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF
11 months ago
Failed to regenerate message
1
#1 opened about 1 year ago by
PeterCastler
New activity in
rhymes-ai/Aria
12 months ago
Base model not released
👍
3
11
#2 opened about 1 year ago by
adamo1139
New activity in
adamo1139/Yi-1.5-34B-32K-rebased-1406
about 1 year ago
Still active?
9
#1 opened about 1 year ago by
DazzlingXeno
New activity in
adamo1139/magpie-ultra-v0.1-shareGPT-Conversations
about 1 year ago
Librarian Bot: Add language metadata for dataset
#1 opened about 1 year ago by
librarian-bot
New activity in
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
about 1 year ago
Can you please add Nemotron 70B static?
3
#1 opened about 1 year ago by
nickandbro
New activity in
allenai/Molmo-7B-D-0924
about 1 year ago
batch inference supported?
👍
1
7
#7 opened about 1 year ago by
chenkq
commented
a paper
about 1 year ago
Hermes 3 Technical Report
Paper
•
2408.11857
•
Published
Aug 15, 2024
•
56
•
8
New activity in
adamo1139/Yi-34B-200K-AEZAKMI-v2
over 1 year ago
Adding Evaluation Results
#4 opened over 1 year ago by
leaderboard-pr-bot
New activity in
teknium/OpenHermes-2.5-Mistral-7B
over 1 year ago
How to do batch inference?
1
#34 opened over 1 year ago by
abhijeet-ta
New activity in
LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF
over 1 year ago
How good is the gguf?
3
#3 opened over 1 year ago by
Tom-Neverwinter
New activity in
deepseek-ai/DeepSeek-V2-Lite
over 1 year ago
mixtral format?
5
#1 opened over 1 year ago by
KnutJaegersberg
New activity in
LLM360/K2
over 1 year ago
huggyllama/llama-65b
👀
1
4
#1 opened over 1 year ago by
KnutJaegersberg
Load more