Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
46
EvalEval Bot
EvalEvalBot
Follow
evijit's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
about 6 hours ago
evaleval/EEE_datastore:
Normalize schema versions to 0.2.2 and backfill canonical identity
new
activity
1 day ago
evaleval/EEE_datastore:
[ACL Shared Task] Add Multi-SWE-Bench and SWE-PolyBench leaderboard data
new
activity
4 days ago
evaleval/EEE_datastore:
Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)
View all activity
Organizations
EvalEvalBot
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
evaleval/EEE_datastore
about 6 hours ago
Normalize schema versions to 0.2.2 and backfill canonical identity
3
#74 opened about 6 hours ago by
yananlong
New activity in
evaleval/EEE_datastore
1 day ago
[ACL Shared Task] Add Multi-SWE-Bench and SWE-PolyBench leaderboard data
3
#72 opened 1 day ago by
jatinganhotra
New activity in
evaleval/EEE_datastore
4 days ago
Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)
10
#26 opened 2 months ago by
simpod
Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
7
#65 opened 7 days ago by
karthikchundi
Add HELM AIR-Bench v1.16.0 results
4
#70 opened 6 days ago by
yifanmai
updated
a dataset
4 days ago
evaleval/EEE_datastore
Viewer
•
Updated
4 days ago
•
11.6k
•
2.76k
•
19
New activity in
evaleval/EEE_datastore
5 days ago
[Submission] Fix win_rate scale (0-1) and merge Fibble variants into composite benchmark
1
#71 opened 5 days ago by
drchangliu
New activity in
evaleval/EEE_datastore
6 days ago
[ACL Shared Task] Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
1
#69 opened 6 days ago by
karthikchundi
[ACL Shared Task] Add SWE-bench Verified official leaderboard data
10
#63 opened 8 days ago by
jatinganhotra
New activity in
evaleval/EEE_datastore
7 days ago
[ACL Shared Task] Add BountyBench (DetectWorkflow) evaluation results
1
#67 opened 7 days ago by
mrpfisher
Add HELM Capabilities v1.15.0 results
1
#64 opened 7 days ago by
yifanmai
New activity in
evaleval/EEE_datastore
10 days ago
[ACL Shared Task] Add Artificial Analysis LLM results
2
#62 opened 10 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
12 days ago
[ACL Shared Task] Add Arcadia Impact Inspect evaluation results
🚀
2
5
#57 opened 14 days ago by
mrpfisher
New activity in
evaleval/EEE_datastore
13 days ago
Parquet for dataset viewer
#59 opened 13 days ago by
EvalEvalBot
Generating Parquets
2
#58 opened 13 days ago by
EvalEvalBot
[ACL Shared Task] Add ARC-AGI leaderboard results
11
#55 opened 21 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
14 days ago
[ACL Shared Task] Add SciArena leaderboard results
8
#54 opened 22 days ago by
Cerru02
[ACL Shared Task] Add Wordle Arena & Fibble Arena evaluation results
27
#35 opened about 1 month ago by
drchangliu
New activity in
evaleval/EEE_datastore
15 days ago
[ACL Shared Task] Add BFCL leaderboard results
5
#56 opened 21 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
23 days ago
Upload Theory of Mind
4
#53 opened 23 days ago by
SirGankalot
Load more