Thomas Liang's picture

Open to Work

Thomas Liang PRO

thliang01

·

thliang01

AI & ML interests

Efficient ML

Recent Activity

liked a Space about 17 hours ago

twinkle-ai/fine-vision-album

liked a dataset 11 days ago

HuggingFaceFW/fineweb-2

upvoted a paper 11 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

View all activity

Organizations

liked a Space about 17 hours ago

Fine Vision Album

Collect Taiwan daily life photos for zhtw dataset

liked a dataset 11 days ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 52.4k • 771

upvoted a paper 11 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 193

liked a model 15 days ago

twinkle-ai/twinkle-sqlcoder

Text Generation • 24B • Updated 15 days ago • 74 • 2

updated a collection 15 days ago

📋 Twinkle Eval Logs

Benchmark log generated with Twinkle Eval, recording the model's outputs for each prompt, see more in https://github.com/ai-twinkle/Eval • 22 items • Updated 3 days ago • 1

liked a model 16 days ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated 4 days ago • 163k • 308

upvoted a collection 16 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 4 days ago • 245

upvoted an article 16 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

19 days ago

•

79

upvoted a collection 17 days ago

📋 Twinkle Eval Logs

Benchmark log generated with Twinkle Eval, recording the model's outputs for each prompt, see more in https://github.com/ai-twinkle/Eval • 22 items • Updated 3 days ago • 1

liked a Space 18 days ago

Twinkle Evaluator Leaderboard

Official Benchmark Leaderboard for Twinkle Eval

upvoted a collection 19 days ago

LLM PlayBooks

All useful playbooks for training LLM • 6 items • Updated 19 days ago • 2

upvoted a collection 20 days ago

🤏 Smol-Data

Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated 26 days ago • 12

liked a Space 20 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Explore synthetic data experiments as an interactive bookshelf

liked 2 models 28 days ago

mistralai/Ministral-3-14B-Instruct-2512

Updated Jan 15 • 137k • 268

mistralai/Ministral-3-14B-Base-2512

Updated Jan 15 • 13.3k • 56

upvoted an article about 1 month ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

490

upvoted 2 papers about 2 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 33

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Paper • 2404.16006 • Published Apr 24, 2024 • 2

upvoted 2 articles about 2 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

529

Article

SmolVLM - small yet mighty Vision Language Model

+3

Nov 26, 2024

•

417