3 10 10

Moshe Wasserblat

moshew

AI & ML interests

NLP Efficiency, Sentiment Analysis, Few-shot learning, Distillation

Recent Activity

upvoted an article 2 days ago

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

published an article 4 days ago

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

upvoted a paper about 2 months ago

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

View all activity

Organizations

published an article 4 days ago

Article

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

danf

•

4 days ago

• 9

published an article 7 months ago

Article

DeepMath: A lightweight math reasoning Agent with smolagents

danf, mber, moshew

•

Dec 4, 2025

• 40

published an article about 1 year ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131

•

Apr 16, 2025

• 42

published an article about 1 year ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

jmamou

•

Mar 24, 2025

• 20

published an article over 1 year ago

Article

Universal Assisted Generation: Faster Decoding with Any Assistant Model

danielkorat, orenpereg, mber, jmamou, joaogante, lewtun, Nadav-Timor, moshew

•

Oct 29, 2024

• 61

published an article over 1 year ago

Article

Faster Assisted Generation with Dynamic Speculation

jmamou, orenpereg, joaogante, lewtun, danielkorat, Nadav-Timor, moshew

•

Oct 8, 2024

• 52

published an article about 2 years ago

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

danielkorat, tomaarsen, orenpereg, moshew, echarlaix, aprabh2

•

Apr 3, 2024

• 11

published an article over 2 years ago

Article

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

juliensimon, echarlaix, ofirzaf, imargulis, guybd, moshew

•

Mar 20, 2024

• 7

published an article over 2 years ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

peterizsak, mber, danf, echarlaix, mfuntowicz, moshew

•

Mar 15, 2024

• 14

published an article over 2 years ago

Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

ofirzaf, echarlaix, imargulis, danielkorat, jmamou, guybd, orenpereg, moshew, Haihao, aayasin, FanZhao

•

Jan 30, 2024

• 9

published an article over 2 years ago

Article

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

ronenlap, tomaarsen, lewtun, danielkorat, orenpereg, moshew

•

Dec 6, 2023

• 15

published an article over 3 years ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew

•

Sep 26, 2022

• 41

published an article over 3 years ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew

•

Sep 26, 2022

• 41

Moshe Wasserblat

AI & ML interests

Recent Activity

Organizations

moshew's activity

Intel XPU Kernel Skill: LLM-driven Triton kernel optimization for the Hugging Face Kernel Hub

DeepMath: A lightweight math reasoning Agent with smolagents

Introducing HELMET: Holistically Evaluating Long-context Language Models

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

SetFit: Efficient Few-Shot Learning Without Prompts

SetFit: Efficient Few-Shot Learning Without Prompts