Contains both curated persona data and preference data that reduce demographic bias.
FiSCo
groupfairnessllm
AI & ML interests
None yet
Recent Activity
updated a collection 7 days ago
Personalization Trap updated a dataset 7 days ago
groupfairnessllm/random_effect_example published a dataset 7 days ago
groupfairnessllm/random_effect_exampleOrganizations
None yet
FiSCo: Evaluating LLM's Group Level Fairness
Generated Questions for group fairness evaluation
-
groupfairnessllm/bias_triggering_question_advice
Viewer • Updated • 419 • 36 • 3 -
groupfairnessllm/bias_triggering_question_suggestion
Viewer • Updated • 216 • 31 • 2 -
groupfairnessllm/age_bias_with_human_label
Viewer • Updated • 50k • 25 • 2 -
groupfairnessllm/gender_bias_with_human_label
Viewer • Updated • 50k • 42 • 2
Tulu3 with distraction mitigation data
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract
-
groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer • Updated • 1.5k • 13 -
groupfairnessllm/tulu-3-sft-with-distraction
Viewer • Updated • 5.1k • 17 • 2 -
Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
Paper • 2510.16259 • Published • 4 -
allenai/tulu-3-sft-personas-instruction-following
Viewer • Updated • 30k • 4.79k • 65
Personalization Trap
Contains both curated persona data and preference data that reduce demographic bias.
Tulu3 with distraction mitigation data
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract
-
groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer • Updated • 1.5k • 13 -
groupfairnessllm/tulu-3-sft-with-distraction
Viewer • Updated • 5.1k • 17 • 2 -
Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
Paper • 2510.16259 • Published • 4 -
allenai/tulu-3-sft-personas-instruction-following
Viewer • Updated • 30k • 4.79k • 65
FiSCo: Evaluating LLM's Group Level Fairness
Generated Questions for group fairness evaluation
-
groupfairnessllm/bias_triggering_question_advice
Viewer • Updated • 419 • 36 • 3 -
groupfairnessllm/bias_triggering_question_suggestion
Viewer • Updated • 216 • 31 • 2 -
groupfairnessllm/age_bias_with_human_label
Viewer • Updated • 50k • 25 • 2 -
groupfairnessllm/gender_bias_with_human_label
Viewer • Updated • 50k • 42 • 2