Palisade Research (Collaborators)

Team

non-profit

Verified

https://palisaderesearch.org

palisadeai

PalisadeResearch

AI & ML interests

None defined yet.

lindapetrini

authored a paper 5 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

Reworr-R

updated a dataset 10 months ago

palisaderesearch/LLM-Honeypot-Logs

Updated about 2 hours ago • 45 • 5

dmitrii-palisaderesearch

authored a paper about 1 year ago

Badllama 3: removing safety finetuning from Llama 3 in minutes

Paper • 2407.01376 • Published Jul 1, 2024

AdamGleave

authored a paper almost 2 years ago

Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14

jladish

authored a paper about 2 years ago

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Paper • 2310.20624 • Published Oct 31, 2023 • 13

oserikov

authored a paper about 2 years ago

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Paper • 2309.11197 • Published Sep 20, 2023 • 5

AdamGleave

authored 2 papers over 2 years ago

Adversarial Policies Beat Superhuman Go AIs

Paper • 2211.00241 • Published Nov 1, 2022

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Paper • 2203.07475 • Published Mar 14, 2022