LLMs Are Better At Jailbreaking Themselves Than Us...
Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs
paper: https://arxiv.org/abs/2603.24511
Check out my latest project: Intuitive AI Academy
We just wrote a new piece on Distillation, breaking down its earliest form up to the latest techniques!
https://intuitiveai.academy/
limited time code "EASY" for 20% off yearly plan!
Watch on YouTube ↗
(saves to browser)
DeepCamp AI