While the shortest distance between two points is a straight line, a straight-line attack on a large language model isn't always the most efficient — and least noisy — way to get the LLM to do bad ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results