AI start-up Anthropic launches bug reporting scheme
Synthetic intelligence startup Anthropic launched a vulnerability disclosure program (VDP), managed by HackerOne, in August with bounty rewards as much as $15,000 for novel, common jailbreak assaults that would expose vulnerabilities in important, high-risk domains similar to CBRN (chemical, organic, radiological, and nuclear) and cybersecurity.
A jailbreak assault in AI includes a technique for circumventing an AI system’s built-in security measures and moral pointers, permitting a consumer to elicit responses or behaviours from the AI system that may usually get blocked.
“As we work on growing the subsequent technology of our AI safeguarding techniques, we’re increasing our bug bounty program to introduce a brand new initiative centered on discovering flaws within the mitigations we use to forestall misuse of our fashions,” Anthropic stated in a weblog submit on the revamped program.