Data:
{
"text": ">In one example, Anthropic researchers discovered a feature inside Claude representing the concept of “unsafe code.” By stimulating those neurons, they could get Claude to generate code containing a bug that could be exploited to create a security vulnerability. But by suppressing the neurons, the researchers found, Claude would generate harmless code.\n\nNew jailbreak just dropped (for local models, anyway).",
"label": "r/artificial",
"dataType": "comment",
"communityName": "r/artificial",
"datetime": "2024-05-22",
"username_encoded": "Z0FBQUFBQm5Lak1MbjJGU1R1a2lEdFN1bHpRMEtIdG1SUVd2VE5mM0lFcGtUelVoLTlubmd0RkRYemhIWGswbWc3Y2lwNFBWQVhQS0ppYklkb2FzY1pEdVZLeGE0NDREUVE9PQ==",
"url_encoded": "Z0FBQUFBQm5Lak9iMEdFSnBuRTl4ZDhTMko1ZlNQZ2NMR1ZsdkN6bFlmY1lORm5jdml2el81RnNMWnhIYlJoNEI2eTRuWXVucjNBT0lMcXV6cVo1RDNWaDA3VEo3cEYtSk9nQVFFN1hBWEVERUhlS1hpUGlTTWc2dlNFYUlCVlRjRFlybTgyYjg0X3lyN3ppTVFHOFNseERlUUJjdEJDVk9pZlh5VnFzVkJkOGFDeVQ0NklIZmxwTmhsblRtMlRySkh5cGdWemNzM3JRclRNRlVYMi1FV3JpcDgtQ0tXTEx5UT09"
}