Row 39774

Row ID: 39774 | Dataset Entry | Axioma AXP Content Repository

Content Data

This page contains data entry 39774 from the Axioma AXP content repository. The structured data below represents the complete record for this entry.

>In one example, Anthropic researchers discovered a feature inside Claude representing the concept of “unsafe code.” By stimulating those neurons, they could get Claude to generate code containing a bug that could be exploited to create a security vulnerability. But by suppressing the neurons, the researchers found, Claude would generate harmless code.

New jailbreak just dropped (for local models, anyway).

Field	Value
text	>In one example, Anthropic researchers discovered a feature inside Claude representing the concept of “unsafe code.” By stimulating those neurons, they could get Claude to generate code containing a bug that could be exploited to create a security vulnerability. But by suppressing the neurons, the researchers found, Claude would generate harmless code. New jailbreak just dropped (for local models, anyway).
label	r/artificial
dataType	comment
communityName	r/artificial
datetime	2024-05-22
username_encoded	Z0FBQUFBQm5Lak1MbjJGU1R1a2lEdFN1bHpRMEtIdG1SUVd2VE5mM0lFcGtUelVoLTlubmd0RkRYemhIWGswbWc3Y2lwNFBWQVhQS0ppYklkb2FzY1pEdVZLeGE0NDREUVE9PQ==
url_encoded	Z0FBQUFBQm5Lak9iMEdFSnBuRTl4ZDhTMko1ZlNQZ2NMR1ZsdkN6bFlmY1lORm5jdml2el81RnNMWnhIYlJoNEI2eTRuWXVucjNBT0lMcXV6cVo1RDNWaDA3VEo3cEYtSk9nQVFFN1hBWEVERUhlS1hpUGlTTWc2dlNFYUlCVlRjRFlybTgyYjg0X3lyN3ppTVFHOFNseERlUUJjdEJDVk9pZlh5VnFzVkJkOGFDeVQ0NklIZmxwTmhsblRtMlRySkh5cGdWemNzM3JRclRNRlVYMi1FV3JpcDgtQ0tXTEx5UT09

Raw Record

{
  "text": ">In one example, Anthropic researchers discovered a feature inside Claude representing the concept of “unsafe code.” By stimulating those neurons, they could get Claude to generate code containing a bug that could be exploited to create a security vulnerability. But by suppressing the neurons, the researchers found, Claude would generate harmless code.\n\nNew jailbreak just dropped (for local models, anyway).",
  "label": "r/artificial",
  "dataType": "comment",
  "communityName": "r/artificial",
  "datetime": "2024-05-22",
  "username_encoded": "Z0FBQUFBQm5Lak1MbjJGU1R1a2lEdFN1bHpRMEtIdG1SUVd2VE5mM0lFcGtUelVoLTlubmd0RkRYemhIWGswbWc3Y2lwNFBWQVhQS0ppYklkb2FzY1pEdVZLeGE0NDREUVE9PQ==",
  "url_encoded": "Z0FBQUFBQm5Lak9iMEdFSnBuRTl4ZDhTMko1ZlNQZ2NMR1ZsdkN6bFlmY1lORm5jdml2el81RnNMWnhIYlJoNEI2eTRuWXVucjNBT0lMcXV6cVo1RDNWaDA3VEo3cEYtSk9nQVFFN1hBWEVERUhlS1hpUGlTTWc2dlNFYUlCVlRjRFlybTgyYjg0X3lyN3ppTVFHOFNseERlUUJjdEJDVk9pZlh5VnFzVkJkOGFDeVQ0NklIZmxwTmhsblRtMlRySkh5cGdWemNzM3JRclRNRlVYMi1FV3JpcDgtQ0tXTEx5UT09"
}

Explore Dataset Explore Row

Entry Information

Entry ID: 39774
Repository: Axioma AXP
Dataset: arrmlet/reddit_dataset_36
Total Entries: 100,000