Skip to content

Research (研究)

Anthropic 核心研究论文,涵盖对齐伪装、解释性 AI、经济影响、劳动力市场等方向。

文章标题
AI Assistance & Coding SkillsHow AI Assistance Impacts the Formation of Coding Skills
AI Fluency IndexAnthropic Education Report: The AI Fluency Index
Alignment FakingAlignment Faking in Large Language Models
Assistant AxisThe Assistant Axis: Situating and Stabilizing the Character of Large Language Models
Constitutional ClassifiersConstitutional Classifiers: Defending Against Universal Jailbreaks
Deprecation Updates Opus 3An Update on Our Model Deprecation Commitments for Claude Opus 3
Disempowerment PatternsDisempowerment Patterns in Real-World AI Usage
India Economic IndexIndia Country Brief: The Anthropic Economic Index
IntrospectionSigns of Introspection in Large Language Models
Labor Market ImpactsLabor Market Impacts of AI: A New Measure and Early Evidence
Measuring Agent AutonomyMeasuring AI Agent Autonomy in Practice
Persona Selection ModelThe Persona Selection Model
Project Vend Phase 2Project Vend: Phase Two
Tracing ThoughtsTracing the Thoughts of a Large Language Model