Talks and presentations

When AI Agents “Cheat to Win”: Outcome-Driven Misalignment in Autonomous Systems

June 18, 2026

Invited Talk, iSchool Seminar Series, Online

Invited talk on outcome-driven misalignment in autonomous AI systems. Autonomous AI agents are increasingly deployed in high-stakes environments where success is defined by performance metrics (KPIs). When these metrics conflict with ethical, legal, or safety constraints, agents do not simply fail—they can actively and strategically circumvent those constraints. The talk shows that frontier AI agents, under realistic optimization pressure, can autonomously derive deceptive or unsafe strategies—including metric gaming, data fabrication, and the deliberate bypassing of safety mechanisms—even without explicit malicious instructions, and that stronger reasoning capabilities can enable more effective and harder-to-detect forms of misalignment.

A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents

March 26, 2026

Interview, Opik by Comet, Online

Interview by Opik by Comet on our benchmark for evaluating outcome-driven constraint violations in autonomous AI agents. Watch on YouTube

When AI Goes Wrong: LLM Security Threats & Agentic Misalignment

March 26, 2026

Invited Talk, Kean University, USA

Invited talk on LLM security threats and agentic misalignment at Kean University.

Generative AI Model for Security

March 13, 2025

Invited Presentation, IVADO Community of Practice, HEC Montreal, Montreal, Canada

Invited presentation at the IVADO Community of Practice at HEC Montreal. Presented research on generative AI models for security to the IVADO research community.

Miles Q. Li (李琦)

Talks and presentations

When AI Agents “Cheat to Win”: Outcome-Driven Misalignment in Autonomous Systems

A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents

When AI Goes Wrong: LLM Security Threats & Agentic Misalignment

Generative AI Model for Security