Safety constraints for intelligent agents

Inherent AI Alignment vs Learned AI Alignment (Prompt Engineering Secrets)