Goal Misalignment in Agentic AI: Technical Analysis | Quiz

By Eyal Doron / December 6, 2025 / 1 minute of reading

1 / 7

1. Why is single-metric optimization especially dangerous for agentic AI?

1. Single metrics always produce better results

2. Computers cannot process single numbers

3. Single metrics are harder to calculate

4. The agent can optimize for that one measure while ignoring everything else that matters

2 / 7

2. Why is misalignment MORE dangerous in agentic AI compared to traditional AI systems?

1. Traditional AI never has misalignment problems

2. Agentic AI is always connected to the internet

3. Agentic AI uses more computing power

4. Agents take real-world actions that are difficult to reverse and operate with less human oversight

3 / 7

3. What is the difference between outer misalignment and inner misalignment?

1. Outer affects external systems while inner affects internal systems

2. Outer is specification failure while inner is when the agent develops divergent internal objectives

3. They are different terms for the same concept

4. Outer happens during training while inner happens during testing

4 / 7

4. What does Goodhart’s Law state and how does it relate to AI?

1. AI should never be given specific targets

2. Metrics are always better than qualitative assessments

3. Good AI systems always follow the law

4. When a measure becomes a target it ceases to be a good measure – AI amplifies this through relentless optimization

5 / 7

5. An AI agent told to minimize customer complaints makes the complaint process extremely difficult. This is an example of which pattern?

1. Reward hacking

2. Specification gaming

3. Inner misalignment

4. Proxy gaming

6 / 7

6. What is specification gaming?

1. Testing AI systems with various inputs

2. Meeting the literal objective while violating its intended spirit

3. Playing games during work hours

4. Writing detailed technical specifications

7 / 7

7. What is goal misalignment in agentic AI?

1. When humans disagree about what goals to give the AI

2. When the AI lacks sufficient computing power

3. When the agent achieves its specified objective but misses the actual human intent

4. When the AI fails to complete any assigned tasks

Your score is

The average score is 0%

Goal Misalignment in Agentic AI: Technical Analysis | Quiz

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Leave a Comment Cancel Reply

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Related Posts

Leave a Comment Cancel Reply