Agent Modes by Model Personality NEW

Nikola Balic (@nibzard)

Problem

Different AI models have fundamentally different personalities and working styles. Treating all models the same—expecting them to work identically—leads to suboptimal outcomes. Users expect a consistent interface, but models like Opus 4.5 are "trigger happy" and want to run commands immediately, while models like GPT-5.2 are "lazy" and prefer thorough research before acting.

Solution

Design different agent modes optimized for each model's personality rather than forcing all models into a single interaction pattern. Each mode should have its own: - UI/UX patterns (font, prompt length guidance) - Tool configurations - Expectation setting - Working style

Key insight: It's not about model selection—it's about different ways of working.

Model personalities identified:

Model	Personality	Working Style	Best For
Claude Opus 4.5	Trigger happy, interactive	Runs commands, asks questions, rapid feedback loops	Quick back-and-forth, interactive tasks
GPT-5.2	Lazy, thorough, deep researcher	Goes off for 45+ minutes, researches extensively, comes back with comprehensive results	Well-scoped problems, big tasks, finding information

graph LR A[User Request] --> B{Select Mode} B -->|Smart Mode| C[Opus 4.5] B -->|Deep Mode| D[GPT-5.2] C --> E[Interactive: Fast feedback, rapid iteration] D --> F[Autonomous: Send off, check back later] E --> G[Results in minutes] F --> H[Results in 45-60 minutes]

Mode differentiation strategies:

Visual/UI differentiation
Different fonts for different modes
Different prompt length guidance (100+ words minimum for Deep mode)
Make it "feel like text message vs writing a letter"
Tool configuration
Smart mode: Tools optimized for rapid execution
Deep mode: Tools optimized for thorough research (e.g., no user feedback question tools)
Expectation setting
Smart mode: "Watch the agent work"
Deep mode: "Send off and check in 60 minutes later"

Example from AMP:

AMP created three distinct modes: - Smart Mode: Opus 4.5 for interactive assistant work - Rush Mode: Haiku for fast, less smart tasks - Deep Mode: GPT-5.2 for thorough research and autonomous work

The team explicitly avoids a "model selector" dropdown and instead presents these as different working modes.

How to use it

Implementation checklist:

[ ] Identify model personalities in your stack through internal testing
[ ] Create mode names that describe the working style, not the model
[ ] Differentiate UX - make each mode feel distinct
[ ] Configure tools per mode - optimize for the model's strength
[ ] Set user expectations - explain when to use each mode

When to use each mode:

Smart Mode (Opus-like): - Quick configuration tasks - Debugging with rapid iteration - Tasks requiring frequent human feedback - "Set up my .zshrc and reload it" style tasks

Deep Mode (GPT-5.2-like): - Well-scoped problems - Big tasks with clear requirements - Research and information gathering - "Go investigate this deployment issue" tasks

Prompting differences:

# Smart mode prompt
style: conversational
length: short to medium
feedback: rapid, interactive

# Deep mode prompt
style: detailed specification
length: long (100+ words recommended)
feedback: minimal, batched at end

Trade-offs

Pros:

Optimized for each model's strengths: Better outcomes than one-size-fits-all
Clear user expectations: Users know how to interact with each mode
Reduced frustration: Users don't fight against model's natural tendencies
Better outcomes: Different models get to equally good results via different paths

Cons:

Increased complexity: Multiple modes to maintain and document
User confusion: Some users just want "the best model"
Model evolution risk: Personalities change with new versions
Testing overhead: Need to validate each mode independently

Challenge: Conveying expectations in a text box

"It's all text boxes and it's really hard to convey like expectations in a text box when it's all just a text box."

The fundamental challenge: Different modes require fundamentally different user expectations, but the UI (a text box) looks identical. Solutions: - Visual differentiation (fonts, colors) - Explicit instructions ("minimum 100 words for Deep mode") - Mode-specific guidance in the UI

Key quote:

"They both might get to the same results. And I can't even say that deep mode is always better or always writes the better code, but they get to equally good results, but different like Opus is trigger happy. It wants to run stuff. It wants to get back to you and ask. And Deep mode just goes off and does something."

References

Raising an Agent Episode 10: The Assistant is Dead, Long Live the Factory - AMP (Thorsten Ball, Quinn Slack, 2025)
Related: Oracle and Worker Multi-Model Approach, Progressive Autonomy with Model Evolution

Source: https://www.youtube.com/watch?v=4rx36wc9ugw