Documentation Index
Fetch the complete documentation index at: https://docs.kontext21.com/llms.txt
Use this file to discover all available pages before exploring further.
Performance Issues
If X21 is responding slowly or experiencing performance issues, this guide helps identify and resolve the cause.Slow AI Responses
Symptoms
- Long wait for response to start
- Slow streaming of text
- Delayed tool execution
- Timeouts
Common Causes & Solutions
1. Token Limit Approaching Cause: Conversation near 200,000 token limit Check: Look at token counter in status bar Solution:- Start a new conversation
- X21 will auto-compact, but new chat is faster
2. Large Attachments Cause: Processing big PDFs or many images Check: File size and page count Solution:
- Reduce PDF to relevant pages only
- Compress images
- Attach fewer files per request
- Split large documents
3. Complex Operations Cause: Operation requires extensive thinking Check: Thinking block duration Solution:
- Break into smaller steps
- Simplify prompt
- Provide more specific instructions
- Use slash commands for common tasks
4. Network Issues Cause: Slow internet connection to API Check: Network speed, other apps Solution:
- Check internet connection
- Close bandwidth-heavy applications
- Wait for better connectivity
- Try smaller requests
When to Start New Conversation
Signs You Should Start Fresh
- Token usage high (over 150,000 tokens)
- Responses getting slower over time
- Unrelated to previous conversation
- Context no longer relevant
- Performance degraded
Benefits of New Conversation
- Faster responses: Less history to process
- Better focus: Clear context for AI
- Token efficiency: Start with low usage
- Organized history: Easier to find later
How to Start New
- Click New Chat button
- Confirms if ongoing requests
- Previous conversation auto-saved
- Fresh start with full token budget
Conversation Compacting
What It Does
When conversation reaches 200,000 tokens:- Summarizes old messages
- Preserves recent context
- Reduces token usage
- Continues seamlessly
When It Happens
- Automatic at token limit
- No user action required
- Brief pause during compacting
- Notice appears in conversation
Impact
Benefits:- Continue same conversation
- Maintain context
- No data loss
- Old details summarized
- Some nuance may be lost
- Still slower than new conversation
Tool Execution Speed
Slow Tool Execution
Causes:- Large data ranges
- Many concurrent tools
- Complex VBA operations
- Excel busy with other tasks
- Batch operations more efficiently
- Smaller ranges per operation
- Sequential execution rather than parallel
- Close other Excel workbooks
- Disable Excel add-ins temporarily
Tool Optimization
Efficient patterns:Workbook Performance
Large Workbooks
Impact: Slower Excel operations Solutions:- Work on smaller worksheets
- Clear unused data/formatting
- Disable automatic calculations temporarily
- Close other workbooks
- Archive old data
External References
Impact: Slower when referencing other workbooks Solutions:- Break external links if not needed
- Copy data into current workbook
- Close source workbooks
System Performance
Resource Usage
Check:- Task Manager (Windows)
- Activity Monitor (Mac)
- Look for high CPU/memory usage
- Close unused applications
- Restart Excel if memory high
- Restart Deno server
- Reboot computer if needed
Disk Space
Impact: Low disk space slows everything Check: Available disk space Solution:- Ensure 5+ GB free space
- Clear temp files
- Archive old log files
- Clean up downloads
Network Performance
Checking Connection
Test:- Visit Anthropic status
- Check general internet speed
- Test other online services
- Wait for better connectivity
- Work offline where possible
- Schedule large operations for better network times
Port Issues
Check ports:- Port 8000: Deno server
- Port 8080: VSTO add-in
- Ensure not blocked by firewall
Monitoring Performance
Token Usage
Watch the token counter:- fewer than 50,000: Excellent
- 50,000-100,000: Good
- 100,000-150,000: Consider new chat soon
- 150,000-200,000: Start new chat for best performance
- over 200,000: Auto-compacting occurs
Response Times
Expected:- First response: 2-5 seconds
- Streaming: Continuous flow
- Tool execution: 1-3 seconds per tool
- Thinking: 1-10 seconds depending on complexity
- First response: >10 seconds
- Streaming: Stuttering or stops
- Tool execution: >10 seconds
- Thinking: >30 seconds
Best Practices
Proactive Management
- Start new conversations for new tasks
- Monitor token usage regularly
- Batch similar operations
- Optimize attachments before sending
- Close unused workbooks
Efficient Workflows
Good:When Performance Matters
For time-sensitive work:- Start fresh conversation
- Use slash commands
- Enable auto-approve
- Prepare data in advance
- Work during off-peak hours
Related Topics
- Rate Limits & Quotas - Understanding limits
- Best Practices: Performance Optimization - Optimization strategies
- Error Types - Timeout and overload errors
- Connection Status - Network troubleshooting

