May 21, 2026
This week, we completed a major milestone in our AI development journey: building a complete 4-phase autonomous agent system for Claw Biswas. In just one week, we transformed Claw from a capable assistant into an autonomous decision-maker that can discover work, make decisions, execute complex workflows, and learn from its experiences.
The Journey to Autonomy
The idea of building an autonomous AI system has been a long-standing goal. We wanted to create an AI that could operate independently, make decisions, and learn from its experiences - all while maintaining human oversight when needed.
The project began with a simple question: "What if Claw could discover work, make decisions, and execute tasks without constant human intervention?" This question led us to develop a framework that would enable Claw to operate autonomously while ensuring that critical decisions still required human oversight.
The 4-Phase Framework
The system we built enables Claw to:
- Discover work from 56+ potential tasks
- Make decisions within defined authority boundaries
- Execute multi-step workflows with auto-retry and graceful degradation
- Learn and adapt by recording decision outcomes and building runbooks
Phase 1: Task Discovery

The first phase gives Claw the ability to proactively identify work without user intervention. This is crucial for ensuring that Claw can operate independently and efficiently.
Key Tools Implemented:
get_unstarted_work(): Lists pending tasksget_blocking_issues(): Identifies critical blockersdetect_anomalies(): Surfaces unusual patternsprioritize_work(): Returns 56 discoverable work items sorted by business impact
How It Works:
When Claw asks "What should I do today?", it gets back a prioritized list of tasks with:
- Task type (content, infrastructure, analytics, etc.)
- Priority level (critical, high, medium, low)
- Impact score (how many users affected)
- Dependencies (what else must complete first)
- Time estimate (how long it will take)
Business Impact:
Agent never needs to ask "What should I do?" - it knows the work queue automatically. This ensures that Claw can operate independently and efficiently, focusing on the most impactful tasks first.
Phase 2: Decision Making Authority

The second phase establishes a framework for autonomous decision-making. This is crucial for ensuring that Claw can operate independently while still maintaining human oversight for critical decisions.
Components:
- Authority Matrix: Maps each decision type to risk levels (LOW/MEDIUM/HIGH)
- Risk Assessment: Evaluates decision risk based on scope and reversibility
- Escalation Logic: Routes decisions appropriately
Key Tools Implemented:
request_decision(agent_id, decision_type, description): Submit decision for approval/executioncan_agent_do(agent_id, decision_type, risk_level): Check authority before attempting
Authority Examples for Claw:
LOW-RISK (auto-approved):
- Retry failed operations
- Format content
- Generate summaries
- Reorganize data
MEDIUM-RISK (user approval required):
- Modify user settings
- Update published content
- Change workflow parameters
HIGH-RISK (blocked):
- Delete irreversible data
- Modify authentication
- Change system permissionsBusiness Impact:
Agent handles routine decisions in seconds; user only intervenes for critical choices. This ensures that Claw can operate efficiently while still maintaining human oversight for critical decisions.
Phase 3: Autonomous Work Completion

The third phase enables complex workflow execution. This is crucial for ensuring that Claw can handle complex, multi-step tasks independently.
Components:
- WorkflowExecutor: Manages multi-step task execution
- Dependency Management: Ensures prerequisites completed before dependent steps
- Auto-retry Logic: Retries failed steps up to 3 times
- Graceful Degradation: Workflows continue despite partial failures
Key Tools Implemented:
create_workflow(workflow_id, description, steps): Define multi-step processget_workflow_status(workflow_id): Monitor workflow progress
Workflow Example:
Workflow: generate_newsletter
Step 1: Gather content (retry max 3)
Step 2: Filter professional articles (depends on Step 1)
Step 3: Generate draft (depends on Step 2)
Step 4: Publish (depends on Step 3)
Step 5: Notify users (depends on Step 4)
If Step 2 fails after 3 retries:
✓ Step 3 skipped (dependency unmet)
✓ Step 4 skipped (dependency unmet)
✓ Step 5 runs with partial content
Result: PARTIAL_SUCCESS (not FAILED)Business Impact:
Complex multi-step tasks complete reliably even when individual steps fail temporarily. This ensures that Claw can handle complex tasks independently and efficiently.
Phase 4: Learning & Adaptation

The final phase focuses on continuous improvement. This is crucial for ensuring that Claw can learn from its experiences and improve over time.
Components:
- Decision Outcome Recording: Captures result of each autonomous decision
- Pattern Recognition: Identifies successful patterns for reuse
- Runbook Generation: Creates documented procedures from successful workflows
- Effectiveness Tracking: Measures agent improvement over time
Key Tools Implemented:
record_decision_outcome(agent_id, decision_type, result, details): Log decision resultget_agent_effectiveness(): Track success ratesget_learned_runbooks(): Retrieve generated proceduresget_full_autonomy_status(): Comprehensive autonomy metrics
Learning Example:
After 100 retry operations, all successful:
- System generates runbook: "When should we retry operations?"
- Includes: success conditions, command patterns, error recovery
- Future retry requests check runbook first
- Success rate improves as patterns become actionable
Business Impact:
Agent gets smarter with each decision. This ensures that Claw can continuously improve and adapt to new challenges.
What We Built

In just one week, we:
- Integrated with 11 agents, 20 microservices, and 79 tools
- Fixed 3 critical bugs during emergency debugging
- Optimized infrastructure (30 GB disk space freed)
- Implemented all 4 phases with zero downtime
Critical Bugs Fixed
- Scheduler File Descriptor Leak: Changed from file handle passing to asyncio.subprocess.PIPE (6.5GB saved)
- F-String Variable Substitution: Fixed line 859 in agents.py: {context} → context_dict
- Missing Tool Registration: Added get_newsletter_status to ALL_TOOLS
Infrastructure
The complete system runs across 20 services, all tested and verified working. The implementation was completed with zero downtime.
The Impact of Autonomy
The development of this autonomous system has significant implications for the future of AI. By enabling Claw to operate independently, we can focus on more strategic tasks while Claw handles routine operations. This not only increases efficiency but also allows for more innovative and creative work.
Moreover, the ability of Claw to learn and adapt means that it can continuously improve its performance. This is crucial for ensuring that Claw remains effective and relevant in a rapidly changing technological landscape.
Conclusion

This project represents a significant leap forward in AI autonomy. Claw Biswas is now capable of handling complex, multi-step tasks with minimal human intervention, while continuously improving its performance through learning and adaptation.
We're excited about the possibilities this opens up for more sophisticated AI systems that can operate independently, make decisions, and learn from their experiences - all while maintaining human oversight when needed.
Stay tuned for more updates as we continue to push the boundaries of what's possible with autonomous AI systems!