Market-Making System: Design Review & Implementation Roadmap

Date: 2026-01-31
Reviewer: AI Code Analysis
Status: Phase 1 In Progress (11% Complete)
Related Workspace: .builders/0013-market-maker-mvp

🚀 Implementation Progress (Session: 2026-01-31)

Phase 1: Data Quality - In Progress

Overall Progress: 11% (1/9 tasks complete, 4/44 hours invested)

✅ Completed Tasks

Task 1: ADX (Average Directional Index) - COMPLETE

Files: src/regime/metrics/adx.py, tests/regime/metrics/test_adx.py
Tests: 8/8 passing
Time: ~4 hours
Implementation: Welles Wilder’s ADX with double smoothing, safe division, proper error handling
Note: Requires 28+ bars for period=14 (double Wilder’s smoothing)

⏳ Remaining Phase 1 Tasks (33/44 hours)

#	Task	Files	Est. Time	Status
2	Efficiency Ratio	`efficiency_ratio.py` + tests	4h	TODO
3	Autocorrelation	`autocorrelation.py` + tests	3h	TODO
4	OU Half-Life	`ou_process.py` + tests	9h	TODO
5	Normalized Slope	`slope.py` + tests	3h	TODO
6	BB Bandwidth	`bollinger.py` + tests	3h	TODO
7	Integration	Modify `regime/engine.py`	8h	TODO
8	Validation	`schema_validator.py`	6h	TODO
9	Dashboard	`quality/dashboard.py`	4h	TODO

Resume Instructions

Environment Setup:

cd /home/coder/src/repos/market-making/metrics-service
source .venv/bin/activate  # numpy, pytest installed
python -m pytest tests/regime/metrics/test_adx.py -v  # Verify: 8 passed

Next Task: Implement Efficiency Ratio (Task 2)

Reference: .ai/projects/market-making/phase-1-plan.md (Day 3-4)
Pattern: Write tests first (RED), implement (GREEN), refactor
Formula: ER = |Price[0] - Price[n]| / Σ|Price[i] - Price[i-1]|

Files Modified This Session:

repos/market-making/metrics-service/src/regime/metrics/__init__.py (new)
repos/market-making/metrics-service/src/regime/metrics/adx.py (new, 155 lines)
repos/market-making/metrics-service/tests/regime/metrics/__init__.py (new)
repos/market-making/metrics-service/tests/regime/metrics/test_adx.py (new, 125 lines, 8 tests)

Executive Summary

The market-making tool has a solid technical foundation but is approximately 40-50% complete toward stated MVP goals. The core regime detection engine works well, but the Grid Exit Strategy (primary value proposition) is only partially implemented.

CRITICAL ISSUE IDENTIFIED: Hardcoded dummy values in restart gates evaluation (repos/market-making/metrics-service/src/regime/engine.py lines 268-280, 349-359) make metrics YAMLs untrustworthy.

Completion Estimate

Total Effort: 180-250 hours
Timeline: 9-12 weeks at 20h/week, or 4.5-6 weeks at 40h/week
Priority Phases: P0 (Data Quality, Exit Strategy, Testing)

Current State Analysis

What’s Working ✅

1. Regime Detection Engine (`src/regime/`)

✅ Hourly OHLCV analysis from KuCoin API
✅ 4 regime classifications: RANGE_OK, RANGE_WEAK, TRANSITION, TREND
✅ 15+ metrics per analysis (Bollinger Bands, mean reversion, volatility, trend)
✅ Git-backed storage in market-maker-data repository

Assessment: Core regime detection logic is well-implemented.

2. Infrastructure (`infra/`)

✅ Kubernetes CronJob running hourly at :01
✅ ExternalSecrets integration for KuCoin API keys
✅ Docker image build/deploy workflow
✅ ArgoCD deployment patterns
✅ Git-based persistence (no database required)

Assessment: Infrastructure is production-ready.

3. Notification System (Partial)

✅ Pushover integration working
✅ Basic regime alerts functional
✅ Entry evaluator module created (src/exit_strategy/entry_evaluator.py)
✅ Rate limiting (4h minimum between same-state notifications)

Assessment: Working but incomplete.

4. Grid Configuration Management

✅ YAML-based grid configurations
✅ History tracking via Git
✅ Configuration versioning
✅ Grid state determination from history array

Assessment: Config management is functional.

Critical Gaps Identified

Gap 1: Data Quality - Hardcoded Dummy Values (P0 - CRITICAL)

Location: repos/market-making/metrics-service/src/regime/engine.py

The Problem (lines 268-280 and duplicated at 349-359):

# Mock values for now - these should come from actual analysis
# TODO: Extract these from the detailed_analysis once refined classification is implemented
trend_score = regime_state.trend_score or 50.0
mean_rev_score = regime_state.mean_rev_score or 50.0
adx = 25.0  # TODO: Extract from analysis
adx_history = [25.0] * 10  # TODO: Extract from analysis
normalized_slope = 0.1  # TODO: Extract from analysis
efficiency_ratio = 0.4  # TODO: Extract from analysis
lag1_autocorr = -0.1  # TODO: Extract from analysis
ou_half_life = 24.0  # TODO: Extract from analysis
atr = 1500.0  # TODO: Extract from analysis
atr_history = [1500.0] * 100  # TODO: Extract from analysis
bb_bandwidth = 0.02  # TODO: Extract from analysis
bb_bandwidth_history = [0.02] * 10  # TODO: Extract from analysis

Impact:

🔴 Restart gates evaluation uses fake data
🔴 Grid creation recommendations unreliable
🔴 Metrics YAMLs contain static values
🔴 Cannot trust historical analysis
🔴 No way to validate regime classifications
🔴 User quote: “I often don’t trust the generated metrics yamls”

Missing Calculations:

ADX (Average Directional Index) - Trend strength measurement
Efficiency Ratio - Trend efficiency (Perry Kaufman formula)
Lag-1 Autocorrelation - Mean reversion indicator
OU Half-Life - Time for price to revert halfway to mean
Normalized Slope - Price slope normalized by ATR
Bollinger Band Bandwidth - Volatility measurement

Effort: 40-60 hours (Phase 1)

Gap 2: Grid Exit Strategy - NOT IMPLEMENTED (P0)

Location: src/exit_strategy/evaluator.py

Current Status: 30% complete - stub implementation only

What Exists:

Basic ExitState enum (NORMAL, WARNING, LATEST_ACCEPTABLE_EXIT, MANDATORY_EXIT)
Simple MANDATORY_EXIT trigger for TREND regime
Basic boundary violation check (≥2 consecutive closes outside range)

What’s Missing:

LATEST_ACCEPTABLE_EXIT Triggers

Per spec (.ai/projects/market-making/grid-exit-strategy/spec.md):

❌ TRANSITION persistence tracking (≥2 consecutive 4h bars OR ≥4 consecutive 1h bars)
❌ Mean reversion degradation (OU half-life ≥ 2× baseline)
❌ Volatility expansion ratio > 1.25 threshold
❌ Z-score reversion failure tracking

WARNING Triggers

❌ TRANSITION probability ≥ 40% (configurable)
❌ Regime confidence declining over 3 bars
❌ Efficiency Ratio rising above range threshold
❌ Mean reversion speed slowing
❌ Volatility expansion 1.1-1.25× range
❌ Require 2+ conditions to trigger (critical logic)

State Transition Tracking

❌ Store previous exit states in Git
❌ Track state durations
❌ Prevent notification spam for same state

Historical Data Loading

❌ Load last N metrics files for persistence checks
❌ Cache recent history for performance
❌ Multi-timeframe analysis (1h + 4h bars)

Impact: Cannot trust system to alert when to exit grids - defeats entire purpose.

Effort: 50-70 hours (Phase 2)

Gap 3: Position Risk Quantification - MISSING (P1)

Current Status: NOT IMPLEMENTED

What’s Missing:

Position Tracking

# DOES NOT EXIST - Need to implement:
class PositionTracker:
    def get_active_positions(self, grid_id: str) -> List[Position]:
        """Fetch actual open orders from KuCoin API"""
        
    def calculate_unrealized_pnl(self, positions: List[Position]) -> float:
        """Current unrealized P&L"""

Capital Risk Calculator

# DOES NOT EXIST - Need to implement:
class CapitalRiskCalculator:
    def calculate_capital_at_risk(self, inventory, current_price, stop_loss) -> float:
        """Inventory value × (current_price - stop_loss) / current_price"""
        
    def estimate_profit_giveback(self, peak_pnl, current_pnl, delay_hours) -> Tuple[float, float]:
        """Range estimate: [min_giveback, max_giveback]"""

Current Behavior: Risk assessment in history.py only looks at config, not actual positions.

Impact: Notifications cannot show:

“Capital at risk: $120.50”
“Expected give-back if delayed 12h: $4-7”
“Stop-loss distance: 0.85 ATR”

Effort: 30-40 hours (Phase 3)

Gap 4: Testing - NEARLY ZERO (P0)

Current Status: Minimal test coverage

Required:

Unit tests for all metric calculations
Unit tests for exit trigger logic
Integration tests (regime → exit → notification flow)
Backtesting framework to validate signal quality

Impact: Cannot refactor or trust changes without tests.

Effort: 40-50 hours (Phase 4)

Gap 5: Operational Improvements (P2)

Issues:

⚠️ Evaluation cadence: Hourly (should be 15 minutes)
❌ Audit logging: Not implemented
❌ KPI tracking: Not implemented
⚠️ Documentation: Minimal

Effort: 20-30 hours (Phase 5)

Implementation Roadmap

Phase 1: Data Trust & Quality (P0 - CRITICAL)

Duration: 2-3 weeks (40-60 hours)
Objective: Remove all hardcoded dummy values and implement real metric calculations

Tasks:

Implement missing metric calculations (20-30h)
- ADX (Average Directional Index)
- Efficiency Ratio
- Lag-1 Autocorrelation
- OU Half-Life
- Normalized Slope
- Bollinger Band Bandwidth
Extract metrics from regime analysis (8-12h)
- Modify src/regime/engine.py lines 268-280, 349-359
- Remove all hardcoded values
- Extract from detailed_analysis dict
Add data validation (8-12h)
- Schema validator for metrics YAMLs
- Sanity checks per metric type
- Automated validation on Git commit
Create data quality dashboard (4-6h)
- Visual indicators: Real vs Dummy data
- Historical trend validation
- Anomaly detection
Unit tests (8-12h)
- 22+ test cases covering all metrics
- Validation against TA-Lib/TradingView
- 90%+ code coverage

Success Criteria:

✅ All TODOs removed from regime/engine.py
✅ All metrics calculated from real data
✅ Data validation prevents bad data from being committed
✅ Quality dashboard shows 100% real data
✅ User confirms: “I trust the metrics YAMLs now”

Phase 2: Complete Grid Exit Strategy (P0)

Duration: 2-3 weeks (50-70 hours)
Objective: Implement all missing exit triggers and state tracking

Tasks:

LATEST_ACCEPTABLE_EXIT triggers (8-12h)
- TRANSITION persistence tracking
- Mean reversion degradation checks
- Volatility expansion detection
- Z-score reversion failure
WARNING triggers (4-6h)
- 5 condition checks
- Require 2+ conditions to trigger
- Configurable thresholds
State transition tracking (4-6h)
- Store state history in Git
- Prevent notification spam
- Track state durations
Historical data loading (4-6h)
- Load last 12-24 hours of metrics
- Multi-timeframe analysis (1h + 4h)
- Caching for performance
Integration & testing (8-12h)
- Wire up all triggers
- End-to-end tests
- Real data validation
Configuration & documentation (4-6h)
- exit_strategy_config.yaml
- Trigger logic documentation

Success Criteria:

✅ All exit triggers implemented and tested
✅ State tracking working
✅ Historical data loading functional
✅ Integration validated with real data

Phase 3: Position Risk Quantification (P1)

Duration: 1-2 weeks (30-40 hours)
Objective: Add real position tracking and capital risk calculations

Tasks:

KuCoin Position Tracker (8-12h)
- Fetch active positions from API
- Calculate unrealized PnL
- Inventory imbalance tracking
Capital Risk Calculator (6-8h)
- Capital-at-risk calculation
- Profit give-back estimation
- Stop-loss distance in ATR units
Enhance notifications (4-6h)
- Add risk metrics to all alerts
- Update notification templates
Error handling (4-6h)
- Graceful degradation on API failures
- Circuit breaker pattern
- Clear error messaging
Testing (8-12h)
- Unit tests with mocked KuCoin
- Integration tests
- Manual validation vs KuCoin UI

Success Criteria:

✅ Position tracking working
✅ Risk calculations accurate
✅ Notifications enhanced with risk data

Phase 4: Testing & Validation (P0)

Duration: 1 week (40-50 hours)
Objective: Comprehensive test coverage and backtesting validation

Tasks:

Unit tests for metrics (8-10h)
- 22+ test cases for all calculations
- Edge cases covered
- 90%+ coverage
Unit tests for exit triggers (10-12h)
- 26+ test cases
- Boundary conditions tested
Integration tests (10-12h)
- End-to-end flow tests
- Multi-timeframe analysis
- Git integration
- Notification delivery
Backtesting framework (12-16h)
- Replay historical metrics
- Evaluate exit quality
- Profit preservation analysis
- KPI validation
CI/CD integration (4-6h)
- GitHub Actions workflow
- Quality gates
- Coverage reports

Success Criteria:

✅ 80%+ code coverage
✅ All critical paths tested
✅ Backtesting shows system would work
✅ CI/CD pipeline running

Phase 5: Operational Improvements (P2)

Duration: 1 week (20-30 hours)
Objective: Production readiness and observability

Tasks:

15-minute evaluation cadence (0.5h)
- Update CronJob schedule
- Or create separate exit-evaluation job
Audit logging (3-4h)
- Log all exit state transitions
- Track notification delivery
- Record operator actions
KPI tracking (4-6h)
- Implement KPI calculations
- Monthly report generation
- Trend analysis
Documentation (4-6h)
- Operational runbook
- Troubleshooting guide
- Metrics interpretation guide
Monitoring & alerting (4-6h)
- Prometheus metrics
- Grafana dashboard
- Alert configuration
Performance optimization (4-6h)
- Caching
- Async processing
- Benchmarking

Success Criteria:

✅ 15-min cadence running
✅ Audit logging complete
✅ KPIs tracked
✅ Production ready

Risk Assessment

Technical Risks

Risk	Impact	Likelihood	Mitigation
Metric calculations incorrect	High	Medium	Extensive unit tests, validation against known indicators
False MANDATORY_EXIT signals	High	Medium	Require multiple confirming indicators, tune thresholds
Missed regime transitions	High	Low	15-min cadence, multi-timeframe confirmation
KuCoin API rate limiting	Medium	Low	Cache data, backoff strategy
Git push failures	Medium	Low	Retry logic, local backup

Operational Risks

Risk	Impact	Likelihood	Mitigation
Operator misses notification	High	Medium	Multi-channel delivery, escalating urgency
Notification fatigue	Medium	High	Smart rate limiting, clear urgency indicators
Grid stopped unnecessarily	Medium	Medium	Backtesting, tunable thresholds, track False Exit Rate

Success Metrics (KPIs)

Per .ai/projects/market-making/new-instructions.md:

Exit Quality KPIs

Exit Within Acceptable Window (EAW%) ≥ 90%
- Formula: ExitsBeforeMandatory / TotalExitEvents
Profit Retention Ratio (PRR) ≥ 0.75
- Formula: RealizedProfitAtExit / MaxUnrealizedProfitBeforeExit
Stop-Loss Avoidance Rate (SLAR) ≥ 95%
- Formula: ExitsBeforeStop / TotalGridsStopped
True Transition Detection Rate (TTDR) ≥ 70%
- Formula: TransitionExitsWithFollowThrough / TotalTransitionExits
Mandatory Exit Compliance (MEC%) = 100%
- Formula: CompliedMandatoryExits / MandatoryExitSignals

Architecture Recommendations

1. Implement Exit State Engine First

# src/exit_strategy/state_engine.py
class ExitStateEngine:
    def evaluate(self, regime: Dict, grid: Dict) -> ExitState:
        """
        Main entry point. Returns one of:
        - NORMAL
        - WARNING
        - LATEST_ACCEPTABLE_EXIT
        - MANDATORY_EXIT
        """
        if self._check_mandatory_exit(regime, grid):
            return ExitState.MANDATORY_EXIT
        elif self._check_latest_acceptable_exit(regime, grid):
            return ExitState.LATEST_ACCEPTABLE_EXIT
        elif self._check_warning(regime, grid):
            return ExitState.WARNING
        else:
            return ExitState.NORMAL

2. Refactor Notification System

Current: Monolithic script with mixed concerns
Proposed:

send_regime_notifications.py  (orchestrator)
    ├→ src/exit_strategy/state_engine.py  (exit state classification)
    ├→ src/exit_strategy/message_builder.py  (notification content)
    └→ src/exit_strategy/pushover_client.py  (delivery)

3. Add Position Tracker

# src/position/tracker.py
class PositionTracker:
    def get_active_positions(self, grid_id: str) -> List[Position]:
        """Fetch actual open orders from KuCoin API"""
        
    def calculate_pnl(self, positions: List[Position]) -> PnLSummary:
        """Calculate unrealized PnL"""

4. Separate Risk Assessment from Metrics Collection

Current: metrics/history.py does both
Proposed:

src/
  metrics/
    collector.py           # Fetch & store metrics
  risk/
    assessor.py           # Analyze metrics → risk level
  exit_strategy/
    state_engine.py       # Risk + regime → exit state

File Structure After Completion

repos/market-making/metrics-service/
├── src/
│   ├── regime/
│   │   ├── metrics/              # NEW
│   │   │   ├── adx.py
│   │   │   ├── efficiency_ratio.py
│   │   │   ├── autocorrelation.py
│   │   │   ├── ou_process.py
│   │   │   ├── slope.py
│   │   │   └── bollinger.py
│   │   ├── validation/           # NEW
│   │   │   └── schema_validator.py
│   │   ├── quality/              # NEW
│   │   │   └── dashboard.py
│   │   └── engine.py             # MODIFIED (TODOs removed)
│   ├── exit_strategy/
│   │   ├── triggers/             # NEW
│   │   │   ├── mandatory.py
│   │   │   ├── latest_acceptable.py
│   │   │   └── warning.py
│   │   ├── evaluator.py          # ENHANCED
│   │   ├── state_tracker.py      # ENHANCED
│   │   ├── history_loader.py     # NEW
│   │   ├── audit_logger.py       # NEW
│   │   └── kpis.py               # NEW
│   ├── position/                 # NEW
│   │   ├── tracker.py
│   │   └── risk_calculator.py
│   └── ...
├── tests/
│   ├── regime/metrics/           # NEW (22 test cases)
│   ├── exit_strategy/triggers/   # NEW (26 test cases)
│   ├── position/                 # NEW
│   └── integration/              # ENHANCED
├── backtest/                     # NEW
│   └── regime_exit_backtest.py
├── config/
│   └── exit_strategy_config.yaml # NEW
└── docs/                         # NEW
    ├── ops/
    │   ├── runbook.md
    │   └── troubleshooting.md
    ├── metrics_guide.md
    └── configuration.md

Immediate Next Steps

Week 1: Start Phase 1

Setup (Day 1-2)
- Review this document
- Set up development environment
- Create feature branch: feature/phase-1-data-quality
Implement Metrics (Day 3-5)
- ADX calculation
- Efficiency Ratio
- Unit tests (10 test cases)
- Validate against TradingView
Continue (Week 2-3)
- Remaining metrics
- Integration with regime engine
- Data validation
- Quality dashboard

Detailed SOW: .builders/0013-market-maker-mvp/SYSTEM_ANALYSIS.md
Phase 1 Plan: .builders/0013-market-maker-mvp/PHASE_1_PLAN.md
Original Review: .ai/projects/market-making/SYSTEM_REVIEW.md
Exit Strategy Spec: .ai/projects/market-making/grid-exit-strategy/spec.md
Requirements: .ai/projects/market-making/regime-management/requirements.md
New Instructions: .ai/projects/market-making/new-instructions.md

Conclusion

The market-making system has a strong technical foundation but requires focused effort to complete:

🔴 Phase 1 (Data Quality) - CRITICAL: Fix dummy data immediately
🔴 Phase 2 (Exit Strategy) - CRITICAL: Core value proposition
🟡 Phase 3 (Position Risk) - IMPORTANT: Enhances notifications
🔴 Phase 4 (Testing) - CRITICAL: Cannot deploy without
🟢 Phase 5 (Operational) - NICE TO HAVE: Polish and monitoring

Total: 180-250 hours over 7-10 weeks

Recommendation: Start with Phase 1 immediately. This is a blocker for everything else and addresses the root cause of trust issues with the system.

Document Version: 1.0
Last Updated: 2026-01-31
Next Review: After Phase 1 completion

Techcle Wiki

Explorer

Design Review

Market-Making System: Design Review & Implementation Roadmap

🚀 Implementation Progress (Session: 2026-01-31)

Phase 1: Data Quality - In Progress

✅ Completed Tasks

⏳ Remaining Phase 1 Tasks (33/44 hours)

Resume Instructions

Executive Summary

Completion Estimate

Current State Analysis

What’s Working ✅

1. Regime Detection Engine (src/regime/)

2. Infrastructure (infra/)

3. Notification System (Partial)

4. Grid Configuration Management

Critical Gaps Identified

Gap 1: Data Quality - Hardcoded Dummy Values (P0 - CRITICAL)

Gap 2: Grid Exit Strategy - NOT IMPLEMENTED (P0)

LATEST_ACCEPTABLE_EXIT Triggers

WARNING Triggers

State Transition Tracking

Historical Data Loading

Gap 3: Position Risk Quantification - MISSING (P1)

Position Tracking

Capital Risk Calculator

Gap 4: Testing - NEARLY ZERO (P0)

Gap 5: Operational Improvements (P2)

Implementation Roadmap

Phase 1: Data Trust & Quality (P0 - CRITICAL)

Phase 2: Complete Grid Exit Strategy (P0)

Phase 3: Position Risk Quantification (P1)

Phase 4: Testing & Validation (P0)

Phase 5: Operational Improvements (P2)

Risk Assessment

Technical Risks

Operational Risks

Success Metrics (KPIs)

Exit Quality KPIs

Architecture Recommendations

1. Implement Exit State Engine First

2. Refactor Notification System

3. Add Position Tracker

4. Separate Risk Assessment from Metrics Collection

File Structure After Completion

Immediate Next Steps

Week 1: Start Phase 1

Related Documentation

Conclusion

Graph View

Table of Contents

1. Regime Detection Engine (`src/regime/`)

2. Infrastructure (`infra/`)