architecture-drift

Architecture Drift in AI-Generated Code: The Root Cause Explained

Architecture drift is the primary structural root cause in AI-generated codebases — the mechanism by which a coherent initial design degrades, one prompt session at a time, into a system where boundaries are invisible, logic is misplaced, and every change carries an unpredictable blast radius.

It is the root cause with the highest weight in the AI Chaos Index scoring model: 25%. This reflects its role not just as one of five structural problems, but as the precondition that enables the others. Dependency graph corruption accelerates when boundaries erode. Structural entropy accumulates when there is no enforced naming or layer standard. Test infrastructure failure becomes self-reinforcing when the architecture is too unstable to test safely.

This page explains the mechanism of architecture drift in technical depth — how it forms, how it propagates, how to detect it with precision, and what structural intervention is required to stop it.

Who This Is For

Developers, architects, and technical leads working with AI-generated codebases who need a precise technical understanding of:

Why prompt-driven development produces architecture drift as a structural consequence, not an accident
How to detect architecture drift with concrete, reproducible metrics
What the difference is between addressing symptoms (fixing individual violations) and addressing the root cause (establishing enforced boundaries)
How enforced boundary isolation stabilizes a drifted codebase without requiring a rewrite

This page assumes technical familiarity with software architecture concepts. For the founder-facing explanation of what architecture drift feels like in practice, see Architecture Drift — Pain Page.

The Mechanism: Why Prompt-Driven Development Produces Architecture Drift

Architecture drift is not a bug in AI tools. It is a structural consequence of how prompt-driven development works at scale.

The Local Optimization Problem

Every AI prompt session operates with a bounded context window. The AI sees:

The files explicitly provided in the prompt
The immediate task to be accomplished
The code that needs to be generated or modified

The AI does not see:

The architectural decisions made in previous sessions
The layer boundaries that were established at the start of the project
The import graph across the entire codebase
The naming conventions that were agreed upon but not enforced

The result is local optimization without global structural awareness. Each prompt session produces code that is locally coherent — it solves the immediate problem, it compiles, it passes the immediate test. But it is globally erosive: it places logic in the most convenient location rather than the architecturally correct one.

The Accumulation Mechanism

Architecture drift accumulates through a specific pattern:

Session 1: Developer prompts "add user authentication."
           AI generates: auth logic in a dedicated auth module. ✅

Session 12: Developer prompts "show user tier in the dashboard."
            AI generates: user tier lookup directly in the Dashboard component.
            Reason: Dashboard.tsx is in context. auth module is not. ⚠️

Session 31: Developer prompts "apply discount based on user tier."
            AI generates: discount calculation in the checkout route handler.
            Reason: route handler is in context. business logic layer is not. ⚠️

Session 47: Developer prompts "fix the discount calculation bug."
            AI generates: a second discount calculation in a utility file.
            Reason: the original calculation in the route handler is not in context. ❌

Session 63: Developer prompts "add enterprise pricing."
            AI generates: pricing logic in three different files.
            Reason: three different files are in context. None of them is the canonical location. ❌

By session 63, the pricing logic exists in four places. The user tier lookup exists in six places. The auth logic has leaked into five components. The architecture has drifted — not through a single catastrophic decision, but through 63 locally reasonable ones.

Why Enforcement Is the Only Solution

The pattern above reveals why code review alone cannot prevent architecture drift. Each individual change — user tier in Dashboard, discount in route handler — is locally defensible. A reviewer who sees only the diff cannot easily determine that the change is architecturally wrong without a complete understanding of the intended layer structure.

Enforcement requires automated tooling that operates at the codebase level, not the diff level:

A boundary linter that detects cross-layer imports regardless of how they were introduced
A naming linter that enforces consistent naming conventions across all files
A CI/CD pipeline that runs these checks on every commit, before merge

Without automated enforcement, architecture drift is structurally inevitable in any codebase that uses prompt-driven development at scale.

Technical Depth: The Three Failure Patterns of Architecture Drift

Architecture drift manifests through three specific, detectable failure patterns.

FP001: Oversized Files — The Accumulation Signal

Oversized files are the most reliable early signal of architecture drift. A file that started as a focused module and has grown to 600, 800, or 1000+ lines is accumulating logic it should not own.

The mechanism: when a developer prompts "add X to the checkout flow," the AI adds X to the most convenient file in context — often the largest file, because it already contains the most related logic. The file grows. The next prompt adds more logic to the same file. The file grows further. By the time the file reaches 1000 lines, it contains logic from three different architectural layers.

Quantitative thresholds (from scoring model):

Files >500 LOC ratio	RC01 base severity
0–5%	2
5–15%	4
15–30%	6
>30%	8

Secondary signals that increase severity: cross-layer imports, business logic in wrong layer, missing code separation, git full-file rewrites.

FP002: Business Logic in Wrong Layer — The Boundary Violation Signal

Business logic in the wrong layer is the direct evidence of architectural boundary erosion. The canonical violations in AI-generated codebases:

Backend violations:

Database queries in route handlers (should be in repository layer)
Business calculations in API endpoints (should be in service layer)
Validation logic in database models (should be in service or domain layer)
External API calls in service layer (should be in adapter/infrastructure layer)

Frontend violations:

Direct database calls in React components (should be in API layer)
Business calculations in UI components (should be in hook or service layer)
State management logic in presentational components (should be in container or store)
Pricing, discount, or tax logic in render functions (should be in business logic layer)

Each violation is individually small. In aggregate, they represent the dissolution of the layer architecture that makes the system testable, maintainable, and safe to change.

FP006: Circular Dependencies — The Graph Corruption Signal

Circular dependencies are the structural consequence of architecture drift at the import graph level. When boundaries erode, modules begin importing each other's internals. The import graph develops cycles.

The mechanism: module A imports from module B. Module B, in a later prompt session, imports from module A. Neither the developer nor the AI notices — the import resolves, the code compiles, the feature ships. The circular dependency is now present in the graph.

The consequence: isolation becomes impossible. A change in module A propagates to module B, which propagates back to module A. The blast radius of any change is no longer bounded by the change itself — it is bounded by the entire circular subgraph.

Circular dependencies are the structural reason why architecture drift produces fragile systems: once the import graph has cycles, every change has an unpredictable blast radius.

Detection: Measuring Architecture Drift with Precision

The following detection methodology maps directly to the AI Chaos Index scoring model for RC01.

Step 1: File Size Distribution (primary signal)

# Complete file size distribution — the architecture drift fingerprint
find . \( -name "*.py" -o -name "*.ts" -o -name "*.tsx" \) \
  -not -path "*/node_modules/*" -not -path "*/.git/*" \
  -not -path "*/__pycache__/*" -not -name "*.test.*" -not -name "*.spec.*" | \
  xargs wc -l 2>/dev/null | grep -v total | \
  awk '{
    if ($1 < 100) s++
    else if ($1 < 300) m++
    else if ($1 < 500) l++
    else { c++; files[c]=$2 }
    total++
  }
  END {
    print "< 100 LOC:", s, "files"
    print "100-300 LOC:", m, "files"
    print "300-500 LOC:", l, "files"
    print "> 500 LOC:", c, "files (drift signal)"
    print "Drift ratio:", c/total*100 "%"
    print "\nDrift hotspots:"
    for (i=1; i<=c && i<=10; i++) print " ", files[i]
  }'

Step 2: Layer Boundary Violations (secondary signal)

# Python: business logic in route handlers
echo "=== DB queries in route/handler files ==="
find . \( -name "routes.py" -o -name "views.py" -o -name "handlers.py" \
  -o -path "*/api/*.py" -o -path "*/routes/*.py" \) \
  -not -path "*/test*" 2>/dev/null | \
  xargs grep -ln "\.query\(\|\.filter\(\|\.execute\(\|session\." 2>/dev/null

# TypeScript: direct DB/API calls in React components
echo "=== Direct data access in UI components ==="
find src -name "*.tsx" -not -name "*.test.tsx" 2>/dev/null | \
  xargs grep -ln "fetch(\|axios\.\|supabase\.\|prisma\.\|\.from(" 2>/dev/null | \
  grep -v "api\|service\|hook\|use[A-Z]"

# Business calculations in UI
echo "=== Business logic keywords in UI layer ==="
find src/components src/pages src/app -name "*.tsx" 2>/dev/null | \
  xargs grep -c "price\|discount\|tax\|invoice\|billing" 2>/dev/null | \
  grep -v ":0$" | sort -t: -k2 -rn | head -10

Step 3: Circular Dependency Detection (tertiary signal)

# Python
pip install pydeps --quiet 2>/dev/null
pydeps . --max-bacon=3 --show-cycles 2>/dev/null

# TypeScript/JavaScript
npx madge --circular --extensions ts,tsx,js src/ 2>/dev/null

# Count circular chains
CYCLES=$(npx madge --circular --extensions ts,tsx src/ 2>/dev/null | \
  grep -c "→" || echo "0")
echo "Circular dependency chains: $CYCLES"

Step 4: RC01 Severity Calculation

Using the AI Chaos Index scoring model:

primary_signal = files_over_500loc / total_files × 100
secondary_signals = [
  cross_layer_imports_present,      # boolean
  business_logic_in_wrong_layer,    # boolean
  missing_code_separation,          # boolean
  git_full_file_rewrites            # boolean
]
secondary_bonus = count(secondary_signals_present) × 0.75

RC01_severity = min(lookup(primary_signal) + secondary_bonus, 10)

Example calculation:

Codebase: 187 files, 7 files >500 LOC = 3.7% → base score: 2
Secondary: cross_layer(✓) + biz_wrong_layer(✓) + missing_sep(✓) = 3 × 0.75 = 2.25
RC01_severity = 2 + 2.25 = 4.25 → rounded: 4
RC01 contribution to ACI = 4 × 0.25 = 1.0 (out of 2.5 max)

The Boundary Enforcement Model

The structural intervention required to stop architecture drift is enforced layer boundaries. The enforcement model has three components:

1. Layer Definition

A documented, explicit layer architecture that defines:

Which layers exist (e.g., UI → Hook → Service → Repository → Database)
Which imports are allowed between layers (directional, never circular)
Which types of logic belong in which layer

Without an explicit layer definition, enforcement is impossible — the linter has nothing to enforce against.

2. Boundary Linter

An automated tool that detects cross-layer imports and fails the build when they are present:

# Example: ASA boundary linter rule
# domains/billing/calculate_overage/repository.py

from domains.user.get_profile import UserProfileService  # ❌ FORBIDDEN
# Cross-domain import detected: billing cannot import from user domain
# Fix: use shared interface or event-based communication

# Run boundary linter
asa lint billing/calculate_overage
# ❌ [LINT FAIL] Boundary violation in repository.py:
#    Line 3: Illegal import 'domains.user.get_profile'
#    -> Cannot import from other domains.

3. CI/CD Enforcement

The boundary linter runs on every commit, before merge. A commit that introduces a cross-layer import fails the CI pipeline and cannot be merged. This is the structural enforcement layer — it operates regardless of whether the developer noticed the violation, regardless of whether the reviewer caught it, and regardless of whether the AI generated it.

The Cap & Grow Stabilization Model

For codebases where architecture drift has already accumulated, the Cap & Grow methodology provides a zero-rewrite stabilization path:

Cap: The existing drifted code is frozen. No new features are added to the legacy zone. The legacy code is treated as read-only — it can be read for context, but not modified.

Bridge: An adapter layer is established between the legacy code and the new clean zone. The bridge translates between the legacy data structures and the clean domain model.

Grow: All new development happens in the clean zone — isolated, independently testable modules with explicit layer boundaries enforced by the boundary linter and CI/CD pipeline.

The result: the legacy architecture drift is contained. It does not spread. New development happens in a zone where drift is structurally impossible — not because developers are more careful, but because the tooling makes architectural violations impossible to merge.

Before Cap & Grow:
[Legacy drifted code] ←→ [New features added to legacy zone] → drift accelerates

After Cap & Grow:
[Legacy drifted code] → [Bridge adapter] → [Clean ASA zone]
                                              ↑
                                    All new development here
                                    Boundary linter enforced
                                    CI/CD enforced