CI/CD Integration

Run Specflow contracts and journey tests in your CI/CD pipeline.

The Fail-Fast Pipeline Pattern
Contract Completeness Gate (NEW)
The Key: needs: contract-tests
Recommended GitHub Actions Pipeline
Hooks vs CI: Two Enforcement Layers
1. Why Both?
2. The Combined Workflow
Journey Test Criticality
1. Mark Criticality in Contracts
2. CI Job for Criticality
Branch Protection Setup
Debugging CI Failures
1. Contract Test Failure
2. Journey Test Failure
GitLab CI Example
npm Scripts Integration
Agent Teams CI Pipeline (5 Gates)
1. Key Differences from Standard Pipeline
Summary

The Fail-Fast Pipeline Pattern

The most important CI pattern for Specflow is fail-fast:

contracts → unit-tests → build → e2e-tests → journey-tests
    ↓           ↓         ↓          ↓            ↓
  FAIL?      SKIP      SKIP       SKIP         SKIP

Why fail fast?

Contract tests are FAST (pattern matching, no browser)
E2E tests are SLOW (browser automation)
If contracts fail, skip expensive tests (save CI minutes)

Contract Completeness Gate (NEW)

Before running contract pattern tests, a completeness check verifies that CONTRACT_INDEX.yml is in sync with actual files on disk. This prevents the gap where tickets get created but contract artifacts don’t.

# .github/workflows/specflow-ci.yml
contract-completeness:
  name: Contract Completeness
  runs-on: ubuntu-latest
  steps:
    - uses: actions/checkout@v4
    - uses: actions/setup-node@v4
      with: { node-version: '20', cache: 'pnpm' }
    - uses: pnpm/action-setup@v4
      with: { version: 9 }
    - run: pnpm install --frozen-lockfile
    - name: Check contract completeness
      run: node scripts/check-contract-completeness.mjs

What it checks:

Every journey_*.yml on disk has a CONTRACT_INDEX entry
Every CONTRACT_INDEX journey entry has a .yml file on disk
Every feature_*.yml on disk has a CONTRACT_INDEX entry
Metadata counts (total_contracts, total_journeys) match reality
Feature contract journeys: lists reference real journey entries

When it fails, the output tells you exactly what to fix:

✗ Found 2 completeness issue(s):

  1. [ORPHAN_FILE] Journey file exists but is NOT in CONTRACT_INDEX: journey_user_login.yml

     HOW TO FIX:
     Add this entry to docs/contracts/CONTRACT_INDEX.yml ...

  2. [COUNT_MISMATCH] metadata.total_journeys = 12 but found 14

     HOW TO FIX:
     Open docs/contracts/CONTRACT_INDEX.yml
     Change: total_journeys: 14

There is also a Jest test (contract_completeness.test.ts) that performs the same checks locally via pnpm test -- contracts.

The Key: `needs: contract-tests`

This single line creates the fail-fast behavior:

e2e-tests:
  needs: contract-tests  # ← E2E waits for contracts

If contract-tests fails:

e2e-tests is SKIPPED
journey-tests is SKIPPED
You save 10-20 minutes of CI time

Recommended GitHub Actions Pipeline

# .github/workflows/ci.yml
name: CI

on:
  push:
    branches: [main]
  pull_request:
    branches: [main]

env:
  NODE_VERSION: '20'

jobs:
  # ============================================
  # STEP 1: Contract Tests (FAIL FAST)
  # ============================================
  contract-tests:
    name: Contract Tests (Pattern Enforcement)
    runs-on: ubuntu-latest
    timeout-minutes: 10

    steps:
      - uses: actions/checkout@v4

      - uses: pnpm/action-setup@v3
        with:
          version: 9

      - uses: actions/setup-node@v4
        with:
          node-version: $
          cache: 'pnpm'

      - name: Install dependencies
        run: pnpm install --frozen-lockfile

      - name: Run contract tests
        run: pnpm test -- contracts --passWithNoTests

  # ============================================
  # STEP 2: Unit Tests (parallel with contracts)
  # ============================================
  unit-tests:
    name: Unit Tests
    runs-on: ubuntu-latest
    timeout-minutes: 15

    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with:
          version: 9
      - uses: actions/setup-node@v4
        with:
          node-version: $
          cache: 'pnpm'
      - run: pnpm install --frozen-lockfile
      - run: pnpm test:coverage

  # ============================================
  # STEP 3: E2E Tests (WAITS FOR CONTRACTS)
  # ============================================
  e2e-tests:
    name: E2E Tests
    runs-on: ubuntu-latest
    timeout-minutes: 20
    needs: contract-tests  # ← KEY: Wait for contracts!

    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with:
          version: 9
      - uses: actions/setup-node@v4
        with:
          node-version: $
          cache: 'pnpm'
      - run: pnpm install --frozen-lockfile
      - run: pnpm exec playwright install --with-deps chromium
      - run: pnpm test:e2e

      - name: Upload test results
        if: always()
        uses: actions/upload-artifact@v4
        with:
          name: playwright-report
          path: playwright-report/

  # ============================================
  # STEP 4: Journey Tests (RELEASE GATING)
  # ============================================
  journey-tests:
    name: Journey Tests (RELEASE GATING)
    runs-on: ubuntu-latest
    timeout-minutes: 15
    needs: contract-tests  # ← Wait for contracts

    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with:
          version: 9
      - uses: actions/setup-node@v4
        with:
          node-version: $
          cache: 'pnpm'
      - run: pnpm install --frozen-lockfile
      - run: pnpm exec playwright install --with-deps chromium
      - run: pnpm test:e2e tests/e2e/journey_*.spec.ts

  # ============================================
  # STEP 5: Build
  # ============================================
  build:
    name: Build Application
    runs-on: ubuntu-latest
    timeout-minutes: 10
    needs: [unit-tests, contract-tests]

    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with:
          version: 9
      - uses: actions/setup-node@v4
        with:
          node-version: $
          cache: 'pnpm'
      - run: pnpm install --frozen-lockfile
      - run: pnpm build

Hooks vs CI: Two Enforcement Layers

Specflow enforces contracts in two places:

Layer	Where	When	Speed
Hooks	Local (your machine)	Build, commit	Seconds
CI	Remote (GitHub)	PR, merge	Minutes

Why Both?

Hooks catch problems before you push:

Instant feedback
No wasted CI minutes
You see issues immediately

CI catches problems after you push:

Authoritative source of truth
Clean environment
Required for branch protection

The Combined Workflow

Your Machine                           GitHub
────────────                           ──────

pnpm build
    ↓
[post-build HOOK]
    ↓
Runs journey tests locally ← Fast feedback
    ↓
git commit (#375)
    ↓
[post-commit HOOK]
    ↓
Extracts #375, runs journey
    ↓
git push
    └──────────────────────────────→ [contract-tests]
                                          ↓ needs:
                                     [e2e-tests]
                                          ↓ needs:
                                     [journey-tests]
                                          ↓
                                     Branch protection ✓

Journey Test Criticality

Not all journeys are equally important:

Criticality	Example	CI Behavior
Critical	`J-USER-LOGIN`	BLOCK merge
Important	`J-EXPORT-REPORT`	WARN only
Future	`J-AI-ASSISTANT`	Skip

Mark Criticality in Contracts

# docs/contracts/journey_user_login.yml
name: User Login Journey
type: journey
criticality: critical  # ← Affects CI gating

scenarios:
  - name: User logs in with email
    # ...

CI Job for Criticality

journey-tests:
  steps:
    - name: Run critical journeys
      run: pnpm test:e2e -- --grep "@critical"

    - name: Run important journeys (warn only)
      continue-on-error: true  # ← Doesn't block
      run: pnpm test:e2e -- --grep "@important"

Branch Protection Setup

Configure GitHub to require passing checks:

Settings → Branches → main → Branch protection rules

Required status checks:
✅ contract-tests     ← Must pass
✅ journey-tests      ← Must pass
◻️ e2e-tests         ← Optional

✅ Require branches to be up to date

Debugging CI Failures

Contract Test Failure

❌ contract-tests failed

Error: CONTRACT VIOLATION: ARCH-001
File: src/routes/AdminPage.tsx
Issue: Protected route missing ProtectedRoute wrapper

Fix: Read the contract, add the wrapper.

Journey Test Failure

❌ journey-tests failed

Error: J-STAFF-REQUEST-LEAVE scenario 2
File: tests/e2e/journey_staff_request_leave.spec.ts:45
Issue: Expected "Pending" but got "Error"

Debug locally:

pnpm test:e2e:ui tests/e2e/journey_staff_request_leave.spec.ts

GitLab CI Example

# .gitlab-ci.yml
stages:
  - contracts
  - test
  - build

contract-tests:
  stage: contracts
  script:
    - pnpm install --frozen-lockfile
    - pnpm test -- contracts

e2e-tests:
  stage: test
  needs: [contract-tests]  # ← Wait for contracts
  script:
    - pnpm install --frozen-lockfile
    - pnpm exec playwright install --with-deps
    - pnpm test:e2e

build:
  stage: build
  needs: [contract-tests]
  script:
    - pnpm build

npm Scripts Integration

Add these to your package.json:

{
  "scripts": {
    "test": "jest",
    "test:contracts": "jest src/__tests__/contracts/",
    "test:e2e": "playwright test",
    "test:e2e:journeys": "playwright test tests/e2e/journey_*.spec.ts",
    "ci:contracts": "npm run test:contracts -- --passWithNoTests",
    "ci:full": "npm run ci:contracts && npm test && npm run build"
  }
}

Agent Teams CI Pipeline (5 Gates)

When using Agent Teams mode, a more comprehensive CI pipeline is available with three-tier journey enforcement.

File: .github/workflows/specflow-ci.yml

# .github/workflows/specflow-ci.yml
name: Specflow CI (Agent Teams)

on:
  push:
    branches: [main]
  pull_request:
    branches: [main]

jobs:
  # Gate 1: Contract Tests (pattern enforcement)
  gate-1-contracts:
    name: "Gate 1: Contracts"
    runs-on: ubuntu-latest
    timeout-minutes: 10
    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with: { version: 9 }
      - uses: actions/setup-node@v4
        with: { node-version: '20', cache: 'pnpm' }
      - run: pnpm install --frozen-lockfile
      - run: pnpm test -- contracts --passWithNoTests

  # Gate 2: Build
  gate-2-build:
    name: "Gate 2: Build"
    needs: gate-1-contracts
    runs-on: ubuntu-latest
    timeout-minutes: 10
    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with: { version: 9 }
      - uses: actions/setup-node@v4
        with: { node-version: '20', cache: 'pnpm' }
      - run: pnpm install --frozen-lockfile
      - run: pnpm build

  # Gate 3: Tier 2 Journey Tests (wave-level)
  gate-3-tier2:
    name: "Gate 3: Tier 2 Journey Tests"
    needs: gate-2-build
    runs-on: ubuntu-latest
    timeout-minutes: 20
    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with: { version: 9 }
      - uses: actions/setup-node@v4
        with: { node-version: '20', cache: 'pnpm' }
      - run: pnpm install --frozen-lockfile
      - run: pnpm exec playwright install --with-deps chromium
      - run: pnpm test:e2e tests/e2e/journey_*.spec.ts

  # Gate 4: Tier 3 Regression (baseline comparison)
  gate-4-tier3:
    name: "Gate 4: Tier 3 Regression"
    needs: gate-3-tier2
    runs-on: ubuntu-latest
    timeout-minutes: 20
    steps:
      - uses: actions/checkout@v4
      - uses: pnpm/action-setup@v3
        with: { version: 9 }
      - uses: actions/setup-node@v4
        with: { node-version: '20', cache: 'pnpm' }
      - run: pnpm install --frozen-lockfile
      - run: pnpm exec playwright install --with-deps chromium
      - name: Run full E2E suite
        run: pnpm test:e2e --reporter=json > test-results.json 2>&1 || true
      - name: Compare against baseline
        run: |
          if [ -f .specflow/baseline.json ]; then
            node -e "
              const baseline = require('./.specflow/baseline.json');
              const results = require('./test-results.json');
              // Compare and fail on regressions
              const regressions = baseline.passing.filter(t => !results.passing?.includes(t));
              if (regressions.length > 0) {
                console.error('REGRESSIONS:', regressions);
                process.exit(1);
              }
            "
          fi

  # Gate 5: Deploy (only on main)
  gate-5-deploy:
    name: "Gate 5: Deploy"
    needs: gate-4-tier3
    if: github.ref == 'refs/heads/main'
    runs-on: ubuntu-latest
    steps:
      - run: echo "All gates passed. Deployment proceeds via platform (Vercel, etc.)"

Key Differences from Standard Pipeline

Standard Pipeline	Agent Teams Pipeline
4 parallel jobs	5 sequential gates
Journey tests run once	Tier 2 (wave) + Tier 3 (regression)
No baseline comparison	`.specflow/baseline.json` regression detection
Manual criticality filtering	Automatic tier enforcement

Note: The standard pipeline above works for most projects. The Agent Teams pipeline is for projects using persistent teammates and three-tier journey gates. See Agent Teams for details.

Summary

Principle	Implementation
Fail fast	`needs: contract-tests`
Local enforcement	Journey verification hooks
Remote enforcement	CI pipeline with branch protection
Criticality gating	`--grep "@critical"`
Save CI minutes	Skip E2E if contracts fail
Three-tier gates	Agent Teams mode with `.specflow/baseline.json`

The key insight: Contracts are cheap to check. Check them first, skip expensive tests on failure.