DevSecOps📅 March 10, 2026· 12 min read

Shifting Security Left: Integrating SAST, DAST, and Secret Scanning into Your CI/CD Pipeline

✍️

Stripe Systems Engineering

"Shift left" means running security checks earlier in the development lifecycle — during coding and code review rather than after deployment. The economic argument is straightforward: a vulnerability found during a pull request review costs a code change; the same vulnerability found in production costs an incident response, a patch, a deployment, customer communication, and potentially regulatory notification.

This post covers six categories of security scanning, where each fits in your CI/CD pipeline, how to manage false positives without training developers to ignore findings, and how to measure whether your security pipeline is actually working.

The Six Scanning Categories

1. SAST — Static Application Security Testing

SAST tools analyze source code without executing it. They identify patterns that match known vulnerability classes: SQL injection, cross-site scripting, insecure deserialization, hardcoded credentials, path traversal.

Tool comparison:

Tool	Language Support	CI Integration	Custom Rules	License
Semgrep	30+ languages	GitHub Actions, GitLab CI, CLI	YAML-based (accessible)	OSS + commercial
SonarQube	25+ languages	Plugins for most CI systems	Java-based	Community + commercial
CodeQL	10+ languages	Native GitHub integration	QL query language (steep learning curve)	Free for OSS, paid for private

Semgrep has become the default choice for most teams because of its rule syntax. A custom Semgrep rule is a YAML file that most developers can read and write:

rules:
  - id: hardcoded-database-password
    patterns:
      - pattern: |
          $DB_CONFIG = {
            ...,
            password: "...",
            ...
          }
    message: "Database password is hardcoded. Use environment variables or a secrets manager."
    severity: ERROR
    languages: [javascript, typescript]
    metadata:
      category: security
      cwe: "CWE-798"
      compliance: [pci-dss, soc2]

False positive rates: SAST tools produce the highest false positive rates of any scanning category. Semgrep averages 10–20% false positives with its default rulesets; SonarQube can reach 30–40% without tuning. This is the single biggest factor in developer adoption — if developers learn that most findings are noise, they stop reading the reports.

2. DAST — Dynamic Application Security Testing

DAST tools test running applications by sending crafted HTTP requests and analyzing responses. They find vulnerabilities that SAST cannot: server misconfigurations, authentication flaws, runtime injection vulnerabilities.

Key tools:

✓OWASP ZAP: open-source, scriptable, headless mode for CI. The baseline scan runs in 2–5 minutes; the full active scan can take 30+ minutes.
✓Burp Suite: commercial, more comprehensive scanning engine, CI integration via Burp Suite Enterprise.

DAST requires a running application, which means it fits later in the pipeline — typically against a staging or preview environment after deployment.

# ZAP baseline scan in GitHub Actions
- name: DAST scan with ZAP
  uses: zaproxy/[email protected]
  with:
    target: "https://staging.example.com"
    rules_file_name: "zap-rules.tsv"
    fail_action: "warn"
    allow_issue_writing: false

The rules_file_name parameter points to a TSV file that configures which alerts cause failures vs. warnings. This is how you manage false positives in DAST — by tuning rule severities rather than suppressing individual findings.

3. Secret Scanning

Secret scanning detects credentials, API keys, tokens, and private keys committed to version control.

Tools:

✓Gitleaks: scans git history and current files, configurable via TOML, works as a pre-commit hook and CI check.
✓TruffleHog: scans git history with entropy-based detection plus regex patterns, supports scanning multiple VCS providers.
✓GitHub Secret Scanning: native integration for GitHub repositories, automatic alerts for partner patterns (AWS keys, Stripe keys, etc.).

The pre-commit hook is the first line of defense — catching secrets before they enter git history:

# .pre-commit-config.yaml
repos:
  - repo: https://github.com/gitleaks/gitleaks
    rev: v8.18.0
    hooks:
      - id: gitleaks

But pre-commit hooks run locally and can be bypassed (developers can use --no-verify). The CI check is the enforcement layer:

# GitHub Actions secret scanning
- name: Gitleaks scan
  uses: gitleaks/gitleaks-action@v2
  with:
    args: "--verbose --redact"
  env:
    GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
    GITLEAKS_ENABLE_COMMENTS: true

Important: if a secret is detected in git history, rotating the secret is mandatory — removing the commit (via git filter-branch or BFG Repo-Cleaner) is not sufficient because the secret may have been cloned, cached, or logged.

4. SCA — Software Composition Analysis

SCA tools analyze your dependency tree for known vulnerabilities (CVEs). The critical nuance is transitive dependencies — your application might directly depend on 50 packages, but the full dependency tree includes 500+ packages, and vulnerabilities in transitive dependencies are just as exploitable.

Tools:

✓Snyk: commercial, covers npm, pip, Maven, Go, container images. Good at suggesting fix versions.
✓Dependabot: native GitHub integration, automatic PRs for vulnerable dependencies.
✓npm audit / pip-audit: built-in package manager tools, limited to their respective ecosystems.

# Snyk in CI with severity threshold
- name: Snyk dependency test
  uses: snyk/actions/node@master
  with:
    args: --severity-threshold=high --fail-on=upgradable
  env:
    SNYK_TOKEN: ${{ secrets.SNYK_TOKEN }}

The --fail-on=upgradable flag is significant: it only fails the build when there is a known fix available. Failing on vulnerabilities with no available patch creates build failures that developers cannot resolve, which degrades trust in the pipeline.

5. Container Image Scanning

Container scanners analyze OS packages and application dependencies within container images.

Tools:

✓Trivy: fast, covers OS packages + application dependencies, supports multiple output formats.
✓Grype: from Anchore, similar coverage to Trivy, good SBOM (Software Bill of Materials) integration.

# Trivy scan in CI
- name: Build Docker image
  run: docker build -t app:${{ github.sha }} .

- name: Trivy vulnerability scan
  uses: aquasecurity/trivy-action@master
  with:
    image-ref: "app:${{ github.sha }}"
    exit-code: 1
    severity: "CRITICAL,HIGH"
    format: "sarif"
    output: "trivy-results.sarif"

- name: Upload Trivy scan results
  uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: "trivy-results.sarif"

Uploading results in SARIF format integrates findings directly into GitHub's Security tab, providing a unified view of vulnerabilities alongside code scanning results.

6. Infrastructure as Code Scanning

IaC scanning catches cloud misconfigurations before terraform apply or kubectl apply runs.

Tools:

✓Checkov: covers Terraform, CloudFormation, Kubernetes, Helm, Dockerfile. 1000+ built-in checks.
✓tfsec (now part of Trivy): Terraform-focused, fast, good IDE integration.

# Checkov in CI
- name: Checkov IaC scan
  uses: bridgecrewio/checkov-action@master
  with:
    directory: ./terraform
    framework: terraform
    check: CKV_AWS_18,CKV_AWS_19,CKV_AWS_145
    soft_fail: false

Pipeline Architecture: Where Each Scan Fits

Not every scan belongs at every stage. The goal is to catch issues as early as possible while keeping the pipeline fast enough that developers don't bypass it.

┌─────────────────────────────────────────────────────────────┐
│ PRE-COMMIT (developer machine)                              │
│  • Gitleaks (secret scanning) — 2-5 seconds                │
│  • Semgrep quick rules (top 20 patterns) — 5-10 seconds    │
├─────────────────────────────────────────────────────────────┤
│ PR CHECK (CI, runs on every push to PR)                     │
│  • Full Semgrep SAST scan — 30-90 seconds                  │
│  • Snyk/npm audit dependency scan — 20-60 seconds          │
│  • Gitleaks (full repo scan) — 10-30 seconds               │
│  • Checkov/tfsec IaC scan — 15-45 seconds                  │
│  • Unit/integration tests — varies                          │
├─────────────────────────────────────────────────────────────┤
│ MERGE GATE (CI, runs before merge to main)                  │
│  • All PR checks must pass                                  │
│  • Container image build + Trivy scan — 2-5 minutes        │
│  • DAST baseline scan against preview env — 3-5 minutes    │
├─────────────────────────────────────────────────────────────┤
│ PRE-DEPLOY (CI, after merge to main)                        │
│  • Container image signing (cosign) — 30 seconds           │
│  • Image digest verification — 10 seconds                  │
│  • Final vulnerability gate check — 30 seconds             │
├─────────────────────────────────────────────────────────────┤
│ POST-DEPLOY (production)                                    │
│  • Runtime security monitoring (Falco/Sysdig)              │
│  • Continuous DAST scanning (scheduled)                     │
│  • Log-based anomaly detection                              │
└─────────────────────────────────────────────────────────────┘

Key principle: pre-commit and PR checks must be fast. If your security checks add more than 3 minutes to a PR build, developers will find ways to avoid them. Move heavyweight scans (full DAST, comprehensive container scanning) to merge gates where they run less frequently.

False Positive Management

False positives are the primary reason security pipelines fail in practice. A scan that produces 50 findings where 40 are false positives trains developers to ignore all findings.

Strategies:

1. Severity thresholds. Only fail builds on HIGH and CRITICAL findings. Report MEDIUM and LOW as informational.

2. Suppression files with expiration dates. When a finding is reviewed and determined to be a false positive or accepted risk, document the decision:

# .semgrep/ignore.yml
rules:
  - id: javascript.express.security.audit.xss.mustache-escape
    paths:
      - src/templates/admin-panel.js
    reason: "Admin panel is internal-only, behind VPN and SSO. Risk accepted by security team on 2025-09-15."
    expires: 2026-03-15
    approved_by: "security-team"

3. Incremental scanning. On PR checks, only scan changed files. This reduces both scan time and noise. Semgrep supports this with --baseline-commit:

semgrep --config=auto --baseline-commit=$(git merge-base HEAD origin/main)

4. Centralized triage. Route all findings to a security channel (Slack, Jira) rather than blocking individual PRs. Reserve build-breaking for CRITICAL severity findings only.

Developer Experience

The security pipeline must be designed for the developer workflow, not against it.

Speed: total security check time per PR should be under 3 minutes. Parallelize scans that don't depend on each other:

jobs:
  sast:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Semgrep
        uses: returntocorp/semgrep-action@v1

  sca:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Snyk test
        uses: snyk/actions/node@master

  secrets:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Gitleaks
        uses: gitleaks/gitleaks-action@v2

  iac:
    runs-on: ubuntu-latest
    if: contains(github.event.pull_request.labels.*.name, 'infrastructure')
    steps:
      - uses: actions/checkout@v4
      - name: Checkov
        uses: bridgecrewio/checkov-action@master

Clarity: when a scan blocks a PR, the developer must understand what the finding is, why it matters, and how to fix it. Semgrep and Snyk both provide fix suggestions in their PR comments. If your tool doesn't, configure it to link to remediation documentation.

Escape hatches: provide a documented process for overriding a finding when there's a legitimate business reason. Require security team approval for overrides and track them centrally.

Metrics: Measuring Security Pipeline Effectiveness

Track these metrics monthly:

Metric	What It Measures	Target
Mean Time to Remediate (MTTR)	Average time from vulnerability detection to fix merged	Critical: < 7 days, High: < 30 days
Vulnerability Escape Rate	Percentage of production vulnerabilities not caught by CI scans	< 5%
Scan Coverage	Percentage of repositories with security scanning enabled	100%
False Positive Rate	Percentage of findings marked as false positive after triage	< 15%
Developer Bypass Rate	Percentage of deployments that skipped security checks	0%
Detection by Stage	Distribution of findings across pipeline stages	70%+ caught at PR stage

The detection-by-stage metric is the most informative. If most vulnerabilities are caught at the DAST or post-deploy stage, your SAST rules need improvement. If most are caught by pre-commit hooks, your pipeline is working as designed.

Case Study: Fintech API Platform — 5-Stage Security Pipeline

Background

A fintech API platform processing card transactions (PCI-DSS scope) needed to demonstrate a comprehensive security pipeline to their QSA (Qualified Security Assessor). The platform comprised 12 microservices in a Kubernetes cluster, with 40+ deployments per week. The Stripe Systems team designed and implemented a 5-stage security pipeline.

Pipeline Architecture

Stage 1: Pre-Commit

Gitleaks and Semgrep quick rules ran on the developer's machine. The Semgrep configuration targeted the 15 highest-confidence PCI-relevant patterns:

# .semgrep/pci-quick.yml
rules:
  - id: pci-hardcoded-card-number
    patterns:
      - pattern-regex: '\b(?:4[0-9]{12}(?:[0-9]{3})?|5[1-5][0-9]{14}|3[47][0-9]{13})\b'
    message: "Possible hardcoded card number detected. Card data must never be stored in source code."
    severity: ERROR
    languages: [generic]

  - id: pci-unencrypted-sensitive-field
    patterns:
      - pattern: |
          $MODEL = {
            ...,
            $FIELD: { type: String, ... },
            ...
          }
      - metavariable-regex:
          metavariable: $FIELD
          regex: (cardNumber|cvv|pan|accountNumber|ssn)
    message: "Sensitive field '$FIELD' stored without encryption. Use application-level encryption for PCI-scoped data."
    severity: ERROR
    languages: [javascript, typescript]

  - id: pci-logging-sensitive-data
    patterns:
      - pattern: |
          console.log(..., $DATA, ...)
      - metavariable-regex:
          metavariable: $DATA
          regex: .*(card|pan|cvv|ssn|account).*
    message: "Potentially logging sensitive data. PCI-DSS Requirement 3.4 prohibits displaying full PAN in logs."
    severity: WARNING
    languages: [javascript, typescript]

Stage 2: PR Check

The full Semgrep ruleset (OWASP Top 10 + PCI-specific rules + custom rules) plus Trivy for any Dockerfile changes. All four scans ran in parallel:

# .github/workflows/pr-security.yml
name: PR Security Checks

on:
  pull_request:
    branches: [main, release/*]

jobs:
  semgrep:
    runs-on: ubuntu-latest
    container:
      image: returntocorp/semgrep
    steps:
      - uses: actions/checkout@v4
      - run: |
          semgrep --config=p/owasp-top-ten \
                  --config=p/secrets \
                  --config=.semgrep/ \
                  --baseline-commit=${{ github.event.pull_request.base.sha }} \
                  --sarif --output=semgrep.sarif
      - uses: github/codeql-action/upload-sarif@v3
        with:
          sarif_file: semgrep.sarif

  trivy-fs:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: aquasecurity/trivy-action@master
        with:
          scan-type: "fs"
          scan-ref: "."
          exit-code: 1
          severity: "CRITICAL,HIGH"

  snyk:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version: 20
      - run: npm ci
      - uses: snyk/actions/node@master
        with:
          args: --severity-threshold=high
        env:
          SNYK_TOKEN: ${{ secrets.SNYK_TOKEN }}

  gitleaks:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0
      - uses: gitleaks/gitleaks-action@v2
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}

Stage 3: Merge Gate (DAST)

After PR approval, a merge-triggered workflow deployed the branch to a preview environment and ran a ZAP baseline scan:

  dast:
    runs-on: ubuntu-latest
    needs: [deploy-preview]
    steps:
      - uses: zaproxy/[email protected]
        with:
          target: "https://preview-${{ github.event.pull_request.number }}.staging.example.com"
          rules_file_name: ".zap/rules.tsv"
          cmd_options: >-
            -j
            -z "-config api.disablekey=true
                -config spider.maxDuration=2
                -config scanner.maxScanDuration=5"
          fail_action: "warn"
      - name: Parse ZAP results
        run: |
          HIGHS=$(cat zap-report.json | jq '[.site[].alerts[] | select(.riskcode == "3")] | length')
          if [ "$HIGHS" -gt 0 ]; then
            echo "::error::ZAP found $HIGHS high-severity alerts"
            exit 1
          fi

Stage 4: Pre-Deploy (Image Signing)

After merge, the production image was built, scanned, and signed with cosign:

  sign-image:
    runs-on: ubuntu-latest
    needs: [build, trivy-image]
    steps:
      - name: Sign container image
        uses: sigstore/cosign-installer@v3
      - run: |
          cosign sign --yes \
            --key env://COSIGN_PRIVATE_KEY \
            ${{ env.REGISTRY }}/${{ env.IMAGE }}@${{ steps.build.outputs.digest }}
        env:
          COSIGN_PRIVATE_KEY: ${{ secrets.COSIGN_PRIVATE_KEY }}

The Kubernetes deployment was configured to verify image signatures via a policy controller, preventing unsigned images from running in production.

Stage 5: Post-Deploy (Runtime Scanning)

Falco monitored runtime behavior in the production cluster:

# falco-rules.yaml
- rule: PCI Sensitive File Access
  desc: Detect access to files containing card data
  condition: >
    open_read and
    (fd.name startswith /data/cards or
     fd.name startswith /data/transactions) and
    not proc.name in (payment-service, encryption-service)
  output: >
    Unauthorized access to PCI-scoped file
    (user=%user.name command=%proc.cmdline file=%fd.name container=%container.name)
  priority: CRITICAL
  tags: [pci, filesystem]

Metrics Dashboard

After 3 months of operation, the security pipeline produced these results:

Metric	Value
Total vulnerabilities detected	247
Caught at pre-commit	31 (12.5%)
Caught at PR check	168 (68.0%)
Caught at merge gate (DAST)	29 (11.7%)
Caught at pre-deploy	12 (4.9%)
Caught post-deploy	7 (2.8%)
False positive rate	11.3%
Mean time to remediate (Critical)	3.2 days
Mean time to remediate (High)	18.7 days
Developer bypass rate	0%

The 68% detection rate at the PR check stage validated the pipeline architecture — developers received findings in their PR context where remediation was cheapest. The 2.8% escape rate to post-deploy (7 findings) consisted of 4 runtime-specific issues that SAST/DAST couldn't detect and 3 findings in third-party components disclosed after the scan.

The QSA accepted the pipeline documentation, scan results, and metrics dashboard as evidence satisfying PCI-DSS Requirements 6.3 (security vulnerabilities management), 6.5 (secure development), and 11.3 (penetration testing and vulnerability scanning).

Key Implementation Decisions

Why Semgrep over SonarQube: the PCI-specific custom rules were faster to write in Semgrep's YAML format. The team had 15 custom rules operational within 2 days. Equivalent SonarQube custom rules would have required Java development and plugin packaging.

Why baseline scanning for PRs: running Semgrep with --baseline-commit limited findings to code changed in the PR. Without this, every PR in a legacy codebase would inherit hundreds of pre-existing findings, making the reports unusable.

Why ZAP at merge gate, not PR check: the ZAP baseline scan added 3–5 minutes. At the PR stage, this delay would apply to every push. At the merge gate, it runs once per PR — an acceptable tradeoff for DAST coverage.

Why image signing: the fintech platform's PCI assessor specifically asked how they prevent unauthorized container images from running in production. Cosign with a Kubernetes admission controller provided a verifiable chain: the image was built by CI, scanned by Trivy, and signed before deployment. No unsigned image could be scheduled.

Summary

A security pipeline is a layered detection system. No single tool catches every vulnerability. The goal is to construct overlapping layers where each tool's blind spots are covered by another, and the pipeline is fast enough that developers treat it as a natural part of their workflow rather than an obstacle to work around.

Start with secret scanning and SAST at the PR stage — these have the best effort-to-value ratio. Add SCA and container scanning once the first two are stable. DAST and runtime scanning are the final layers, providing defense against vulnerability classes that static analysis cannot reach.

Ready to discuss your project?

Get in Touch →

Related Services from Stripe Systems

Stripe Systems helps teams put the patterns covered in this article into production.

DevOps

Infrastructure automation, CI/CD pipelines, and security practices integrated from project inception.

Learn more →

← Back to Blog

AI/MLFebruary 28, 2026

Agentic AI in the Enterprise: Designing Multi-Agent Systems with LangGraph and Tool Orchestration

The term "AI agent" has been diluted by marketing to the point where it describes everything from a chatbot with a system prompt to a fully autonomous multi-step reasoning system. For this discussi...

Software DevelopmentFebruary 10, 2026

Agile vs Waterfall — Choosing the Right Methodology for Your Project

The methodology debate in software development is older than most of the frameworks we argue about on the internet. Waterfall has been declared dead roughly once per year since the Agile Manifesto ...

Engineering CultureMarch 5, 2026

AI-Assisted Code Review at Scale: How We Cut Review Cycle Time by 60% Without Sacrificing Architecture Standards

Code review is the most important quality gate in a software team, and it is also the most common bottleneck. Every team has the same problem: senior engineers are the reviewers, they have their ow...

Engineering CultureFebruary 5, 2026

The AI-Augmented SDLC: How We've Embedded AI at Every Phase — From Requirements to Deployment

The phrase "AI-augmented SDLC" gets thrown around loosely. Vendors pitch it as "AI writes your code." That is not what it means in practice. What it actually means: at every phase of the developmen...

Quality AssuranceMarch 15, 2026

How AI Is Transforming Automated Testing — Unit Tests, Code Coverage, and E2E Integration

AI-assisted testing has moved from research papers into daily engineering workflows. Tools powered by large language models can generate test scaffolds, detect visual regressions, predict flaky tes...

AI/MLMarch 19, 2026

AI Code Review Agents: How We Built a Custom Pipeline That Catches Architecture Violations, Not Just Bugs

Generic AI code review tools are good at catching syntax errors, unused variables, and simple bugs. They are poor at catching architecture violations — the kind of issues that compound over months ...

Engineering CultureMarch 20, 2026

How Our Engineering Team Uses AI Tools Daily to Ship Faster, Catch More Bugs, and Write Better Code — A Practitioner's Honest Breakdown

AI tools are not magic. They do not replace engineers, they do not understand your codebase, and they will confidently generate code that compiles but violates your business rules. What they do — w...

Backend DevelopmentJanuary 15, 2026

API Gateway Patterns: BFF vs Aggregator vs Direct — Choosing for Your Stack

Every team building on microservices eventually hits the same question: how should clients talk to your backend? The answer is some form of API gateway — but which pattern you choose has lasting co...

Cloud ComputingFebruary 24, 2026

AWS Lambda Cold Starts — Root Causes, Benchmarks, and 7 Proven Mitigation Strategies

Every engineer who has operated a Lambda-based production service has encountered the cold start problem. The function responds in 12 milliseconds on the second invocation but takes 3.8 seconds on ...

Cloud ComputingFebruary 15, 2026

AWS vs Azure vs GCP for Startups in 2026 — An Honest Cost and Capability Breakdown

Most cloud comparison articles recycle the same vague advice: "AWS has the most services, Azure integrates with Microsoft, GCP is good for data." That is not useful when you are a startup founder s...

Mobile DevelopmentMarch 1, 2026

Choosing the Right Mobile Development Approach: Native vs Cross-Platform

One of the first and most important decisions in any mobile app project is choosing between native and cross-platform development. Each approach has distinct advantages, and the right choice depend...

DevOpsMarch 7, 2026

Building a Production-Grade CI/CD Pipeline for a Monorepo (GitHub Actions + Docker + Kubernetes)

Monorepos consolidate multiple services, shared libraries, and frontend applications into a single repository. This brings benefits — atomic cross-service changes, shared tooling, simplified depend...

Backend DevelopmentJanuary 29, 2026

Clean Architecture in .NET 8 — Structuring Enterprise Apps That Scale Without Rot

Software architecture is not about choosing the right framework. It is about deciding which parts of a system should be easy to change and which should be stable — then enforcing that decision stru...

Mobile DevelopmentJanuary 6, 2026

CLEAN Architecture in Flutter — BLoC vs Riverpod for State Management

Flutter gives you a rendering engine and a widget tree. It does not give you an architecture. That gap is where most projects accumulate the technical debt that slows them down six months after lau...

DevOpsFebruary 28, 2026

How DevOps and DevSecOps Integrate Into Enterprise Product Development From Day One

Most enterprise teams treat DevOps as something to bolt on after the application takes shape. Security gets deferred even further — relegated to a penetration test two weeks before launch. This seq...

DevOpsJanuary 23, 2026

Docker Image Hardening for Production — Distroless, Non-Root Users, and Layer Optimization

A default Docker image built from `node:18` or `python:3.11` ships with hundreds of packages you do not need in production — compilers, package managers, shells, debug utilities. Each unnecessary p...

Backend DevelopmentJanuary 18, 2026

Event-Driven Architecture with Kafka, NestJS, and Outbox Pattern — A Production Walkthrough

Most backend systems start as synchronous request-response services. A client sends a request, the server processes it, and returns a result. This model is simple to reason about, easy to debug, an...

Cloud ComputingMarch 5, 2026

FinOps in Practice: How We Cut a Client's AWS Bill by 40% Without Touching Their Codebase

Most organizations overspend on AWS by 25–35%. Not because their engineers are careless, but because cloud billing is structurally opaque. Pricing varies by region, instance family, tenancy, paymen...

Mobile DevelopmentJanuary 10, 2026

Flutter vs React Native in 2026 — A Deep Technical Comparison for Enterprise Apps

Cross-platform mobile development has converged on two serious contenders: Flutter and React Native. Both are production-ready for enterprise applications, but they make fundamentally different arc...

DevOpsMarch 13, 2026

GitOps with ArgoCD and Terraform: The Infrastructure Deployment Workflow That Eliminates Drift

Infrastructure drift — the divergence between what is declared in code and what is actually running — is the root cause of a large class of production incidents. GitOps addresses this by making Git...

DevSecOpsFebruary 18, 2026

Infrastructure as Code Security: Detecting Misconfigurations with Checkov and OPA Before Deployment

Cloud misconfigurations remain the most common cause of cloud security incidents. The 2024 Verizon Data Breach Investigations Report attributes 74% of cloud breaches to misconfiguration or misuse, ...

Backend DevelopmentFebruary 10, 2026

Java Virtual Threads (Project Loom) vs Node.js — Concurrency Models Compared for Backend Engineers

Backend concurrency is not a solved problem. It is a set of trade-offs that shift with every workload profile. Java 21 introduced virtual threads — lightweight threads managed by the JVM rather tha...

DevOpsJanuary 25, 2026

Kubernetes Multi-Tenancy Patterns — Namespace Isolation vs Virtual Clusters vs Separate Clusters

Multi-tenancy in Kubernetes is not a single problem — it is a spectrum of isolation requirements that vary based on trust boundaries, compliance mandates, and operational capacity. This post examin...

AI/MLJanuary 18, 2026

LLM Cost Optimization at Scale — Prompt Caching, Model Routing, and Batch Inference in Production

LLM API costs follow a simple formula: tokens consumed × price per token. At low volume, this is negligible. At production scale, it becomes a significant line item. A system processing 1 million r...

Frontend DevelopmentMarch 2, 2026

Micro-Frontend Architecture at Scale: Module Federation with React and Webpack 5

The pitch for micro-frontends is compelling: split a monolithic frontend into independently deployable units owned by autonomous teams. The reality is more nuanced. Module Federation, introduced in...

Software DevelopmentJanuary 9, 2026

Microservices vs Monolith — Making the Right Architecture Decision

The architecture decision between microservices and a monolith is not a technology choice — it is an organizational one. The right answer depends on your team size, your domain maturity, your opera...

Cloud ComputingMarch 22, 2026

Multi-Cloud Architecture: Avoiding Vendor Lock-in Without Sacrificing Performance

Multi-cloud is one of the most oversold ideas in infrastructure. The pitch is simple: run workloads across AWS, GCP, and Azure to avoid vendor lock-in, improve resilience, and negotiate better pric...

Backend DevelopmentFebruary 21, 2026

NestJS Microservices with gRPC — Architecture Patterns for High-Throughput APIs

REST and GraphQL dominate client-facing APIs for good reason: browser support, tooling maturity, and developer familiarity. But for service-to-service communication inside a cluster, gRPC offers me...

Staff AugmentationFebruary 27, 2026

Why an Offshore Development Centre (ODC) Beats a Distributed Freelance Model — And How Stripe Systems Sets One Up

Engineering leaders who need to extend capacity beyond their core team face a fundamental choice between two models: hire individual freelancers through marketplace platforms, or establish a dedica...

Frontend DevelopmentFebruary 4, 2026

Building Offline-First PWAs with Next.js, Service Workers, and IndexedDB

Most web applications treat offline support as an afterthought — a "no internet" screen with a sad dinosaur. Offline-first flips this: the app is designed to work without a network connection, and ...

Staff AugmentationFebruary 1, 2026

Beyond Cost Arbitrage: How Stripe Systems' Offshore Teams Deliver Senior-Level Architecture, Not Just Execution

The offshore development industry has a reputation problem, and it is largely self-inflicted. For two decades, the dominant sales pitch was cost arbitrage: "Get the same work done for 60% less." Th...

Staff AugmentationFebruary 10, 2026

How to Onboard an Augmented Team Without Losing Velocity — A 90-Day Playbook for Engineering Leads

The single biggest risk in staff augmentation is not cost, quality, or attrition. It is the velocity dip during onboarding. A team that goes from signing a contract to productive output in 4 weeks ...

Staff AugmentationMarch 15, 2026

Onshore vs Offshore vs Nearshore Augmentation — A Decision Framework for CTOs Beyond Just Cost

Most engineering leaders approach the onshore-vs-offshore decision with a spreadsheet containing hourly rates and a vague sense of "risk." That is insufficient. The actual decision involves at leas...

AI/MLMarch 10, 2026

Building Production-Ready RAG Pipelines — Chunking Strategies, Vector DBs, and Evaluation Frameworks

Retrieval-Augmented Generation (RAG) has become the default architecture for building LLM-powered applications over proprietary data. The core idea is straightforward: instead of fine-tuning a lang...

Engineering CultureMarch 25, 2026

Prompt Engineering for Software Teams: The Internal Playbook We Built to Maximize Developer Output with LLMs

Every developer on your team uses LLMs differently. One engineer writes "make me a login page" and gets generic boilerplate. Another writes a structured prompt with framework constraints, authentic...

Staff AugmentationJanuary 5, 2026

The Real ROI of Offshore vs Nearshore vs Onshore Augmentation — A Data-Driven Cost-Benefit Framework for Engineering Leaders

Every year, engineering leaders evaluate staff augmentation options by comparing hourly rates on a spreadsheet. Offshore at $40–55/hr, nearshore at $65–85/hr, onshore at $130–180/hr. The math looks...

Frontend DevelopmentMarch 16, 2026

Server Components vs Client Components in Next.js 14 — When to Use Which (And Why Most Teams Get It Wrong)

Most teams adopt the Next.js App Router and immediately add `"use client"` to every component that does anything interactive. Within a week, they've recreated a fully client-rendered SPA with extra...

Staff AugmentationFebruary 13, 2026

Setting Up an ODC in India: Legal, Compliance, HR, and Infrastructure — What CTOs and Founders Actually Need to Know

If you are a CTO or founder evaluating India for an Offshore Development Centre (ODC), you have probably encountered two types of advice: breathless marketing from outsourcing firms promising effor...

DevSecOpsFebruary 20, 2026

SOC 2 Type II for Engineering Teams — What Developers Actually Need to Build and Change

SOC 2 Type II audits examine whether your security controls work consistently over a defined observation period — typically 6 to 12 months. Unlike Type I, which captures a point-in-time snapshot, T...

TechnologyJanuary 12, 2026

Staff Augmentation — A Practical Guide for Engineering Leaders

Staff augmentation is a staffing model where external engineers join your team on a contract basis, working under your technical leadership and within your existing processes. Unlike project outsou...

Frontend DevelopmentJanuary 26, 2026

State Management Showdown: Zustand vs Redux Toolkit vs Jotai for Large React Codebases

React 19 shipped server components, and with them came a reasonable question: do we still need client-side state management libraries? The answer is yes, but the reasoning has shifted. Server compo...

Software DevelopmentJanuary 3, 2026

Why Test-Driven Development Is Non-Negotiable in Our Engineering Process

Most teams agree that automated tests are valuable. Far fewer teams write the tests *before* the implementation. The gap between those two positions is where the majority of preventable defects live.

DevOpsFebruary 15, 2026

Terraform at Scale: Remote State, Workspaces, and Module Versioning for Multi-Team Environments

Terraform works well for a single team managing a handful of resources. It does not work well when five teams share a single state file containing 200+ resources. This post covers the specific prob...

Software DevelopmentMarch 15, 2026

Why Custom Software Development Matters for Growing Businesses

In today's competitive landscape, growing businesses face a critical decision: should they rely on off-the-shelf software or invest in custom-built solutions? While pre-built tools offer quick depl...

DevSecOpsJanuary 21, 2026

Zero-Trust API Security — mTLS, JWT Validation, and Rate Limiting in a Kubernetes-Native Stack

Zero-trust networking operates on a simple principle: no request is trusted based on its network origin. A request from inside your VPC receives the same scrutiny as a request from the public inter...

Cloud ComputingFebruary 7, 2026

Building a Zero-Trust Network on GCP with VPC Service Controls and Identity-Aware Proxy

Traditional network security operates on a simple assumption: traffic inside the firewall is trusted, traffic outside is not. This model fails in cloud environments for three reasons. First, there ...

Staff AugmentationApril 28, 2026

2026 Global Software Engineering Rate Benchmark — India vs US vs UK vs LATAM vs Eastern Europe

Most "offshoring rate" guides float a single dollar number per country and call it analysis. That number is almost always wrong — because it conflates raw salary with the fully-loaded cost of empl...

DevOpsApril 28, 2026

DevOps Maturity Benchmarks: What Top 1% Engineering Teams Do Differently in 2026

Most engineering organisations think they have a DevOps problem. They do not. They have a DevOps *belief* problem — they believe their CI/CD pipeline, weekly deploys, and a Datadog dashboard amou...

DevSecOps📅 March 10, 2026· 12 min read

Shifting Security Left: Integrating SAST, DAST, and Secret Scanning into Your CI/CD Pipeline

✍️

Stripe Systems Engineering

The Six Scanning Categories

1. SAST — Static Application Security Testing

Tool comparison:

Tool	Language Support	CI Integration	Custom Rules	License
Semgrep	30+ languages	GitHub Actions, GitLab CI, CLI	YAML-based (accessible)	OSS + commercial
SonarQube	25+ languages	Plugins for most CI systems	Java-based	Community + commercial
CodeQL	10+ languages	Native GitHub integration	QL query language (steep learning curve)	Free for OSS, paid for private

Semgrep has become the default choice for most teams because of its rule syntax. A custom Semgrep rule is a YAML file that most developers can read and write:

rules:
  - id: hardcoded-database-password
    patterns:
      - pattern: |
          $DB_CONFIG = {
            ...,
            password: "...",
            ...
          }
    message: "Database password is hardcoded. Use environment variables or a secrets manager."
    severity: ERROR
    languages: [javascript, typescript]
    metadata:
      category: security
      cwe: "CWE-798"
      compliance: [pci-dss, soc2]

2. DAST — Dynamic Application Security Testing

Key tools:

✓OWASP ZAP: open-source, scriptable, headless mode for CI. The baseline scan runs in 2–5 minutes; the full active scan can take 30+ minutes.
✓Burp Suite: commercial, more comprehensive scanning engine, CI integration via Burp Suite Enterprise.

DAST requires a running application, which means it fits later in the pipeline — typically against a staging or preview environment after deployment.

# ZAP baseline scan in GitHub Actions
- name: DAST scan with ZAP
  uses: zaproxy/[email protected]
  with:
    target: "https://staging.example.com"
    rules_file_name: "zap-rules.tsv"
    fail_action: "warn"
    allow_issue_writing: false

3. Secret Scanning

Secret scanning detects credentials, API keys, tokens, and private keys committed to version control.

Tools:

✓Gitleaks: scans git history and current files, configurable via TOML, works as a pre-commit hook and CI check.
✓TruffleHog: scans git history with entropy-based detection plus regex patterns, supports scanning multiple VCS providers.
✓GitHub Secret Scanning: native integration for GitHub repositories, automatic alerts for partner patterns (AWS keys, Stripe keys, etc.).

The pre-commit hook is the first line of defense — catching secrets before they enter git history:

# .pre-commit-config.yaml
repos:
  - repo: https://github.com/gitleaks/gitleaks
    rev: v8.18.0
    hooks:
      - id: gitleaks

But pre-commit hooks run locally and can be bypassed (developers can use --no-verify). The CI check is the enforcement layer:

# GitHub Actions secret scanning
- name: Gitleaks scan
  uses: gitleaks/gitleaks-action@v2
  with:
    args: "--verbose --redact"
  env:
    GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
    GITLEAKS_ENABLE_COMMENTS: true

4. SCA — Software Composition Analysis

Tools:

✓Snyk: commercial, covers npm, pip, Maven, Go, container images. Good at suggesting fix versions.
✓Dependabot: native GitHub integration, automatic PRs for vulnerable dependencies.
✓npm audit / pip-audit: built-in package manager tools, limited to their respective ecosystems.

# Snyk in CI with severity threshold
- name: Snyk dependency test
  uses: snyk/actions/node@master
  with:
    args: --severity-threshold=high --fail-on=upgradable
  env:
    SNYK_TOKEN: ${{ secrets.SNYK_TOKEN }}

5. Container Image Scanning

Container scanners analyze OS packages and application dependencies within container images.

Tools:

✓Trivy: fast, covers OS packages + application dependencies, supports multiple output formats.
✓Grype: from Anchore, similar coverage to Trivy, good SBOM (Software Bill of Materials) integration.

# Trivy scan in CI
- name: Build Docker image
  run: docker build -t app:${{ github.sha }} .

- name: Trivy vulnerability scan
  uses: aquasecurity/trivy-action@master
  with:
    image-ref: "app:${{ github.sha }}"
    exit-code: 1
    severity: "CRITICAL,HIGH"
    format: "sarif"
    output: "trivy-results.sarif"

- name: Upload Trivy scan results
  uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: "trivy-results.sarif"

Uploading results in SARIF format integrates findings directly into GitHub's Security tab, providing a unified view of vulnerabilities alongside code scanning results.

6. Infrastructure as Code Scanning

IaC scanning catches cloud misconfigurations before terraform apply or kubectl apply runs.

Tools:

✓Checkov: covers Terraform, CloudFormation, Kubernetes, Helm, Dockerfile. 1000+ built-in checks.
✓tfsec (now part of Trivy): Terraform-focused, fast, good IDE integration.

# Checkov in CI
- name: Checkov IaC scan
  uses: bridgecrewio/checkov-action@master
  with:
    directory: ./terraform
    framework: terraform
    check: CKV_AWS_18,CKV_AWS_19,CKV_AWS_145
    soft_fail: false

Pipeline Architecture: Where Each Scan Fits

Not every scan belongs at every stage. The goal is to catch issues as early as possible while keeping the pipeline fast enough that developers don't bypass it.

┌─────────────────────────────────────────────────────────────┐
│ PRE-COMMIT (developer machine)                              │
│  • Gitleaks (secret scanning) — 2-5 seconds                │
│  • Semgrep quick rules (top 20 patterns) — 5-10 seconds    │
├─────────────────────────────────────────────────────────────┤
│ PR CHECK (CI, runs on every push to PR)                     │
│  • Full Semgrep SAST scan — 30-90 seconds                  │
│  • Snyk/npm audit dependency scan — 20-60 seconds          │
│  • Gitleaks (full repo scan) — 10-30 seconds               │
│  • Checkov/tfsec IaC scan — 15-45 seconds                  │
│  • Unit/integration tests — varies                          │
├─────────────────────────────────────────────────────────────┤
│ MERGE GATE (CI, runs before merge to main)                  │
│  • All PR checks must pass                                  │
│  • Container image build + Trivy scan — 2-5 minutes        │
│  • DAST baseline scan against preview env — 3-5 minutes    │
├─────────────────────────────────────────────────────────────┤
│ PRE-DEPLOY (CI, after merge to main)                        │
│  • Container image signing (cosign) — 30 seconds           │
│  • Image digest verification — 10 seconds                  │
│  • Final vulnerability gate check — 30 seconds             │
├─────────────────────────────────────────────────────────────┤
│ POST-DEPLOY (production)                                    │
│  • Runtime security monitoring (Falco/Sysdig)              │
│  • Continuous DAST scanning (scheduled)                     │
│  • Log-based anomaly detection                              │
└─────────────────────────────────────────────────────────────┘

False Positive Management

False positives are the primary reason security pipelines fail in practice. A scan that produces 50 findings where 40 are false positives trains developers to ignore all findings.

Strategies:

1. Severity thresholds. Only fail builds on HIGH and CRITICAL findings. Report MEDIUM and LOW as informational.

2. Suppression files with expiration dates. When a finding is reviewed and determined to be a false positive or accepted risk, document the decision:

# .semgrep/ignore.yml
rules:
  - id: javascript.express.security.audit.xss.mustache-escape
    paths:
      - src/templates/admin-panel.js
    reason: "Admin panel is internal-only, behind VPN and SSO. Risk accepted by security team on 2025-09-15."
    expires: 2026-03-15
    approved_by: "security-team"

3. Incremental scanning. On PR checks, only scan changed files. This reduces both scan time and noise. Semgrep supports this with --baseline-commit:

semgrep --config=auto --baseline-commit=$(git merge-base HEAD origin/main)

4. Centralized triage. Route all findings to a security channel (Slack, Jira) rather than blocking individual PRs. Reserve build-breaking for CRITICAL severity findings only.

Developer Experience

The security pipeline must be designed for the developer workflow, not against it.

Speed: total security check time per PR should be under 3 minutes. Parallelize scans that don't depend on each other:

jobs:
  sast:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Semgrep
        uses: returntocorp/semgrep-action@v1

  sca:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Snyk test
        uses: snyk/actions/node@master

  secrets:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Gitleaks
        uses: gitleaks/gitleaks-action@v2

  iac:
    runs-on: ubuntu-latest
    if: contains(github.event.pull_request.labels.*.name, 'infrastructure')
    steps:
      - uses: actions/checkout@v4
      - name: Checkov
        uses: bridgecrewio/checkov-action@master

Escape hatches: provide a documented process for overriding a finding when there's a legitimate business reason. Require security team approval for overrides and track them centrally.

Metrics: Measuring Security Pipeline Effectiveness

Track these metrics monthly:

Metric	What It Measures	Target
Mean Time to Remediate (MTTR)	Average time from vulnerability detection to fix merged	Critical: < 7 days, High: < 30 days
Vulnerability Escape Rate	Percentage of production vulnerabilities not caught by CI scans	< 5%
Scan Coverage	Percentage of repositories with security scanning enabled	100%
False Positive Rate	Percentage of findings marked as false positive after triage	< 15%
Developer Bypass Rate	Percentage of deployments that skipped security checks	0%
Detection by Stage	Distribution of findings across pipeline stages	70%+ caught at PR stage

Case Study: Fintech API Platform — 5-Stage Security Pipeline

Background

Pipeline Architecture

Stage 1: Pre-Commit

Gitleaks and Semgrep quick rules ran on the developer's machine. The Semgrep configuration targeted the 15 highest-confidence PCI-relevant patterns:

# .semgrep/pci-quick.yml
rules:
  - id: pci-hardcoded-card-number
    patterns:
      - pattern-regex: '\b(?:4[0-9]{12}(?:[0-9]{3})?|5[1-5][0-9]{14}|3[47][0-9]{13})\b'
    message: "Possible hardcoded card number detected. Card data must never be stored in source code."
    severity: ERROR
    languages: [generic]

  - id: pci-unencrypted-sensitive-field
    patterns:
      - pattern: |
          $MODEL = {
            ...,
            $FIELD: { type: String, ... },
            ...
          }
      - metavariable-regex:
          metavariable: $FIELD
          regex: (cardNumber|cvv|pan|accountNumber|ssn)
    message: "Sensitive field '$FIELD' stored without encryption. Use application-level encryption for PCI-scoped data."
    severity: ERROR
    languages: [javascript, typescript]

  - id: pci-logging-sensitive-data
    patterns:
      - pattern: |
          console.log(..., $DATA, ...)
      - metavariable-regex:
          metavariable: $DATA
          regex: .*(card|pan|cvv|ssn|account).*
    message: "Potentially logging sensitive data. PCI-DSS Requirement 3.4 prohibits displaying full PAN in logs."
    severity: WARNING
    languages: [javascript, typescript]

Stage 2: PR Check

The full Semgrep ruleset (OWASP Top 10 + PCI-specific rules + custom rules) plus Trivy for any Dockerfile changes. All four scans ran in parallel:

# .github/workflows/pr-security.yml
name: PR Security Checks

on:
  pull_request:
    branches: [main, release/*]

jobs:
  semgrep:
    runs-on: ubuntu-latest
    container:
      image: returntocorp/semgrep
    steps:
      - uses: actions/checkout@v4
      - run: |
          semgrep --config=p/owasp-top-ten \
                  --config=p/secrets \
                  --config=.semgrep/ \
                  --baseline-commit=${{ github.event.pull_request.base.sha }} \
                  --sarif --output=semgrep.sarif
      - uses: github/codeql-action/upload-sarif@v3
        with:
          sarif_file: semgrep.sarif

  trivy-fs:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: aquasecurity/trivy-action@master
        with:
          scan-type: "fs"
          scan-ref: "."
          exit-code: 1
          severity: "CRITICAL,HIGH"

  snyk:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version: 20
      - run: npm ci
      - uses: snyk/actions/node@master
        with:
          args: --severity-threshold=high
        env:
          SNYK_TOKEN: ${{ secrets.SNYK_TOKEN }}

  gitleaks:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0
      - uses: gitleaks/gitleaks-action@v2
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}

Stage 3: Merge Gate (DAST)

After PR approval, a merge-triggered workflow deployed the branch to a preview environment and ran a ZAP baseline scan:

  dast:
    runs-on: ubuntu-latest
    needs: [deploy-preview]
    steps:
      - uses: zaproxy/[email protected]
        with:
          target: "https://preview-${{ github.event.pull_request.number }}.staging.example.com"
          rules_file_name: ".zap/rules.tsv"
          cmd_options: >-
            -j
            -z "-config api.disablekey=true
                -config spider.maxDuration=2
                -config scanner.maxScanDuration=5"
          fail_action: "warn"
      - name: Parse ZAP results
        run: |
          HIGHS=$(cat zap-report.json | jq '[.site[].alerts[] | select(.riskcode == "3")] | length')
          if [ "$HIGHS" -gt 0 ]; then
            echo "::error::ZAP found $HIGHS high-severity alerts"
            exit 1
          fi

Stage 4: Pre-Deploy (Image Signing)

After merge, the production image was built, scanned, and signed with cosign:

  sign-image:
    runs-on: ubuntu-latest
    needs: [build, trivy-image]
    steps:
      - name: Sign container image
        uses: sigstore/cosign-installer@v3
      - run: |
          cosign sign --yes \
            --key env://COSIGN_PRIVATE_KEY \
            ${{ env.REGISTRY }}/${{ env.IMAGE }}@${{ steps.build.outputs.digest }}
        env:
          COSIGN_PRIVATE_KEY: ${{ secrets.COSIGN_PRIVATE_KEY }}

The Kubernetes deployment was configured to verify image signatures via a policy controller, preventing unsigned images from running in production.

Stage 5: Post-Deploy (Runtime Scanning)

Falco monitored runtime behavior in the production cluster:

# falco-rules.yaml
- rule: PCI Sensitive File Access
  desc: Detect access to files containing card data
  condition: >
    open_read and
    (fd.name startswith /data/cards or
     fd.name startswith /data/transactions) and
    not proc.name in (payment-service, encryption-service)
  output: >
    Unauthorized access to PCI-scoped file
    (user=%user.name command=%proc.cmdline file=%fd.name container=%container.name)
  priority: CRITICAL
  tags: [pci, filesystem]

Metrics Dashboard

After 3 months of operation, the security pipeline produced these results:

Metric	Value
Total vulnerabilities detected	247
Caught at pre-commit	31 (12.5%)
Caught at PR check	168 (68.0%)
Caught at merge gate (DAST)	29 (11.7%)
Caught at pre-deploy	12 (4.9%)
Caught post-deploy	7 (2.8%)
False positive rate	11.3%
Mean time to remediate (Critical)	3.2 days
Mean time to remediate (High)	18.7 days
Developer bypass rate	0%

Key Implementation Decisions

Summary

Ready to discuss your project?

Get in Touch →

Related Services from Stripe Systems

Stripe Systems helps teams put the patterns covered in this article into production.

DevOps

Infrastructure automation, CI/CD pipelines, and security practices integrated from project inception.

Learn more →

← Back to Blog

AI/MLFebruary 28, 2026

Agentic AI in the Enterprise: Designing Multi-Agent Systems with LangGraph and Tool Orchestration

Software DevelopmentFebruary 10, 2026

Agile vs Waterfall — Choosing the Right Methodology for Your Project

Engineering CultureMarch 5, 2026

AI-Assisted Code Review at Scale: How We Cut Review Cycle Time by 60% Without Sacrificing Architecture Standards

Engineering CultureFebruary 5, 2026

The AI-Augmented SDLC: How We've Embedded AI at Every Phase — From Requirements to Deployment

Quality AssuranceMarch 15, 2026

How AI Is Transforming Automated Testing — Unit Tests, Code Coverage, and E2E Integration

AI/MLMarch 19, 2026

AI Code Review Agents: How We Built a Custom Pipeline That Catches Architecture Violations, Not Just Bugs

Engineering CultureMarch 20, 2026

How Our Engineering Team Uses AI Tools Daily to Ship Faster, Catch More Bugs, and Write Better Code — A Practitioner's Honest Breakdown

Backend DevelopmentJanuary 15, 2026

API Gateway Patterns: BFF vs Aggregator vs Direct — Choosing for Your Stack

Cloud ComputingFebruary 24, 2026

AWS Lambda Cold Starts — Root Causes, Benchmarks, and 7 Proven Mitigation Strategies

Cloud ComputingFebruary 15, 2026

AWS vs Azure vs GCP for Startups in 2026 — An Honest Cost and Capability Breakdown

Mobile DevelopmentMarch 1, 2026

Choosing the Right Mobile Development Approach: Native vs Cross-Platform

DevOpsMarch 7, 2026

Building a Production-Grade CI/CD Pipeline for a Monorepo (GitHub Actions + Docker + Kubernetes)

Backend DevelopmentJanuary 29, 2026

Clean Architecture in .NET 8 — Structuring Enterprise Apps That Scale Without Rot

Mobile DevelopmentJanuary 6, 2026

CLEAN Architecture in Flutter — BLoC vs Riverpod for State Management

DevOpsFebruary 28, 2026

How DevOps and DevSecOps Integrate Into Enterprise Product Development From Day One

DevOpsJanuary 23, 2026

Docker Image Hardening for Production — Distroless, Non-Root Users, and Layer Optimization

Backend DevelopmentJanuary 18, 2026

Event-Driven Architecture with Kafka, NestJS, and Outbox Pattern — A Production Walkthrough

Cloud ComputingMarch 5, 2026

FinOps in Practice: How We Cut a Client's AWS Bill by 40% Without Touching Their Codebase

Mobile DevelopmentJanuary 10, 2026

Flutter vs React Native in 2026 — A Deep Technical Comparison for Enterprise Apps

DevOpsMarch 13, 2026

GitOps with ArgoCD and Terraform: The Infrastructure Deployment Workflow That Eliminates Drift

DevSecOpsFebruary 18, 2026

Infrastructure as Code Security: Detecting Misconfigurations with Checkov and OPA Before Deployment

Backend DevelopmentFebruary 10, 2026

Java Virtual Threads (Project Loom) vs Node.js — Concurrency Models Compared for Backend Engineers

DevOpsJanuary 25, 2026

Kubernetes Multi-Tenancy Patterns — Namespace Isolation vs Virtual Clusters vs Separate Clusters

AI/MLJanuary 18, 2026

LLM Cost Optimization at Scale — Prompt Caching, Model Routing, and Batch Inference in Production

Frontend DevelopmentMarch 2, 2026

Micro-Frontend Architecture at Scale: Module Federation with React and Webpack 5

Software DevelopmentJanuary 9, 2026

Microservices vs Monolith — Making the Right Architecture Decision

Cloud ComputingMarch 22, 2026

Multi-Cloud Architecture: Avoiding Vendor Lock-in Without Sacrificing Performance

Backend DevelopmentFebruary 21, 2026

NestJS Microservices with gRPC — Architecture Patterns for High-Throughput APIs

Staff AugmentationFebruary 27, 2026

Why an Offshore Development Centre (ODC) Beats a Distributed Freelance Model — And How Stripe Systems Sets One Up

Frontend DevelopmentFebruary 4, 2026

Building Offline-First PWAs with Next.js, Service Workers, and IndexedDB

Staff AugmentationFebruary 1, 2026

Beyond Cost Arbitrage: How Stripe Systems' Offshore Teams Deliver Senior-Level Architecture, Not Just Execution

Staff AugmentationFebruary 10, 2026

How to Onboard an Augmented Team Without Losing Velocity — A 90-Day Playbook for Engineering Leads

Staff AugmentationMarch 15, 2026

Onshore vs Offshore vs Nearshore Augmentation — A Decision Framework for CTOs Beyond Just Cost

AI/MLMarch 10, 2026

Building Production-Ready RAG Pipelines — Chunking Strategies, Vector DBs, and Evaluation Frameworks

Engineering CultureMarch 25, 2026

Prompt Engineering for Software Teams: The Internal Playbook We Built to Maximize Developer Output with LLMs

Staff AugmentationJanuary 5, 2026

The Real ROI of Offshore vs Nearshore vs Onshore Augmentation — A Data-Driven Cost-Benefit Framework for Engineering Leaders

Frontend DevelopmentMarch 16, 2026

Server Components vs Client Components in Next.js 14 — When to Use Which (And Why Most Teams Get It Wrong)

Staff AugmentationFebruary 13, 2026

Setting Up an ODC in India: Legal, Compliance, HR, and Infrastructure — What CTOs and Founders Actually Need to Know

DevSecOpsFebruary 20, 2026

SOC 2 Type II for Engineering Teams — What Developers Actually Need to Build and Change

TechnologyJanuary 12, 2026

Staff Augmentation — A Practical Guide for Engineering Leaders

Frontend DevelopmentJanuary 26, 2026

State Management Showdown: Zustand vs Redux Toolkit vs Jotai for Large React Codebases

Software DevelopmentJanuary 3, 2026

Why Test-Driven Development Is Non-Negotiable in Our Engineering Process

Most teams agree that automated tests are valuable. Far fewer teams write the tests *before* the implementation. The gap between those two positions is where the majority of preventable defects live.

DevOpsFebruary 15, 2026

Terraform at Scale: Remote State, Workspaces, and Module Versioning for Multi-Team Environments

Software DevelopmentMarch 15, 2026

Why Custom Software Development Matters for Growing Businesses

DevSecOpsJanuary 21, 2026

Zero-Trust API Security — mTLS, JWT Validation, and Rate Limiting in a Kubernetes-Native Stack

Cloud ComputingFebruary 7, 2026

Building a Zero-Trust Network on GCP with VPC Service Controls and Identity-Aware Proxy

Staff AugmentationApril 28, 2026

2026 Global Software Engineering Rate Benchmark — India vs US vs UK vs LATAM vs Eastern Europe

DevOpsApril 28, 2026

Shifting Security Left: Integrating SAST, DAST, and Secret Scanning into Your CI/CD Pipeline

The Six Scanning Categories

1. SAST — Static Application Security Testing

2. DAST — Dynamic Application Security Testing

3. Secret Scanning

4. SCA — Software Composition Analysis

5. Container Image Scanning

6. Infrastructure as Code Scanning

Pipeline Architecture: Where Each Scan Fits

False Positive Management

Developer Experience

Metrics: Measuring Security Pipeline Effectiveness

Case Study: Fintech API Platform — 5-Stage Security Pipeline

Background

Pipeline Architecture

Metrics Dashboard

Key Implementation Decisions

Summary

Related Services from Stripe Systems

DevOps

More Articles

Agentic AI in the Enterprise: Designing Multi-Agent Systems with LangGraph and Tool Orchestration

Agile vs Waterfall — Choosing the Right Methodology for Your Project

AI-Assisted Code Review at Scale: How We Cut Review Cycle Time by 60% Without Sacrificing Architecture Standards

The AI-Augmented SDLC: How We've Embedded AI at Every Phase — From Requirements to Deployment

How AI Is Transforming Automated Testing — Unit Tests, Code Coverage, and E2E Integration

AI Code Review Agents: How We Built a Custom Pipeline That Catches Architecture Violations, Not Just Bugs

How Our Engineering Team Uses AI Tools Daily to Ship Faster, Catch More Bugs, and Write Better Code — A Practitioner's Honest Breakdown

API Gateway Patterns: BFF vs Aggregator vs Direct — Choosing for Your Stack

AWS Lambda Cold Starts — Root Causes, Benchmarks, and 7 Proven Mitigation Strategies

AWS vs Azure vs GCP for Startups in 2026 — An Honest Cost and Capability Breakdown

Choosing the Right Mobile Development Approach: Native vs Cross-Platform

Building a Production-Grade CI/CD Pipeline for a Monorepo (GitHub Actions + Docker + Kubernetes)

Clean Architecture in .NET 8 — Structuring Enterprise Apps That Scale Without Rot

CLEAN Architecture in Flutter — BLoC vs Riverpod for State Management

How DevOps and DevSecOps Integrate Into Enterprise Product Development From Day One

Docker Image Hardening for Production — Distroless, Non-Root Users, and Layer Optimization

Event-Driven Architecture with Kafka, NestJS, and Outbox Pattern — A Production Walkthrough

FinOps in Practice: How We Cut a Client's AWS Bill by 40% Without Touching Their Codebase

Flutter vs React Native in 2026 — A Deep Technical Comparison for Enterprise Apps

GitOps with ArgoCD and Terraform: The Infrastructure Deployment Workflow That Eliminates Drift

Infrastructure as Code Security: Detecting Misconfigurations with Checkov and OPA Before Deployment

Java Virtual Threads (Project Loom) vs Node.js — Concurrency Models Compared for Backend Engineers

Kubernetes Multi-Tenancy Patterns — Namespace Isolation vs Virtual Clusters vs Separate Clusters

LLM Cost Optimization at Scale — Prompt Caching, Model Routing, and Batch Inference in Production

Micro-Frontend Architecture at Scale: Module Federation with React and Webpack 5

Microservices vs Monolith — Making the Right Architecture Decision

Multi-Cloud Architecture: Avoiding Vendor Lock-in Without Sacrificing Performance

NestJS Microservices with gRPC — Architecture Patterns for High-Throughput APIs

Why an Offshore Development Centre (ODC) Beats a Distributed Freelance Model — And How Stripe Systems Sets One Up

Building Offline-First PWAs with Next.js, Service Workers, and IndexedDB

Beyond Cost Arbitrage: How Stripe Systems' Offshore Teams Deliver Senior-Level Architecture, Not Just Execution

How to Onboard an Augmented Team Without Losing Velocity — A 90-Day Playbook for Engineering Leads

Onshore vs Offshore vs Nearshore Augmentation — A Decision Framework for CTOs Beyond Just Cost

Building Production-Ready RAG Pipelines — Chunking Strategies, Vector DBs, and Evaluation Frameworks

Prompt Engineering for Software Teams: The Internal Playbook We Built to Maximize Developer Output with LLMs

The Real ROI of Offshore vs Nearshore vs Onshore Augmentation — A Data-Driven Cost-Benefit Framework for Engineering Leaders

Server Components vs Client Components in Next.js 14 — When to Use Which (And Why Most Teams Get It Wrong)

Setting Up an ODC in India: Legal, Compliance, HR, and Infrastructure — What CTOs and Founders Actually Need to Know

SOC 2 Type II for Engineering Teams — What Developers Actually Need to Build and Change

Staff Augmentation — A Practical Guide for Engineering Leaders

State Management Showdown: Zustand vs Redux Toolkit vs Jotai for Large React Codebases

Why Test-Driven Development Is Non-Negotiable in Our Engineering Process

Terraform at Scale: Remote State, Workspaces, and Module Versioning for Multi-Team Environments

Why Custom Software Development Matters for Growing Businesses

Zero-Trust API Security — mTLS, JWT Validation, and Rate Limiting in a Kubernetes-Native Stack

Building a Zero-Trust Network on GCP with VPC Service Controls and Identity-Aware Proxy

2026 Global Software Engineering Rate Benchmark — India vs US vs UK vs LATAM vs Eastern Europe

DevOps Maturity Benchmarks: What Top 1% Engineering Teams Do Differently in 2026

Shifting Security Left: Integrating SAST, DAST, and Secret Scanning into Your CI/CD Pipeline

The Six Scanning Categories

1. SAST — Static Application Security Testing

2. DAST — Dynamic Application Security Testing

3. Secret Scanning

4. SCA — Software Composition Analysis

5. Container Image Scanning

6. Infrastructure as Code Scanning

Pipeline Architecture: Where Each Scan Fits

False Positive Management

Developer Experience