Traffic Management - FluxGate

Open-source API gateway for microservices

fluxctl apply -f traffic-policy.yaml --namespace production
Read Routing Docs View Live Metrics

Architecture & Control

Intelligent Routing & Load Distribution

FluxGate replaces static round-robin routing with weight-aware, health-check-driven load balancing that adapts to real-time service latency and error rates. Combined with granular rate limiting and progressive traffic shifting, your microservices stay resilient under peak loads.

Our traffic engine evaluates upstream health every 200ms, automatically draining nodes that exceed a 95th percentile latency of 450ms or return 5xx errors above 2.5%. Rate limiting operates at both the client IP and API key level, supporting sliding-window counters with Redis-backed synchronization across gateway replicas. Canary deployments leverage weighted routing rules to safely direct 5–15% of production traffic to new service versions while monitoring error budgets and response times.

Configuration

Kubernetes Traffic Policy Definition

Define routing weights, rate limits, and canary rules directly in your cluster manifests. FluxGate reconciles these policies in under 300ms without requiring pod restarts.

apiVersion: fluxgate.io/v1beta1
kind: TrafficPolicy
metadata:
  name: checkout-service-policy
  namespace: payments
spec:
  targetRef:
    apiGroup: apps/v1
    kind: Deployment
    name: checkout-service
  loadBalancing:
    algorithm: weighted-round-robin
    healthCheck:
      interval: 200ms
      unhealthyThreshold: 3
  rateLimit:
    requestsPerSecond: 1200
    burstSize: 150
    keyExtractor: header.X-Client-Id
  canary:
    enabled: true
    weight: 10
    match:
      - header:
          name: x-canary
          value: "true"
    metrics:
      targetErrorRate: 0.01
      targetP99Latency: 300ms

Operational Impact

Why Teams Choose FluxGate

Zero-Downtime Rollouts

Shift traffic incrementally using header, cookie, or random-weight matching. Automatically revert canaries when error budgets drop below 98.5% availability thresholds.

Distributed Rate Limiting

Enforce consistent quotas across multi-region deployments with Redis-backed sliding windows. Protects downstream databases from query spikes during flash sales or API abuse.

Adaptive Load Balancing

Dynamically adjusts routing weights based on real-time CPU, memory, and request latency metrics. Reduces tail latency by up to 40% in heterogeneous cluster environments.