Service Mesh Basics: What It Is and When You Need It

Service mesh is either the solution to all your microservices problems or unnecessary complexity you don’t need. Here’s how to tell which.

What a Service Mesh Does

A service mesh handles cross-cutting concerns for service-to-service communication:

Traffic management — Load balancing, routing, retries
Security — mTLS, authorization policies
Observability — Metrics, tracing, logging
Resilience — Circuit breakers, timeouts, fault injection

Instead of implementing these in every service, the mesh handles them at the infrastructure layer.

How It Works

Every service gets a sidecar proxy (usually Envoy). The proxy intercepts all traffic and applies policies. A control plane configures all the proxies.

Istio Quick Start

Install

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
# Download istioctl
curl -L https://istio.io/downloadIstio | sh -
cd istio-*
export PATH=$PWD/bin:$PATH

# Install with demo profile
istioctl install --set profile=demo -y

# Enable sidecar injection for namespace
kubectl label namespace default istio-injection=enabled

Deploy an App

1
2
3
4
5
6
7
# Deploy normally - sidecars are injected automatically
kubectl apply -f my-app.yaml

# Verify sidecars
kubectl get pods
# NAME                    READY   STATUS
# my-app-xxx              2/2     Running  # 2 containers = app + sidecar

Traffic Management

Virtual Service (Routing Rules)

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: reviews
spec:
  hosts:
    - reviews
  http:
    - match:
        - headers:
            end-user:
              exact: jason
      route:
        - destination:
            host: reviews
            subset: v2
    - route:
        - destination:
            host: reviews
            subset: v1

User “jason” sees v2, everyone else sees v1.

Destination Rule (Load Balancing)

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: reviews
spec:
  host: reviews
  trafficPolicy:
    loadBalancer:
      simple: ROUND_ROBIN
  subsets:
    - name: v1
      labels:
        version: v1
    - name: v2
      labels:
        version: v2

Canary Deployments

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: my-app
spec:
  hosts:
    - my-app
  http:
    - route:
        - destination:
            host: my-app
            subset: v1
          weight: 90
        - destination:
            host: my-app
            subset: v2
          weight: 10

10% of traffic goes to v2.

Retries and Timeouts

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: ratings
spec:
  hosts:
    - ratings
  http:
    - route:
        - destination:
            host: ratings
      timeout: 10s
      retries:
        attempts: 3
        perTryTimeout: 2s
        retryOn: 5xx,reset,connect-failure

Security

mTLS (Automatic Encryption)

1
2
3
4
5
6
7
8
apiVersion: security.istio.io/v1beta1
kind: PeerAuthentication
metadata:
  name: default
  namespace: istio-system
spec:
  mtls:
    mode: STRICT  # All traffic must be mTLS

All service-to-service traffic is now encrypted and authenticated.

Authorization Policies

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
apiVersion: security.istio.io/v1beta1
kind: AuthorizationPolicy
metadata:
  name: api-access
  namespace: production
spec:
  selector:
    matchLabels:
      app: api
  rules:
    - from:
        - source:
            principals: ["cluster.local/ns/production/sa/web"]
      to:
        - operation:
            methods: ["GET", "POST"]
            paths: ["/api/*"]

Only the web service account can call the API.

Observability

Metrics

Istio automatically collects:

Request count, duration, size
Response codes
Connection metrics

Access via Prometheus:

1
kubectl port-forward svc/prometheus -n istio-system 9090:9090

Distributed Tracing

1
2
3
4
5
# Install Jaeger
kubectl apply -f https://raw.githubusercontent.com/istio/istio/release-1.20/samples/addons/jaeger.yaml

# Access UI
kubectl port-forward svc/tracing -n istio-system 16686:80

Traces show the full path of requests across services.

Kiali Dashboard

1
2
kubectl apply -f https://raw.githubusercontent.com/istio/istio/release-1.20/samples/addons/kiali.yaml
kubectl port-forward svc/kiali -n istio-system 20001:20001

Visual service graph, traffic flow, health status.

Circuit Breakers

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: reviews
spec:
  host: reviews
  trafficPolicy:
    connectionPool:
      tcp:
        maxConnections: 100
      http:
        h2UpgradePolicy: UPGRADE
        http1MaxPendingRequests: 100
        http2MaxRequests: 1000
    outlierDetection:
      consecutive5xxErrors: 5
      interval: 30s
      baseEjectionTime: 30s
      maxEjectionPercent: 50

Ejects unhealthy instances after 5 consecutive 5xx errors.

When You Don’t Need a Service Mesh

Skip the mesh if:

You have < 10 services
Traffic patterns are simple
You can handle retries/timeouts in code
You don’t need mTLS between services
Observability tools already work

The overhead:

Increased latency (small, but exists)
More resource usage (sidecars need CPU/memory)
Operational complexity
Learning curve

Alternatives

Linkerd (Simpler)

1
2
3
4
5
6
7
# Install
curl --proto '=https' -sL https://run.linkerd.io/install | sh
linkerd install --crds | kubectl apply -f -
linkerd install | kubectl apply -f -

# Inject sidecars
kubectl get deploy -o yaml | linkerd inject - | kubectl apply -f -

Linkerd is lighter weight, easier to operate, fewer features.

No Mesh (Libraries)

Handle concerns in application code:

Retries: tenacity (Python), resilience4j (Java)
mTLS: Application-level certificates
Tracing: OpenTelemetry SDK
Metrics: Prometheus client libraries

More work per service, but no infrastructure overhead.

The Service Mesh Decision

Need	Without Mesh	With Mesh
Retries	Code in each service	Config once
mTLS	Manual cert management	Automatic
Traffic splitting	Complex routing	Simple YAML
Observability	Instrument each service	Automatic
Authorization	Each service checks	Centralized policies

Start without a mesh. Add one when you genuinely need:

Zero-trust security (mTLS everywhere)
Fine-grained traffic control
Consistent observability across many services
Policy enforcement at the infrastructure layer

The Service Mesh Checklist

Before adopting:

Have 10+ services that communicate
Need mTLS between all services
Want canary/blue-green without code changes
Need consistent retry/timeout policies
Team has capacity to learn and operate

After adopting:

Sidecar injection enabled
mTLS mode configured
Basic routing rules in place
Observability dashboards accessible
Team trained on troubleshooting

A service mesh is powerful, but power has a price. Make sure you’re buying something you’ll actually use.

What a Service Mesh Does#

How It Works#

Istio Quick Start#

Install#

Deploy an App#

Traffic Management#

Virtual Service (Routing Rules)#

Destination Rule (Load Balancing)#

Canary Deployments#

Retries and Timeouts#

Security#

mTLS (Automatic Encryption)#

Authorization Policies#

Observability#

Metrics#

Distributed Tracing#

Kiali Dashboard#

Circuit Breakers#

When You Don’t Need a Service Mesh#

Alternatives#

Linkerd (Simpler)#

No Mesh (Libraries)#

The Service Mesh Decision#

The Service Mesh Checklist#

📬 Get the Newsletter