Kubernetes

When to use a StatefulSet in Kubernetes

Justin VanWinkle

Jan 3, 2021 — 2 min read

To put it simply, a StatefulSet should be used to govern one or more related pods that need to track state in some way. A StatefulSet would be what you use instead of a Deployment.

StatefulSets are a particularly cool implementation detail among Kubernetes workload APIs. It works very similarly to a Deployment, but it has additional constraints that it must apply to ensure that its pods are both stable and ordered. In other words, the pods in a StatefulSet must be consistently accessible via unique network identifiers instead of IP addresses and they must have a graceful and ordered way of deploying and scaling. Let's take a look at how this is all accomplished.

Network Identifiers

StatefulSets depend on the implementation of a Service, currently, to map network identities to pods. The service definition itself is, unfortunately, not created by the StatefulSet, but it isn't too difficult to add into your manifest. Here's an example of a service definition that can be used for a StatefulSet:

apiVersion: v1
kind: Service
metadata:
  name: nginx
  labels:
    app: nginx
spec:
  ports:
  - port: 80
    name: web
  clusterIP: None
  selector:
    app: nginx

This service will be referenced by the Stateful set under .spec.selector.matchLabels and .spec.template.metadata.labels. This will allow the StatefulSet to make use of the service for mapping to the pods. But the main reason I wanted to share it is to express how similar it is to the services you are already familiar with.

Deploying and Scaling

While there are some caveats, these are pretty cut and dry. Here are the highlights:

Pods are created in a sequential order. (Think of a stack)
Pods are terminated in reverse order. (Like popping off the stack)
Scaling up or down can only happen if all previous pods are in the Running and Ready states. (We can't push onto nor pop off of the stack unless everything is completely healthy and capable of serving requests – i.e. all data has been cloned in the case of a database)
A pod cannot be terminated unless it is the oldest one (Again, like popping off a stack)

Happy Kuberneting!

AI in Retail: Transformative Use Cases, Success Stories, and Challenges

The retail industry is witnessing a profound transformation through the integration of Artificial Intelligence (AI). From personalized shopping experiences to supply chain optimization, AI is redefining how retailers operate and interact with customers. In this blog post, we’ll explore various use cases of AI in retail, share some success

Mastering Customer Interviews: Best Practices and Real-World Insights for Product Managers

In the dynamic world of product management, knowing your market and your customers is crucial. This involves in-depth research, data analysis, and most importantly, conducting effective customer interviews. Customer interviews provide invaluable insights into your users' needs, pain points, and the overall product experience. In this blog post, we

Streamlining AI Workflows with Apache Airflow: A Comprehensive Technical Guide

In the burgeoning field of artificial intelligence (AI), the challenge of integrating various machine learning (ML) libraries and frameworks into a cohesive pipeline often emerges. This is where Apache Airflow shines. Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. Originally developed by Airbnb, it has

Getting Started with Terraform: Managing Cloud Infrastructure as Code

In the rapidly evolving landscape of cloud-native technologies, infrastructure as code (IaC) has become a cornerstone for managing and provisioning cloud infrastructure. One of the most popular IaC tools is HashiCorp's Terraform. In this blog post, we will explore Terraform's capabilities, provide a step-by-step guide to