Making sense of Kubernetes (Credit everything here is from Boot.Dev)

Node is a kubernetes word for computer (can be VM, physical hardware etc)
Pod: A Pod is the smallest and simplest unit in the Kubernetes object model that you create or deploy. It represents one (or sometimes more) running container(s) in a cluster
Ephemeral: Fancy word for "temporary", pods are designed to be spun up, torn down, and restarted at a moment's notice, promotes immutability as well
ReplicaSet: Maintains a stable set of replica Pods running at any given time. It's the thing that makes sure that the number of Pods you want running is the same as the number of Pods that are actually running. (You will probably never use ReplicaSets directly)
Thrashing Pods

Usually caused by:
- bug in the image
- misconfigured app
- dependency of app is misconfigured
- app using too much memory
CrashLoopBackoff: Means container is crashing. Kubernetes is all about building self-healing systems, it will automatically restart the container. However, each time it tries to restart the container, if it crashes again, it will wait longer and longer in between restarts. That's why it's called a "backoff".
ConfigMap: are not cryptographically secure, should use Kubernetes Secrets for sensitive data
Services: provide a stable endpoint for pods (service will always be available at a given endpoint even if pods is destroyed and recreated), load balances traffic across a group of pods

Service Type (spec/type)
- ClusterIp: Exposes the Service on a cluster-internal IP. Choosing this value makes the Service only reachable from within the cluster
- NodePort: Exposes the Service on each Node's IP at a static port
- LoadBalancer: Exposes the service externally using a external load balancer (if supported, e.g. AWS, GCP, Azure, or your own)
- ExternalName: Maps the Service to the contents of the externalName field (for example, to the hostname api.foo.bar.example). The mapping configures your cluster's DNS server to return a CNAME record with that external hostname value. No proxying of any kind is set up (DNS level redirect, can be used to redirect traffic from one service to another)
Interesting thing, they are built on top of each other.

NodePort = ClusterIp + expose service on each node's IP at a statis port

LoadBalancer = NodePort + external load balancer

ClusterIp usually go-to, NodePort and LoadBalancer when want to expose a service to the outside world. ExternalName for DNS redirects
Ingress: exposes services to the outside world.
It's important to remember that while it's common for a pod to run just a single container, multiple containers can run in a single pod. This is useful when you have containers that need to share resources. In other words, we can scale up the instances of an application either at the container level or at the pod level.
Persistent Volumes: Instead of simply adding a volume to a deployment, a persistent volume is a cluster-level resource that is created separately from the pod and then attached to the pod. PVs can be created statically or dynamically.
- Static PVs are created manually by a cluster admin
- Dynamic PVs are created automatically when a pod requests a volume that doesn't exist yet
Generally speaking, and especially in the cloud-native world, we want to use dynamic PVs. It's less work and more flexible.
Persistent Volume Claim: A persistent volume claim is a request for a persistent volume. When using dynamic provisioning, a PVC will automatically create a PV if one doesn't exist that matches the claim.
Namespace: a way to isolate clusters resources into groups. Names of resources need to be unique within a namespace, but not across namespaces. Namespaces cannot be nested inside one another and each Kubernetes resource can only be in one namespace.

Kubernetes makes it really easy for pods to communicate with each other. It does this by automatically creating DNS entries for each service. The format is:
```
<service-name>.<namespace>.svc.cluster.local
```
In reality, the .svc.cluster.local isn't needed in most scenarios.

Unless a service really needs to be made available to the outside world, it's better to keep it internal to the cluster. Internal communications are great because:
1. It's faster (assuming nodes are close to each other physically)
2. No public DNS is required
3. Communication is inherently more secure because it runs on an internal network (usually don't even need HTTPS)
Horizontal Pod Autoscaler (HPA): automatically scale the number of Pods in a Deployment based on observed CPU utilization or other custom metrics. It's very common in a Kubernetes environment to have a low number of pods in a deployment, and then scale up the number of pods automatically as CPU usage increases
Node Types: two types:

Control Plane: The control plane is responsible for managing the cluster. It's where the API server, scheduler, and controller manager live. The control plane used to be called "master nodes", but that term is deprecated now.

Worker Nodes: They're the machines that are actually running our containers.

Usually concerned with scaling out worker nodes and making sure they're healthy. The control plane is fairly static.
Resource requests: Allows us to tell Kubernetes up front how much resource is required If we try to schedule a new pod with resource requests that exceed the node resource, k8s will gracefully tell us it doesn't have enough resources to do so, or it will use a node in the cluster that has at least the amount of resource available

General tips:
- Set memory requests ~10% higher than the average memory usage of your pods
- Set CPU requests to 50% of the average CPU usage of your pods
- Set memory limits ~100% higher than the average memory usage of your pods
- Set CPU limits ~100% higher than the average CPU usage of your pods
Because:
- Memory is the scariest resource to run out of. If you run out of CPU, your pods will just slow down. If you run out of memory, your pods will crash. For that reason, it's more important to add a buffer to your memory requests than your CPU requests.
- Limits should only take effect when a pod is using more resources than it should. Limits are like a safety net. If your limits are constantly being hit, you should either increase them or fix your application code so that it uses fewer resources.
  
  As such, limits should generally be set higher than requests.
- Because requests are used to schedule pods, you want to make sure that your requests are high enough that once scheduled, your pods will have the resources, but not so high that you're wasting resources. If you set your requests too high, you'll end up with a situation where you can't schedule pods because k8s thinks it doesn't have enough resources, even though it does.

Yaml stuff, because why not

apiVersion: apps/v1 - Specifies the version of the Kubernetes API you're using to create the object (e.g., apps/v1 for Deployments).
kind: Deployment - Specifies the type of object you're configuring
metadata: Metadata about the deployment, like when it was created, its name, and its ID
- kubectl.kubernetes.io/last-applied-configuration: will not be there if we do kubectl create deployment, but will be there after we do kubectl apply
- annotations : The core Kubernetes API is intentionally kept small, instead of adding a bunch of new fields to the core API, Kubernetes allows you to add arbitrary annotations to your resources, and then various extensions can read those annotations and do things with them. in most production deployments you'll be using annotations specific to the cloud provider you're using. Each major cloud provider has their own products, so you need to use k8s annotations and extensions specific to that cloud provider.

Deployments

spec: The desired state of the deployment. How many replicas you want, will be made here.
- replicas: Amount of replicas
- selector/matchLabels/app: should match metadata/labels/app
- template/metadata/labels/app: should match metadata/labels/app
- containers: stuff like name, image, env or envFrom
  - env: stuff like name, valueFrom
    - valueFrom/configMapKeyRef: to specify where to get the value from configmap, includes name, key
  - envFrom/configMapRef: compared to env, we don't have to list each env variable one by one
  - resources
    - limits: stuff like cpu, memory
status: The current state of the deployment. You won't edit this directly, it's just for you to see what's going on with your deployment.

ConfigMap

data: where you specify any key values

Services

spec: stuff like ports, selector/app
- selector/app: This should match the metadata/labels/app in Deployments
- ports: stuff like protocol
  - port: will listen on this port
  - targetPort: traffic will be forwarded to this port in the pods

Ingress

spec
- rules
  - host: specfies the host this rule is for
  - http
    - paths: like path or pathType
      - backend: specifies the backend this host should resolve to, stuff liek service.name, service.port.number

Kubectl

kubectl get deployments: create a deployment, needs name and id of docker image

kubectl create deployment {some-deployment-name-web} --image={docker.io/username/some-docker-image:latest}

kubectl get pods

use -o wide to get a wide output, including ip address
```
kubectl get pods -o wide
```
kubectl port-forward {pod-name, service/{service-name}} 8080:8080
kubectl edit deployment {deployment-name}
kubectl delete pod {pod-name}

kubectl logs {pod-name}

 `--all-containers`: If there are mulitple containers running on the same pod and see the logs for all of them

kubectl proxy: start a proxy server on your local machine
kubectl get {replicasets, svc OR service, pvc, pv, namespace OR ns, hpa}
kubectl apply -f {configuration}.yaml
kubectl port-forward service/{service-name} 8080:8080
kubectl create ns crawler: create a namespace named crawler
kubectl addons enable {stuff}
1. ingress: enable the ingress service
2. metrics-server: enable the metrics-server
  1. kubectl top pod: see the resource that each pod is using
kubectl describe pods

Minikube

Minikube runs a single node cluster, compared to production kubernetes clusters which are multi-node and distributed

minikube start
minikube stop
minikube delete
minikube dashboard: Open a browser window with a locally hosted dashboard for your cluster. You can use this dashboard to view and manage your cluster.
minikube tunnel -c: Tunnel creates a route to services deployed with type LoadBalancer and sets their Ingress to their ClusterIP.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
.DS_Store		.DS_Store
README.md		README.md
api-configmap.yaml		api-configmap.yaml
api-deployment.yaml		api-deployment.yaml
api-pvc.yaml		api-pvc.yaml
api-service.yaml		api-service.yaml
app-ingress.yaml		app-ingress.yaml
crawler-configmap.yaml		crawler-configmap.yaml
crawler-deployment.yaml		crawler-deployment.yaml
crawler-service.yaml		crawler-service.yaml
testcpu-deployment.yaml		testcpu-deployment.yaml
testcpu-hpa.yaml		testcpu-hpa.yaml
testram-configmap.yaml		testram-configmap.yaml
testram-deployment.yaml		testram-deployment.yaml
web-configmap.yaml		web-configmap.yaml
web-deployment.yaml		web-deployment.yaml
web-hpa.yaml		web-hpa.yaml
web-service.yaml		web-service.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Making sense of Kubernetes (Credit everything here is from Boot.Dev)

Yaml stuff, because why not

Deployments

ConfigMap

Services

Ingress

Kubectl

Minikube

About

Releases

Packages

DanWlker/making_sense_of_k8s

Folders and files

Latest commit

History

Repository files navigation

Making sense of Kubernetes (Credit everything here is from Boot.Dev)

Yaml stuff, because why not

Deployments

ConfigMap

Services

Ingress

Kubectl

Minikube

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages