Questions tagged [kubernetes]
Kubernetes is an open source orchestration system for Docker containers. It handles scheduling onto nodes in a compute cluster and actively manages workloads to ensure that their state matches the users declared intentions. Using the concepts of "labels" and "pods", it groups the containers which make up an application into logical units for easy management and discovery.
2,860 questions
1
vote
1
answer
13
views
Talos OS and TrueCharts - Failed to create network namespace for sandbox error
I have a Kubernetes deployment using Talos OS v1.10.3 with three masters deployed to ARM based hosts (two Turing RK1 and one Raxda CM5).
I used to run my cluster on my TrueNAS server using TrueCharts (...
1
vote
0
answers
23
views
Is there a way to check why a kubernetes cluster restarted on google cloud?
Is there a way to check why a kubernetes cluster restarted on google cloud? I see that the clusters in qa have restarted 100 times as much as the ones on dev and prod, but I don't see a place where we ...
1
vote
1
answer
83
views
Unit kube-apiserver.service could not be found
I'm encountering an issue setting up a Kubernetes cluster (v1.33.2) on three Ubuntu 24.04.2 LTS VMs (master_1 at 1.2.3.4, worker_1 and worker_2).
When running kubectl get nodes on worker_2, I get a ...
0
votes
0
answers
18
views
Karpenter not consolidating nodes
I'm using Karpenter in my EKS, and I noticed that I have at least two underutilized t3.2xlarge nodes that have only kube-system pods, that I expect Karpenter to consolidate them, I've attached my ...
0
votes
0
answers
37
views
MinIO in Kubernetes fails with "Rename across devices" error when using PersistentVolumeClaim (works with hostPath)
I'm setting up MinIO and MLflow in a Minikube Kubernetes cluster and encountering persistent volume issues. The deployment works correctly when using hostPath, but fails with a "Rename across ...
0
votes
1
answer
51
views
Vulnerabilities in ingress-nginx controller v1.12.2
I was tasked with using ingress-nginx controller v1.12.2 on a Kubernetes Cluster for my org. I scanned it for vulnerabilities with Trivy (github action), and got this report -
┌───────────────────────...
1
vote
1
answer
112
views
Kubernetes Cluster Crashing Post-Upgrade: API Server and etcd in CrashLoopBackOff
My Kubernetes cluster, self-hosted on an Ubuntu virtual machine running on Proxmox VE, has been crashing repeatedly since a version upgrade.
The primary issue is that the API server (kube-apiserver) ...
0
votes
1
answer
43
views
Kubernetes: helm deploys only part of the defined templates
I use minikube kubernetes cluster installed on a Rocky linux 9.6 VM.
I am trying to create a Helm chart for deploying an application, but some of the defined templates are not being deployed inside my ...
0
votes
0
answers
39
views
Load kubernetes secret file at application pod boot time
I have deployed the application in Kubernetes and I have couple of secret files for the application. Further, I want to implement the secret file to load at the boot of application. Is there any ...
0
votes
0
answers
30
views
【help】How to implement the failover of the same application Pod between different clusters in istio non-flat network multi-control plane environment
How do I implement such a requirement
From nanjing cluster test-source-869888dfdc-9k6bt Pod a request on http://helloworld.sample.svc.cluster.local:5000/hello
Dispatch to different subsets based on ...
0
votes
0
answers
58
views
How to implement the failover of the same application Pod between different clusters in istio non-flat network multi-control plane environment
Description of operating environment:
【Nanjing cluster】
k8s version
Client Version: v1.32.3
Kustomize Version: v5.5.0
Server Version: v1.32.3
k8s each host address segment:192.168.110.0/24
service ...
-1
votes
0
answers
86
views
NVIDIA_VISIBLE_DEVICES not being respected by the nvidia-container-runtime
Summary
Multiple GPUs are visible in a container despite setting limits in Kubernetes manifest
Each GPU should be exclusively allocated to each container that requests a GPU and it should run on that ...
0
votes
0
answers
55
views
Pod not receiving SIGTERM on delete
My application is not receiving (or kubernetes is not sending) SIGTERM signal when doing delete pod. I've prepared simple entrypoint script to test it:
sleep 10000 &
pid=$!
# Trap the SIGTERM ...
0
votes
0
answers
27
views
Kubernetes Ingress behaviour when pod is unavailable
I have an application where the frontend (Angular SPA, nginx) and backend services are defined as separate deployments in the same Kubernetes namespace. Each application has corresponding services. ...
0
votes
0
answers
46
views
calico/node is not ready: bird/confd is not live: exit status 1
After fresh kubespray install my worker node has calico pod in Init:CrashLoopBackOff state.
(We have deployed cluster this way before - a few months ago and all was good)
Error log from journalctl:
...