Skip to main content

Questions tagged [kubernetes]

Kubernetes is an open source orchestration system for Docker containers. It handles scheduling onto nodes in a compute cluster and actively manages workloads to ensure that their state matches the users declared intentions. Using the concepts of "labels" and "pods", it groups the containers which make up an application into logical units for easy management and discovery.

1 vote
1 answer
13 views

Talos OS and TrueCharts - Failed to create network namespace for sandbox error

I have a Kubernetes deployment using Talos OS v1.10.3 with three masters deployed to ARM based hosts (two Turing RK1 and one Raxda CM5). I used to run my cluster on my TrueNAS server using TrueCharts (...
Sean Dawson's user avatar
1 vote
0 answers
23 views

Is there a way to check why a kubernetes cluster restarted on google cloud?

Is there a way to check why a kubernetes cluster restarted on google cloud? I see that the clusters in qa have restarted 100 times as much as the ones on dev and prod, but I don't see a place where we ...
John Bush's user avatar
1 vote
1 answer
83 views

Unit kube-apiserver.service could not be found

I'm encountering an issue setting up a Kubernetes cluster (v1.33.2) on three Ubuntu 24.04.2 LTS VMs (master_1 at 1.2.3.4, worker_1 and worker_2). When running kubectl get nodes on worker_2, I get a ...
Casper LI's user avatar
  • 141
0 votes
0 answers
18 views

Karpenter not consolidating nodes

I'm using Karpenter in my EKS, and I noticed that I have at least two underutilized t3.2xlarge nodes that have only kube-system pods, that I expect Karpenter to consolidate them, I've attached my ...
Hamza Zaafrane's user avatar
0 votes
0 answers
37 views

MinIO in Kubernetes fails with "Rename across devices" error when using PersistentVolumeClaim (works with hostPath)

I'm setting up MinIO and MLflow in a Minikube Kubernetes cluster and encountering persistent volume issues. The deployment works correctly when using hostPath, but fails with a "Rename across ...
Serdar Akyol's user avatar
0 votes
1 answer
51 views

Vulnerabilities in ingress-nginx controller v1.12.2

I was tasked with using ingress-nginx controller v1.12.2 on a Kubernetes Cluster for my org. I scanned it for vulnerabilities with Trivy (github action), and got this report - ┌───────────────────────...
Munish-Soni's user avatar
1 vote
1 answer
112 views

Kubernetes Cluster Crashing Post-Upgrade: API Server and etcd in CrashLoopBackOff

My Kubernetes cluster, self-hosted on an Ubuntu virtual machine running on Proxmox VE, has been crashing repeatedly since a version upgrade. The primary issue is that the API server (kube-apiserver) ...
Casper LI's user avatar
  • 141
0 votes
1 answer
43 views

Kubernetes: helm deploys only part of the defined templates

I use minikube kubernetes cluster installed on a Rocky linux 9.6 VM. I am trying to create a Helm chart for deploying an application, but some of the defined templates are not being deployed inside my ...
David Jones's user avatar
0 votes
0 answers
39 views

Load kubernetes secret file at application pod boot time

I have deployed the application in Kubernetes and I have couple of secret files for the application. Further, I want to implement the secret file to load at the boot of application. Is there any ...
iamdevops's user avatar
0 votes
0 answers
30 views

【help】How to implement the failover of the same application Pod between different clusters in istio non-flat network multi-control plane environment

How do I implement such a requirement From nanjing cluster test-source-869888dfdc-9k6bt Pod a request on http://helloworld.sample.svc.cluster.local:5000/hello Dispatch to different subsets based on ...
15051810258139com's user avatar
0 votes
0 answers
58 views

How to implement the failover of the same application Pod between different clusters in istio non-flat network multi-control plane environment

Description of operating environment: 【Nanjing cluster】 k8s version Client Version: v1.32.3 Kustomize Version: v5.5.0 Server Version: v1.32.3 k8s each host address segment:192.168.110.0/24 service ...
15051810258139com's user avatar
-1 votes
0 answers
86 views

NVIDIA_VISIBLE_DEVICES not being respected by the nvidia-container-runtime

Summary Multiple GPUs are visible in a container despite setting limits in Kubernetes manifest Each GPU should be exclusively allocated to each container that requests a GPU and it should run on that ...
tintin98's user avatar
0 votes
0 answers
55 views

Pod not receiving SIGTERM on delete

My application is not receiving (or kubernetes is not sending) SIGTERM signal when doing delete pod. I've prepared simple entrypoint script to test it: sleep 10000 & pid=$! # Trap the SIGTERM ...
Furgas's user avatar
  • 163
0 votes
0 answers
27 views

Kubernetes Ingress behaviour when pod is unavailable

I have an application where the frontend (Angular SPA, nginx) and backend services are defined as separate deployments in the same Kubernetes namespace. Each application has corresponding services. ...
bowling's user avatar
0 votes
0 answers
46 views

calico/node is not ready: bird/confd is not live: exit status 1

After fresh kubespray install my worker node has calico pod in Init:CrashLoopBackOff state. (We have deployed cluster this way before - a few months ago and all was good) Error log from journalctl: ...
Quies's user avatar
  • 1

15 30 50 per page
1
2 3 4 5
191