Managed Kubernetes
Latest
Frequently Asked Questions
Solutions
How Tos
Internal Only
Templates
Powered By

Title
Message
Create new category
What is the title of your new category?
Edit page index title
What is the title of the page index?
Edit category
What is the new title of your category?
Edit link
What is the new title and URL of your link?
ETCD Backup Cronjob Fails and job pods Report the Status as 'NotReady'
Copy Markdown
Open in ChatGPT
Open in Claude
Problem
The etcd-backup-with-intervalcronjob in thekube-systemnamespace fails and the job pod created during the cron execution reports the status as'NotReady'- ETCD debug logs show the below message:
transport: loopyWriter.run returning. connection error: desc = "transport is closing"
Environment
- Platform9 Managed Kubernetes - v-5.6.8 and Higher.
Cause
- ETCD uses gRPC calls and the error message means that the connection which the RPC was using, was closed.
- This can happen due to any of the below reasons:
- Mis-configured transport credentials, connection failed on handshaking.
- Bytes disrupted, possibly by a proxy in between.
- Server shutdown.
- Keepalive parameters caused connection shutdown, for example if you have configured your server to terminate connections regularly to trigger DNS lookups. If this is the case, you may want to increase your MaxConnectionAgeGrace, to allow longer RPC calls to finish.
- ETCD Leader Elections can also cause transient fails.
- ETCD took too long to process this request and eventually it hit a timeout.
Resolution
- List the
jobs(not cronjobs) in thekube-systemnamespace
List the jobs
xxxxxxxxxx$ kubectl get jobs -n kube-system- Delete all the jobs that are reporting the status as
"0/1"but are notCompleted
Delete failed jobs
xxxxxxxxxx$ kubectl delete job <job-name> -n kube-systemAdditional Information
- Currently, Catapult monitoring does trigger an alert if the job fails but does not trigger an alert in this case as the job is never failed but is running and failing.
- There is an existing bug reported internally for Catapult monitoring to send this alert as well - PMK-6340.
VariableType to search · ESC to discard
GlossaryType to search · ESC to discard
InsertType to search · ESC to discard
No matches
Last updated on
Was this page helpful?
Next to read:
Unable to access the PMK UI due to sunpike-kine issuesDiscard Changes
Do you want to discard your current changes and overwrite with the template?
Archive Synced Block
Message
Create new Template
What is this template's title?
Delete Template
Message