[BUG] FQDN use or customizable cluster domain for DD_AGENT_HOST #35526
Labels
team/container-platform
The Container Platform Team
team/dynamic-instrumentation
Dynamic Instrumentation
team/triage
We're debugging an excess of DNS requests to CoreDNS. We see datadog looking for weird search domains:
So it makes sense, as by default in Kubernetes' resolv.conf has:
and
datadog.datadog.svc.cluster.local
has only 4 dots, it starts looking for additional domains, duplicating the search domains. This is a 6x increase in requests for everydatadog.datadog
resolution.Whenever a service is calling the short service name:
http://<service>
this hits on the first try of a search domain. With the next levelhttp://<service>.<namespace>
, it hits on the second try. I think that in case a service uses the full domain of the cluster, the recommended way would be using FQDNs (adding a.
likedatadog.datadog.svc.cluster.local.
) to completely skip searching through the domains, and not forcing to search through all searchable domains.Moreover, if a service decides to use a cluster domain different from
cluster.local
this:datadog-agent/pkg/clusteragent/admission/mutate/config/mutator.go
Lines 73 to 88 in dc550eb
will fail, as it hard-codes the domain.
What are your thoughts on using FQDNs or customizing the cluster domain? That way, I cannot only use
example.org
domain, if I need to, but also useexample.org.
orcluster.local.
to set FQDNs, and reduce the amount of DNS calls.Agent Environment
Agent (v7.62.0)
Describe what happened:
Agent creates an admission controller with a full domain but without FQDN
Describe what you expected:
I expected a customizable MutationWebhookConfiguration object.
Steps to reproduce the issue:
Additional environment details (Operating System, Cloud provider, etc):
The text was updated successfully, but these errors were encountered: