Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kepler node platform metric not available on BM #1897

Open
vprashar2929 opened this issue Dec 20, 2024 · 1 comment
Open

Kepler node platform metric not available on BM #1897

vprashar2929 opened this issue Dec 20, 2024 · 1 comment
Labels
kind/bug report bug issue

Comments

@vprashar2929
Copy link
Collaborator

vprashar2929 commented Dec 20, 2024

What happened?

In case of uncore rapl domain available on a BM machine Kepler doesn't produce kepler_node_platform metric

Ref:
Screenshot 2024-12-20 at 12 17 28 PM

Screenshot 2024-12-20 at 12 17 42 PM

Screenshot 2024-12-20 at 12 17 59 PM

What did you expect to happen?

Since kepler_node_platform metric represents the total energy consumption of the host https://github.com/sustainable-computing-io/kepler-doc/blob/main/docs/design/metrics.md we should see the metric available irrespective of uncore rapl domain available or not

How can we reproduce it (as minimally and precisely as possible)?

Use a BM machine where uncore rapl domain is available. To check what all domains are available use below command:

for file in $(sudo find -L /sys/class/powercap/intel-rapl -name name  2>/dev/null); do cat $file;  done  | sort -n| uniq

Anything else we need to know?

No response

Kepler image tag

latest

Kubernetes version

$ kubectl version
# paste output here

Cloud provider or bare metal

Bare metal

OS version

# On Linux:
$ cat /etc/os-release
# paste output here
$ uname -a
# paste output here

# On Windows:
C:\> wmic os get Caption, Version, BuildNumber, OSArchitecture
# paste output here

Install tools

Kepler deployment config

For on kubernetes:

$ KEPLER_NAMESPACE=kepler

# provide kepler configmap
$ kubectl get configmap kepler-cfm -n ${KEPLER_NAMESPACE}
# paste output here

# provide kepler deployment description
$ kubectl describe deployment kepler-exporter -n ${KEPLER_NAMESPACE}

For standalone:

put your Kepler command argument here

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

@vprashar2929 vprashar2929 added the kind/bug report bug issue label Dec 20, 2024
@vprashar2929
Copy link
Collaborator Author

cc: @rootfs @KaiyiLiu1234 sustainable-computing-io/kepler-metal-ci#335 (comment) I also see the same weird behaviour in our MSE and MAPE numbers as well in case when uncore rapl domain is available

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug report bug issue
Projects
None yet
Development

No branches or pull requests

1 participant