Add more Prometheus metrics #3682

yuvipanda · 2018-06-13T18:30:59Z

Now that #3490 has been merged, we should add more prometheus metrics to the notebook server!

Some ideas for metrics to add...

Number of kernels running (labeled by type)
Number of sessions open (not sure if this is useful?)
Number of terminals open
Mirror of the activity tracking metrics
Kernel start / stop latency metrics
0mq metrics

Am sure there's more that I don't know of!

yuvipanda · 2018-06-13T18:32:06Z

Can someone with rights mark this as 'good first issue'? :)

dhirschfeld · 2018-06-13T20:46:44Z

I'm interested in RAM - per kernel and total. Not sure if that's already available though?

rgbkrk · 2018-06-13T20:52:31Z

There's some light amount of collaboration around RAM on a kernel level in jupyter/jupyter#264, though it's on a spec level (especially since the actual kernel make be several child processes deep). @ivanov would probably enjoy having a collaborator on the Python side -- I'm looking forward to the UI portion of using it. The notebook server can use it as well since it has access to the messages as it transports them from ZeroMQ to WebSocket.

yuvipanda · 2018-06-20T19:40:19Z

@dhirschfeld @rgbkrk we could possibly also do it from the default Kernel Manager, since it is just spawning local processes and knows how to collect metrics for them (and their children). This lets other kernel managers report their own metrics as they wish, and works across all kernels without any extra work. It would be complimentary to jupyter/jupyter#264.

GoelJatin · 2018-07-07T10:32:06Z

Hi @yuvipanda ,

If no one is working on this, then I would like to take it up.

I would like to start with the very first one for now.

But have a few doubts.

Do we want the number of kernels running at the time when the API is called / keep on collecting them over the complete period of time since the notebook server was started?

Please let me know accordingly.

CC @Madhu94

Issue jupyter#3682

GoelJatin · 2018-07-09T09:06:34Z

Guys,

Any update on this?

manuhortet · 2018-07-13T09:09:45Z

Hey @GoelJatin, I'm up to work on this with you.

I'm interested in RAM - per kernel and total. Not sure if that's already available though?

As I think this will be the most useful metric to add, may I take it? :)

Will try to do it from the default Kernel Manager, as mentioned by @yuvipanda

GoelJatin · 2018-07-13T09:38:48Z

Hey @manuhortet , sure go ahead.

No concerns from my end. :)

LiryChen · 2018-09-06T17:39:10Z

Hey, I am a first-timer looking for tasks to do too. I found this issue pretty interesting. Anything I can help with?

konnermacias · 2018-09-10T05:07:11Z

@manuhortet Have you been able to make any progress on adding that metric? I'm a first-timer as well and would love to help out!

manuhortet · 2018-09-11T14:15:57Z

Hey! Sincerely I've been delaying some OS contributions in order to gain time for personal projects. I'm sorry I delayed you two too doing that! You can take this issue if you want to. In fact, feel free to ask me if you face any problems. Good luck! @konnermacias @LiryChen

LiryChen · 2018-09-12T03:03:13Z

Alright thanks! I see look into the issue and may ask you a few questions to understand the problem!

Hyaxia · 2018-09-19T23:49:06Z

@manuhortet Hey, I would be glad to try and help too.
Not sure how all of this is done, but can I just choose one of the options above and start working on it?
Is there something left to work on?

manuhortet · 2018-09-20T07:54:21Z

@Hyaxia of course, you can. Choose some metric you feel relevant from the first comment on this issue and go for it.
I guess there are already people working on the "RAM - per kernel and total" one, so I'd try to avoid that one!

LiryChen · 2018-10-02T01:42:42Z

Sorry, I don't think I will continue to work on this due to the limited time I have besides school :( I would like to pick something up in the future once I have more free time!

Hyaxia · 2018-10-02T22:03:07Z

Few questions.
First, I wanted to clarify something about number 4.
Does the kind of tracking that is talked about is tracking the very last thing that the user did and its timestamp?

Second, is anyone still working on the RAM per kernel?

Third, what does number 6 mean?

Thanks.

manuhortet · 2018-10-04T09:33:12Z

For number 4, the last done thing and timestamp would be the logical thing IMO.
Don't think there's anyone working on the RAM per kernel metric, maybe @konnermacias ?

Can't really help on the explanation for number 6. Some help here @yuvipanda ?

konnermacias · 2018-10-04T16:24:45Z

@manuhortet I apologize, school has picked up and I was planning on working on it in a week or two when eveyrthing dies down. @Hyaxia feel free to go for it!

Hyaxia · 2018-10-04T17:56:18Z

Ok then, I will start working on the ram per kernel metric in a few days.

vinaycalastry · 2019-06-12T20:05:59Z

Hello.. This issue seems to be open and there has been no status change since October. Can we get the current status on this please ?

santosh2702 · 2019-10-01T08:33:32Z

anything i can help with

Hyaxia · 2019-10-01T14:26:15Z

So I guess I'm the last one who was working on it.
When I opened the PR #4075, @minrk pointed out that it should go into the jupyter_client project.
Then I opened a PR in jupyter_client jupyter/jupyter_client#407 and you can read yourselves to the point we stopped.

TL;DR - goto jupyter/jupyter_client#407 , you should implement some kind of generic way to expose different kernel statistics.
I started to work on it in the PR, but just didn't have the time to finish it.
As the last comments there state, check how to do it using entry points.
For further information just read the comments in the PR itself, all of the information I had at that time is there.

GL.

Franky12 · 2020-04-13T12:24:26Z

Hi can I try this? looking for some beginner-friendly issues

sudo-k-runner · 2020-10-10T19:33:40Z

Hey! not sure if anyone is still looking into this? I would like to work on this.

kevin-bates · 2020-10-16T21:21:00Z

Hi @sudo-k-runner - thank you for your interest. In light of the fact that the primary notebook server will eventually be based on the jupyter server project, you will likely find better traction on metrics gathering via the jupyter telemetry project - which the jupyter server plans on utilizing.

dhivyasreedhar · 2021-08-25T17:02:35Z

Hi, can I work on this? I'm new so can someone guide me?

kevin-bates · 2021-08-25T18:14:44Z

Hi @dhivyasreedhar - please see the previous comment. This repository is currently focused on bug fixes and security issues.

rgbkrk added the good first issue label Jun 13, 2018

takluyver added the help wanted label Jun 13, 2018

GoelJatin added a commit to GoelJatin/notebook that referenced this issue Jul 7, 2018

Return the kernels running and their details with the metrics API

a34cc7f

Issue jupyter#3682

GoelJatin mentioned this issue Jul 7, 2018

[WIP]: #3682, add metrics #3743

Closed

Hyaxia mentioned this issue Sep 26, 2018

Added metrics for currently running terminals and labeled by type kernels #4036

Merged

Hyaxia mentioned this issue Oct 7, 2018

Kernel ram metric #4075

Closed

Hyaxia mentioned this issue Oct 1, 2019

[WIP]Add method for kernel manager to retrieve statistics jupyter/jupyter_client#407

Closed

MartinForReal mentioned this issue Aug 15, 2020

[notebook controller] Integration with prometheus operator kubeflow/kubeflow#5216

Closed

jtpio removed the good first issue label Oct 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more Prometheus metrics #3682

Add more Prometheus metrics #3682

yuvipanda commented Jun 13, 2018

yuvipanda commented Jun 13, 2018

dhirschfeld commented Jun 13, 2018

rgbkrk commented Jun 13, 2018

yuvipanda commented Jun 20, 2018

GoelJatin commented Jul 7, 2018

GoelJatin commented Jul 9, 2018

manuhortet commented Jul 13, 2018

GoelJatin commented Jul 13, 2018

LiryChen commented Sep 6, 2018

konnermacias commented Sep 10, 2018

manuhortet commented Sep 11, 2018 •

edited

Loading

LiryChen commented Sep 12, 2018

Hyaxia commented Sep 19, 2018 •

edited

Loading

manuhortet commented Sep 20, 2018

LiryChen commented Oct 2, 2018

Hyaxia commented Oct 2, 2018

manuhortet commented Oct 4, 2018

konnermacias commented Oct 4, 2018

Hyaxia commented Oct 4, 2018

vinaycalastry commented Jun 12, 2019

santosh2702 commented Oct 1, 2019

Hyaxia commented Oct 1, 2019 •

edited

Loading

Franky12 commented Apr 13, 2020

sudo-k-runner commented Oct 10, 2020

kevin-bates commented Oct 16, 2020

dhivyasreedhar commented Aug 25, 2021

kevin-bates commented Aug 25, 2021

Add more Prometheus metrics #3682

Add more Prometheus metrics #3682

Comments

yuvipanda commented Jun 13, 2018

yuvipanda commented Jun 13, 2018

dhirschfeld commented Jun 13, 2018

rgbkrk commented Jun 13, 2018

yuvipanda commented Jun 20, 2018

GoelJatin commented Jul 7, 2018

GoelJatin commented Jul 9, 2018

manuhortet commented Jul 13, 2018

GoelJatin commented Jul 13, 2018

LiryChen commented Sep 6, 2018

konnermacias commented Sep 10, 2018

manuhortet commented Sep 11, 2018 • edited Loading

LiryChen commented Sep 12, 2018

Hyaxia commented Sep 19, 2018 • edited Loading

manuhortet commented Sep 20, 2018

LiryChen commented Oct 2, 2018

Hyaxia commented Oct 2, 2018

manuhortet commented Oct 4, 2018

konnermacias commented Oct 4, 2018

Hyaxia commented Oct 4, 2018

vinaycalastry commented Jun 12, 2019

santosh2702 commented Oct 1, 2019

Hyaxia commented Oct 1, 2019 • edited Loading

Franky12 commented Apr 13, 2020

sudo-k-runner commented Oct 10, 2020

kevin-bates commented Oct 16, 2020

dhivyasreedhar commented Aug 25, 2021

kevin-bates commented Aug 25, 2021

manuhortet commented Sep 11, 2018 •

edited

Loading

Hyaxia commented Sep 19, 2018 •

edited

Loading

Hyaxia commented Oct 1, 2019 •

edited

Loading