This repo contains a dashboard solution that allows you to monitor that status of your ARPA Network AVS Operator Node.
To set it up, please refer to the setup instructions.
To check our reward policies and related metrics, please refer to document here.
Scrape interval: 30 seconds
- Functionality: Pull on-chain data
- Default URL:
localhost:8000
- Functionality: Metrics collection
- Default URL:
localhost:9090
- Functionality: Visualization and alerting
- Default URL:
localhost:3000
-
Data Collection: Custom Exporter is responsible for data collection as it:
- pulls on-chain data
- and extracts information from stored logs
-
Data Processing:
- Prometheus transforms the collected data into metrics (gauges)
-
Visualization and Alerting:
- Grafana receives the metrics from Prometheus
- Displays them on dashboards
- Triggers alerts based on predefined rules
You may access metrics from different components. For example, you can retreive data from Prometheus programmatically or Grafana visually. You can also grab data from Custom Exporter directly as needed.
- Node State: if the node is activated
- Up Time: how long the node has been in "activated" state (Note: currently determine this duration by on-chain events and all historical durations count)
- Node Address: the node account address you are monitoring
- ETH Balance: current eth balance of the node account monitored
- Group Index: index of the group this node belongs to
- Group State: if the group is functioning
- Group Size: count of group members
- DKG Grouping State: if DKG process is finished, still processing or overrun
- Committer Addresses: committer nodes of the current group
Note: Below metrics may not always have data since current traffic is low.
- Partial Signature Generation Processing Time: average processing time of requests per minute
- Randomness Task Count: Task received per minute
- Reconnection Attempts Count: count of listener interrupted error per minute
We currently have 3 rules set by default:
- If node is deactivated
- If node account balance is less than 0.1 ETH
- If there are more than 10 RPC reconnection attempts within a minute
To use these rules, you can follow setup instructions step #5.