Validator Operations Playbook
This runbook collects the key operational tasks for maintaining a production Sei validator, with emphasis on the latest mempool and consensus changes (sei-tendermint@02c9462f1).
Daily Checklist
| Block production | Verify `seid status | jq .SyncInfo.catching_up` returns `false` and height increases steadily. |
| Mempool saturation | Monitor `mempool/size` and `mempool/cache_size` metrics; ensure they are below configured maxima. |
| Validator signing | Check `consensus/validators_signed` > 0 in the last 100 blocks. |
| Oracle participation | If applicable, verify oracle votes landed within the window. |
Configuration Highlights
mempool.cache_size = 10000(default). Withsei-tendermint@02c9462f1, the cache cap is enforced accurately. Raise to 20,000 on high-load validators.- Keep
mempool.broadcastenabled to propagate transactions quickly. consensus.create_empty_blocks = true(default) ensures liveness even under low load. Avoid disabling unless you understand the implications.
Monitoring Metrics
Track these Prometheus metrics:
consensus_height,consensus_round– detect stalls.consensus_validator_power– verify voting power changes.mempool_size,mempool_cache_size– watch for saturation.rpc_trace_pending– ensure tracer load stays belowmax_concurrent_trace_calls.
Incident Response
⚠️
Always snapshot your validator before modifying configuration or restarting under duress.
-
Consensus halt
- Confirm majority of validators are on the same binary.
- Check logs for
nil vote extensionor duplicate tx warnings. - Coordinate restart if required; use state sync if node falls far behind.
-
Mempool overflow
- Increase
mempool.cache_sizegradually (requiressei-tendermint@02c9462f1). - Prune invalid transactions by restarting with
--mempool.recheck=truetemporarily.
- Increase
-
RPC saturation
- Scale out dedicated RPC nodes; validator should keep RPC closed to the public when possible.
Troubleshooting
| Error | Cause | Fix |
|---|---|---|
Duplicate transaction rejected repeatedly | Cache size too small for workload. | Increase `mempool.cache_size` and restart during low traffic. |
Validator missed blocks | Node lagging or signing key offline. | Check hardware load, ensure sentry nodes are reachable, and restart if necessary. |
Vote extension warnings in logs | Experimental flag toggled vote extensions. | Revert configuration; once enabled, the protocol expects extensions. |
Last updated on