{{indexmenu_n>2}} ====== Components and indicators ====== ===== Monitoring coverage ===== flowchart LR EXP["fast-epdg
/metrics :9817"] EXP --> CFG["Config
2 metrics"] EXP --> NET["Network
1 metric"] EXP --> PROTO["Protocols L5-L7
15 metrics"] EXP --> SVC["Service KPI
4 metrics"] EXP --> SESS["Session State
4 metrics"] EXP --> APP["Application
3 metrics"] EXP --> SYS["System
4 metrics"] PROTO --> IKEV2["IKEv2
SWu — 3"] PROTO --> GTPC["GTPv2-C
S2b — 4"] PROTO --> GTPU["GTP-U
S2b data — 3"] PROTO --> DIA["Diameter
SWm/SWx/S6b — 5"]
===== Quantitative review by category ===== ^ Category ^ Number of metrics ^ Survey interval ^ Key indicators ^ | **Config** | 2 | 10 sec | Configuration status, reload counter | | **Network** | 1 | 10 sec | Node connection status (PGW/AAA/HSS) | | **IKEv2 (SWu)** | 3 | 10 sec | Reports by type (IKE_SA_INIT, IKE_AUTH, CREATE_CHILD_SA), delay diagram, errors | | **GTPv2-C (S2b)** | 4 | 10 sec | Messages (Create/Modify/Delete Session), delays, errors, relays | | **GTP-U data plane** | 3 | 10 sec | Packets/bytes, tunneling errors | | **Diameter (SWm/SWx/S6b)** | 5 | 10 sec | Command code messages (DER/DEA, MAR/MAA, AAR/AAA), delays, errors, watchdog, connection status | | **Service KPI** | 4 | 10 sec | Percentage of successful attempts, duration histogram, service availability, uptime | | **Session State** | 4 | 10 sec | IKE SA, Child SA, GTP sessions, all users | | **Application** | 3 | 10 sec | Number of streams, memory, log messages by levels | | **System** | 4 | 10 sec | CPU recycling, memory, memory disposal, open FD | | **Total** | **33 metrics** | | | ===== Naming principles ===== All metrics have the prefix ''epdg_'' and are organized in a hierarchy: epdg_ ├── config_* # Configuration ├── network_* # Network layer ├── ikev2_* # SWu (IKEv2/IPSec) ├── gtp_* # S2b control-plane GTPv2-C ├── gtpu_* # S2b data-plane GTP-U ├── diameter_* # SWm/SWx/S6b ├── service_* # Service KPIs (attach, availability, uptime) ├── session_* # Session Status (IKE SA, Child SA, GTP, subscribers) ├── app_* # App Metrics (memory, threads, logs) └── system_* # System metrics (CPU, disk, network)