1

Degradation Point -- The user count where p95 response time starts exceeding SLA. Example: "At 4,200 users, login p95 jumped from 1.2s to 3.8s." This is your capacity ceiling.

2

Error Threshold -- The user count where error rate exceeds the acceptable limit. Example: "At 5,100 users, error rate jumped from 0.05% to 4.2%, mostly HTTP 503 (connection pool exhausted)."

3

Resource Saturation -- Which resource maxes out first? CPU? Memory? Database connections? Disk I/O? This tells you what to scale or optimize.

4

Recovery Behavior -- After load decreases, does the system recover to normal? Some systems get stuck in a degraded state (thread pool exhaustion, connection leak) even after load drops.

5

Cascading Failures -- Does one component failure cause others to fail? A slow database response can back up the connection pool, which backs up the thread pool, which causes timeouts upstream.

Parameter	Baseline Test Configuration
Virtual Users	10-100 (just enough to validate the setup)
Ramp-up	1 minute
Duration	10-15 minutes
Think Time	Same as load test (keep it realistic)
What to Check	All requests succeed, correlation works, response times are reasonable, no script errors
Pass Criterion	0% errors, response times within expected single-user range
Action if Failed	Fix scripts, fix environment, do NOT proceed to load test

Issue Found in Soak Tests	Symptom	Root Cause	How to Detect
Memory Leak	Heap usage climbs steadily, GC becomes more frequent	Objects not released, caches without TTL	Grafana: JVM heap usage graph shows upward trend
Connection Leak	Database connections exhausted after hours	Connections not returned to pool in error paths	Monitor DB connection pool: active count climbs, never drops
Log Rotation Failure	Disk fills up, app crashes	Log files not rotated, debug logging left on	Monitor disk usage: /var/log growing without bounds
Session Accumulation	Memory grows as sessions are never cleaned	Session timeout too long or cleanup job disabled	Monitor session count: should plateau, not climb
Thread Leak	Thread count climbs, eventually hits OS limit	Threads created but never terminated	Monitor JVM thread count: should be stable after warm-up

Parameter	Baseline Test Configuration
Virtual Users	10-100 (just enough to validate the setup)
Ramp-up	1 minute
Duration	10-15 minutes
Think Time	Same as load test (keep it realistic)
What to Check	All requests succeed, correlation works, response times are reasonable, no script errors
Pass Criterion	0% errors, response times within expected single-user range
Action if Failed	Fix scripts, fix environment, do NOT proceed to load test

Issue Found in Soak Tests	Symptom	Root Cause	How to Detect
Memory Leak	Heap usage climbs steadily, GC becomes more frequent	Objects not released, caches without TTL	Grafana: JVM heap usage graph shows upward trend
Connection Leak	Database connections exhausted after hours	Connections not returned to pool in error paths	Monitor DB connection pool: active count climbs, never drops
Log Rotation Failure	Disk fills up, app crashes	Log files not rotated, debug logging left on	Monitor disk usage: /var/log growing without bounds
Session Accumulation	Memory grows as sessions are never cleaned	Session timeout too long or cleanup job disabled	Monitor session count: should plateau, not climb
Thread Leak	Thread count climbs, eventually hits OS limit	Threads created but never terminated	Monitor JVM thread count: should be stable after warm-up

Test Execution Strategy -- Baseline to Soak

The Five-Phase Execution Strategy

Test Execution Order

Phase 1: Baseline Test

Phase 2: Load Test

Phase 3: Stress Test

What to Look For in Stress Test Results

Phase 4: Spike Test

Phase 5: Soak (Endurance) Test