Commit 8b1f1bf
committed
fix: extend Polly timeouts for local Ollama chat inference
qwen2.5-coder:7b consistently takes >30s, causing Polly's default
TotalRequestTimeout to reject every chat response. Override via
PostConfigureAll<HttpStandardResilienceOptions> when UseLocalAI=true
(dev-only path):
- TotalRequestTimeout: 30s 10min
- AttemptTimeout: 10s 5min
- CircuitBreaker.SamplingDuration: 30s 11min (Polly requires >= 2x AttemptTimeout)
The global override is acceptable here: this code path only runs
when the Ollama local-AI flag is set, which is developer-only.1 parent bea1fb9 commit 8b1f1bf
1 file changed
Lines changed: 16 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
22 | 23 | | |
23 | 24 | | |
24 | 25 | | |
| |||
243 | 244 | | |
244 | 245 | | |
245 | 246 | | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
246 | 262 | | |
247 | 263 | | |
248 | 264 | | |
| |||
0 commit comments