fix: honour server-provided Retry-After in bot rate limiting/retries by praisonai-triage-agent[bot] · Pull Request #2431 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-06-28T09:19:57Z

Summary

Bot outbound send/retry paths now honour an explicit server throttle signal instead of falling back to a generic exponential backoff.

New server_retry_after(err) helper in bots/_resilience.py extracts the mandated wait:
- Telegram: parameters.retry_after (and the .retry_after attr exposed by python-telegram-bot's RetryAfter).
- HTTP channels (Slack/Discord/WhatsApp): Retry-After header as integer seconds or an HTTP-date.
- Text fallback: retry after <n> / retry_after: <n> in the message.
The honoured value takes precedence over the policy backoff in:
- _resilience.deliver_with_retry
- _resilience.ConnectionMonitor.record_error
- _delivery.deliver_with_retry
- _outbox.OutboundQueue.drain (gates the next retry by the stored hint)
RateLimiter gains adaptive penalise(channel_id, seconds) lane-widening (per-channel + global penalty windows), wired into _delivery.deliver_with_retry via a new optional rate_limiter= arg so the next sends do not immediately re-trip the limit.

Layer placement

Wrapper-only (src/praisonai/praisonai/bots/) transport resilience. No core/Agent changes, no new dependencies, no new user-facing surface, fully backward-compatible.

Test plan

New tests/unit/bots/test_server_retry_after.py (12 tests): hint extraction (Telegram attr/params, HTTP seconds/HTTP-date, text fallback, none), ConnectionMonitor honour-vs-backoff, RateLimiter.penalise lane widening + zero no-op, and deliver_with_retry honouring 30s flood_wait + penalising the lane. All pass.
Existing test_outbound_retry.py, test_delivery_imports.py, test_dlq.py, test_outbox_reconcile.py (28 tests) still pass.

Generated with Claude Code

Summary by CodeRabbit

New Features
- Retry handling now respects server-provided wait times, including Retry-After and similar throttle signals.
- Rate limiting can now apply temporary penalties to slow down repeated requests after throttling.
Bug Fixes
- Retry delays now avoid resending too early when the server specifies a longer wait.
- Retry logging now clearly shows when a server-mandated delay is being honored.

…ixes #2427) Outbound send/retry paths now extract the server-mandated wait (Telegram parameters.retry_after / HTTP Retry-After header, with a text fallback) via a new server_retry_after() helper and honour it over the generic exponential backoff. The RateLimiter gains an adaptive penalise() that widens the affected channel (or global) lane for the window so the next sends do not immediately re-trip the platform's throttle. Co-authored-by: MervinPraison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-28T09:20:08Z

@coderabbitai review

MervinPraison · 2026-06-28T09:20:08Z

/review

qodo-code-review · 2026-06-28T09:20:12Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

coderabbitai · 2026-06-28T09:20:13Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-06-28T09:20:25Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 53696c4b-4f1a-4dad-be7d-ce08b6fa618f

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds server_retry_after() to _resilience.py to extract server-mandated wait times from Telegram flood-wait errors and HTTP Retry-After headers. RateLimiter gains per-channel and global penalty windows via a new penalise method. deliver_with_retry, ConnectionMonitor.record_error, and OutboundQueue.drain are updated to prefer server-mandated delays over generic backoff, with deliver_with_retry also accepting an optional rate_limiter to penalize channels on throttle. A new unit test module validates all new behaviors.

Changes

Server Retry-After honoring

Layer / File(s)	Summary
`server_retry_after()` parser `bots/_resilience.py`	Adds `_parse_http_date_seconds` and `server_retry_after()` extracting mandated wait from Telegram `retry_after` fields, HTTP `Retry-After` headers (numeric and HTTP-date), and text fallbacks. Adds `import re`.
RateLimiter penalty windows `bots/_rate_limit.py`	`__init__` adds per-channel and global penalty state; `acquire` subtracts active penalty windows; `penalise` records hold-off durations per-channel or globally; `reset` clears penalty state.
Retry paths wired to server hint `bots/_resilience.py`, `bots/_delivery.py`, `bots/_outbox.py`	`ConnectionMonitor.record_error` and `deliver_with_retry` in `_resilience.py` prefer `server_retry_after()` over `compute_backoff`; `deliver_with_retry` in `_delivery.py` gains optional `rate_limiter` and penalizes on throttle; `OutboundQueue.drain` enforces `max(backoff, mandated)`.
Unit tests `tests/unit/bots/test_server_retry_after.py`	New test module with error fakes; covers `server_retry_after()` parsing, `ConnectionMonitor` mandated-wait vs backoff, `RateLimiter.penalise` channel gating and noop, and `deliver_with_retry` delay honoring and lane penalization.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐇 No more guessing — the server has spoken!
With Retry-After parsed, no throttle-ban broken.
The penalise window holds each channel in line,
flood_wait or header, the timing's just fine.
A rabbit respects what the platform decrees! 🕐

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 32.50% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly matches the main change: honoring server-provided Retry-After in bot retries and rate limiting.
Linked Issues check	✅ Passed	The PR implements server-driven retry waits, HTTP/Telegram hint parsing, and limiter widening as requested in `#2427`.
Out of Scope Changes check	✅ Passed	The changes stay within the bot wrapper resilience/rate-limit scope and add only supporting tests and internals.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-2427-20260628-0912

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

MervinPraison · 2026-06-28T09:20:32Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

greptile-apps · 2026-06-28T09:22:09Z

Greptile Summary

This PR makes bot retry handling use server-provided throttle delays.

Adds Retry-After extraction for Telegram, HTTP headers, and text messages.
Applies server delays in delivery retry and connection monitor paths.
Adds adaptive rate-limiter penalty windows.
Persists retry hints for outbox retry gating.

Confidence Score: 4/5

This is close, but one durable retry path should be fixed before merging.

The direct outbox path can persist a parsed retry hint.
The durable drain wrapper can still drop structured throttle fields before the outbox stores the failure.
That can make a queued message retry too early after a server throttle.

src/praisonai/praisonai/bots/_delivery.py

Important Files Changed

Filename	Overview
src/praisonai/praisonai/bots/_delivery.py	Adds retry hint handling and optional rate-limiter penalties, but the durable drain wrapper can still lose structured retry hints.
src/praisonai/praisonai/bots/_outbox.py	Adds retry hint persistence and gating for stored outbox errors.
src/praisonai/praisonai/bots/_rate_limit.py	Adds per-channel and global penalty windows for sends after server throttling.
src/praisonai/praisonai/bots/_resilience.py	Adds centralized Retry-After parsing and uses it in resilience retry paths.
src/praisonai/tests/unit/bots/test_server_retry_after.py	Adds tests for retry hint parsing, retry delays, limiter penalties, and direct outbox gating.

_{Reviews (2): Last reviewed commit: "chore: drop unused AsyncMock import in s..." | Re-trigger Greptile}

coderabbitai

Actionable comments posted: 5

🧹 Nitpick comments (1)

src/praisonai/tests/unit/bots/test_server_retry_after.py (1)
5-10: 📐 Maintainability & Code Quality | 🔵 Trivial | ⚡ Quick win

Add the missing _outbox.OutboundQueue.drain() test or drop this claim.

This module says it covers OutboundQueue gating, but it never imports or exercises _outbox. Since _outbox.OutboundQueue.drain() is one of the changed retry paths in this PR, that branch is still unverified here.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/praisonai/tests/unit/bots/test_server_retry_after.py` around lines 5 -
10, The test module currently claims coverage for OutboundQueue gating but does
not import or exercise _outbox.OutboundQueue.drain(). Add a focused test that
drives the drain() retry path using the stored server hint, or remove that claim
from the module docstring if you are not covering it. Use the existing
server_retry_after(), deliver_with_retry(), and RateLimiter.penalise() test
patterns as a guide and reference _outbox.OutboundQueue.drain() directly so the
changed branch is actually verified.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai/praisonai/bots/_delivery.py`:
- Around line 354-365: Only penalize the limiter when the wait is
server-mandated, not for generic backoff. In `_delivery.py` within the
send/retry flow where `server_retry_after(e)` and `compute_backoff(backoff,
attempt)` are used, update the `rate_limiter.penalise(channel_id, delay)` call
so it runs only when `mandated` is not `None` and `delay` comes from that
mandated value. Keep the backoff delay for sleeping/retry logic, but avoid
widening the channel lane on ordinary transient errors.

In `@src/praisonai/praisonai/bots/_outbox.py`:
- Around line 355-358: Persist the retry-after hint on the outbox entry instead
of reconstructing it from entry.error in the retry path. Update the
failure-marking flow in the outbox logic so the structured retry delay/timestamp
is saved when the error is recorded, then have the retry handling around
server_retry_after in the outbox processor read that persisted value directly
rather than wrapping entry.error in Exception. Use the existing outbox
entry/error handling methods and the server_retry_after call site to locate the
change.

In `@src/praisonai/praisonai/bots/_rate_limit.py`:
- Around line 118-120: The global penalty handling in the rate limiter should
not collapse all waiting callers onto the same wake-up without preserving token
spacing. Update the reservation logic in the rate-limit path around
_global_penalty_until and global_wait so the delayed time is used as the new
token reservation anchor, or fold the penalty directly into the token
reservation calculation, ensuring queued sends still remain staggered after the
penalty expires.

In `@src/praisonai/tests/unit/bots/test_server_retry_after.py`:
- Line 224: The unpacked tuple element in the test run result is unused, which
keeps Ruff RUF059 active. Update the `run()` result handling in
`test_server_retry_after` so the second value is intentionally ignored or
clearly marked unused, while keeping the existing `success` assertion flow
unchanged.
- Around line 198-227: This test only verifies lane penalisation, not that
deliver_with_retry in _delivery honors the server Retry-After value. Update the
test around deliver_with_retry and fake_sleep_with_abort to capture the sleep
duration used after HTTPRateLimit("5"), and assert it waits approximately 5
seconds rather than falling back to generic backoff. Keep the existing
RateLimiter and Adapter setup, but strengthen the assertion on the _delivery
retry path so the behavior in deliver_with_retry is validated directly.

---

Nitpick comments:
In `@src/praisonai/tests/unit/bots/test_server_retry_after.py`:
- Around line 5-10: The test module currently claims coverage for OutboundQueue
gating but does not import or exercise _outbox.OutboundQueue.drain(). Add a
focused test that drives the drain() retry path using the stored server hint, or
remove that claim from the module docstring if you are not covering it. Use the
existing server_retry_after(), deliver_with_retry(), and RateLimiter.penalise()
test patterns as a guide and reference _outbox.OutboundQueue.drain() directly so
the changed branch is actually verified.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 57588c21-e9c6-49d1-98e3-f41e716ab129

📥 Commits

Reviewing files that changed from the base of the PR and between 03e374b and 66de502.

📒 Files selected for processing (5)

src/praisonai/praisonai/bots/_delivery.py
src/praisonai/praisonai/bots/_outbox.py
src/praisonai/praisonai/bots/_rate_limit.py
src/praisonai/praisonai/bots/_resilience.py
src/praisonai/tests/unit/bots/test_server_retry_after.py

coderabbitai · 2026-06-28T09:24:50Z

+            # Honour a server-mandated wait (Telegram retry_after, HTTP
+            # Retry-After) over the generic backoff estimate.
+            mandated = server_retry_after(e)
+            delay = mandated if mandated is not None else compute_backoff(backoff, attempt)

+            # Widen this channel's lane so the next send does not immediately
+            # re-trip the platform limit.
+            if rate_limiter is not None and delay > 0:
+                try:
+                    await rate_limiter.penalise(channel_id, delay)
+                except Exception:
+                    pass


🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

Only penalize the limiter for server-mandated waits.

When mandated is None, delay is just generic backoff, but Line 363 still widens the channel lane. That can unnecessarily throttle future sends after ordinary transient errors.

Proposed fix

- if rate_limiter is not None and delay > 0: + if mandated is not None and rate_limiter is not None and delay > 0: try: await rate_limiter.penalise(channel_id, delay) except Exception: pass

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Honour a server-mandated wait (Telegram retry_after, HTTP

# Retry-After) over the generic backoff estimate.

mandated = server_retry_after(e)

delay = mandated if mandated is not None else compute_backoff(backoff, attempt)

# Widen this channel's lane so the next send does not immediately

# re-trip the platform limit.

if rate_limiter is not None and delay > 0:

try:

await rate_limiter.penalise(channel_id, delay)

except Exception:

pass

# Honour a server-mandated wait (Telegram retry_after, HTTP

# Retry-After) over the generic backoff estimate.

mandated = server_retry_after(e)

delay = mandated if mandated is not None else compute_backoff(backoff, attempt)

# Widen this channel's lane so the next send does not immediately

# re-trip the platform limit.

if mandated is not None and rate_limiter is not None and delay > 0:

try:

await rate_limiter.penalise(channel_id, delay)

except Exception:

pass

🧰 Tools

🪛 Ruff (0.15.18)

[error] 364-365: try-except-pass detected, consider logging the exception

(S110)

[warning] 364-364: Do not catch blind exception: Exception

(BLE001)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/bots/_delivery.py` around lines 354 - 365, Only penalize the limiter when the wait is server-mandated, not for generic backoff. In `_delivery.py` within the send/retry flow where `server_retry_after(e)` and `compute_backoff(backoff, attempt)` are used, update the `rate_limiter.penalise(channel_id, delay)` call so it runs only when `mandated` is not `None` and `delay` comes from that mandated value. Keep the backoff delay for sleeping/retry logic, but avoid widening the channel lane on ordinary transient errors.

coderabbitai · 2026-06-28T09:24:50Z

+                if entry.error:
+                    mandated = server_retry_after(Exception(entry.error))
+                    if mandated is not None:
+                        delay = max(delay, mandated)


🗄️ Data Integrity & Integration | 🟠 Major | 🏗️ Heavy lift

Persist the structured retry hint instead of reparsing str(error).

Exception(entry.error) loses Telegram attributes and HTTP headers, so durable retries only honor hints that happen to survive in text. Store a retry-after timestamp/delay when marking the failure, then use that persisted value here.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/bots/_outbox.py` around lines 355 - 358, Persist the retry-after hint on the outbox entry instead of reconstructing it from entry.error in the retry path. Update the failure-marking flow in the outbox logic so the structured retry delay/timestamp is saved when the error is recorded, then have the retry handling around server_retry_after in the outbox processor read that persisted value directly rather than wrapping entry.error in Exception. Use the existing outbox entry/error handling methods and the server_retry_after call site to locate the change.

coderabbitai · 2026-06-28T09:24:50Z

+            # Honour any active global penalty window (e.g. a platform-wide 429).
+            if self._global_penalty_until > now + global_wait:
+                global_wait = self._global_penalty_until - now


🚀 Performance & Scalability | 🟠 Major | ⚡ Quick win

Preserve token spacing after a global penalty.

Line 120 collapses all callers to the same _global_penalty_until wake-up without reserving token slots at that delayed time, so a burst of queued sends can fire together when the global hold-off expires. Update the token reservation anchor after applying the penalty, or fold the penalty into the reservation calculation.

Possible direction

# Honour any active global penalty window (e.g. a platform-wide 429). if self._global_penalty_until > now + global_wait: global_wait = self._global_penalty_until - now + self._last_refill = max(self._last_refill, now + global_wait)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Honour any active global penalty window (e.g. a platform-wide 429).

if self._global_penalty_until > now + global_wait:

global_wait = self._global_penalty_until - now

# Honour any active global penalty window (e.g. a platform-wide 429).

if self._global_penalty_until > now + global_wait:

global_wait = self._global_penalty_until - now

self._last_refill = max(self._last_refill, now + global_wait)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/bots/_rate_limit.py` around lines 118 - 120, The global penalty handling in the rate limiter should not collapse all waiting callers onto the same wake-up without preserving token spacing. Update the reservation logic in the rate-limit path around _global_penalty_until and global_wait so the delayed time is used as the new token reservation anchor, or fold the penalty directly into the token reservation calculation, ensuring queued sends still remain staggered after the penalty expires.

coderabbitai · 2026-06-28T09:24:50Z

+    async def fake_sleep_with_abort(seconds, abort_signal=None):
+        return True
+
+    monkeypatch.setattr(_delivery, "sleep_with_abort", fake_sleep_with_abort)
+
+    limiter = RateLimiter(RateLimitConfig(messages_per_second=1000, per_channel_delay=0.0, burst_size=100))
+
+    calls = {"n": 0}
+
+    class Adapter:
+        async def send_message(self, channel_id, content, **kwargs):
+            calls["n"] += 1
+            if calls["n"] == 1:
+                raise HTTPRateLimit("5")
+            return "ok"
+
+    async def run():
+        return await _delivery.deliver_with_retry(
+            Adapter(),
+            "chanX",
+            "hi",
+            max_attempts=3,
+            platform="discord",
+            rate_limiter=limiter,
+        )
+
+    success, err = asyncio.run(run())
+    assert success is True
+    # The lane should have been penalised by ~5s.
+    assert limiter._channel_penalty_until.get("chanX", 0) > 0


🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

Assert the _delivery wait duration, not just lane penalisation.

Right now this test only proves that rate_limiter.penalise() was called. It would still pass if _delivery.deliver_with_retry() ignored the server's Retry-After: 5 and slept its generic backoff instead, which is one of the main behaviors this PR is changing.

Proposed assertion

+ slept = [] + async def fake_sleep_with_abort(seconds, abort_signal=None): + slept.append(seconds) return True @@ - success, err = asyncio.run(run()) + success, err = asyncio.run(run()) assert success is True + assert slept == [pytest.approx(5.0, abs=0.01)] # The lane should have been penalised by ~5s. assert limiter._channel_penalty_until.get("chanX", 0) > 0

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

async def fake_sleep_with_abort(seconds, abort_signal=None):

return True

monkeypatch.setattr(_delivery, "sleep_with_abort", fake_sleep_with_abort)

limiter = RateLimiter(RateLimitConfig(messages_per_second=1000, per_channel_delay=0.0, burst_size=100))

calls = {"n": 0}

class Adapter:

async def send_message(self, channel_id, content, **kwargs):

calls["n"] += 1

if calls["n"] == 1:

raise HTTPRateLimit("5")

return "ok"

async def run():

return await _delivery.deliver_with_retry(

Adapter(),

"chanX",

"hi",

max_attempts=3,

platform="discord",

rate_limiter=limiter,

)

success, err = asyncio.run(run())

assert success is True

# The lane should have been penalised by ~5s.

assert limiter._channel_penalty_until.get("chanX", 0) > 0

slept = []

async def fake_sleep_with_abort(seconds, abort_signal=None):

slept.append(seconds)

return True

monkeypatch.setattr(_delivery, "sleep_with_abort", fake_sleep_with_abort)

limiter = RateLimiter(RateLimitConfig(messages_per_second=1000, per_channel_delay=0.0, burst_size=100))

calls = {"n": 0}

class Adapter:

async def send_message(self, channel_id, content, **kwargs):

calls["n"] += 1

if calls["n"] == 1:

raise HTTPRateLimit("5")

return "ok"

async def run():

return await _delivery.deliver_with_retry(

Adapter(),

"chanX",

"hi",

max_attempts=3,

platform="discord",

rate_limiter=limiter,

)

success, err = asyncio.run(run())

assert success is True

assert slept == [pytest.approx(5.0, abs=0.01)]

# The lane should have been penalised by ~5s.

assert limiter._channel_penalty_until.get("chanX", 0) > 0

🧰 Tools

🪛 Ruff (0.15.18)

[warning] 224-224: Unpacked variable err is never used

Prefix it with an underscore or any other dummy variable pattern

(RUF059)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/tests/unit/bots/test_server_retry_after.py` around lines 198 - 227, This test only verifies lane penalisation, not that deliver_with_retry in _delivery honors the server Retry-After value. Update the test around deliver_with_retry and fake_sleep_with_abort to capture the sleep duration used after HTTPRateLimit("5"), and assert it waits approximately 5 seconds rather than falling back to generic backoff. Keep the existing RateLimiter and Adapter setup, but strengthen the assertion on the _delivery retry path so the behavior in deliver_with_retry is validated directly.

coderabbitai · 2026-06-28T09:24:50Z

+            rate_limiter=limiter,
+        )
+
+    success, err = asyncio.run(run())


📐 Maintainability & Code Quality | 🟡 Minor | ⚡ Quick win

Rename the unused tuple element.

err is never used after unpacking, so this keeps Ruff's RUF059 warning alive.

Proposed fix

- success, err = asyncio.run(run()) + success, _err = asyncio.run(run())

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

success, err = asyncio.run(run())

success, _err = asyncio.run(run())

🧰 Tools

🪛 Ruff (0.15.18)

[warning] 224-224: Unpacked variable err is never used

Prefix it with an underscore or any other dummy variable pattern

(RUF059)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/tests/unit/bots/test_server_retry_after.py` at line 224, The unpacked tuple element in the test run result is unused, which keeps Ruff RUF059 active. Update the `run()` result handling in `test_server_retry_after` so the second value is intentionally ignored or clearly marked unused, while keeping the existing `success` assertion flow unchanged.

Source: Linters/SAST tools

MervinPraison · 2026-06-28T09:30:22Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions
SDK value: review in depth whether the change genuinely adds value to the SDK — never add features for the sake of adding them. It must strengthen the SDK (simpler, more user-friendly, robust, world-class, secure). If it does not clearly add value, request changes or recommend rejecting/closing rather than merging scope creep
Do not bloat the Agent class with additional params — only if absolutely required; we already support many params.

Phase 2: FIX Valid Issues
5. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
6. Push all code fixes directly to THIS branch (do NOT create a new PR)
7. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
8. If all issues are resolved, approve the PR / close the Issue
9. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-28T09:30:40Z

Claude finished @MervinPraison's task —— View job

Final Architecture Review — PR #2431 ✅

Pushed fixes to claude/issue-2427-20260628-0912 (commit f0bb0aad).

Phase 1 — Architecture (per AGENTS.md)

Routing: Correct. All changes are wrapper-only (src/praisonai/praisonai/bots/) transport resilience. No core SDK / agent.py changes, no new params, no new deps. ✅
Backward compatible: rate_limiter= is optional; _TransientDeliveryError is internal; outbox schema unchanged. Zero regressions. ✅
Performance: import re is stdlib; server_retry_after only runs on the error path, not the hot send path. ✅
SDK value: Genuinely strengthens robustness — honouring a server's explicit throttle signal avoids 429 ban-loops that generic backoff causes. Worth merging. ✅

Phase 2 — Fixes applied (reviewer findings verified valid)

#	Finding (source)	Fix
1	Generic backoff penalises the lane on ordinary transient errors (Greptile P1 + CodeRabbit Major)	`_delivery.py` — gated `penalise()` on `mandated is not None`
2	Global penalty collapses token spacing → burst on expiry (CodeRabbit Major)	`_rate_limit.py` — anchor `_last_refill` to penalty end
3	Serialized errors lose structured Retry-After → durable retries fire early (Greptile P1 + CodeRabbit Major)	`_outbox.py` — persist hint from live exception into stored error text; `_delivery.py` raises recoverable `_TransientDeliveryError` so the hint survives the transient path; `_resilience.py` classifies it recoverable
4	Test only asserts `penalise()` called, not the wait duration (CodeRabbit Major)	added `assert slept == [~5.0]`
5	Ruff RUF059 unused `err` (CodeRabbit Minor)	renamed to `_err`
6	Module claims `OutboundQueue.drain` coverage but doesn't test it (CodeRabbit Nitpick)	added `test_outbox_drain_gates_next_retry_by_server_hint`

Skipped: none of the flagged items — all were valid and addressed. (Pre-existing E501 line-length warnings in the test file are not introduced by this PR and left untouched per minimal-change scope.)

Phase 3 — Verdict

Approve. All reviewer-flagged issues resolved. Tests: 41 passed (test_server_retry_after, test_outbound_retry, test_delivery_imports, test_dlq, test_outbox_reconcile). New outbox-drain test confirmed to fail without the _outbox.py persistence fix (raw 429 text → None; with hint → 50.0), proving the change is necessary and correct. Ruff F841/RUF059 clean on the touched test.

Note: I cannot formally approve via GitHub review (security restriction), but from an architecture standpoint this is ready to merge.

Job run · Branch

- _delivery: only penalise the rate limiter on a server-mandated wait, not on generic backoff after ordinary transient errors (Greptile P1 + CodeRabbit) - _rate_limit: anchor refill clock to a global penalty's end so queued sends stay staggered instead of firing together when the hold-off expires (CodeRabbit) - _outbox: persist the structured server Retry-After hint from the live exception into the stored error text so the durable retry gate honours it (Greptile P1 + CodeRabbit); surface transient drain failures via a recoverable _TransientDeliveryError so the hint is not lost - tests: assert deliver_with_retry sleeps the mandated 5s, fix RUF059 unused var, add OutboundQueue.drain gating coverage Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-28T09:35:30Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions
SDK value: review in depth whether the change genuinely adds value to the SDK — never add features for the sake of adding them. It must strengthen the SDK (simpler, more user-friendly, robust, world-class, secure). If it does not clearly add value, request changes or recommend rejecting/closing rather than merging scope creep
Do not bloat the Agent class with additional params — only if absolutely required; we already support many params.

Phase 2: FIX Valid Issues
5. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
6. Push all code fixes directly to THIS branch (do NOT create a new PR)
7. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
8. If all issues are resolved, approve the PR / close the Issue
9. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-28T09:35:54Z

Claude finished @MervinPraison's task —— View job

Final Architecture Review — PR #2431 ✅

Read AGENTS.md & current branch state
Verify reviewer fixes are present (Greptile/CodeRabbit)
Run tests (41 passed)
Final verdict

Pushed 8dd601a8 (one residual lint fix); all prior reviewer fixes from f0bb0aad verified present.

Phase 1 — Architecture (per AGENTS.md)

Check	Result
Routing	✅ Wrapper-only (`src/praisonai/praisonai/bots/`). No `praisonaiagents` core / `agent.py` changes, no new Agent params, no new deps (`re` is stdlib).
Backward compatible	✅ `rate_limiter=` is optional; `server_retry_after` returns `None` when no hint → falls through to existing backoff; outbox schema unchanged. Zero regressions.
Performance	✅ `server_retry_after` only runs on the error/retry path, never the hot send path. Penalty windows are O(1) dict lookups, bounded to 4096 entries (LRU eviction).
SDK value	✅ Genuine. Honouring a platform's explicit `Retry-After` / Telegram `flood_wait` avoids the 429 ban-loops generic exponential backoff triggers. Not scope creep.

Phase 2 — Reviewer findings verified against current code

#	Finding (source)	Status
1	Penalise lane only on server-mandated wait, not generic backoff (Greptile P1 + CodeRabbit Major)	✅ Fixed — `_delivery.py:373` gates on `mandated is not None`
2	Global penalty collapses token spacing → burst on expiry (CodeRabbit Major)	✅ Fixed — `_rate_limit.py:124` anchors `_last_refill` to penalty end
3	Serialized errors lose structured Retry-After (Greptile P1 + CodeRabbit Major)	✅ Fixed — `_outbox.py:416-418` persists the live exception's hint into the error text; `drain()` recovers it via `server_retry_after` text-fallback (`_outbox.py:356`)
4	Test must assert wait duration, not just that `penalise` was called (CodeRabbit Major)	✅ Fixed — `test:230` asserts `slept == [~5.0]`
5	Ruff RUF059 unused `err` (CodeRabbit Minor)	✅ Fixed — renamed `_err` (`test:227`)
6	Module claimed `OutboundQueue.drain` coverage but didn't test it (CodeRabbit Nitpick)	✅ Fixed — `test_outbox_drain_gates_next_retry_by_server_hint` (`test:237`)
7	Ruff F401 unused `AsyncMock` import (newly introduced by this PR)	✅ Fixed this run — `8dd601a8`

Skipped (with reason):

_resilience.py:401 last_error = e F841 — pre-existing (PR fix: make wrapper framework selection, tool sources, and managed backends extensible #2363, d05d0f62), not introduced here; left untouched per minimal-change scope.
Pre-existing E501 line-length nits in the test file — not introduced by this PR.

Phase 3 — Verdict

Approve (architecturally). All reviewer-flagged issues are resolved and verified in the current tree.

Tests: 41 passed (test_server_retry_after, test_outbound_retry, test_delivery_imports, test_dlq, test_outbox_reconcile).
Ruff F401/F841/RUF059 clean on the touched test file.

Note: I cannot submit a formal GitHub approval (security restriction), but from an architecture standpoint this is ready to merge.

Job run · Branch

Resolves the Ruff F401 introduced by the new test module. Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-28T09:38:35Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions
SDK value: review in depth whether the change genuinely adds value to the SDK — never add features for the sake of adding them. It must strengthen the SDK (simpler, more user-friendly, robust, world-class, secure). If it does not clearly add value, request changes or recommend rejecting/closing rather than merging scope creep
Do not bloat the Agent class with additional params — only if absolutely required; we already support many params.

Phase 2: FIX Valid Issues
5. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
6. Push all code fixes directly to THIS branch (do NOT create a new PR)
7. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
8. If all issues are resolved, approve the PR / close the Issue
9. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-28T09:38:52Z

Claude finished @MervinPraison's task —— View job

Final Architecture Review — PR #2431 ✅

Read AGENTS.md + current branch state
Verify prior reviewer fixes still present in tree
Run tests (41 passed)
Final verdict

Branch tip 8dd601a8 — verified clean. This is the third review pass; all reviewer-flagged issues from Greptile, CodeRabbit, and Qodo were already fixed in f0bb0aad/8dd601a8. I re-verified each against the current tree and no new code changes were required.

Phase 1 — Architecture (per AGENTS.md)

Check	Result
Routing	✅ Wrapper-only (`src/praisonai/praisonai/bots/`). No `praisonaiagents` core / `agent.py` changes, no new Agent params, no new deps (`re`, `email.utils` are stdlib).
Backward compatible	✅ `rate_limiter=` is optional; `server_retry_after` returns `None` when no hint → falls through to existing backoff; outbox schema unchanged; `_TransientDeliveryError` is internal. Zero regressions.
Performance	✅ `server_retry_after` runs only on the error/retry path, never the hot send path. Penalty windows are O(1) dict ops, LRU-bounded to 4096 entries.
SDK value	✅ Genuine. Honouring a platform's explicit `Retry-After` / Telegram `flood_wait` avoids 429 ban-loops that generic exponential backoff triggers. Not scope creep.

Phase 2 — Reviewer findings verified against current code

#	Finding (source)	Status
1	Penalise lane only on server-mandated wait, not generic backoff (Greptile P1 + CodeRabbit Major)	✅ `_delivery.py:373` — `if mandated is not None and rate_limiter is not None and delay > 0`
2	Global penalty collapses token spacing → burst on expiry (CodeRabbit Major)	✅ `_rate_limit.py:124` — `self._last_refill = max(self._last_refill, now + global_wait)`
3	Serialized errors lose structured Retry-After (Greptile P1 + CodeRabbit Major)	✅ `_outbox.py:416-418` persists live exception's hint as `[retry_after: N]`; `drain` gate at `_outbox.py:356` recovers via text fallback; `_delivery.py:636` raises `_TransientDeliveryError` so the hint survives
4	Test must assert wait duration, not just `penalise` called (CodeRabbit Major)	✅ `test:229` — `assert slept == [pytest.approx(5.0, abs=0.01)]`
5	Ruff RUF059 unused `err` (CodeRabbit Minor)	✅ `test:226` — renamed `_err`
6	Module claimed `OutboundQueue.drain` coverage but didn't test it (CodeRabbit Nitpick)	✅ `test:236` — `test_outbox_drain_gates_next_retry_by_server_hint`
7	Ruff F401 unused `AsyncMock` import	✅ Fixed in `8dd601a8`

Skipped (with reason):

_resilience.py:401 last_error = e F841 — pre-existing (PR fix: make wrapper framework selection, tool sources, and managed backends extensible #2363), not introduced here; left untouched per minimal-change scope.
Pre-existing E501 line-length nits in the test file — not introduced by this PR.

Phase 3 — Verdict

Approve (architecturally). Ready to merge. All reviewer-flagged issues are resolved and verified in the current tree; no further code changes needed.

Tests: 41 passed (test_server_retry_after, test_outbound_retry, test_delivery_imports, test_dlq, test_outbox_reconcile) in 0.63s.
Lint fixes (F401/F841/RUF059) verified present by inspection (ruff not installed in this runner).

Note: I cannot submit a formal GitHub approval (security restriction), but from an architecture standpoint this is ready to merge.

Job run · Branch

greptile-apps · 2026-06-28T09:41:02Z

+            # Retry-After hint carried in that text. Returning False would store
+            # a generic "Delivery returned false" and lose the hint.
+            if not success and error:
+                raise _TransientDeliveryError(error)


Retry hint lost When drain_pending() retries an outbox entry, it calls deliver_with_retry() with one attempt. A server throttle exception is caught there and converted into a plain error string, and this line wraps only that string in _TransientDeliveryError. The outbox then sees the wrapper instead of the original Telegram or HTTP exception, so fields like .retry_after, .parameters, and response.headers are gone. Common messages such as Retry in 30 seconds or 429 Too Many Requests do not match the text fallback, so the stored error lacks a recoverable retry_after hint and the next drain can retry after the local backoff instead of the server-mandated wait.

praisonai-triage-agent Bot mentioned this pull request Jun 28, 2026

Bot rate limiting/retries ignore server-provided Retry-After (Telegram flood_wait, HTTP 429) #2427

Open

MervinPraison added pipeline/blocked:ci Blocked: CI not green on HEAD pipeline/blocked:no-final Blocked: no FINAL @claude trigger yet pipeline/final-claude-pending Reviews done; waiting for FINAL @claude labels Jun 28, 2026

greptile-apps Bot reviewed Jun 28, 2026

View reviewed changes

Comment thread src/praisonai/praisonai/bots/_outbox.py

Comment thread src/praisonai/praisonai/bots/_delivery.py Outdated

coderabbitai Bot reviewed Jun 28, 2026

View reviewed changes

chore: drop unused AsyncMock import in server retry-after test

8dd601a

Resolves the Ruff F401 introduced by the new test module. Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

greptile-apps Bot reviewed Jun 28, 2026

View reviewed changes

MervinPraison removed the pipeline/blocked:cooldown Blocked: post-push or @claude cooldown label Jun 28, 2026

	success, err = asyncio.run(run())
	success, _err = asyncio.run(run())

Uh oh!

Conversation

praisonai-triage-agent Bot commented Jun 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Layer placement

Test plan

Summary by CodeRabbit

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

qodo-code-review Bot commented Jun 28, 2026

Qodo reviews are paused for this user.

Uh oh!

coderabbitai Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

greptile-apps Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 28, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Final Architecture Review — PR #2431 ✅

Phase 1 — Architecture (per AGENTS.md)

Phase 2 — Fixes applied (reviewer findings verified valid)

Phase 3 — Verdict

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Final Architecture Review — PR #2431 ✅

Phase 1 — Architecture (per AGENTS.md)

Phase 2 — Reviewer findings verified against current code

Phase 3 — Verdict

Uh oh!

MervinPraison commented Jun 28, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Final Architecture Review — PR #2431 ✅

Phase 1 — Architecture (per AGENTS.md)

praisonai-triage-agent Bot commented Jun 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 28, 2026 •

edited

Loading

coderabbitai Bot commented Jun 28, 2026 •

edited

Loading

greptile-apps Bot commented Jun 28, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 28, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 28, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 28, 2026 •

edited

Loading