awesome-agentic-patterns/patterns/self-rewriting-meta-prompt-loop.md at main · nibzard/awesome-agentic-patterns

title

Self-Rewriting Meta-Prompt Loop

status

emerging

authors

Nikola Balic (@nibzard)

based_on

Noah D. Goodman (Meta-Prompt)

Problem

Static system prompts become stale or overly brittle as an agent encounters new tasks and edge-cases. Manually editing them is slow and error-prone.

Solution

Let the agent rewrite its own system prompt after each interaction:

Reflect on the latest dialogue or episode.
Draft improvements to the instructions (add heuristics, refine tool advice, retire bad rules).
Validate the draft (internal sanity-check or external gate).
Replace the old system prompt with the revised version; persist in version control.
Use the new prompt on the next episode, closing the self-improvement loop.

# pseudo-code
dialogue = run_episode()
delta = LLM("Reflect on dialogue and propose prompt edits", dialogue)
if passes_guardrails(delta):
    system_prompt += delta
    save(system_prompt)

Evidence

Evidence Grade: high (academic), low (direct production implementation)
Key Findings: Strong academic foundation from Reflexion, APE, Self-Refine, DSPy, and Constitutional AI. Direct autonomous implementations are rare in production due to safety concerns (drift, jailbreak risk). Industry prefers hybrid approaches with guardrails and human oversight.
Best Practice: Pair with canary rollouts, multi-layer guardrails, and version control integration.

How to use it

Best for low-risk domains with high-volume, well-defined workflows (e.g., formatting, style)
Requires strong guardrails: structural validation, intent preservation checks, change magnitude limits
Include version control integration and rollback capability
Consider dual-agent architecture (executor + critic) for safer delta generation
Avoid in safety-critical or high-regulation domains without human approval gates

Trade-offs

Pros: Rapid adaptation; data-driven improvements; no training infrastructure required.

Cons: Risk of drift or jailbreak; prompt bloat; oscillation and instability.

References

Goodman, Meta-Prompt: A Simple Self-Improving Language Agent. (noahgoodman.substack.com)
Shinn et al., Reflexion: Language Agents with Verbal Reinforcement Learning. arXiv:2303.11366 (2023)
Madaan et al., Self-Refine: Large Language Models Can Self-Correct. arXiv:2303.05125 (2023)
Khattab et al., DSPy: Declarative Self-Improving Language Programs. (github.com/stanfordnlp/dspy)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem

Solution

Evidence

How to use it

Trade-offs

References

FilesExpand file tree

self-rewriting-meta-prompt-loop.md

Latest commit

History

self-rewriting-meta-prompt-loop.md

File metadata and controls

Problem

Solution

Evidence

How to use it

Trade-offs

References