Awesome LLM Research Collections
  • Home
  • Papers
    • Attention
    • LLMs
    • Multimodal LLMs
    • Embeddings
    • SFT
    • Training
    • Reinforcement Learning
    • Agents Application
    • Vision
    • Auto-Prompt
  • Notes
  • Blogs
  • English
  • 中文

Auto-Prompt

Prompt optimization, evaluator prompting, prompt ensembles, and test-time prompt learning.
中文

Research category

Prompt optimization, evaluator prompting, prompt ensembles, and test-time prompt learning.

3Papers
4Resource links
2025.12Latest month
Prompt Optimization Judge Prompting

1 paper

Prompt Optimization

2025.07 Prompt Optimization

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

GEPA introduces a prompt optimizer that uses natural language reflection to learn high-level rules from trial and error, outperforming GRPO by 6% on average with up to 35x fewer rollouts. It also beats MIPROv2 by over 10% and shows promising results as an inference-time search strategy for code optimization.

Paper Code

2 papers

Judge Prompting

2025.12 Judge Prompting

Becoming Experienced Judges: Selective Test-Time Learning for Evaluators

This paper introduces Learning While Evaluating (LWE), enabling LLM-as-a-judge systems to improve sequentially at inference time by updating an evolving meta-prompt with self-generated feedback. It further proposes Selective LWE, which updates only on self-inconsistent cases to improve evaluation quality with better cost efficiency.

Paper
2025.10 Judge Prompting

Auto-Prompt Ensemble for LLM Judge

APE improves LLM-as-a-judge reliability by automatically discovering auxiliary evaluation dimensions from failure cases and ensembling them with confidence-aware selection. It boosts agreement with human-aligned benchmarks by using test-time computation more effectively.

Paper
  • View source
  • Report an issue