Awesome LLM Research Collections

Curated LLM research map

A browsable Quarto edition of the repository README, organized for fast scanning across papers, projects, code, and model resources.

92Papers

10Categories

216Resource links

2026.06Latest month

Browse

Research Categories

4 papers

Attention

Transformer internals, attention variants, KV/cache behavior, and depth-wise information flow.

2026.06 Attention Architecture

18 papers

LLMs

Foundation model reports, inference methods, long-context language modeling, coding, and reasoning systems.

2026.05 Detection, Foundation Models, Inference

15 papers

Multimodal LLMs

Vision-language, video-language, and VLA research that connects perception with language reasoning.

2026.05 Multimodal Reasoning, VLA, Vision-Language

1 paper

Embeddings

Representation learning, retrieval, semantic matching, and embedding model research.

2025.06 Direct collection

3 papers

SFT

Supervised fine-tuning methods, data recipes, token weighting, and reasoning generalization studies.

2026.05 SFT Methods

2 papers

Training

Reusable training recipes, SFT methods, data selection, distillation, and optimization practice.

2026.05 Distillation, Optimization

32 papers

Reinforcement Learning

Reward modeling, RLHF-style optimization, reasoning RL, agent RL, and VLA policy learning.

2026.06 Agentic RL, Multimodal RL, OPD, Policy Optimization, Reasoning RL, Reward Modeling, VLA RL, Video Generation RL

13 papers

Agents Application

Agent systems, tool use, memory, AI research workflows, and reusable skill ecosystems.

2026.05 AI Research, Agent Development, Agent Skills, Memory, Tool Use

1 paper

Vision

Computer vision methods that are useful background for modern multimodal systems.

2022.03 Object Detection

3 papers

Auto-Prompt

Prompt optimization, evaluator prompting, prompt ensembles, and test-time prompt learning.

2025.12 Judge Prompting, Prompt Optimization

Fresh index

Recent Papers

2026.06 FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Attention 2026.06 Rethinking the Divergence Regularization in LLM RL Reinforcement Learning 2026.05 GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding Attention 2026.05 The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence LLMs 2026.05 Base Models Look Human To AI Detectors LLMs 2026.05 Lance: Unified Multimodal Modeling by Multi-Task Synergy Multimodal LLMs 2026.05 Data Difficulty and the Generalization--Extrapolation Tradeoff in LLM Fine-Tuning SFT 2026.05 PowLU: An Activation Function for Stable Pre-Training of LLMs Training