feat(memory): add token budgeting, fact summaries, and similarity ranking #975

fazlerahmanejazi · 2025-10-15T08:52:24Z

Motivation and Context

Support configurable token budgets, summaries, and similarity ranking so memory injections stay concise and relevant.

add opt-in token budgets, summarisation, and similarity ranking for AgentMemory via the SmartAgentMemoryProvider decorator and shared FactReplayProcessor (uses Embedder, Tokenizer, RankedDocumentStorage; defaults keep legacy behaviour when the decorator isn’t applied).
add tests (AgentMemoryEnhancementsTest, AgentMemoryEnrichmentTest)

Breaking Changes

No breaking changes—new controls are optional and preserve existing defaults.

Type of the changes

New feature (non-breaking change which adds functionality)
Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update
Tests improvement
Refactoring

Checklist

The pull request has a description of the proposed change
I read the Contributing Guidelines before opening the pull request
The pull request uses develop as the base branch
Tests for the changes have been added
All new and existing tests passed

Additional steps for pull requests adding a new feature

An issue describing the proposed change exists
The pull request includes a link to the issue
The change was discussed and approved in the issue
Docs have been added / updated

…king

aozherelyeva · 2025-10-28T13:33:26Z

Hi @fazlerahmanejazi! Could you please resolve the conflicts? :)

…mory-enhancements

fazlerahmanejazi · 2025-10-28T16:18:46Z

Hi @fazlerahmanejazi! Could you please resolve the conflicts? :)

Done @aozherelyeva, ready for review!

Ololoshechkin

Thanks for your pull request and contribution!

I see the idea as very valuable and practical, though there are some design suggestions that I propose to consider.

AgentMemoryProvider is already an interface that defines how to retrieve facts, and it's being used by AgentMemory feature. So I propose moving all additional filters, embedders and summarizers to some abstract class or implementation of that AgentMemoryProvider interface. Basically, you'd need to override suspend fun load(concept: Concept, subject: MemorySubject, scope: MemoryScope): List<Fact> and suspend fun save(fact: Fact, subject: MemorySubject, scope: MemoryScope) methods for that.

load can do the ranking, summarizing and filtering.

Additionally, please consider using ai.koog.rag.base.RankedDocumentStorage (https://docs.koog.ai/ranked-document-storage/) for ranking -- it's a generic interface that can be implemented via LLM embeddings + local storages as well as with vector databases (ai.koog.rag.vector.EmbeddingBasedDocumentStorage)

Ololoshechkin · 2025-10-28T19:59:47Z

...s-memory/src/commonMain/kotlin/ai/koog/agents/memory/feature/similarity/EmbeddingProvider.kt

+ *
+ * Implementations can wrap existing embedding infrastructure or provide lightweight in-memory behaviour.
+ */
+public interface EmbeddingProvider {


Could you please elaborate why ai.koog.embeddings.base.Embedder (and classes that implement it) doesn't fit here?

Ololoshechkin · 2025-10-28T20:02:52Z

...es/agents-features-memory/src/commonMain/kotlin/ai/koog/agents/memory/feature/AgentMemory.kt

+        }
+    }
+
+    private fun estimateTokens(text: String): Int = max(1, text.length / 4 + 1)


Please consider checking ai.koog.prompt.tokenizer.Tokenizer interface (there's already a simple regex-based tokenizer implementation, as well as TiktokenEncoder)

…tions-based SmartAgentMemoryProvider

…torage

fazlerahmanejazi · 2025-10-29T11:13:16Z

@Ololoshechkin thanks again for the thoughtful review. I’ve pushed a set of changes that addresses each point:

Provider-based enrichment/ranking: A SmartAgentMemoryProvider decorator now overrides save/load to run summarisation on save and ranking/budgeting on load. AgentMemory simply forwards the MemoryRequestOptions; plain providers continue to behave exactly as before.
Replay engine centralised: SmartAgentMemoryProvider delegates the heavy lifting to a reusable FactReplayProcessor, so the enrichment, ranking, and token-budget logic lives in one place. Providers opt in via the lightweight MemoryPostProcessor contract; non-implementers return the raw facts.
Koog primitives throughout:
- Replaced the ad-hoc EmbeddingProvider with the existing ai.koog.embeddings.base.Embedder interface.
- Token estimation now uses the ai.koog.prompt.tokenizer.Tokenizer interface (defaulting to the regex implementation).
- Ranking now happens via ai.koog.rag.base.RankedDocumentStorage. The processor accepts a storage factory so future integrations can supply persistent/vector-backed implementations; the default is an in-memory adapter that uses the embedder under the hood.

Happy to dig into any of the details!

feat(memory): add token budgeting, fact summaries, and similarity ran…

00e7086

…king

fazlerahmanejazi force-pushed the feature/agent-memory-enhancements branch from b1e667e to 00e7086 Compare October 15, 2025 10:13

aozherelyeva requested review from EugeneTheDev, Rizzen and sdubov and removed request for EugeneTheDev and Rizzen October 16, 2025 10:51

fazlerahmanejazi mentioned this pull request Oct 20, 2025

AgentMemory replay floods prompts as stored facts grow #1001

Open

add AgentMemoryBudgetSimulationTest

6c83c6b

aozherelyeva added enhancement New feature or request bugfix Something was fixed 🎉 labels Oct 20, 2025

Merge remote-tracking branch 'upstream/develop' into feature/agent-me…

4b3370f

…mory-enhancements

fazlerahmanejazi force-pushed the feature/agent-memory-enhancements branch from 99ce88e to 784ab2d Compare October 28, 2025 14:57

fix AgentMemory tests for new load/save signatures after upstream merge

b742e55

fazlerahmanejazi force-pushed the feature/agent-memory-enhancements branch from 784ab2d to b742e55 Compare October 28, 2025 15:28

Ololoshechkin requested changes Oct 28, 2025

View reviewed changes

fazle added 3 commits October 29, 2025 09:00

Address review feedback: move enrichment/ranking into provider via op…

df8641c

…tions-based SmartAgentMemoryProvider

Address review feedback: hook similarity ranking into RankedDocumentS…

8315377

…torage

Address review feedback: make AgentMemory leaner

b57fb83

fazlerahmanejazi requested a review from Ololoshechkin October 29, 2025 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(memory): add token budgeting, fact summaries, and similarity ranking #975

feat(memory): add token budgeting, fact summaries, and similarity ranking #975

fazlerahmanejazi commented Oct 15, 2025 •

edited

Loading

Uh oh!

aozherelyeva commented Oct 28, 2025

Uh oh!

fazlerahmanejazi commented Oct 28, 2025

Uh oh!

Ololoshechkin left a comment

Uh oh!

Ololoshechkin Oct 28, 2025

Uh oh!

Ololoshechkin Oct 28, 2025

Uh oh!

fazlerahmanejazi commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(memory): add token budgeting, fact summaries, and similarity ranking #975

Are you sure you want to change the base?

feat(memory): add token budgeting, fact summaries, and similarity ranking #975

Conversation

fazlerahmanejazi commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Breaking Changes

Type of the changes

Checklist

Additional steps for pull requests adding a new feature

Uh oh!

aozherelyeva commented Oct 28, 2025

Uh oh!

fazlerahmanejazi commented Oct 28, 2025

Uh oh!

Ololoshechkin left a comment

Choose a reason for hiding this comment

Uh oh!

Ololoshechkin Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Ololoshechkin Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

fazlerahmanejazi commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fazlerahmanejazi commented Oct 15, 2025 •

edited

Loading