Appeng 3801-A agent performance fixes - tool use stability #132

etsien · 2025-10-19T22:05:33Z

renamed tools - emphasis on conciseness and distinctness for model to correctly pick the right tool
reworked tool descriptions - focused on conciseness (shortened by ~60%), removing conflicting text, to help model send in the right parameters
refactored scripts and config files to use constants instead of strings for referencing tools
added unit tests to check tool names, uniqueness, and proper referencing of tools in scripts

zvigrinberg · 2025-10-20T05:32:11Z

/ok-to-test

zvigrinberg · 2025-10-20T05:50:09Z

/retest

zvigrinberg · 2025-10-20T05:52:02Z

/retest vulnerability-analysis-on-pr

zvigrinberg

@etsien
It looks very good and very promising.
Let's just wait for the CM results to be published so will verify the improvements before approving it and merging.

zvigrinberg · 2025-10-21T05:59:54Z

src/vuln_analysis/tools/transitive_code_search.py

+        description=(
+            "Checks if a function from a package is reachable from application code through the call chain. "
+            "Input format: 'package_name,function_name' (comma-separated). "
+            "Example: 'urllib,parse'. "
+            "Returns: (is_reachable: bool, call_hierarchy_path: list)."
+        )
+    )


@etsien I Asked Shimon to extend the description of tool to support more cases, as it's not always the case that it's just a function inside a package ( for example, sometimes in python there is a method of class ( prefixed or not by containing module, depends on the import statement) that should be checked if it's being called from source code or not, and then the format could be sometimes , ()... he should do it as an addition to his transitive code search support work for c programming language.

Please coordinate with him as he is currently doing some changes to the prompts and tools descriptions, and i believe he contacted you to consult with you, please make sure that the his changes are following tools calling and prompts best practices, and that they're correlated with your changes ( maybe after potential tailoring them accordingly)

zvigrinberg · 2025-10-21T13:21:35Z

@etsien I Found a bug that eliminate all tools but CVE Web Search from tools list,
Please apply this patch to your head branch of this PR

diff --git a/src/vuln_analysis/functions/cve_generate_vdbs.py b/src/vuln_analysis/functions/cve_generate_vdbs.py
index 7772e88..2a9dc31 100644
--- a/src/vuln_analysis/functions/cve_generate_vdbs.py
+++ b/src/vuln_analysis/functions/cve_generate_vdbs.py
@@ -27,6 +27,7 @@ from aiq.data_models.function import FunctionBaseConfig
 from pydantic import Field
 
 from vuln_analysis.logging.loggers_factory import LoggingFactory, trace_id
+from vuln_analysis.tools.tool_names import ToolNames
 
 logger = LoggingFactory.get_agent_logger(__name__)
 
@@ -69,11 +70,11 @@ async def generate_vdb(config: CVEGenerateVDBsToolConfig, builder: Builder):
     assert isinstance(agent_config, CVEAgentExecutorToolConfig)
 
     # Update config based on tools available in agent config
-    if "Container Image Code QA System" not in agent_config.tool_names:
+    if ToolNames.CODE_SEMANTIC_SEARCH not in agent_config.tool_names:
         logger.info("Container Image Code QA System tool is not enabled, setting ignore_code_embedding to True")
         config.ignore_code_embedding = True
 
-    if "Lexical Search Container Image Code QA System" not in agent_config.tool_names:
+    if ToolNames.CODE_KEYWORD_SEARCH not in agent_config.tool_names:
         logger.info(
             "Lexical Search Container Image Code QA System tool is not enabled, setting ignore_code_index to True")
         config.ignore_code_index = True

Please also apply it to the other PR #134 that was branched out from this head branch

adding in constants during the vdb generation check

etsien · 2025-10-21T18:06:24Z

...
Please also apply it to the other PR #134 that was branched out from this head branch

Bugfix pushed to both PR branches

etsien · 2025-10-28T16:20:15Z

Closing this PR in favor of APPENG-3801-B instead, which has these changes and other agent/tool changes.

etsien added 6 commits October 2, 2025 16:13

bugfix in the testing env

a1834bd

update tool descriptions for clarity

1d79035

refactor tool names to be class constants instead of disparate strings

6864fa7

add initial unit tests

e05ea7a

rename tool names to be more consistent and distinct

8893f5c

update unit tests with tool names and tool constants

dd18463

etsien marked this pull request as draft October 19, 2025 22:06

etsien changed the base branch from main to rh-aiq-main October 19, 2025 22:06

etsien marked this pull request as ready for review October 19, 2025 22:06

cleanup startup guide notebook

4efffcd

etsien mentioned this pull request Oct 20, 2025

APPENG-3801-B - Agent performance fixes - all agent stages #134

Open

zvigrinberg reviewed Oct 21, 2025

View reviewed changes

bug patch for vdb generation

476c805

adding in constants during the vdb generation check

bugfix by Tamar

9c284c3

etsien closed this Oct 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Appeng 3801-A agent performance fixes - tool use stability #132

Appeng 3801-A agent performance fixes - tool use stability #132

Uh oh!

etsien commented Oct 19, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg left a comment

Uh oh!

zvigrinberg Oct 21, 2025

Uh oh!

zvigrinberg Oct 21, 2025

Uh oh!

etsien Oct 21, 2025

Uh oh!

zvigrinberg commented Oct 21, 2025 •

edited

Loading

Uh oh!

etsien commented Oct 21, 2025

Uh oh!

etsien commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Appeng 3801-A agent performance fixes - tool use stability #132

Appeng 3801-A agent performance fixes - tool use stability #132

Uh oh!

Conversation

etsien commented Oct 19, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg commented Oct 20, 2025

Uh oh!

zvigrinberg left a comment

Choose a reason for hiding this comment

Uh oh!

zvigrinberg Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

zvigrinberg Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

etsien Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

zvigrinberg commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

etsien commented Oct 21, 2025

Uh oh!

etsien commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zvigrinberg commented Oct 21, 2025 •

edited

Loading