Bugfix: Add max_line_length to search_code_advanced to prevent token flooding #22

marksolly · 2025-08-08T06:39:21Z

Introduces a new max_line_length parameter to the search_code_advanced tool to address an issue where search results containing very long lines (e.g., from minified JavaScript files) could lead to excessive token usage and unexpected behavior when consumed by Large Language Models (LLMs).

Problem:

When using the search_code_advanced tool with context_lines, if a match is found in a file with extremely long lines (such as a .min.js file), the entire line is returned as context. This can result in a massive, unexpected output, leading to:

Token Flooding: The large output consumes a significant number of tokens, which can be costly and inefficient.
LLM Input Issues: The excessive input can confuse or overwhelm LLMs, leading to poor or irrelevant responses.
Performance Degradation: Processing and transmitting large amounts of unnecessary data can slow down the entire workflow.

Solution:

This PR introduces a max_line_length parameter to the search_code_advanced tool, which defaults to 200 characters. This parameter truncates any line in the search results that exceeds the specified length, appending ... (truncated) to indicate that the line has been shortened.

Key Changes:

src/code_index_mcp/server.py: The search_code_advanced tool now accepts a max_line_length parameter with a default value of 200.
src/code_index_mcp/services/search_service.py: The search_code method now accepts and passes the max_line_length parameter to the underlying search strategies.
src/code_index_mcp/search/base.py: The parse_search_output function now includes logic to truncate lines based on max_line_length. The SearchStrategy abstract base class has been updated to include this parameter in the search method.
src/code_index_mcp/search/*.py: All concrete SearchStrategy implementations (ugrep, ripgrep, ag, grep, and basic) have been updated to accept and utilize the max_line_length parameter.

Limit response line length in search_code_advanced

4ed55d9

marksolly mentioned this pull request Aug 8, 2025

Results may contain extremely long lines marksolly/code-index-mcp#2

Closed

johnhuang316 force-pushed the master branch from 98e22c8 to 6f7f8d0 Compare August 11, 2025 05:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bugfix: Add max_line_length to search_code_advanced to prevent token flooding #22

Bugfix: Add max_line_length to search_code_advanced to prevent token flooding #22

Uh oh!

marksolly commented Aug 8, 2025

Uh oh!

Uh oh!

Bugfix: Add max_line_length to search_code_advanced to prevent token flooding #22

Are you sure you want to change the base?

Bugfix: Add max_line_length to search_code_advanced to prevent token flooding #22

Uh oh!

Conversation

marksolly commented Aug 8, 2025

Uh oh!

Uh oh!