Skip to content

Conversation

petern48
Copy link
Contributor

What changes were proposed in this pull request?

Cache the converter between Arrow and pandas using memoization to avoid recreating duplicate ones unnecessarily.

Why are the changes needed?

Performance improvement

Does this PR introduce any user-facing change?

No

How was this patch tested?

Passes existing tests

Was this patch authored or co-authored using generative AI tooling?

No

@petern48 petern48 changed the title [SPARK-43579][PS] optim: Cache the converter between Arrow and pandas for reuse [SPARK-43579][PYTHON] optim: Cache the converter between Arrow and pandas for reuse Sep 13, 2025
@petern48 petern48 marked this pull request as ready for review September 13, 2025 03:35
@petern48
Copy link
Contributor Author

@xinrong-meng @ueshin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants