Add iOS-OCR-Server provider support for local OCR processing #655
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR implements support for iOS-OCR-Server as a new OCR provider, enabling users to turn their iPhone or iPad into a powerful local OCR server using Apple's Vision Framework.
What's New
iOS-OCR-Server Provider (
ios_ocr
)Key Features
Configuration
Add the following environment variables to use iOS-OCR-Server:
Implementation Details
The implementation follows the established OCR provider patterns in the codebase:
Provider
interface withProcessImage()
methodretryablehttp.Client
for reliable communication with iOS devicesTesting
Documentation
Updated README.md with:
This addition expands paperless-gpt's OCR capabilities to include a privacy-focused, cost-effective option for users with iOS devices, complementing the existing cloud-based and self-hosted OCR providers.
Warning
Firewall rules blocked me from connecting to one or more addresses (expand for details)
I tried to connect to the following addresses, but was blocked by firewall rules:
esm.ubuntu.com
/usr/lib/apt/methods/https
(dns block)openaipublic.blob.core.windows.net
/tmp/go-build1921101790/b001/paperless-gpt.test -test.testlogfile=/tmp/go-build1921101790/b001/testlog.txt -test.paniconexit0 -test.timeout=10m0s
(dns block)If you need me to access, download, or install something from one of these locations, you can either:
Original prompt
This section details on the original issue you should resolve
<issue_title>New OCR Provider: iOS-OCR-Server</issue_title>
<issue_description>https://github.com/riddleling/iOS-OCR-Server
It has API, should be easy to integrated
I run it with the
LLM-Based OCR: Compare for Yourself
examples on the index.OCR Results
Example 1
Example 2