Skip to content

Conversation

shankeleven
Copy link

Solves #13

regex patterns have been modified to be unicode aware

just went through the regex patterns , thought it was feasible to include other language tokens by modifying the regex pattern to be unicode aware

[A-Za-z]+ -> \p{L}+ [0-9]+ -> \p{N}+ (or \p{Nd}+)
[^A-Za-z0-9\s]+ -> [^\p{L}\p{N}\s]+

"Hello, 世界!", \p{L}+ matches "Hello" and "世界"

have added tests for different languages
added a new depency regex , as it works better with unicode compared to re

aksg87 and others added 22 commits July 22, 2025 01:39
- Switch from badge.fury.io to shields.io for working PyPI badge
- Convert relative paths to absolute GitHub URLs for PyPI compatibility
- Bump version to 0.1.3
- Add GitHub Actions workflow for automated PyPI publishing via OIDC
- Configure trusted publishing environment for verified releases
- Update project metadata with proper URLs and license format
- Prepare for v1.0.0 stable release with production-ready automation
- Add pylibmagic>=0.5.0 dependency for bundled libraries
- Add [full] install option and pre-import handling
- Update README with troubleshooting and Docker sections
- Bump version to 1.0.1

Fixes google#6
Deleted an inline comment referencing the  output directory in the save_annotated_documents.
…ples.md

docs: clarify output_dir behavior in medication_examples.md
Prevents confusion from default `test_output/...` by explicitly saving to current directory.
docs: add output_dir="." to all save_annotated_documents examples
feat: add code formatting and linting pipeline
Introduces a common base exception class that all library-specific exceptions inherit from, enabling users to catch all LangExtract errors with a single except clause.
Add LangExtractError base exception for centralized error handling
Fixes google#25 - Windows installation failure due to pylibmagic build requirements

Breaking change: LangFunLanguageModel removed. Use GeminiLanguageModel or OllamaLanguageModel instead.
fix: Remove LangFun and pylibmagic dependencies to fix Windows installation and OpenAI SDK v1.x compatibility
- Modified save_annotated_documents to accept both pathlib.Path and string paths
- Convert string paths to Path objects before calling mkdir()
- This fixes the error when using output_dir='.' as shown in the README example
…-mkdir

Fix save_annotated_documents to handle string paths
feat: Add OpenAI language model support
Copy link

github-actions bot commented Aug 6, 2025

Manual validation results:

Size: 205 lines
Template: ✗
Linked issue: ✗

Run ID: 16790882125

Copy link

github-actions bot commented Aug 6, 2025

Manual validation results:

Size: 205 lines
Template: ✗
Linked issue: ✗

Run ID: 16791204003

Copy link

github-actions bot commented Aug 6, 2025

Manual Validation Results

Status: ❌ Failed

Check Status Details
PR Size 205 lines
Template Missing required sections
Linked Issue Missing Fixes/Closes #XXX

Errors:

  • ❌ Missing PR template sections: # Description, Fixes #, # How Has This Been Tested?, # Checklist
  • ❌ No linked issue found

View workflow run

Copy link

github-actions bot commented Aug 7, 2025

Manual Validation Results

Status: ❌ Failed

Check Status Details
PR Size 205 lines
Template Missing required sections
Linked Issue Missing Fixes/Closes #XXX

Errors:

  • ❌ Missing PR template sections: # Description, Fixes #, # How Has This Been Tested?, # Checklist
  • ❌ No linked issue found

View workflow run

@aksg87 aksg87 added the size/M Pull request with 150-600 lines changed label Aug 7, 2025
Copy link

github-actions bot commented Aug 7, 2025

⚠️ Branch Update Required

Your branch is 24 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 51 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 86 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 98 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

github-actions bot commented Sep 7, 2025

⚠️ Branch Update Required

Your branch is 106 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 107 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 109 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

⚠️ Branch Update Required

Your branch is 110 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Copy link

github-actions bot commented Oct 7, 2025

⚠️ Branch Update Required

Your branch is 111 commits behind main. Please update your branch to ensure CI checks run with the latest code:

git fetch origin main
git merge origin/main
git push

Note: Enable "Allow edits by maintainers" to allow automatic updates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size/M Pull request with 150-600 lines changed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants