-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Comparing changes
Open a pull request
base repository: google/langextract
base: v1.0.5
head repository: google/langextract
compare: v1.0.6
- 17 commits
- 44 files changed
- 4 contributors
Commits on Aug 8, 2025
-
Remove duplicate exceptions.py from root directory (#94)
The exceptions.py file existed in both the root directory and langextract/ directory with identical content. This removes the duplicate from the root to avoid confusion and maintain proper package structure.
Configuration menu - View commit details
-
Copy full SHA for f3c1553 - Browse repository at this point
Copy the full SHA f3c1553View commit details -
Configuration menu - View commit details
-
Copy full SHA for 845258c - Browse repository at this point
Copy the full SHA 845258cView commit details -
Add provider registry infrastructure and custom provider plugin examp…
…le (#97) Introduces a provider registry system enabling third-party providers to be dynamically registered and discovered through a plugin architecture. Users can now integrate custom LLM backends (Azure OpenAI, AWS Bedrock, custom inference servers) without modifying core LangExtract code. Fixes #80, #67, #54, #49, #48, #53 Key Changes: **Provider Registry** (`langextract/providers/registry.py`) - Pattern-based registration with priority resolution - Automatic discovery via Python entry points - Lazy loading for performance **Factory Enhancements** (`langextract/factory.py`) - `ModelConfig` dataclass for structured configuration - Explicit provider selection when patterns overlap - Full backward compatibility maintained **Plugin Example** (`examples/custom_provider_plugin/`) - Complete working example with entry point configuration - Shows how to create custom providers for any backend **Documentation** - Comprehensive provider system README with architecture diagrams - Step-by-step plugin creation guide **Dependencies** - Move openai to optional dependencies - Update tox.ini to include openai in test environments **Lint Fixes** - Add appropriate pylint suppressions for legitimate patterns - Fix unused variable warnings in tests - Address import and global statement warnings No anticipated breakage - full backward compatibility maintained. Given significant internal changes to provider loading, issues should be reported if unexpected behavior is encountered.
Configuration menu - View commit details
-
Copy full SHA for 00acc43 - Browse repository at this point
Copy the full SHA 00acc43View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8aa788 - Browse repository at this point
Copy the full SHA c8aa788View commit details
Commits on Aug 9, 2025
-
Update .gitignore with additional development patterns
Add common development files, tools, and temporary file patterns
Configuration menu - View commit details
-
Copy full SHA for f069d6f - Browse repository at this point
Copy the full SHA f069d6fView commit details -
Update custom provider example to clarify planned model passing feature
- Show current approach using factory.create_model() - Add note that direct model passing to extract() is coming soon - Keep planned API as commented code for reference
1Configuration menu - View commit details
-
Copy full SHA for 0c08fd1 - Browse repository at this point
Copy the full SHA 0c08fd1View commit details
Commits on Aug 11, 2025
-
Fix lazy loading for provider pattern registration (#113)
Ensure providers are loaded before pattern matching to prevent API key errors when using local models. Optimize to skip loading when provider is explicitly specified.
Configuration menu - View commit details
-
Copy full SHA for 1a25621 - Browse repository at this point
Copy the full SHA 1a25621View commit details -
Improve PR validation workflow based on expert review
- Add proper permissions (issues: write for comments) - Skip draft PRs to avoid noise - Prevent duplicate comments with hidden marker - Search both title and body for issue links - Support all keyword variants and cross-repo references - Count unique users for reactions, not total count - Include 'write' permission for maintainer override - Add concurrency control for rapid edits - Handle cross-repo issues gracefully
Configuration menu - View commit details
-
Copy full SHA for a209903 - Browse repository at this point
Copy the full SHA a209903View commit details -
Add tests for provider plugin system (#114)
- 6 tests: plugin discovery, loading, idempotency, error handling - Smart CI triggers for integration test on provider changes - New tox environments: plugin-smoke and plugin-integration
Configuration menu - View commit details
-
Copy full SHA for 8989620 - Browse repository at this point
Copy the full SHA 8989620View commit details -
Adding model and config parameters to extract() (#119)
* Adding model and config parameters to extract() * Adding extract precedence tests * Applying changes from #120
Configuration menu - View commit details
-
Copy full SHA for 1a24cd0 - Browse repository at this point
Copy the full SHA 1a24cd0View commit details
Commits on Aug 13, 2025
-
Refactor schema system to support provider plugins (#130)
Enable providers to define custom schema implementations via BaseSchema abstraction. Add property-based fence output, FormatModeSchema for JSON/YAML providers, and move GeminiSchema to providers/schemas/.
Configuration menu - View commit details
-
Copy full SHA for 77b7b95 - Browse repository at this point
Copy the full SHA 77b7b95View commit details -
Support Hugging Face style model IDs for Ollama provider (#131)
- Add HF-style patterns (e.g., meta-llama/Llama-3.2-1B-Instruct) to Ollama registry - Add gpt-oss pattern to support issue #116 - Add comprehensive documentation for using Ollama with extract() - Include example for direct provider instantiation when ID conflicts exist - Add test coverage for HF-style model ID patterns
Configuration menu - View commit details
-
Copy full SHA for 50ba182 - Browse repository at this point
Copy the full SHA 50ba182View commit details -
Add tests for Ollama format parameter handling
Test that format='json' is correctly passed to Ollama API
Configuration menu - View commit details
-
Copy full SHA for 031c41c - Browse repository at this point
Copy the full SHA 031c41cView commit details -
Update Ollama quickstart to use ModelConfig with JSON mode
Show both ModelConfig and direct model_id usage patterns
Configuration menu - View commit details
-
Copy full SHA for 38528bf - Browse repository at this point
Copy the full SHA 38528bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for e056a85 - Browse repository at this point
Copy the full SHA e056a85View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a91ce1 - Browse repository at this point
Copy the full SHA 6a91ce1View commit details -
Configuration menu - View commit details
-
Copy full SHA for bdcd416 - Browse repository at this point
Copy the full SHA bdcd416View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.0.5...v1.0.6