Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: google/langextract
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v1.0.5
Choose a base ref
...
head repository: google/langextract
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.0.6
Choose a head ref
  • 17 commits
  • 44 files changed
  • 4 contributors

Commits on Aug 8, 2025

  1. Remove duplicate exceptions.py from root directory (#94)

    The exceptions.py file existed in both the root directory and langextract/ directory with identical content. This removes the duplicate from the root to avoid confusion and maintain proper package structure.
    aksg87 authored Aug 8, 2025
    Configuration menu
    Copy the full SHA
    f3c1553 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    845258c View commit details
    Browse the repository at this point in the history
  3. Add provider registry infrastructure and custom provider plugin examp…

    …le (#97)
    
    Introduces a provider registry system enabling third-party providers to be dynamically registered and discovered through a plugin architecture. Users can now integrate custom LLM backends (Azure OpenAI, AWS Bedrock, custom inference servers) without modifying core LangExtract code.
    
    Fixes #80, #67, #54, #49, #48, #53
    
    Key Changes:
    
    **Provider Registry** (`langextract/providers/registry.py`)
    - Pattern-based registration with priority resolution
    - Automatic discovery via Python entry points
    - Lazy loading for performance
    
    **Factory Enhancements** (`langextract/factory.py`)
    - `ModelConfig` dataclass for structured configuration
    - Explicit provider selection when patterns overlap
    - Full backward compatibility maintained
    
    **Plugin Example** (`examples/custom_provider_plugin/`)
    - Complete working example with entry point configuration
    - Shows how to create custom providers for any backend
    
    **Documentation**
    - Comprehensive provider system README with architecture diagrams
    - Step-by-step plugin creation guide
    
    **Dependencies**
    - Move openai to optional dependencies
    - Update tox.ini to include openai in test environments
    
    **Lint Fixes**
    - Add appropriate pylint suppressions for legitimate patterns
    - Fix unused variable warnings in tests
    - Address import and global statement warnings
    
    No anticipated breakage - full backward compatibility maintained. Given significant internal changes to provider loading, issues should be reported if unexpected behavior is encountered.
    aksg87 authored Aug 8, 2025
    Configuration menu
    Copy the full SHA
    00acc43 View commit details
    Browse the repository at this point in the history
  4. Update provider documentation

    aksg87 committed Aug 8, 2025
    Configuration menu
    Copy the full SHA
    c8aa788 View commit details
    Browse the repository at this point in the history

Commits on Aug 9, 2025

  1. Update .gitignore with additional development patterns

    Add common development files, tools, and temporary file patterns
    aksg87 committed Aug 9, 2025
    Configuration menu
    Copy the full SHA
    f069d6f View commit details
    Browse the repository at this point in the history
  2. Update custom provider example to clarify planned model passing feature

    - Show current approach using factory.create_model()
    - Add note that direct model passing to extract() is coming soon
    - Keep planned API as commented code for reference
    aksg87 committed Aug 9, 2025
    1 Configuration menu
    Copy the full SHA
    0c08fd1 View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2025

  1. Fix lazy loading for provider pattern registration (#113)

    Ensure providers are loaded before pattern matching to prevent API key
    errors when using local models. Optimize to skip loading when provider
    is explicitly specified.
    aksg87 authored Aug 11, 2025
    Configuration menu
    Copy the full SHA
    1a25621 View commit details
    Browse the repository at this point in the history
  2. Improve PR validation workflow based on expert review

    - Add proper permissions (issues: write for comments)
    - Skip draft PRs to avoid noise
    - Prevent duplicate comments with hidden marker
    - Search both title and body for issue links
    - Support all keyword variants and cross-repo references
    - Count unique users for reactions, not total count
    - Include 'write' permission for maintainer override
    - Add concurrency control for rapid edits
    - Handle cross-repo issues gracefully
    aksg87 committed Aug 11, 2025
    Configuration menu
    Copy the full SHA
    a209903 View commit details
    Browse the repository at this point in the history
  3. Add tests for provider plugin system (#114)

    - 6 tests: plugin discovery, loading, idempotency, error handling
    - Smart CI triggers for integration test on provider changes
    - New tox environments: plugin-smoke and plugin-integration
    aksg87 authored Aug 11, 2025
    Configuration menu
    Copy the full SHA
    8989620 View commit details
    Browse the repository at this point in the history
  4. Adding model and config parameters to extract() (#119)

    * Adding model and config parameters to extract()
    
    * Adding extract precedence tests
    
    * Applying changes from #120
    mariano authored Aug 11, 2025
    Configuration menu
    Copy the full SHA
    1a24cd0 View commit details
    Browse the repository at this point in the history

Commits on Aug 13, 2025

  1. Refactor schema system to support provider plugins (#130)

    Enable providers to define custom schema implementations via BaseSchema abstraction.
    Add property-based fence output, FormatModeSchema for JSON/YAML providers, and
    move GeminiSchema to providers/schemas/.
    aksg87 authored Aug 13, 2025
    Configuration menu
    Copy the full SHA
    77b7b95 View commit details
    Browse the repository at this point in the history
  2. Support Hugging Face style model IDs for Ollama provider (#131)

    - Add HF-style patterns (e.g., meta-llama/Llama-3.2-1B-Instruct) to Ollama registry
    - Add gpt-oss pattern to support issue #116
    - Add comprehensive documentation for using Ollama with extract()
    - Include example for direct provider instantiation when ID conflicts exist
    - Add test coverage for HF-style model ID patterns
    aksg87 authored Aug 13, 2025
    Configuration menu
    Copy the full SHA
    50ba182 View commit details
    Browse the repository at this point in the history
  3. Add tests for Ollama format parameter handling

    Test that format='json' is correctly passed to Ollama API
    aksg87 committed Aug 13, 2025
    Configuration menu
    Copy the full SHA
    031c41c View commit details
    Browse the repository at this point in the history
  4. Update Ollama quickstart to use ModelConfig with JSON mode

    Show both ModelConfig and direct model_id usage patterns
    aksg87 committed Aug 13, 2025
    Configuration menu
    Copy the full SHA
    38528bf View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e056a85 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    6a91ce1 View commit details
    Browse the repository at this point in the history
  7. Bump version to 1.0.6 (#134)

    aksg87 authored Aug 13, 2025
    Configuration menu
    Copy the full SHA
    bdcd416 View commit details
    Browse the repository at this point in the history
Loading