Feat/response caching utility #74

LifeJiggy · 2025-10-28T14:52:40Z

Summary

This PR adds a ResponseCache utility class that provides simple in-memory response caching with TTL support, helping developers reduce API calls and improve application performance.

Problem

Gradient API calls can be expensive and slow, especially when the same data is requested repeatedly. Developers currently have no built-in way to cache API responses, leading to:

Unnecessary API calls for identical requests
Slower application performance
Higher API usage costs
No control over response freshness

Solution

Add ResponseCache class with:

TTL (time-to-live) support for automatic cache expiration
LRU (least recently used) eviction when cache is full
Request deduplication based on method, URL, params, and data
Simple API for get/set/clear operations
Configurable cache size and default TTL

Key Features

TTL Support: Automatic expiration of cached responses
LRU Eviction: Removes least recently used items when cache is full
Request Deduplication: MD5-based keys from method, URL, params, and data
Thread Safe: Uses standard library only, no external dependencies
Configurable: Adjustable cache size and TTL settings
Simple API: Easy to integrate into existing code
Performance Benefits
Reduces redundant API calls
Improves response times for cached data
Helps stay within API rate limits
Reduces network latency

Testing

Added comprehensive test suite covering:

Basic cache operations (set/get)
TTL expiration behavior
Cache size limits and LRU eviction
Parameter-based caching
Cache clearing functionality
All tests pass with full coverage of cache functionality.

Usage Examples

from gradient._utils import ResponseCache

# Create cache with 5-minute default TTL
cache = ResponseCache(max_size=100, default_ttl=300)

# Cache API responses
response = client.models.list()
cache.set("GET", "/v2/gen-ai/models", response)

# Later, check cache before making API call
cached = cache.get("GET", "/v2/gen-ai/models")
if cached:
    print("Using cached response")
    return cached

# Make fresh API call and cache it
response = client.models.list()
cache.set("GET", "/v2/gen-ai/models", response)
return response

bbatha · 2025-11-25T14:49:10Z

Independently this is a useful pr however you are including features we do not want such as the key validator and the cli. Please remove those unrelated additions and I will review the cache code.

LifeJiggy · 2025-11-26T07:01:51Z

Thanks for the feedback, @bbatha! Totally agree—I'll strip out the key validator and CLI to keep things laser-focused on the caching utility. Aiming to push an update in the next [24-48 hours]. Excited for your thoughts on the ResponseCache core! 🚀

feat: add ResponseCache utility class

9fcef6f

LifeJiggy closed this Nov 27, 2025

LifeJiggy force-pushed the feat/response-caching-utility branch from d941755 to 9fcef6f Compare November 27, 2025 06:49

LifeJiggy mentioned this pull request Nov 27, 2025

feat: add ResponseCache utility class #84

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/response caching utility #74

Feat/response caching utility #74

Uh oh!

LifeJiggy commented Oct 28, 2025

Uh oh!

bbatha commented Nov 25, 2025

Uh oh!

LifeJiggy commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat/response caching utility #74

Feat/response caching utility #74

Uh oh!

Conversation

LifeJiggy commented Oct 28, 2025

Summary

Problem

Solution

Key Features

Testing

Usage Examples

Uh oh!

bbatha commented Nov 25, 2025

Uh oh!

LifeJiggy commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants