- Add in-memory cache with 60s TTL for article/url/author queries - Check cache before network fetch to reduce redundant queries - Support force flag to bypass cache when needed - Stream cached results through onHighlight callback for consistency