Features:
Receive content in real-time.
Our streaming API allows you to index content in real-time, as soon as we discover new content. Our client installs as a daemon, runs in the background, and spools content to disk.
Advanced filtering with boolean logic
Our streaming API supports advanced filtering using boolean logic, on any field (or within fields). Search for documents in English, by publisher type, with containing terms or tags, etc.
High throughput
Our streaming API is designed to scale. We serve more than 100TB to our customers per month. Our infrastructure is built on a highly parallel cluster design which we’ve had in production for nearly a decade.