Scrapy Cloud API

Modified on Wed, 3 Feb, 2021 at 7:34 AM

The Scrapy Cloud API (often also referred as the Zyte API) is a HTTP API that you can use to control your spiders and consume the scraped data, among other things.


It is the recommended way to consume scraped data from spiders run on Zyte, regardless of whether they're built with Scrapy or Portia. You can use tags to mark jobs consumed and skip them on next reads.

For more information, please refer to the API reference documentation here:
https://docs.zyte.com/scrapy-cloud.html


Note: Portia is no longer available for new users. It has been disabled for all the new organisations from August 20, 2018 onward.

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article