Start a new topic
Answered

Modify Periodic Jobs Programmatically?

Hello,


Is there any way to create or modify periodic jobs programmatically? I'd like to create a process where I autogenerate spiders and commit them to github, where they are pulled down automatically by ScrapingHub.


Then I'd like to script modifying a periodic job to add new spiders to the job to be run on a periodic basis. 


Is this possible? The Jobs API only seems to be for one-off jobs (https://doc.scrapinghub.com/api/jobs.html).


thanks!


Best Answer

This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.


any update on this requirement?


1 person likes this

We are also looking at something similar.


We want to scrape IG post page with registering a spider for the specific post link every X timespan. We also need to remove the job in some cases. The reason is we can have hundreds of posts to scrape and need to happen dynamically.


Workflow:

1. Register spider in scrapy cloud and start scraping every X timespan

2. If applicable delete the spider job

3. Retrieve the data extracted or notified that the job is finished with the data


Any documentation on the above subjects that can help accomplish the scenario would be helpful


1 person likes this

Hi, any update on when this feature will be available?


1 person likes this
Answer

This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.

Any update on this? Been waiting 2+ years...

Login to post a comment