Is there any way to create or modify periodic jobs programmatically? I'd like to create a process where I autogenerate spiders and commit them to github, where they are pulled down automatically by ScrapingHub.
Then I'd like to script modifying a periodic job to add new spiders to the job to be run on a periodic basis.
This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.
0 Votes
5 Comments
Sorted by
A
Aaron Cowperposted
over 3 years ago
Any update on this? Been waiting 2+ years...
0 Votes
H
Hareesh Kadaliposted
over 4 years ago
any update on this requirement?
1 Votes
g
george8posted
almost 5 years ago
We are also looking at something similar.
We want to scrape IG post page with registering a spider for the specific post link every X timespan. We also need to remove the job in some cases. The reason is we can have hundreds of posts to scrape and need to happen dynamically.
Workflow:
1. Register spider in scrapy cloud and start scraping every X timespan
2. If applicable delete the spider job
3. Retrieve the data extracted or notified that the job is finished with the data
Any documentation on the above subjects that can help accomplish the scenario would be helpful
1 Votes
A
Aaron Cowperposted
about 5 years ago
Hi, any update on when this feature will be available?
1 Votes
nestorposted
almost 6 years ago
AdminAnswer
This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.
Hello,
Is there any way to create or modify periodic jobs programmatically? I'd like to create a process where I autogenerate spiders and commit them to github, where they are pulled down automatically by ScrapingHub.
Then I'd like to script modifying a periodic job to add new spiders to the job to be run on a periodic basis.
Is this possible? The Jobs API only seems to be for one-off jobs (https://doc.scrapinghub.com/api/jobs.html).
thanks!
0 Votes
nestor posted almost 6 years ago Admin Best Answer
This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.
0 Votes
5 Comments
Aaron Cowper posted over 3 years ago
Any update on this? Been waiting 2+ years...
0 Votes
Hareesh Kadali posted over 4 years ago
any update on this requirement?
1 Votes
george8 posted almost 5 years ago
We are also looking at something similar.
We want to scrape IG post page with registering a spider for the specific post link every X timespan. We also need to remove the job in some cases. The reason is we can have hundreds of posts to scrape and need to happen dynamically.
Workflow:
1. Register spider in scrapy cloud and start scraping every X timespan
2. If applicable delete the spider job
3. Retrieve the data extracted or notified that the job is finished with the data
Any documentation on the above subjects that can help accomplish the scenario would be helpful
1 Votes
Aaron Cowper posted about 5 years ago
Hi, any update on when this feature will be available?
1 Votes
nestor posted almost 6 years ago Admin Answer
This is a very popular request and we are already working on an API for Periodic Jobs, it should be ready sometime this year. For now, the only option is to do it via the UI.
0 Votes
Login to post a comment