Cancelled (stalled) Job outcome because of scrapy_dotpersistence syncing over an hour
c
chops
started a topic
over 6 years ago
My job outcome is cancelled (stalled) repeatedly after the scraping is over and the scrapy_dotpersitence addon stores the .scrapy directory to S3:
[scrapy_dotpersistence] Syncing .scrapy directory to s3://scrapinghub-app-dash-addons/org-176226/[...]/dot-scrapy/immo[...]/
1090: 2017-12-26 17:50:02 INFO
[scrapy.crawler] Received SIGTERM, shutting down gracefully. Send again to force
I tried to delete the httpcache folder in the console, but the syncing duration is over an hour and the job is getting canceled anyway.
How can I solve this issue? Can I "reset" the S3 folder directly?
Best Answer
n
nestor
said
over 6 years ago
Jobs will get cancelled if they're not doing anything for an hour, you could add some log every hour or so, so that the job doesn't get cancelled.
chops
My job outcome is cancelled (stalled) repeatedly after the scraping is over and the scrapy_dotpersitence addon stores the .scrapy directory to S3:
I tried to delete the httpcache folder in the console, but the syncing duration is over an hour and the job is getting canceled anyway.
How can I solve this issue? Can I "reset" the S3 folder directly?
Jobs will get cancelled if they're not doing anything for an hour, you could add some log every hour or so, so that the job doesn't get cancelled.
- Oldest First
- Popular
- Newest First
Sorted by Oldest Firstnestor
Jobs will get cancelled if they're not doing anything for an hour, you could add some log every hour or so, so that the job doesn't get cancelled.
chops
thriveni
Do let us know if you are still facing the issue? I do not see any jobs getting stalled in the account.
chops
Is it possible to insert own S3 Credentials for scrapy_dotpersistence?
-
Unable to select Scrapy project in GitHub
-
ScrapyCloud can't call spider?
-
Unhandled error in Deferred
-
Item API - Filtering
-
newbie to web scraping but need data from zillow
-
ValueError: Invalid control character
-
Cancelling account
-
Best Practices
-
Beautifulsoup with ScrapingHub
-
Delete a project in ScrapingHub
See all 458 topics