I seem to have an issue with a simple Sitemap scraper with extruct.
There should be some wordpress plugin that creates wrong jsonld so that scrapy crashes.
I checked stackoverflow, and apparently someone else has similar issues, but got no answer.
Also the error from lxml isn't very clear, since response[802] doesn't show any risky char (new line , utf8 letters or \ / ' ).
Any Idea?
ValueError: Invalid control character at: line 14 column 170 (char 802)
Best Answer
v
vaz
said
about 7 years ago
Hi ma,
seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.
Best,
Pablo
1 Comment
vaz
said
about 7 years ago
Answer
Hi ma,
seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.
ma
I seem to have an issue with a simple Sitemap scraper with extruct.
There should be some wordpress plugin that creates wrong jsonld so that scrapy crashes.
I checked stackoverflow, and apparently someone else has similar issues, but got no answer.
Also the error from lxml isn't very clear, since response[802] doesn't show any risky char (new line , utf8 letters or \ / ' ).
Any Idea?
Hi ma,
seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.
Best,
Pablo
vaz
Hi ma,
seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.
Best,
Pablo
-
Unable to select Scrapy project in GitHub
-
ScrapyCloud can't call spider?
-
Unhandled error in Deferred
-
Item API - Filtering
-
newbie to web scraping but need data from zillow
-
Cancelling account
-
Best Practices
-
Beautifulsoup with ScrapingHub
-
Delete a project in ScrapingHub
See all 460 topics