videocamWeb Data Extraction Summit - September 30th, 2021.
Join some of the greatest minds in web scraping to educate, inspire, and innovate.
Register for free!
Start a new topic
Answered

ValueError: Invalid control character

I seem to have an issue with a simple Sitemap scraper with extruct.

There should be some wordpress plugin that creates wrong jsonld so that scrapy crashes.

I checked stackoverflow, and apparently someone else has similar issues, but got no answer. 

Also the error from lxml isn't very clear, since response[802] doesn't show any risky char (new line , utf8 letters or \ / ' ).


Any Idea? 

ValueError: Invalid control character at: line 14 column 170 (char 802)

Best Answer

Hi ma,


seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.


Best,


Pablo

1 Comment

Answer

Hi ma,


seems a script issue. Unfortunately we don't provide coding assistance, but you can ask in reddit or Stack overflow using "scrapy" tag. Many of our best developers contribute there actively.


Best,


Pablo

Login to post a comment