HN2new | past | comments | ask | show | jobs | submit | damjon's favoriteslogin

Wrapper induction library is separated from Scrapy: https://github.com/scrapy/scrapely. It is used in Portia under the hood. Portia can be seen as a tool to annotate scrapely templates and define crawling rules and post-processing rules.

I'm not a Portia developer/user myself, but I think it is possible to get script code from Portia; it exports Scrapy spider to some folder. But I don't really know what I'm talking about, it is better to ask at https://groups.google.com/forum/#!forum/portia-scraper or at stackoverflow (use tag 'Portia').


Sounds like fun but only if you’re not a programmer, for whom it would just be work.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: