Subscribe Free

Join 2670+ others. No spamming.
I promise!

We are currently under high development. Follow us at github.


Looking for Python Tutorials?
Check these awesome tutorials



scrapinghub/portia

READ ME

Portia

Portia is a tool that allows you to visually scrape websites without any programming knowledge required. With Portia you can annotate a web page to identify the data you wish to extract, and Portia will understand based on these annotations how to scrape data from similar pages.

Running Portia

The easiest way to run Portia is using Vagrant.

Clone the repository:

git clone https://github.com/scrapinghub/portia

Then inside the Portia directory, run:

vagrant up

For more detailed instructions, and alternatives to using Vagrant, see the Installation docs.

Documentation

Documentation can be found here. Source files can be found in the docs directory.