WebScraper.IO

Making web data extraction easy and accessible for everyone.

Integrate the WebScraper.IO API with the Python API

Setup the WebScraper.IO API trigger to run a workflow which integrates with the Python API. Pipedream's integration platform allows you to integrate WebScraper.IO and Python remarkably fast. Free for developers.

Run Python Code with Python API on New Scraping Completed from WebScraper.IO API
WebScraper.IO + Python
 
Try it
New Scraping Completed from the WebScraper.IO API

Emit new event when a page scraping job has completed. See the docs here

 
Try it
Create Scraping Job with the WebScraper.IO API

Creates a scraping job (scrapes a sitemap). See the docs here

 
Try it
Run Python Code with the Python API

Write Python and use any of the 350k+ PyPi packages available. Refer to the Pipedream Python docs to learn more.

 
Try it
Create Sitemap with the WebScraper.IO API

Creates a sitemap for the selected website. See the docs here

 
Try it
Get Scraping Jobs with the WebScraper.IO API

Retrieves a list of scraping jobs for a sitemap. See the docs here

 
Try it

Overview of WebScraper.IO

The WebScraper.IO API allows you to programmatically perform web scraping tasks, extracting structured data from websites. With the API, you can automate the gathering of web content for analysis, monitoring, and integration with other data sources. In Pipedream, you can leverage this API to build workflows that process, analyze, and act on the data you scrape without writing code for backend infrastructure.

Connect WebScraper.IO

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
import { axios } from "@pipedream/platform"
export default defineComponent({
  props: {
    webscraper_io: {
      type: "app",
      app: "webscraper_io",
    }
  },
  async run({steps, $}) {
    return await axios($, {
      url: `https://api.webscraper.io/api/v1/sitemaps`,
      params: {
        api_token: `${this.webscraper_io.$auth.api_key}`,
      },
    })
  },
})

Overview of Python

Develop, run and deploy your Python code in Pipedream workflows. Integrate seamlessly between no-code steps, with connected accounts, or integrate Data Stores and manipulate files within a workflow.

This includes installing PyPI packages, within your code without having to manage a requirements.txt file or running pip.

Below is an example of using Python to access data from the trigger of the workflow, and sharing it with subsequent workflow steps:

Connect Python

1
2
3
4
5
def handler(pd: "pipedream"):
  # Reference data from previous steps
  print(pd.steps["trigger"]["context"]["id"])
  # Return data for use in future steps
  return {"foo": {"test":True}}