with Zyte API and Scrapeless?
Crawl any website at scale and say goodbye to blocks. See the documentation
Retrieve the result of a completed scraping job. See the documentation
Endpoints for fresh, structured data from 100+ popular sites. See the documentation
Submit a new web scraping job with specified target URL and extraction rules. See the documentation
Access any website at scale and say goodbye to blocks. See the documentation
The Zyte API provides programmatic access to web data extraction services, allowing you to pull structured data from websites efficiently. Within Pipedream, you can leverage the Zyte API to create powerful serverless workflows that automate data collection, monitor web content changes, or enrich your datasets with web-sourced information. By connecting Zyte to other apps on Pipedream, you can easily integrate web scraping into your data processing pipelines, event-driven applications, and more, with minimal setup and no server maintenance.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
zyte_api: {
type: "app",
app: "zyte_api",
}
},
async run({steps, $}) {
const data = {
"url": "https://books.toscrape.com/",
"httpResponseBody": true
}
return await axios($, {
method: "post",
url: `https://api.zyte.com/v1/extract`,
headers: {
"Content-Type": `application/json`,
},
auth: {
username: `${this.zyte_api.$auth.api_key}`,
password: ``,
},
data,
})
},
})
Scrapeless – your go-to platform for powerful, compliant web data extraction. With tools like Universal Scraping API, Scrapeless makes it easy to access and gather data from complex sites. Focus on insights while we handle the technical hurdles. Scrapeless – data extraction made simple.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
scrapeless: {
type: "app",
app: "scrapeless",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.scrapeless.com/api/v1/me`,
headers: {
"x-api-token": `${this.scrapeless.$auth.api_key}`,
},
})
},
})