Easy Web Scraping for Anyone. No code is the best code. Octoparse allows everyone to build reliable web scrapers they need - no coding needed.
Get a URL and emit the full HTTP event on every request (including headers and query parameters). You can also configure the HTTP response code, body, and more.
Get a URL and emit the HTTP body as an event on every request
Emit new event when the content of the URL changes.
Send an HTTP request using any method and URL. Optionally configure query string parameters, headers, and basic auth.
Send an HTTP GET request to any URL. Optionally configure query string parameters, headers and basic auth.
Send an HTTP POST request to any URL. Optionally configure query string parameters, headers and basic auth.
Send an HTTP PUT request to any URL. Optionally configure query string parameters, headers and basic auth.
Use with an HTTP trigger that uses "Return a custom response from your workflow" as its HTTP Response
The Octoparse API allows you to automate the extraction of web data without coding, making it a powerful tool for data-driven workflows. With this API, you can control your scraping tasks, retrieve extracted data, and manage your account programmatically. When combined with Pipedream's serverless execution environment, you can build custom workflows to process, store, or act upon the data fetched by Octoparse. This integration can be a cornerstone for solutions in market research, competitor analysis, price monitoring, or lead generation.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
octoparse: {
type: "app",
app: "octoparse",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://openapi.octoparse.com/taskGroup`,
headers: {
Authorization: `Bearer ${this.octoparse.$auth.oauth_access_token}`,
},
})
},
})
Build, test, and send HTTP requests without code using your Pipedream workflows. The HTTP / Webhook action is a tool to build HTTP requests with a Postman-like graphical interface.
Define the target URL, HTTP verb, headers, query parameters, and payload body without writing custom code.
This action can also use your connected accounts with third-party APIs. Selecting an integrated app will automatically update the request’s headers to authenticate with the app properly, and even inject your token dynamically.
Pipedream integrates with thousands of APIs, but if you can’t find a Pipedream integration simply use Environment Variables in your request headers to authenticate with.
The HTTP/Webhook action exports HTTP response data for use in subsequent workflow steps, enabling easy data transformation, further API calls, database storage, and more.
Response data is available for both coded (Node.js, Python) and no-code steps within your workflow.
// To use any npm package on Pipedream, just import it
import axios from "axios"
export default defineComponent({
async run({ steps, $ }) {
const { data } = await axios({
method: "GET",
url: "https://pokeapi.co/api/v2/pokemon/charizard",
})
return data.species
},
})