An incredibly powerful web scraper.
Get a URL and emit the full HTTP event on every request (including headers and query parameters). You can also configure the HTTP response code, body, and more.
Get a URL and emit the HTTP body as an event on every request
Emit new event when the content of the URL changes.
Returns the data extracted by a specified run. See the documentation
Retrieves the details of a specified project within the user's account. See the documentation
Initiates an instance of a specified project on the Parsehub cloud. See the documentation
Send an HTTP request using any method and URL. Optionally configure query string parameters, headers, and basic auth.
Send an HTTP GET request to any URL. Optionally configure query string parameters, headers and basic auth.
The ParseHub API allows you to leverage the power of web scraping directly within Pipedream. By integrating ParseHub, you can automate the collection of data from web pages, manipulate and transform it with Pipedream’s built-in code steps or pre-built actions, and connect it to hundreds of other apps. You can extract structured data from any website, run scraping jobs, retrieve results and integrate with other services for data processing, visualization, or storage.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
parsehub: {
type: "app",
app: "parsehub",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://www.parsehub.com/api/v2/projects`,
params: {
api_key: `${this.parsehub.$auth.api_key}`,
},
})
},
})
Build, test, and send HTTP requests without code using your Pipedream workflows. The HTTP / Webhook action is a tool to build HTTP requests with a Postman-like graphical interface.
Define the target URL, HTTP verb, headers, query parameters, and payload body without writing custom code.
This action can also use your connected accounts with third-party APIs. Selecting an integrated app will automatically update the request’s headers to authenticate with the app properly, and even inject your token dynamically.
Pipedream integrates with thousands of APIs, but if you can’t find a Pipedream integration simply use Environment Variables in your request headers to authenticate with.
The HTTP/Webhook action exports HTTP response data for use in subsequent workflow steps, enabling easy data transformation, further API calls, database storage, and more.
Response data is available for both coded (Node.js, Python) and no-code steps within your workflow.
// To use any npm package on Pipedream, just import it
import axios from "axios"
export default defineComponent({
async run({ steps, $ }) {
const { data } = await axios({
method: "GET",
url: "https://pokeapi.co/api/v2/pokemon/charizard",
})
return data.species
},
})