Never write another web scraper. Diffbot structures information from the web, so you don't have to.
Get a URL and emit the full HTTP event on every request (including headers and query parameters). You can also configure the HTTP response code, body, and more.
Get a URL and emit the HTTP body as an event on every request
Emit new event when the content of the URL changes.
Enrich a person or organization record with partial data input [See the documentation] (https://docs.diffbot.com/reference/enhancepost)
Automatically classify a page and extract data according to its type. See the documentation
Send an HTTP request using any method and URL. Optionally configure query string parameters, headers, and basic auth.
Send an HTTP GET request to any URL. Optionally configure query string parameters, headers and basic auth.
Send an HTTP POST request to any URL. Optionally configure query string parameters, headers and basic auth.
The Diffbot API enables you to extract structured data from web pages automatically. It transforms the chaos of the web into usable information through web scraping and natural language processing. On Pipedream, you can use Diffbot to monitor changes on websites, extract article data, or process web pages for specific information. By tapping into Pipedream’s ability to integrate with hundreds of other services, you can create powerful workflows that automate data extraction and act on the data in real-time.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
diffbot: {
type: "app",
app: "diffbot",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.diffbot.com/v4/account`,
headers: {
"Accept": `application/json`,
},
params: {
token: `${this.diffbot.$auth.api_token}`,
},
})
},
})
Build, test, and send HTTP requests without code using your Pipedream workflows. The HTTP / Webhook action is a tool to build HTTP requests with a Postman-like graphical interface.
Define the target URL, HTTP verb, headers, query parameters, and payload body without writing custom code.
This action can also use your connected accounts with third-party APIs. Selecting an integrated app will automatically update the request’s headers to authenticate with the app properly, and even inject your token dynamically.
Pipedream integrates with thousands of APIs, but if you can’t find a Pipedream integration simply use Environment Variables in your request headers to authenticate with.
The HTTP/Webhook action exports HTTP response data for use in subsequent workflow steps, enabling easy data transformation, further API calls, database storage, and more.
Response data is available for both coded (Node.js, Python) and no-code steps within your workflow.
// To use any npm package on Pipedream, just import it
import axios from "axios"
export default defineComponent({
async run({ steps, $ }) {
const { data } = await axios({
method: "GET",
url: "https://pokeapi.co/api/v2/pokemon/charizard",
})
return data.species
},
})