Never write another web scraper. Diffbot structures information from the web, so you don't have to.
Enrich a person or organization record with partial data input [See the documentation] (https://docs.diffbot.com/reference/enhancepost)
Write custom Node.js code and use any of the 400k+ npm packages available. Refer to the Pipedream Node docs to learn more.
Automatically classify a page and extract data according to its type. See the documentation
The Diffbot API enables you to extract structured data from web pages automatically. It transforms the chaos of the web into usable information through web scraping and natural language processing. On Pipedream, you can use Diffbot to monitor changes on websites, extract article data, or process web pages for specific information. By tapping into Pipedream’s ability to integrate with hundreds of other services, you can create powerful workflows that automate data extraction and act on the data in real-time.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
diffbot: {
type: "app",
app: "diffbot",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.diffbot.com/v4/account`,
headers: {
"Accept": `application/json`,
},
params: {
token: `${this.diffbot.$auth.api_token}`,
},
})
},
})
Develop, run and deploy your Node.js code in Pipedream workflows, using it between no-code steps, with connected accounts, or integrate Data Stores and File Stores.
This includes installing NPM packages, within your code without having to manage a package.json
file or running npm install
.
Below is an example of installing the axios
package in a Pipedream Node.js code step. Pipedream imports the axios
package, performs the API request, and shares the response with subsequent workflow steps:
// To use previous step data, pass the `steps` object to the run() function
export default defineComponent({
async run({ steps, $ }) {
// Return data to use it in future steps
return steps.trigger.event
},
})