Never write another web scraper. Diffbot structures information from the web, so you don't have to.
Go to siteThe Diffbot API enables you to extract structured data from web pages automatically. It transforms the chaos of the web into usable information through web scraping and natural language processing. On Pipedream, you can use Diffbot to monitor changes on websites, extract article data, or process web pages for specific information. By tapping into Pipedream’s ability to integrate with hundreds of other services, you can create powerful workflows that automate data extraction and act on the data in real-time.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
diffbot: {
type: "app",
app: "diffbot",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.diffbot.com/v4/account`,
headers: {
"Accept": `application/json`,
},
params: {
token: `${this.diffbot.$auth.api_token}`,
},
})
},
})
Content Change Detection
Use Diffbot to monitor a product page for changes and trigger a Pipedream workflow whenever the price or availability changes. This can be connected to a notification service like Slack or email to alert your team immediately.
Article Extraction and Analysis
Extract articles from a set of URLs using Diffbot and send the extracted text to a sentiment analysis API like IBM Watson. Store the sentiment scores in a Google Sheet for easy tracking and visualization of the sentiment of news articles or blog posts over time.
Lead Generation Automation
Scrape contact information from a list of company websites using Diffbot. Combine this with the Clearbit API to enrich the data and add it to a CRM like Salesforce or HubSpot, automating the lead generation process.
Enrich a person or organization record with partial data input [See the documentation] (https://docs.diffbot.com/reference/enhancepost)
Automatically classify a page and extract data according to its type. See the documentation
Diffbot uses API keys for authentication. When you connect your Diffbot account, Pipedream securely stores the keys so you can easily authenticate to Diffbot APIs in both code and no-code steps.
Sign in and copy your API token from your dashboard on the top right