Real Simple Syndication
Extract data from a specified URL See the docs here
Retrieve multiple RSS feeds and return a merged array of items sorted by date See documentation
The RSS app allows users to automatically fetch and parse updates from web feeds. This functionality is pivotal for staying abreast of content changes or updates from websites, blogs, and news outlets that offer RSS feeds. With Pipedream, you can harness the RSS API to trigger workflows that enable a broad range of automations, like content aggregation, monitoring for specific keywords, notifications, and data synchronization across platforms.
module.exports = defineComponent({
props: {
rss: {
type: "app",
app: "rss",
}
},
async run({steps, $}) {
// Retrieve items from a sample feed
const Parser = require('rss-parser');
const parser = new Parser();
const stories = []
// Replace with your feed URL
const url = "https://pipedream.com/community/latest.rss"
const feed = await parser.parseURL(url);
const { title, items } = feed
this.title = title
if (!items.length) {
$end("No new stories")
}
this.items = items
},
})
The Automatic Data Extraction API by Zyte specializes in extracting structured data from web pages. When incorporated into Pipedream workflows, this API allows you to automate the process of gathering web data, which can feed into various tasks such as market research, price monitoring, or even lead generation. By triggering workflows with new data inputs, processing and storing the extracted data, and connecting to other apps, Pipedream amplifies the API's utility.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
automatic_data_extraction: {
type: "app",
app: "automatic_data_extraction",
}
},
async run({steps, $}) {
const data = JSON.stringify([{
'url': 'http://books.toscrape.com/catalogue/a-light-in-the-attic_1000/index.html',
'pageType': 'product',
}]);
return await axios($, {
method: "post",
url: `https://autoextract.scrapinghub.com/v1/extract`,
headers: {
"Content-Type": `application/json`,
},
auth: {
username: `${this.automatic_data_extraction.$auth.api_key}`,
password: ``,
},
data,
})
},
})