with RSS and Scrapeless?
Crawl any website at scale and say goodbye to blocks. See the documentation
Retrieve multiple RSS feeds and return a merged array of items sorted by date See documentation
Retrieve the result of a completed scraping job. See the documentation
Endpoints for fresh, structured data from 100+ popular sites. See the documentation
Submit a new web scraping job with specified target URL and extraction rules. See the documentation
The RSS app allows users to automatically fetch and parse updates from web feeds. This functionality is pivotal for staying abreast of content changes or updates from websites, blogs, and news outlets that offer RSS feeds. With Pipedream, you can harness the RSS API to trigger workflows that enable a broad range of automations, like content aggregation, monitoring for specific keywords, notifications, and data synchronization across platforms.
module.exports = defineComponent({
props: {
rss: {
type: "app",
app: "rss",
}
},
async run({steps, $}) {
// Retrieve items from a sample feed
const Parser = require('rss-parser');
const parser = new Parser();
const stories = []
// Replace with your feed URL
const url = "https://pipedream.com/community/latest.rss"
const feed = await parser.parseURL(url);
const { title, items } = feed
this.title = title
if (!items.length) {
$end("No new stories")
}
this.items = items
},
})
Scrapeless – your go-to platform for powerful, compliant web data extraction. With tools like Universal Scraping API, Scrapeless makes it easy to access and gather data from complex sites. Focus on insights while we handle the technical hurdles. Scrapeless – data extraction made simple.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
scrapeless: {
type: "app",
app: "scrapeless",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.scrapeless.com/api/v1/me`,
headers: {
"x-api-token": `${this.scrapeless.$auth.api_key}`,
},
})
},
})