Real Simple Syndication
Gets an answer to a question about a given webpage. See the documentation
Retrieve multiple RSS feeds and return a merged array of items sorted by date See documentation
Returns the full HTML content of a webpage specified by the URL. See the documentation:
Returns the visible text content of a webpage specified by the URL. See the documentation.
The RSS app allows users to automatically fetch and parse updates from web feeds. This functionality is pivotal for staying abreast of content changes or updates from websites, blogs, and news outlets that offer RSS feeds. With Pipedream, you can harness the RSS API to trigger workflows that enable a broad range of automations, like content aggregation, monitoring for specific keywords, notifications, and data synchronization across platforms.
module.exports = defineComponent({
props: {
rss: {
type: "app",
app: "rss",
}
},
async run({steps, $}) {
// Retrieve items from a sample feed
const Parser = require('rss-parser');
const parser = new Parser();
const stories = []
// Replace with your feed URL
const url = "https://pipedream.com/community/latest.rss"
const feed = await parser.parseURL(url);
const { title, items } = feed
this.title = title
if (!items.length) {
$end("No new stories")
}
this.items = items
},
})
WebScraping.AI API provides powerful tools for extracting data from websites, enabling users to retrieve structured information without the hassle of setting up a custom scraper. It handles proxy rotation, browsers, and CAPTCHAs, allowing you to focus on data collection. With Pipedream, you can harness this capability to create automated workflows that trigger on various events, process web content, and connect with countless other apps to feed data pipelines, monitor changes, or populate databases.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
webscraping_ai: {
type: "app",
app: "webscraping_ai",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.webscraping.ai/account`,
params: {
api_key: `${this.webscraping_ai.$auth.api_key}`,
},
})
},
})