RSS

Real Simple Syndication

Integrate the RSS API with the Playwright API

Setup the RSS API trigger to run a workflow which integrates with the Playwright API. Pipedream's integration platform allows you to integrate RSS and Playwright remarkably fast. Free for developers.

Get Page HTML with Playwright API on New Item in Feed from RSS API
RSS + Playwright
 
Try it
Get Page Title with Playwright API on New Item in Feed from RSS API
RSS + Playwright
 
Try it
Page PDF with Playwright API on New Item in Feed from RSS API
RSS + Playwright
 
Try it
Take Screenshot with Playwright API on New Item in Feed from RSS API
RSS + Playwright
 
Try it
Get Page HTML with Playwright API on New Item From Multiple RSS Feeds from RSS API
RSS + Playwright
 
Try it
New Item in Feed from the RSS API

Emit new items from an RSS feed

 
Try it
New Item From Multiple RSS Feeds from the RSS API

Emit new items from multiple RSS feeds

 
Try it
Random item from multiple RSS feeds from the RSS API

Emit a random item from multiple RSS feeds

 
Try it
Get Page HTML with the Playwright API

Returns the page's html. See the documentation

 
Try it
Merge RSS Feeds with the RSS API

Retrieve multiple RSS feeds and return a merged array of items sorted by date See documentation

 
Try it
Get Page Title with the Playwright API

Returns the page's title. See the documentation

 
Try it
Page PDF with the Playwright API

Generates a pdf of the page and store it on /tmp directory. See the documentation

 
Try it
Take Screenshot with the Playwright API

Store a new screenshot file on /tmp directory. See the documentation

 
Try it

Overview of RSS

The RSS app allows users to automatically fetch and parse updates from web feeds. This functionality is pivotal for staying abreast of content changes or updates from websites, blogs, and news outlets that offer RSS feeds. With Pipedream, you can harness the RSS API to trigger workflows that enable a broad range of automations, like content aggregation, monitoring for specific keywords, notifications, and data synchronization across platforms.

Connect RSS

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
module.exports = defineComponent({
  props: {
    rss: {
      type: "app",
      app: "rss",
    }
  },
  async run({steps, $}) {
    // Retrieve items from a sample feed
    const Parser = require('rss-parser');
    const parser = new Parser();
    
    const stories = []
    
    // Replace with your feed URL
    const url = "https://pipedream.com/community/latest.rss"
    
    const feed = await parser.parseURL(url);
    const { title, items } = feed
    this.title = title
    
    if (!items.length) {
      $end("No new stories")
    }
    
    this.items = items
  },
})

Overview of Playwright

Playwright is a Node.js library which provides a high-level API to control Chrome/Chromium over the DevTools Protocol. Playwright runs in headless mode on Chromium on Pipedream.

Using Playwright you can perform tasks including:

  • Capture Screenshots: Convert webpages into images.
  • Processing PDFs: parse and scan PDFs.
  • Web Scraping: Extract data from websites.
  • UI/UX Testing: Verify user interface and experience.
  • Integration with Test Frameworks: Combine with testing frameworks.
  • Task Automation: Automate web-related tasks like form filling.
  • Functional Testing: Automate user interactions to test web application functionality.
  • Regression Testing: Ensure new code changes don't introduce bugs.

Connect Playwright

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
import { playwright } from '@pipedream/browsers';

export default defineComponent({
  async run({steps, $}) {
    const browser = await playwright.launch();
    
    // Interact with the web page programmatically
    // See Playwright's Page documentation for available methods:
    // https://playwright.dev/docs/api/class-page
    const page = await browser.newPage();

    await page.goto('https://pipedream.com/');
    const title = await page.title();
    const content = await page.content();

    // Close context and browser otherwise the step will hang
    await page.context().close()
    await browser.close();

    return { title, content }
  },
})