with Affinda and Playwright?
Emit new event when a document parsing is completed in Affinda See docs here
Emit new event when a document is validated in Affinda. See docs here
Generates a pdf of the page and store it on /tmp directory. See the documentation
Store a new screenshot file on /tmp directory. See the documentation
The Affinda API offers a suite of powerful tools for document analysis and extraction, enabling you to automate the processing of resumes, invoices, bank statements, and other documents. By parsing detailed information from documents, Affinda can transform unstructured data into structured, actionable insights. With Pipedream, you can integrate Affinda's capabilities into serverless workflows, connecting to hundreds of apps to automate tasks like candidate screening, financial data aggregation, or content management.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
affinda: {
type: "app",
app: "affinda",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://${this.affinda.$auth.api}.affinda.com/v3/organizations`,
headers: {
Authorization: `Bearer ${this.affinda.$auth.api_key}`,
"Accept": `application/json`,
},
})
},
})
Playwright is a Node.js library which provides a high-level API to control Chrome/Chromium over the DevTools Protocol. Playwright runs in headless mode on Chromium on Pipedream.
Using Playwright you can perform tasks including:
import { playwright } from '@pipedream/browsers';
export default defineComponent({
async run({steps, $}) {
const browser = await playwright.launch();
// Interact with the web page programmatically
// See Playwright's Page documentation for available methods:
// https://playwright.dev/docs/api/class-page
const page = await browser.newPage();
await page.goto('https://pipedream.com/');
const title = await page.title();
const content = await page.content();
// Close context and browser otherwise the step will hang
await page.context().close()
await browser.close();
return { title, content }
},
})