Hi, we’re Zyte, the central point of entry for all your web data needs.
Create a new document in a collection of your choice. See the docs here
Execute an aggregation pipeline on a MongoDB collection. See the documentation
The Zyte API provides programmatic access to web data extraction services, allowing you to pull structured data from websites efficiently. Within Pipedream, you can leverage the Zyte API to create powerful serverless workflows that automate data collection, monitor web content changes, or enrich your datasets with web-sourced information. By connecting Zyte to other apps on Pipedream, you can easily integrate web scraping into your data processing pipelines, event-driven applications, and more, with minimal setup and no server maintenance.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
zyte_api: {
type: "app",
app: "zyte_api",
}
},
async run({steps, $}) {
const data = {
"url": "https://books.toscrape.com/",
"httpResponseBody": true
}
return await axios($, {
method: "post",
url: `https://api.zyte.com/v1/extract`,
headers: {
"Content-Type": `application/json`,
},
auth: {
username: `${this.zyte_api.$auth.api_key}`,
password: ``,
},
data,
})
},
})
The MongoDB API provides powerful capabilities to interact with a MongoDB database, allowing you to perform CRUD (Create, Read, Update, Delete) operations, manage databases, and execute sophisticated queries. With Pipedream, you can harness these abilities to automate tasks, sync data across various apps, and react to events in real-time. It’s a combo that’s particularly potent for managing data workflows, syncing application states, or triggering actions based on changes to your data.
import mongodb from 'mongodb'
export default defineComponent({
props: {
mongodb: {
type: "app",
app: "mongodb",
},
collection: {
type: "string"
},
filter: {
type: "object"
}
},
async run({steps, $}) {
const MongoClient = mongodb.MongoClient
const {
database,
hostname,
username,
password,
} = this.mongodb.$auth
const url = `mongodb+srv://${username}:${password}@${hostname}/test?retryWrites=true&w=majority`
const client = await MongoClient.connect(url, {
useNewUrlParser: true,
useUnifiedTopology: true
})
const db = client.db(database)
const results = await db.collection(this.collection).find(this.filter).toArray();
$.export('results', results);
await client.close()
},
})