Effortless data extraction from any website. We manage headless browsers and rotate proxies for you.
Write Python and use any of the 350k+ PyPi packages available. Refer to the Pipedream Python docs to learn more.
The Soax API provides access to a robust proxy and scraping service that allows users to gather data efficiently and safely from across the web. By leveraging Soax’s rotating proxies, you can access and retrieve data without triggering anti-scraping measures, making it useful for tasks like competitive analysis, market research, SEO monitoring, and more. Integrating Soax with Pipedream enhances these capabilities by automating data collection workflows, enriching the data with other services, and triggering actions based on the retrieved data.
import { axios } from "@pipedream/platform"
export default defineComponent({
props: {
scrapein_: {
type: "app",
app: "scrapein_",
}
},
async run({steps, $}) {
return await axios($, {
url: `https://api.scrapein.app/v1/`,
params: {
apikey: `${this.scrapein_.$auth.api_key}`,
url: `https://dashboard.scrapein.app/testdata.html`,
},
})
},
})
Develop, run and deploy your Python code in Pipedream workflows. Integrate seamlessly between no-code steps, with connected accounts, or integrate Data Stores and manipulate files within a workflow.
This includes installing PyPI packages, within your code without having to manage a requirements.txt
file or running pip
.
Below is an example of using Python to access data from the trigger of the workflow, and sharing it with subsequent workflow steps:
def handler(pd: "pipedream"):
# Reference data from previous steps
print(pd.steps["trigger"]["context"]["id"])
# Return data for use in future steps
return {"foo": {"test":True}}