← Piloterr

Get Website Crawler with Piloterr API

Pipedream makes it easy to connect APIs for Piloterr and 2,000+ other apps remarkably fast.

Trigger workflow on
HTTP requests, schedules and app events
Next, do this
Get Website Crawler with the Piloterr API
No credit card required
Intro to Pipedream
Watch us build a workflow
Watch us build a workflow
4 min
Watch now ➜

Trusted by 800,000+ developers from startups to Fortune 500 companies

Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo
Adyen logo
Appcues logo
Bandwidth logo
Checkr logo
ChartMogul logo
Dataminr logo
Gopuff logo
Gorgias logo
LinkedIn logo
Logitech logo
Replicated logo
Rudderstack logo
SAS logo
Scale AI logo
Webflow logo
Warner Bros. logo

Developers Pipedream

Getting Started

Create a workflow to Get Website Crawler with the Piloterr API. When you configure and deploy the workflow, it will run on Pipedream's servers 24x7 for free.

  1. Configure the Get Website Crawler action
    1. Connect your Piloterr account
    2. Configure Website URL
    3. Optional- Select a Impersonate Version
    4. Optional- Configure Allow Redirects
    5. Optional- Configure Return Page Source
  2. Select a trigger to run your workflow on HTTP requests, schedules or app events
  3. Deploy the workflow
  4. Send a test event to validate your setup
  5. Turn on the trigger

Integrations

Get Website Crawler with Piloterr API on New Requests (Payload Only) from HTTP / Webhook API
HTTP / Webhook + Piloterr
 
Try it
Get Website Crawler with Piloterr API on New Submission from Typeform API
Typeform + Piloterr
 
Try it
Get Website Crawler with Piloterr API on Custom Events from Zoom API
Zoom + Piloterr
 
Try it
Get Website Crawler with Piloterr API on New Submission (Instant) from Jotform API
Jotform + Piloterr
 
Try it
Get Website Crawler with Piloterr API on New Scheduled Tasks from Pipedream API
Pipedream + Piloterr
 
Try it

Details

This is a pre-built, source-available component from Pipedream's GitHub repo. The component is developed by Pipedream and the community, and verified and maintained by Pipedream.

To contribute an update to an existing component or create a new component, create a PR on GitHub. If you're new to Pipedream component development, you can start with quickstarts for trigger span and action development, and then review the component API reference.

Get Website Crawler on Piloterr
Description:Obtains HTML from a given website through web scraping for high performance access and interpretation. [See the documentation](https://docs.piloterr.com/v2/api-reference/website/crawler)
Version:0.0.1
Key:piloterr-get-website-crawler

Code

import piloterr from "../../piloterr.app.mjs";
import constants from "../../common/constants.mjs";

export default {
  key: "piloterr-get-website-crawler",
  name: "Get Website Crawler",
  description: "Obtains HTML from a given website through web scraping for high performance access and interpretation. [See the documentation](https://docs.piloterr.com/v2/api-reference/website/crawler)",
  version: "0.0.1",
  type: "action",
  props: {
    piloterr,
    url: {
      type: "string",
      label: "Website URL",
      description: "The URL of the website to obtain HTML from",
    },
    impersonateVersion: {
      type: "string",
      label: "Impersonate Version",
      description: "Impersonate a browser version",
      options: constants.BROWSER_VERSION,
      optional: true,
    },
    allowRedirects: {
      type: "boolean",
      label: "Allow Redirects",
      description: "If set to `false`, do not follow redirects. `true` by default.",
      optional: true,
    },
    returnPageSource: {
      type: "boolean",
      label: "Return Page Source",
      description: "If set to `false`, the response will be a JSON object with the response body of the page. `true` by default.",
      optional: true,
    },
  },
  async run({ $ }) {
    const response = await this.piloterr.scrapeWebsite({
      params: {
        query: this.url,
        impersonate_version: this.impersonateVersion,
        allow_redirects: this.allowRedirects,
        return_page_source: this.returnPageSource,
      },
      $,
    });
    $.export("$summary", `Successfully obtained HTML from ${this.url}`);
    return response;
  },
};

Configuration

This component may be configured based on the props defined in the component code. Pipedream automatically prompts for input values in the UI and CLI.
LabelPropTypeDescription
PiloterrpiloterrappThis component uses the Piloterr app.
Website URLurlstring

The URL of the website to obtain HTML from

Impersonate VersionimpersonateVersionstringSelect a value from the drop down menu:chrome99chrome100chrome101chrome104chrome107chrome110chrome99_androidedge99edge101safari15_3safari15_5
Allow RedirectsallowRedirectsboolean

If set to false, do not follow redirects. true by default.

Return Page SourcereturnPageSourceboolean

If set to false, the response will be a JSON object with the response body of the page. true by default.

Authentication

Piloterr uses API keys for authentication. When you connect your Piloterr account, Pipedream securely stores the keys so you can easily authenticate to Piloterr APIs in both code and no-code steps.

About Piloterr

Web scraping, made easy.

More Ways to Use Piloterr

Actions

Get Company Database with the Piloterr API

Fetches specified data for a company using a domain name. See the documentation

 
Try it
Get Website Technology with the Piloterr API

Retrieves the core technology used on a designated website. (CMS, Framework, Analytics, CDN, Hosting, etc.) See the documentation

 
Try it

Explore Other Apps

1
-
24
of
2,000+
apps by most popular

HTTP / Webhook
HTTP / Webhook
Get a unique URL where you can send HTTP or webhook requests
Node
Node
Anything you can do with Node.js, you can do in a Pipedream workflow. This includes using most of npm's 400,000+ packages.
Python
Python
Anything you can do in Python can be done in a Pipedream Workflow. This includes using any of the 350,000+ PyPi packages available in your Python powered workflows.
OpenAI (ChatGPT)
OpenAI (ChatGPT)
OpenAI is an AI research and deployment company with the mission to ensure that artificial general intelligence benefits all of humanity. They are the makers of popular models like ChatGPT, DALL-E, and Whisper.
Salesforce (REST API)
Salesforce (REST API)
Web services API for interacting with Salesforce
HubSpot
HubSpot
HubSpot's CRM platform contains the marketing, sales, service, operations, and website-building software you need to grow your business.
Zoho CRM
Zoho CRM
Zoho CRM is an online Sales CRM software that manages your sales, marketing, and support in one CRM platform.
Stripe
Stripe
Stripe powers online and in-person payment processing and financial solutions for businesses of all sizes.
Shopify Developer App
Shopify Developer App
Shopify is a user-friendly e-commerce platform that helps small businesses build an online store and sell online through one streamlined dashboard.
WooCommerce
WooCommerce
WooCommerce is the open-source ecommerce platform for WordPress.
Snowflake
Snowflake
A data warehouse built for the cloud
MongoDB
MongoDB
MongoDB is an open source NoSQL database management program.
Supabase
Supabase
Supabase is an open source Firebase alternative.
MySQL
MySQL
MySQL is an open-source relational database management system.
PostgreSQL
PostgreSQL
PostgreSQL is a free and open-source relational database management system emphasizing extensibility and SQL compliance.
AWS
AWS
Amazon Web Services (AWS) offers reliable, scalable, and inexpensive cloud computing services.
Twilio SendGrid
Twilio SendGrid
Send marketing and transactional email through the Twilio SendGrid platform with the Email API, proprietary mail transfer agent, and infrastructure for scalable delivery.
Amazon SES
Amazon SES
Amazon SES is a cloud-based email service provider that can integrate into any application for high volume email automation
Klaviyo
Klaviyo
Email Marketing and SMS Marketing Platform
Zendesk
Zendesk
Zendesk is award-winning customer service software trusted by 200K+ customers. Make customers happy via text, mobile, phone, email, live chat, social media.
ServiceNow
ServiceNow
The smarter way to workflow
Notion
Notion
Notion is a new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team.
Slack
Slack
Slack is a channel-based messaging platform. With Slack, people can work together more effectively, connect all their software tools and services, and find the information they need to do their best work — all within a secure, enterprise-grade environment.
Microsoft Teams
Microsoft Teams
Microsoft Teams has communities, events, chats, channels, meetings, storage, tasks, and calendars in one place.