AI Data Extraction with Scrapfly API on New Commit (Instant) from GitLab API

The GitLab API provides programmatic access to your GitLab projects, allowing you to automate common tasks, manage issues, merge requests, and more. With the GitLab API on Pipedream, you can create customized workflows that integrate with other services, streamline your development process, and enhance project management. By leveraging the power of serverless, you can set up triggers for GitLab events and perform actions across a variety of apps without managing infrastructure.

The file containing the content of the page you want to extract data from. The content must be in the format specified by Content Type. Provide either a file URL or a path to a file in the /tmp directory (for example, /tmp/myFile.txt)

This URL is used to transform any relative URLs in the document into absolute URLs automatically. It can be either the base URL or the exact URL of the document. Must be url encoded

Charset of the document pass in the body. If you are not sure, you can use the auto value and we will try to detect it. Bad charset can lead to bad extraction, so it's important to set it correctly. The most common charset is utf-8 for text document and ascii for binary. The symptom of a bad charset is that the text is not correctly displayed (accent, special characters, etc).

Define an extraction template to get structured data. Use an ephemeral template (declared on the fly on the API call) or a stored template (declared in the dashboard) by using the template name.

Instruction to extract data or ask a question on the scraped content with an LLM (Large Language Model). Must be url encoded

AI Extraction to auto parse document to get structured data. E.g., product, review, real-estate, article.

Queue you scrape request and redirect API response to a provided webhook endpoint. You can create a webhook endpoint from your dashboard, it takes the name of the webhook. Webhooks are scoped to the given project/env.

Emit new event when a new commit is pushed to a branch

Emit new event when a new branch is created

Emit new event when a project (i.e. repository) is created

Emit new event when a new audit event is created

Emit new event when a commit receives a comment

Create a new branch in the repository. See the documentation

Creates a new epic. See the documentation

Creates a new issue. See the documentation

Gets a single issue from repository. See the documentation

Get a single project repository branch. See the documentation

Label	Prop	Type	Description
GitLab	`gitlab`	`app`	This component uses the GitLab app.
N/A	`db`	`$.service.db`	This component uses `$.service.db` to maintain state between executions.
N/A	`http`	`$.interface.http`	This component uses `$.interface.http` to generate a unique URL when the component is first instantiated. Each request to the URL will trigger the `run()` method of the component.
Project ID	`projectId`	`integer`	Select a value from the drop down menu.
Branch Name	`refName`	`string`	Select a value from the drop down menu.

Label	Prop	Type	Description
Scrapfly	`scrapfly`	`app`	This component uses the Scrapfly app.
File Path or URL	`body`	`string`	The file containing the content of the page you want to extract data from. The content must be in the format specified by `Content Type`. Provide either a file URL or a path to a file in the `/tmp` directory (for example, `/tmp/myFile.txt`)
Content Type	`contentType`	`string`	Select a value from the drop down menu:`application/jsonapplication/jsonldapplication/xmltext/plaintext/htmltext/markdowntext/csvapplication/xhtml+xml`
URL	`url`	`string`	This URL is used to transform any relative URLs in the document into absolute URLs automatically. It can be either the base URL or the exact URL of the document. Must be url encoded
Charset	`charset`	`string`	Charset of the document pass in the body. If you are not sure, you can use the `auto` value and we will try to detect it. Bad charset can lead to bad extraction, so it's important to set it correctly. The most common charset is `utf-8` for text document and `ascii` for binary. The symptom of a bad charset is that the text is not correctly displayed (accent, special characters, etc).
Extraction Template	`extractionTemplate`	`string`	Define an extraction template to get structured data. Use an ephemeral template (declared on the fly on the API call) or a stored template (declared in the dashboard) by using the template name.
Extraction Prompt	`extractionPrompt`	`string`	Instruction to extract data or ask a question on the scraped content with an LLM (Large Language Model). Must be url encoded
Extraction Model	`extractionModel`	`string`	AI Extraction to auto parse document to get structured data. E.g., `product`, `review`, `real-estate`, `article`.
Webhook Name	`webhookName`	`string`	Queue you scrape request and redirect API response to a provided webhook endpoint. You can create a webhook endpoint from your `dashboard`, it takes the name of the webhook. Webhooks are scoped to the given project/env.
syncDir	`syncDir`	`dir`

AI Data Extraction with Scrapfly API on New Commit (Instant) from GitLab API

Pipedream makes it easy to connect APIs for Scrapfly, GitLab and 3,000+ other apps remarkably fast.

Trusted by 1,000,000+ developers from startups to Fortune 500 companies

Developers ♥ Pipedream

Getting Started#

Details#

Trigger#

GitLab Overview#

Trigger Code#

Trigger Configuration#

Trigger Authentication#

About GitLab#

Action#

Action Code#

Action Configuration#

Action Authentication#

About Scrapfly#

More Ways to Connect Scrapfly + GitLab#

Other Popular Integrations#

Popular Triggers#

Popular Actions#

Explore Other Apps#

1
-
24
of
3,000+
apps by most popular

AI Data Extraction with Scrapfly API on New Commit (Instant) from GitLab API

Pipedream makes it easy to connect APIs for Scrapfly, GitLab and 3,000+ other apps remarkably fast.

Trusted by 1,000,000+ developers from startups to Fortune 500 companies

Developers ♥ Pipedream

1-24of3,000+apps by most popular

1
-
24
of
3,000+
apps by most popular