This is a Jitsu.Classic documentation. For the lastest version, please visit docs.jitsu.com. Read about differences here.

🚀 Quick Start

Deploy to Heroku

Deploy on Plural

Build from sources

Configuration Source

Deploy on Kubernetes

1.35 - 17 August 2021

1.34 - 27 July 2021

1.33 - 19 July 2021

1.32.0 - 23 June 2021

1.31.0 - 07 May 2021

1.30.0 - 21 Apr 2021

1.29.0 - 19 Mar 2021

1.28.0 - 15 Feb 2021

1.27.0 - 14 Jan 2021

1.25.0 - 15 Dec 2020

1.21.0 - 25 Nov 2020

1.17.0 - 11 Nov 2020

1.15.0 - 25 Oct 2020

✈️ Sending data

Parameters Reference

Methods Reference

Migration Guide

Android SDK 1.0

React Native SDK 1.0

Segment Integration

Events Interception

Javascript Configuration

Installing with Npm or Yarn

📜 Configuration

Directories Structure

Enrichment Rules

Google Authorization

Primary Keys Configuration

Schema and Mapping

Table Names and Filters

Configuration UI

Facebook Conversion API

Global destinations

Google Analytics

Synchronization Scheduling

Singer Based Sources

Airbyte sources in K8S

How To Implement a Source

❤️ Features

Key-Value Storage

Segment Compatibility

Destination Tags

Data Warehouses

dbt Cloud integration

Test Mapping with Dry-Run

Clickhouse specifics

Redis optimization

Geo Data resolution

Admin Endpoints

Application Metrics

👩‍🔬 Extending Jitsu

Destination Extensions

Source Extensions

Jitsu Internals

🚀 Quick Start

Deploy to Heroku

Deploy on Plural

Build from sources

Configuration Source

Deploy on Kubernetes

1.35 - 17 August 2021

1.34 - 27 July 2021

1.33 - 19 July 2021

1.32.0 - 23 June 2021

1.31.0 - 07 May 2021

1.30.0 - 21 Apr 2021

1.29.0 - 19 Mar 2021

1.28.0 - 15 Feb 2021

1.27.0 - 14 Jan 2021

1.25.0 - 15 Dec 2020

1.21.0 - 25 Nov 2020

1.17.0 - 11 Nov 2020

1.15.0 - 25 Oct 2020

✈️ Sending data

Parameters Reference

Methods Reference

Migration Guide

Android SDK 1.0

React Native SDK 1.0

Segment Integration

Events Interception

Javascript Configuration

Installing with Npm or Yarn

📜 Configuration

Directories Structure

Enrichment Rules

Google Authorization

Primary Keys Configuration

Schema and Mapping

Table Names and Filters

Configuration UI

Facebook Conversion API

Global destinations

Google Analytics

Synchronization Scheduling

Singer Based Sources

Airbyte sources in K8S

How To Implement a Source

❤️ Features

Key-Value Storage

Segment Compatibility

Destination Tags

Data Warehouses

dbt Cloud integration

Test Mapping with Dry-Run

Clickhouse specifics

Redis optimization

Geo Data resolution

Admin Endpoints

Application Metrics

👩‍🔬 Extending Jitsu

Destination Extensions

Source Extensions

Jitsu Internals

Apify Dataset

Overview

Apify is a web scraping and web automation platform providing both ready-made and custom solutions, an open-source SDK for web scraping, proxies, and many other tools to help you build and run web automation jobs at scale. The results of a scraping job are usually stored in Apify Dataset. This connector allows you to automatically sync the contents of a dataset to your chosen destination. To sync data from a dataset, all you need to know is its ID. You will find it in Apify console under storages.

The source is using Airbyte docker image (@airbyte/source-apify-dataset). Learn more how Airbyte-based sources work

How to connect

Obtain Apify Dataset ID.

Connection Parameters

Parameter	Documentation
`datasetId`^* string (required)	ID of the dataset you would like to load to Airbyte.
`clean` boolean (not required)	If set to true, only clean items will be downloaded from the dataset. See description of what clean means in Apify API docs. If not sure, set clean to false.

Edit this page on GitHub