Elasticsearch Writer

Writer expects that mapping of types and indexes in your Elasticsearch exists. If it is missing and you have enabled automatic index creation, new mapping will be created.

Configuration

Configuration has 2 parts - elastic and tables
The elastic section defines connection info and import config
- host - server address
- port - elasticsearch listening port
- username - elasticsearch username
- #password - elasticsearch password
- bulkSize (optional) - size of a batch to upload to Elasticsearch (default is 10.000)
- ssh - SSH tunnel configuration
  - enabled - enable SSH tunnel for connection to Elasticsearch
  - sshHost - address of the SSH server
  - sshPort (optional) - SSH listening port (default is 22)
  - user - SSH login
  - keys
    - #private - Your private key used for authentication. Note that keys MUST have maintain linebreaks every 72 bytes, according to rfc4716 section 3. When copying the contents of a key file, it is important to replace all true linebreaks with "\n" so that it can be accepted by the configuration editor, and be parsed correctly in order to establish the SSH tunnel successfully.
The tables section defines database tables, their columns and their data types
- file or tableId
  - file - CSV file of the table we want to write into Elasticsearch (see https://github.com/keboola/docker-bundle/blob/master/ENVIRONMENT.md#input-mapping) for more info about Input Mapping
  - tableId - (deprecated) - StorageAPI table ID of the table we want to write into Elasticsearch (see https://github.com/keboola/docker-bundle/blob/master/ENVIRONMENT.md#input-mapping) for more info about Input Mapping (Works only if destination attribute is not set in table configuration)
- index - index name in ES
- type - type of the data, determines the type in ES,
- id (optional) - determines in which column of table is the document's ID/primary key
- export - whether this table shall be exported to ES
The optionalitems section defines columns mapping
- name string (required) - name of the column in CSV file
- dbName string (required) - name in the database
- type string (required) - type in the database. Special type "ignore" serves to ignore column present in CSV file.
- nullable bool (required) - is nullable?

Example

{
    "elastic": {
        "host": "my.hostname.com",
        "port": 9200,
        "bulkSize": 10000
    },
    "tables": [
        {
            "file": "products.csv",
            "index": "production",
            "type": "products",
            "id": "id",
            "export": true
        }
    ]
}

Example with SSH

{
    "elastic": {
        "host": "my.hostname.com",
        "port": 9200,
        "bulkSize": 10000,
        "ssh": {
            "enabled": true,
            "sshHost": "10.112.1.1",
            "sshPort": 22,
            "user": "extractor",
            "keys": {
                "private": "YOUR\nPRIVATE\nKEY\nWITHOUT\nPASSPHRASE"
            }
        }
    },
    "tables": [
        {
            "file": "products.csv",
            "index": "production",
            "type": "products",
            "id": "id",
            "export": true
        }
    ]
}

Example with columns mapping

{
    "elastic": "...",
    "tables": [
        {
            "file": "products.csv",
            "index": "production",
            "type": "products",
            "id": "id",
            "export": true,
            "items": [
                {
                    "name": "order",
                    "dbName": "order",
                    "type": "integer",
                    "nullable": false
                },
                {
                    "name": "vat",
                    "dbName": "vat-usa",
                    "type": "double",
                    "nullable": true
                }
            ]
        }
    ]
}

Configuring in Keboola Connection

Elasticsearch Writer is integrated in Keboola Connection.

Available with standard KB Docker Generic UI

License

MIT licensed, see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
.github/workflows		.github/workflows
docker		docker
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
composer.json		composer.json
composer.lock		composer.lock
docker-compose.yml		docker-compose.yml
phpcs.xml		phpcs.xml
phpstan.neon		phpstan.neon
phpunit.xml.dist		phpunit.xml.dist

License

keboola/elastic-writer

Folders and files

Latest commit

History

Repository files navigation

Elasticsearch Writer

Configuration

Example

Example with SSH

Example with columns mapping

Configuring in Keboola Connection

License

About

Resources

License

Stars

Watchers

Forks

Languages