No description
Find a file
2021-04-21 14:44:58 +02:00
config large refactoring, new subpackage structure, Gradle 5.2.1, Java 11 2019-02-18 17:21:57 +01:00
elx-api align with es7102 2021-04-21 14:44:58 +02:00
elx-common align with es7102 2021-04-21 14:44:58 +02:00
elx-http split into admin, bulk, search client, fix HTTP client 2020-05-22 14:31:39 +02:00
elx-node align with es7102 2021-04-21 14:44:58 +02:00
elx-transport align with es7102 2021-04-21 14:44:58 +02:00
gradle working on dynamic bulk requests 2021-04-16 09:19:36 +02:00
http update to Gradle 5.2.1, simpler API without TimeValue, fixing testframework/netty dependencies 2019-02-15 11:10:15 +01:00
node Merge branch '2.2' 2019-02-20 19:14:51 +01:00
transport Merge branch '2.2' 2019-02-20 19:14:51 +01:00
.gitignore cosmetic 2020-05-13 20:44:47 +02:00
build.gradle split into admin, bulk, search client, fix HTTP client 2020-05-22 14:31:39 +02:00
gradle.properties prefix bulk for parameters, fine tuning of auto volume bulk indexing (ezb web with 72 MB/s) 2021-04-19 19:10:09 +02:00
gradlew fix time unit, avoiding zero timeout intervals 2020-11-12 15:43:32 +01:00
gradlew.bat fix time unit, avoiding zero timeout intervals 2020-11-12 15:43:32 +01:00
LICENSE.txt initial commit 2016-11-01 17:28:05 +01:00
README.adoc cosmetic 2020-05-13 20:44:47 +02:00
settings.gradle switch to Junit 5 2019-05-02 19:46:21 +02:00

# Elasticsearch Clients

image:https://api.travis-ci.org/xbib/content.svg[title="Build status", link="https://travis-ci.org/jprante/elasticsearch-extras-client/"]
image:https://img.shields.io/sonar/http/nemo.sonarqube.com/org.xbib%3Aelasticsearch-extras-client/coverage.svg?style=flat-square[title="Coverage", link="https://sonarqube.com/dashboard/index?id=org.xbib%3Aelasticsearch-extras-client"]
image:https://maven-badges.herokuapp.com/maven-central/org.xbib/elasticsearch-extras-client/badge.svg[title="Maven Central", link="http://search.maven.org/#search%7Cga%7C1%7Cxbib%20elasticsearch-extras-client"]
image:https://img.shields.io/badge/License-Apache%202.0-blue.svg[title="Apache License 2.0", link="https://opensource.org/licenses/Apache-2.0"]

This Java library extends the Elasticsearch Java Client classes for better convenience.

It is not a plugin for Elasticsearch. Use it by importing the jar from Maven Central into your project.

The Elasticsearch node client and transport client APIs are unified in a `ClientMethods` interface. This interface uses
bulk services and index management under the hood, like index creation, alias managent, and retention policies.

Two classes `BulkNodeClient` and `BulkTransportClient` combine the client methods with the `BulkProcessor`,
provide some logging convenience, and still offer the `Client` interface of Elasticsearch by using the `client()` method.

A `MockTransportClient` implements the `BulkTransportClient` API but does not need a running Elasticsearch node
to connect to. This is useful for unit testing.

The client classes are enriched by metrics that can measure document count, size, and speed.

A `ClientBuilder` helps to build client instances. For example

[source,java]
----
       ClientBuilder clientBuilder = ClientBuilder.builder()
                .put(elasticsearchSettings)
                .put("client.transport.ping_timeout", settings.get("timeout", "30s"))
                .put(ClientBuilder.MAX_ACTIONS_PER_REQUEST, settings.getAsInt("maxbulkactions", 1000))
                .put(ClientBuilder.MAX_CONCURRENT_REQUESTS, settings.getAsInt("maxconcurrentbulkrequests",
                        Runtime.getRuntime().availableProcessors()))
                .setMetric(new SimpleBulkMetric())
                .setControl(new SimpleBulkControl());
       BulkTransportClient client = clientBuilder.toBulkTransportClient();
----

For more examples, consult the integration etsts at `src/integration-test/java`.

A re-implemented `BulkProcessor` allows flushing of documents before closing.

Also, a light-weight re-implementation of the `TransportClient` class is provided with the following differences to the original `TransportClient`:

- no retry mechanism, no exponential back off, if an error or exception is encountered, the client fails fast

- no _sniffing_, that means, no additional nodes are detected during runtime

- methods of `TransportClient`, `TransportClientNodesServce`, `TransportClientProxy` classes are merged into one class

- configurable ping timeout

#### Some interesting methods

Here are some methods from the `ClientMethods` API, these are not all methods, but maybe
some of which can demonstrate the convenience.

Create new index, use settings and mappings from input streams.
----
ClientMethods newIndex(String index, String type, InputStream settings, InputStream mappings) throws IOException
----

Switch an index to bulk mode - disable replicas, set refresh interval.
----
ClientMethods startBulk(String index, long startRefreshIntervalSeconds, long stopRefreshIntervalSeconds) throws IOException
----

Index document, use bulk mode automatically.
----
ClientMethods index(String index, String type, String id, String source);
----

Wait for outstanding bulk responsed from the cluster.
----
ClientMethods waitForResponses(TimeValue maxWait) throws InterruptedException, ExecutionException;
----

Update replica level on an index.
----
int updateReplicaLevel(String index, int level) throws IOException;
----

Switch aliases from a previously created index with a timestamp to a current index under the common base name `index`.

----
void switchAliases(String index, String concreteIndex, List<String> extraAliases, IndexAliasAdder adder);
----

Retention policy for an index. All indices before `timestampdiff` should be deleted,
but `mintokeep` indices must be kept.

----
void performRetentionPolicy(String index, String concreteIndex, int timestampdiff, int mintokeep);
----

## Prerequisites

You will need Java 8, although Elasticsearch 2.x requires Java 7. Java 7 is not supported.

## Dependencies

This project depends only on https://github.com/xbib/metrics which is a slim version of Coda Hale's metrics library,
Elasticsearch, and Log4j2 API.

## How to decode the Elasticsearch version

This project uses semantic versioning to determine the Elasticsearch upstream version it is built against.

The first three version numbers are the corresponding Elasticsearch version. The last version number is
an incrementing number, the version of this project.

Please use exactly the Elasticsearch version which is declared in the project's version.
Other Elasticsearch versions do not work and will never work, it is not worth to try it.
This is by design of the Elasticsearch project because the internal node communication protocol depends on the
exact same API implementation. Also, the exact same version of Java virtual machine is remoonded on server
and client side.