joerg/elx - Forgejo: Git with a cup of tea

joerg/elx

No description

Find a file

Jörg Prante 6d434a98c6 align with es7102		2021-04-21 14:44:58 +02:00
config	large refactoring, new subpackage structure, Gradle 5.2.1, Java 11	2019-02-18 17:21:57 +01:00
elx-api	align with es7102	2021-04-21 14:44:58 +02:00
elx-common	align with es7102	2021-04-21 14:44:58 +02:00
elx-http	split into admin, bulk, search client, fix HTTP client	2020-05-22 14:31:39 +02:00
elx-node	align with es7102	2021-04-21 14:44:58 +02:00
elx-transport	align with es7102	2021-04-21 14:44:58 +02:00
gradle	working on dynamic bulk requests	2021-04-16 09:19:36 +02:00
http	update to Gradle 5.2.1, simpler API without TimeValue, fixing testframework/netty dependencies	2019-02-15 11:10:15 +01:00
node	Merge branch '2.2'	2019-02-20 19:14:51 +01:00
transport	Merge branch '2.2'	2019-02-20 19:14:51 +01:00
.gitignore	cosmetic	2020-05-13 20:44:47 +02:00
build.gradle	split into admin, bulk, search client, fix HTTP client	2020-05-22 14:31:39 +02:00
gradle.properties	prefix bulk for parameters, fine tuning of auto volume bulk indexing (ezb web with 72 MB/s)	2021-04-19 19:10:09 +02:00
gradlew	fix time unit, avoiding zero timeout intervals	2020-11-12 15:43:32 +01:00
gradlew.bat	fix time unit, avoiding zero timeout intervals	2020-11-12 15:43:32 +01:00
LICENSE.txt	initial commit	2016-11-01 17:28:05 +01:00
README.adoc	cosmetic	2020-05-13 20:44:47 +02:00
settings.gradle	switch to Junit 5	2019-05-02 19:46:21 +02:00

README.adoc

# Elasticsearch Clients

image:https://api.travis-ci.org/xbib/content.svg[title="Build status", link="https://travis-ci.org/jprante/elasticsearch-extras-client/"]
image:https://img.shields.io/sonar/http/nemo.sonarqube.com/org.xbib%3Aelasticsearch-extras-client/coverage.svg?style=flat-square[title="Coverage", link="https://sonarqube.com/dashboard/index?id=org.xbib%3Aelasticsearch-extras-client"]
image:https://maven-badges.herokuapp.com/maven-central/org.xbib/elasticsearch-extras-client/badge.svg[title="Maven Central", link="http://search.maven.org/#search%7Cga%7C1%7Cxbib%20elasticsearch-extras-client"]
image:https://img.shields.io/badge/License-Apache%202.0-blue.svg[title="Apache License 2.0", link="https://opensource.org/licenses/Apache-2.0"]

This Java library extends the Elasticsearch Java Client classes for better convenience.

It is not a plugin for Elasticsearch. Use it by importing the jar from Maven Central into your project.

The Elasticsearch node client and transport client APIs are unified in a `ClientMethods` interface. This interface uses
bulk services and index management under the hood, like index creation, alias managent, and retention policies.

Two classes `BulkNodeClient` and `BulkTransportClient` combine the client methods with the `BulkProcessor`,
provide some logging convenience, and still offer the `Client` interface of Elasticsearch by using the `client()` method.

A `MockTransportClient` implements the `BulkTransportClient` API but does not need a running Elasticsearch node
to connect to. This is useful for unit testing.

The client classes are enriched by metrics that can measure document count, size, and speed.

A `ClientBuilder` helps to build client instances. For example

[source,java]
----
ClientBuilder clientBuilder = ClientBuilder.builder()
.put(elasticsearchSettings)
.put("client.transport.ping_timeout", settings.get("timeout", "30s"))
.put(ClientBuilder.MAX_ACTIONS_PER_REQUEST, settings.getAsInt("maxbulkactions", 1000))
.put(ClientBuilder.MAX_CONCURRENT_REQUESTS, settings.getAsInt("maxconcurrentbulkrequests",
Runtime.getRuntime().availableProcessors()))
.setMetric(new SimpleBulkMetric())
.setControl(new SimpleBulkControl());
BulkTransportClient client = clientBuilder.toBulkTransportClient();
----

For more examples, consult the integration etsts at `src/integration-test/java`.

A re-implemented `BulkProcessor` allows flushing of documents before closing.

Also, a light-weight re-implementation of the `TransportClient` class is provided with the following differences to the original `TransportClient`:

- no retry mechanism, no exponential back off, if an error or exception is encountered, the client fails fast

- no _sniffing_, that means, no additional nodes are detected during runtime

- methods of `TransportClient`, `TransportClientNodesServce`, `TransportClientProxy` classes are merged into one class

- configurable ping timeout

#### Some interesting methods

Here are some methods from the `ClientMethods` API, these are not all methods, but maybe
some of which can demonstrate the convenience.

Create new index, use settings and mappings from input streams.
----
ClientMethods newIndex(String index, String type, InputStream settings, InputStream mappings) throws IOException
----

Switch an index to bulk mode - disable replicas, set refresh interval.
----
ClientMethods startBulk(String index, long startRefreshIntervalSeconds, long stopRefreshIntervalSeconds) throws IOException
----

Index document, use bulk mode automatically.
----
ClientMethods index(String index, String type, String id, String source);
----

Wait for outstanding bulk responsed from the cluster.
----
ClientMethods waitForResponses(TimeValue maxWait) throws InterruptedException, ExecutionException;
----

Update replica level on an index.
----
int updateReplicaLevel(String index, int level) throws IOException;
----

Switch aliases from a previously created index with a timestamp to a current index under the common base name `index`.

----
void switchAliases(String index, String concreteIndex, List<String> extraAliases, IndexAliasAdder adder);
----

Retention policy for an index. All indices before `timestampdiff` should be deleted,
but `mintokeep` indices must be kept.

----
void performRetentionPolicy(String index, String concreteIndex, int timestampdiff, int mintokeep);
----

## Prerequisites

You will need Java 8, although Elasticsearch 2.x requires Java 7. Java 7 is not supported.

## Dependencies

This project depends only on https://github.com/xbib/metrics which is a slim version of Coda Hale's metrics library,
Elasticsearch, and Log4j2 API.

## How to decode the Elasticsearch version

This project uses semantic versioning to determine the Elasticsearch upstream version it is built against.

The first three version numbers are the corresponding Elasticsearch version. The last version number is
an incrementing number, the version of this project.

Please use exactly the Elasticsearch version which is declared in the project's version.
Other Elasticsearch versions do not work and will never work, it is not worth to try it.
This is by design of the Elasticsearch project because the internal node communication protocol depends on the
exact same API implementation. Also, the exact same version of Java virtual machine is remoonded on server
and client side.