Friday, March 14, 2025

Elastic Cross-Cluster Ops - Replication + Search

Summary

This is the final post in a three-part series configuring Elasticsearch (ES) cross-cluster replication and search. The first two posts set up two distinct ES clusters: one implemented in Kubernetes (ECK), and the other in Docker.

Cross-cluster Replication

Architecture

Configuration

Networking

The two clusters (West and East) are both implemented in Docker. Although West is a K8s implementation, the underlying architecture of Kind is in fact, Docker. Each cluster is in its own Docker network. For cross-cluster operations to function, we need to configure the linkage between these two Docker networks. The Docker commands below do just that.

Remote Cluster Configuration

The West Cluster needs to be added to the East as a remote cluster. The commands below do that and then wait for the remote configuration to complete.

East Follower Index

In this scenario, we are setting up a leader index on the West cluster (named west_ccr) and its corresponding follower index on the East (named east_ccr). This will allow one-way replication from West to East.

Demo

At this point, true replication of the West index (west_ccr), including mappings (schema) has been accomplished. This can be verified with a simple Nodejs Elasticsearch client application that is located in the src/javascript directory.

Cross-cluster Search

Architecture

Configuration

Remote Cluster Configuration

Similar to the prior exercise of establishing the West cluster as a remote cluster on the East, the East cluster now needs to be configured as a remote cluster on the West.

Demo

At this point, everything is in place to execute queries that span the two clusters. A Python Elasticsearch client script is included in the src/python directory. That script executes a search against the West ES endpoint that spans both the west_ccs index on West and the east_ccs on East.

Source

https://github.com/joeywhelan/elastic-crosscluster

Elastic Cross-Cluster Ops - East Cluster Build

Summary

This is Part 2 in the three-part series on Elasticsearch (ES) cross-cluster operations. This ES cluster will be implemented in Docker. Like the West Cluster, this configuration yields a single ES node and one Kibana node.

Architecture

Configuration

Docker-compose

For the most part, I used the reference docker-compose. I added a bind mount such that I could add the West Cluster's transport CA to the trusted CAs for the East Cluster.

Index

Below a minimal index is built via the REST API. This index is used in a demonstration of cross-cluster search in the next post.

Source

https://github.com/joeywhelan/elastic-crosscluster

Elastic Cross-Cluster Ops - West Cluster Build

Summary

This is Part 1 in a three-part series on a multi-cluster Elasticsearch (ES) build with cross-cluster replication and search enablement. This post covers the build of the West cluster, which is implemented in Kubernetes.

Architecture

The West cluster is implemented as an Elastic Cloud on Kubernetes (ECK). I'm using Kind for the Kubernetes environment, which allows for a self-contained environment suitable for a capable laptop. Additionally, I use cloud-provider-kind to provide native load-balancer functionality.

Configuration

Kind/Cloud-Provider-Kind

ECK Operator

Elasticsearch + Kibana

Indices

Two minimal indices are created with the REST API. These indices will used in a later post on cross-cluster replication and search.

Source

https://github.com/joeywhelan/elastic-crosscluster

Sunday, March 2, 2025

Geospatial Search with Redis and Apache Pinot

Summary

I'll discuss Redis Enterprise and Apache Pinot setup in Docker for this post.

3-node Redis environment
4-node Pinot environment
1M synthetic JSON records representing a user's geographic location
Equivalent geospatial search queries for both environments

Architecture

Data Generation

Synthetic records are created with a Python function that utilizes the Faker library. They are saved to a file called locations.json. A snippet of the data generator is below.

Redis Deployment

Docker

Three Redis nodes are created.

Cluster Build, DB creation, Index Build, Data Load

Nodes are joined into a Redis cluster. A single-shard database is then created, along with an index. Finally, the database is populated from a JSON file with Riot.