EventSQL

Events over SQL.

Simple. Reliable. Fast.

Able to publish and consume thousands of events per second on a single Postgres instance.

With sharding, it can easily support tens and hundreds of thousands events per second for virtually endless scalability.

For proofs, see benchmarks.

How it works

We just need to have a few tables. Column names should be prefixed because some of the names are reserved keywords in some databases (Postgres syntax, schema is fully managed by EventSQL):

CREATE TABLE topic (
  eql_name TEXT PRIMARY KEY,
  eql_partitions SMALLINT NOT NULL,
  eql_created_at TIMESTAMP NOT NULL DEFAULT NOW()
);

CREATE TABLE consumer (
  eql_topic TEXT NOT NULL,
  eql_name TEXT NOT NULL,
  eql_partition SMALLINT NOT NULL,
  eql_first_event_id BIGINT,
  eql_last_event_id BIGINT,
  eql_last_consumption_at TIMESTAMP,
  eql_consumed_events BIGINT NOT NULL,
  eql_created_at TIMESTAMP NOT NULL DEFAULT NOW(),
  PRIMARY KEY (eql_topic, eql_name, eql_partition)
);

CREATE TABLE {topic}_event (
  eql_id BIGSERIAL PRIMARY KEY,
  eql_partition SMALLINT NOT NULL,
  eql_key TEXT,
  eql_value BYTEA NOT NULL,
  eql_buffered_at TIMESTAMP NOT NULL,
  eql_created_at TIMESTAMP NOT NULL DEFAULT NOW(),
  eql_metadata JSON NOT NULL
);

-- Same schema as event, just not partitioned (by topic). --
-- It is used to handle eventual consistency of auto increment; --
-- there is no guarantee that record of id 2 is visible only after id 1 record. --
-- Events are first inserted to the event_buffer; --
-- they are then moved to the {topic}_event table in bulk, by a single, serialized writer (per topic). --
-- Because there is only one writer, it fixes eventual consistency issue --
CREATE TABLE event_buffer (
  eql_id BIGSERIAL PRIMARY KEY,
  eql_topic TEXT NOT NULL,
  eql_partition SMALLINT NOT NULL,
  eql_key TEXT,
  eql_value BYTEA NOT NULL,
  eql_created_at TIMESTAMP NOT NULL DEFAULT NOW(),
  eql_metadata JSON NOT NULL
);
CREATE INDEX event_buffer_topic_id ON event_buffer (eql_topic, eql_id);
-- Used to lock single (per topic) event_buffer to {topic}_event writer --
CREATE TABLE event_buffer_lock (
  eql_topic TEXT PRIMARY KEY
);

To consume events, we periodically do (every one to several seconds):

BEGIN;

SELECT * FROM consumer 
WHERE eql_topic = :topic 
  AND eql_name = :c_name 
FOR UPDATE SKIP LOCKED;

SELECT * FROM {topic}_event
WHERE (:last_event_id IS NULL OR eql_id > :last_event_id)
ORDER BY eql_id
LIMIT :limit;

(process events)

UPDATE consumer 
SET eql_last_event_id = :id,
    eql_last_consumption_at = :now 
WHERE eql_topic = :topic 
  AND eql_name = :c_name;

COMMIT;

Optionally, to increase throughput & concurrency, we might have a partitioned topic and consumers (-1 partition standing for not partitioned topic/consumer/event).

Distribution of partitioned events is the sole responsibility of a publisher.

Consumption of such events per partition (0 in an example) might look like this:

BEGIN;

SELECT * FROM consumer 
WHERE eql_topic = :topic 
  AND eql_name = :c_name 
  AND eql_partition = 0 
FOR UPDATE SKIP LOCKED;

SELECT * FROM {topic}_event
WHERE (:last_event_id IS NULL OR eql_id > :last_event_id)
  AND eql_partition = 0
ORDER BY eql_id 
LIMIT :limit;

(process events)

UPDATE consumer 
SET eql_last_event_id = :id,
    eql_last_consumption_at = :now
WHERE eql_topic = :topic
  AND eql_name = :c_name 
  AND eql_partition = 0;

COMMIT;

Limitation being that if consumer is partitioned, it must have the exact same number of partition as the topic definition has. It's a rather acceptable tradeoff and easy to enforce at the library level.

How to use it

EventSQL is an entrypoint to the whole library. It requires standard Java javax.sql.DataSource or a list of them:

import com.binaryigor.eventsql.EventSQL;
// dialect of your events backend - POSTGRES, MYSQL and MARIADB are supported, as of now
import com.binaryigor.eventsql.EventSQLDialect;
import javax.sql.DataSource;

var eventSQL = new EventSQL(dataSource, EventSQLDialect.POSTGRES);
ver shardedEventSQL = new EventSQL(dataSources, EventSQLDialect.POSTGRES);

Sharded version works in the same vain - it just assumes that topics and consumers are hosted on multiple dbs.

Required tables are managed automatically by the library, but if you want to customize their schema a bit, you can provide your own EventSQLRegistry.TablesManager implementation. See EventSQLRegistry for details.

Topics and Consumers

Having EventSQL instance, we can register topics and their consumers:

// all operations are idempotent
eventSQL.registry()
  // -1 stands for not partitioned topic  
  .registerTopic(new TopicDefinition("account_created", -1))
  .registerTopic(new TopicDefinition("invoice_issued", 5))
  // thirds argument (true/false) determines whether Consumer is partitioned or not      
  .registerConsumer(new ConsumerDefinition("account_created", "consumer-1", false))
  .registerConsumer(new ConsumerDefinition("invoice_issued", "consumer-2", true));

Topics and consumers can be both partitioned and not partitioned. Partitioned topics allow to have partitioned consumers, increasing parallelism. Parallelism of partitioned consumers is as high as consumed topic number of partitions - events have ordering guarantee within a partition. As a consequence, for a given consumer, each partition can be processed only by a single thread at the time.

For a consumer to be partitioned its topic must be partitioned as well - it will have the same number of partitions. The opposite does not have to be true - consumer might not be partitioned but a related topic can; it has performance implications though, since as described above, consumer parallelism is capped at its number of partitions.

With sharding, partitions are multiplied by the number of shards (db instances).

For example, if we have 3 shards (3 dbs) and a topic with 10 partitions - each shard (db) will host 10 partitions, giving 30 partitions in total. Same with consumers of a sharded topic - they will be all multiplied by the number of shards.

For events, it works differently - in the example above, each shard will host ~ 33% (1/3) of the topic events data (assuming even partition distribution). To get all events, we must read them from all shards.

There will be 30 consumer instances in this particular case - 3 shards * 10 partitions; each consuming from one partition hosted on a given shard. Each event will be published to a one partition of a single shard - as a consequence, events are unique globally, across all shards.

Publishing

We can publish single events and batches of arbitrary data and type:

var publisher = eventSQL.publisher();

publisher.publish(new EventPublication("txt_topic", "txt event".getBytes(StandardCharsets.UTF_8)));
publisher.publish(new EventPublication("raw_topic", new byte[]{1, 2, 3}));
publisher.publish(new EventPublication("json_topic",
  """
  {
    "id": 2,
    "name: "some-user"
  }
  """.getBytes(StandardCharsets.UTF_8)));

// events can have keys and metadata as well;
// key determines event distribution - if it's null, partition is randomly assigned      
publisher.publish(new EventPublication("txt_topic",
  "event-key",
  "txt event".getBytes(StandardCharsets.UTF_8),
  Map.of("some-tag", "some-meta-info")));


// events can be also published in batches, for improved throughput
publisher.publishAll(List.of(
  new EventPublication("txt_topic", "txt event 1".getBytes(StandardCharsets.UTF_8)),
  new EventPublication("txt_topic", "txt event 2".getBytes(StandardCharsets.UTF_8)),
  new EventPublication("txt_topic", "txt event 3".getBytes(StandardCharsets.UTF_8))));

Partitioner

Event partition is determined by EventSQLPublisher.Partitioner. By default, the following implementation is used:

public class DefaultPartitioner implements EventSQLPublisher.Partitioner {

  private static final Random RANDOM = new Random();

  @Override
  public int partition(EventPublication publication, int topicPartitions) {
    if (topicPartitions == -1) {
      return -1;
    }
    if (publication.key() == null) {
      return RANDOM.nextInt(topicPartitions);
    }

    return keyHash(publication.key())
             .mod(BigInteger.valueOf(topicPartitions))
             .intValue();
  }
    
  ...

For a not partitioned topic, no partition is assigned.

If the topic is partitioned and has a null key - partition is random. If a key is defined, the partition is assigned based on key hash. Thanks to this, we have a guarantee that events associated with the same key always land in the same partition.

If you want to change this behavior, you can provide your own implementation and configure it by calling EventSQLPublisher.configurePartitioner method.

Consuming

We can have both single event and batch consumers:

var consumers = eventSQL.consumers();

consumers.startConsumer("txt_topic", "single-consumer", event -> {
  // handle single event
});
// with more frequent polling - by default it is 1 second
consumers.startConsumer("txt_topic", "single-consumer-customized", event -> {
  // handle single event
}, Duration.ofMillis(100));

consumers.startBatchConsumer("txt_topic", "batch-consumer", events -> {
  // handle events batch for better performance
}, // customize batch behavior:
  // minEvents, maxEvents,
  // pollingDelay and maxPollingDelay - how long to wait for minEvents
  EventSQLConsumers.ConsumptionConfig.of(5, 100,
    Duration.ofSeconds(1), Duration.ofSeconds(10)));

Dead Letter Topics (DLT)

If we register a topic with DLT as follows:

eventSQL.registry()
  .registerTopic(new TopicDefinition("account_created", -1))
  .registerTopic(new TopicDefinition("account_created_dlt", -1));

Under certain circumstances, it will get a special treatment.

When a consumer throws EventSQLConsumptionException, DefaultDLTEventFactory takes over and publishes failed event to the associated DLT, if it can find one:

...

@Override
public Optional<EventPublication> create(EventSQLConsumptionException exception, String consumer) {
  var event = exception.event();

  var dltTopic = event.topic() + "_dlt";
  var dltTopicDefinitionOpt = topicDefinitionsCache.getLoadingIf(dltTopic, true);
  if (dltTopicDefinitionOpt.isEmpty()) {
    return Optional.empty();
  }
    
  ...

  // creates dlt event

This factory can be customized by calling EventSQLConsumers.configureDLTEventFactory method.

What is also worth noting is that any exception thrown by a single event consumer is wrapped into EventSQLConsumptionException automatically - see ConsumerWrapper class.

When you use EventSQLConsumers.startBatchConsumer you have to do the wrapping yourself.

How to get it

Maven:

<dependency>
    <groupId>com.binaryigor</groupId>
    <artifactId>eventsql</artifactId>
    <version>1.0.0</version>
</dependency>

Gradle:

implementation 'com.binaryigor:eventsql:1.0.0'
implementation("com.binaryigor:eventsql:1.0.0")

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
benchmarks		benchmarks
src		src
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
TODO.md		TODO.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EventSQL

How it works

How to use it

Topics and Consumers

Publishing

Partitioner

Consuming

Dead Letter Topics (DLT)

How to get it

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

License

BinaryIgor/EventSQL

Folders and files

Latest commit

History

Repository files navigation

EventSQL

How it works

How to use it

Topics and Consumers

Publishing

Partitioner

Consuming

Dead Letter Topics (DLT)

How to get it

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages