Building the Correlator #2: Window architecture

In the previous article, I described what the window correlator is good for, analyzing long series of data to gather security incidents and produce alerts. I also discussed the memory issue that comes with storing long series of data composed of events, which come to the application at a frequency of tens of thousands of events per second. The proposed solution is to store only relevant parts of the series, here called segments, in memory. Now, we'll take a look at the segments in more detail, so we know exactly what structures they are composed of.

Predicate and evaluate phases: Organizing incoming events

First, take a look at the following picture:

Window correlator architecture

When a relevant event comes to the correlator application and is accepted by the correlation rule in the predicate (filter) phase, an evaluation begins. Based on the primary dimension, defined in the given correlation rule (such as user name, device ID etc.), the event is inserted into the data series. The place of its insertion is defined by the timestamp, which is part of the event and specifies the exact time the event originated. If there is an available segment, it checks its start time and end time. If the event time matches the time range defined by the segment, the event is stored inside the segment or, more specifically, inside a cell of the segment. Each segment has multiple cells. The cell size is defined by the parameter called resolution and it can be any length of time, from a single second to several days. Each cell for a single correlation rule is the same length of time, so by the start time and the count of cells inside the segment, we can calculate the segment's end time.

The cells are the part of the segment that is stored in the memory-mapped file, here called a shard. There is another memory-mapped file that holds attributes from the event itself, such as ID. This second file is connected with the shard and is called the dynamic file. Each cell then contains information about offset in the dynamic file, where it stores its list of events. The cell itself may also contain numeric metrics calculated from the events, such as their sum/count, mean spike, median and so on based on the definition of the correlation rule. When the information about the event is inserted in a given cell and some of its attributes in the dynamic file, the evaluation phase ends. The row with its segments is synchronized to LevelDB to reconstruct the same memory structure after a possible restart of the application. In the previous article, I mentioned the issue of storing pointers to a memory mapped from a file, so LevelDB comes here for help to store offsets of segments that are used to reconstruct the pointers after restart.

Analysis phase: The sliding window starts its work

The last phase is called analysis and is triggered either after an event is inserted into memory, or periodically after a defined amount of time. In both cases, the sliding time window is constructed to look at some amount of cells, determined by the correlation rule. For example, if we wish to analyze failed user login attempts from the past five minutes, we can set the resolution (and thus cells) to one minute. Thus, the sliding time window needs to cover five of these one-minute cells at once, for a total of five minutes. There may be more sliding time windows constructed, each starting at some time before or at the changed cell, to ensure all cases are covered.

Segments and cells

Once the time window is prepared, it executes an aggregator function such as sum or unique count. Utilizing the row iterator object, it traverses all encapsulated cells (in our example, always five). The row iterator iterates through the segment and if it reaches the time gap, it simply calculates the number of missing virtual cells (green cells in the picture above) and continues the iterations over them – the metrics of the virtual cells are always empty or equal to zero. When the gap is over, it continues to another segment and so on, until the requested number of cells is reached.

Test and trigger phases: Detecting and taking action

The aggregator then passes the metrics to the window, which it gives to the analysis function for the test. If the result of the aggregator meets the test conditions, such as a sum of failed logins exceeding ten, an alert is triggered. If not, another time window is constructed until all possible cases are covered.

Flexibility

That is simply how the window correlator, an application utilizing time windows, works. As you can see, there is a lot of opportunity for parametrization, so that most known security incidents can be covered. The code is implemented in C to optimize performance, given the potential for thousands of iterations per second. There may be more of the shards and dynamic files, based on the size of the data series, each containing information about the cells and events inside a given segment and row. The alerts produced by the window correlator are then handled by alert management and notification services, so that every security incident is tracked and acted upon.

Want to know more?

Visit the previous article about building the Correlator, or our beginner-friendly introduction to event correlation.

About the Author

Premysl Cerny

Software Developer at TeskaLabs

TeskaLabs LogMan.io

Log Management and SIEM

Tweets by @TeskaLabs

Most Recent Articles

You Might Be Interested in Reading These Articles

You Can Build Apps for the Apple TV, But Do You Know How to Do It Securely?

Apple will want to dominate the market for TV apps. To achieve this objective, it’s understandable that Apple makes it easy for app developers to create apps and games for the Apple TV platform using tvOS and profit from them just as they have already done so for the iPhone and iPad devices. Developers can leverage similar frameworks and technologies since tvOS is just a modified version of the iOS. They can even retrofit the apps that were previously developed for iOS to support the Apple TV’s tvOS.

Continue reading ...

mobile security

Published on June 29, 2016

Android Nougat: Google OS' Tightest Security Yet

Officially released a month ago, the latest Google mobile OS version has made a few major adjustments, particularly in its security features. The search giant has improved the security in the Android Nougat (or also known as Android N) from strengthening the Android itself to some tools that helps developers to keep things as it is while users install apps.

Continue reading ...

security android

Published on November 15, 2016

Entangled ways of product development in the area of cybersecurity #1 - Asynchronous or parallel?

I started working at TeskaLabs at the beginning of autumn 2017 as a student at the Faculty of Information Technology of CTU. In the job advertisement, I was particularly interested in the fact that it is a small, product-based company that does not focus on just one technology or one programming language.

Continue reading ...

development tech premek

Published on November 15, 2022

Tags: security, development

Follow @TeskaLabs