Sort by Regex

Rax string reduction, /cat(), on the IMPALA backend was not trivial to implement. The IMPALA documentation states: GROUP_CONCAT ... does not support the OVER clause, ... Effectively this means no ORDER BY on GROUP_CONCAT in IMPALA. This must be implemented eventually, because the whole GROUP_CONCAT has limited use without… Read more →

Unveiling the online journey with Rax

Introduction It is generally believed that there is a lot of valuable information hidden in people's online-behavior data. By uncovering patterns in this behaviour, we could possibly achieve better segmentation and ad targeting, better website personalization, etc. However, looking for patterns in the (online) behavioral data is not easy. It… Read more →

Rax 1.1 released

We are happy to announce that Rax 1.1 is avalaible from today! This release contains many stability and performance improvements. Most importantly: Added r'years and a'weeks magic tags to relative duration. Added a sample \ operator for sets: some_set\10 will list random 10 elements of some_set. In… Read more →

R comes to SQL Server

Database vendors understand that SQL, while great for simple queries, is not a good enough interface for complex analytics. Here at Rax'db we would say: use Rax :-). Since R so popular among data scientists, though, many database vendors attempt to somehow integrate R into their technology. This post is… Read more →

Sessionization with Temporal Union

A common problem in web analytics is sessionization or session reconstruction: taking a series of user events and splitting the series into a set of sessions [1]. Often, it is done based solely on the timestamps of the events: a session ends after a certain time of user inactivity. As… Read more →