Visualization & streaming: How to use it to your advantage

Visualization & Streaming

Importance of data visualization Many people would say that knowledge sharing is one of the noblest things any human being can do. Aside from helping other people grow and be better, by sharing their knowledge people become happier, develop new professional or private connections and bring more purpose to their life. This is why the […]

Apache Spark – How to create a powerful streaming application

Apache Spark

After setting up Apache Kafka, the next step in our Retail Business Intelligence Platform project  was to set up Spark, another widely used software solution from the Apache workshop. Apache Spark, the real spark necessary to ignite our project and turn it into a true stream processing arrangement, is probably the most famous streaming engine […]

How to setup a Kafka Streaming project

Streaming Kafka

If you’ve read the previous blog post about this project, you should already know what was the main idea of the project. We’re trying to estimate the unit sales of more than 3000 distinct items in 10 different Walmart stores. The dataset provides us with more than 5 years of data about the dates on […]

Let’s dive into data science tools and algorithms

Data science tools

Croatia Osiguranje & BIRD Incubator Data Challenge – Part 2 After we explained fundamental data science concepts and techniques in the first part of this post, the second one will be about the tools and algorithms that were used during this Data Challenge. Data science tools As always, it is very important to use the […]

Let’s dive into time series forecasting concepts

Data Science

Croatia osiguranje & BIRD Incubator Data Challenge – Part 1 Data science isn’t only about using fancy visualization tools and robust machine learning algorithms. There is a lot of manual hard work beneath the shiny and aesthetically pleasing solutions. In this blog post, divided into two parts, we tried to sum up data science concepts, […]