Training

What is Kafka Connect?

Kafka Connect is an integral part of Apache Kafka and integrates other systems with Kafka. For example, Kafka Connect can be used to transfer changes from a database (source) to Kafka and write them from there to another data storage system (sink), thus allowing other applications/services (e.g. dashboard) to access real-time data. Kafka Connect provides…

Read article
Ray: Distributed Data Processing with Python

Ray is a project started by RISELab in 2017 that conducts research on real-time data processing systems and artificial intelligence. Developed as an open-source library with a focus on parallel and distributed computing, Ray has recently become a frequently used tool in data analysis, artificial intelligence, and machine learning projects by data scientists and Python…

Read article