
Become a Big Data Engineer
Get the skills to be a Big Data Engineer, even if you’re just starting out.
Create Real-time Big Data Data Pipelines
Become an advanced Data Engineer and create real-time big data pipelines with the latest technologies.
From the Blog
Thoughts on Cloudera Merging/Buying Hortonworks
Jesse+Cloudera has merged with/purchased Hortonworks. As a former Clouderan, it’s interesting to see this move on several levels. I’m going to share my insights from the outside as a former insider. Full Disclosure: Although I’m former Cloudera, I don’t own any shares of...
Creating Work Queues with Apache Kafka and Apache Pulsar
Jesse+A common use case for using Kafka and Pulsar is to create work queues. The two technologies offer different implementations for accomplishing this use case. I’ll discuss the ways of implementing work queues in Kafka and Pulsar as well as the relative strengths of...
InfiniteConf Keynote – Why Real-time is the Future
Jesse+Here is my keynote from InfiniteConf 2018. I talk about why real-time is gaining so much momentum, what it does for businesses, how it helps data sciences, and some common use cases.
What is a Data Pipeline?
Jesse+I’ve been seeing some questions about data pipelines lately. I realized I haven’t written a post that gives the level of detail necessary for a good definition of a data pipeline in the context of data engineering. Instead of just giving my opinion, I’ve brought...
Professional Data Engineering Review – Sanjoy Roy
Jesse+Note: this is a guest post from Sanjoy Roy who is reviewing my Professional Data Engineering course. Since late 2014, I have been drawn into various analytics projects which required a good mix of skills for both data engineering and data science. There are a lot of...