Course Outline

Introduction

Overview of Spark Streaming Features and Architecture

  • Supported data sources
  • Core APIs

Preparing the Environment

  • Dependencies
  • Spark and streaming context
  • Connecting to Kafka

Processing Messages

  • Parsing inbound messages as JSON
  • ETL processes
  • Starting the streaming context

Performing a Windowed Stream Processing

  • Slide interval
  • Checkpoint delivery configuration
  • Launching the environment

Prototyping the Processing Code

  • Connecting to a Kafka topic
  • Retrieving JSON from data source using Paw
  • Variations and additional processing

Streaming the Code

  • Job control variables
  • Defining values to match
  • Functions and conditions

Acquiring Stream Output

  • Counters
  • Kafka output (matched and non-matched)

Troubleshooting

Summary and Conclusion

Requirements

  • Experience with Python and Apache Kafka
  • Familiarity with stream-processing platforms

Audience

  • Data engineers
  • Data scientists
  • Programmers
 7 Hours

Number of participants



Price per participant

Testimonials (5)

Related Courses

Data Analysis with Python, Pandas and Numpy

14 Hours

Accelerating Python Pandas Workflows with Modin

14 Hours

Machine Learning with Python and Pandas

14 Hours

Scaling Data Analysis with Python and Dask

14 Hours

FARM (FastAPI, React, and MongoDB) Full Stack Development

14 Hours

Developing APIs with Python and FastAPI

14 Hours

Scientific Computing with Python SciPy

7 Hours

Game Development with PyGame

7 Hours

Web application development with Flask

14 Hours

Advanced Flask

14 Hours

Build REST APIs with Python and Flask

14 Hours

GUI Programming with Python and Tkinter

14 Hours

Kivy: Building Android Apps with Python

7 Hours

GUI Programming with Python and PyQt

21 Hours

Web Development with Web2Py

28 Hours

Related Categories

1