Course Outline

Introduction

SMACK Stack Overview

  • What is Apache Spark? Apache Spark features
  • What is Apache Mesos? Apache Mesos features
  • What is Apache Akka? Apache Akka features
  • What is Apache Cassandra? Apache Cassandra features
  • What is Apache Kafka? Apache Kafka features

Scala Language

  • Scala syntax and structure
  • Scala control flow

Preparing the Development Environment

  • Installing and configuring the SMACK stack
  • Installing and configuring Docker

Apache Akka

  • Using actors

Apache Cassandra

  • Creating a database for read operations
  • Working with backups and recovery

Connectors

  • Creating a stream
  • Building an Akka application
  • Storing data with Cassandra
  • Reviewing connectors

Apache Kafka

  • Working with clusters
  • Creating, publishing, and consuming messages

Apache Mesos

  • Allocating resources
  • Running clusters
  • Working with Apache Aurora and Docker
  • Running services and jobs
  • Deploying Spark, Cassandra, and Kafka on Mesos

Apache Spark

  • Managing data flows
  • Working with RDDs and dataframes
  • Performing data analysis

Troubleshooting

  • Handling failure of services and errors

Summary and Conclusion

Requirements

  • An understanding of data processing systems

Audience

  • Data Scientists
 14 Hours

Number of participants



Price per participant

Testimonials (1)

Related Courses

Kaggle

14 Hours

Accelerating Python Pandas Workflows with Modin

14 Hours

GPU Data Science with NVIDIA RAPIDS

14 Hours

Anaconda Ecosystem for Data Scientists

14 Hours

Python and Spark for Big Data (PySpark)

21 Hours

Introduction to Graph Computing

28 Hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 Hours

Apache Spark MLlib

35 Hours

Introduction to Data Science and AI using Python

35 Hours

Big Data Business Intelligence for Telecom and Communication Service Providers

35 Hours

Data Science for Executives

7 Hours

A Practical Introduction to Data Science

35 Hours

Data Science for Big Data Analytics

35 Hours

Data Science essential for Marketing/Sales professionals

21 Hours

F# for Data Science

21 Hours

Related Categories