

Apache Spark – advanced usage
Overview
The course is aimed at the participants who want to advance their knowledge in the Spark environment, such as the Spark Streaming.
All the examples in this education will be primarily processed in Python, but other programing languages, e.g. Scala, will also be used. The exercise will be done in the independent and cluster environment, depending on the assignment the participants will be working on.
Target audience
- System architects
- Development engineers
- Business analysts
Prerequisites
- Basic Python knowledge
- Knowledge of OO programming
- Advanced knowledge of the SQL language
Content
The participants will get all the necessary info how to establish a streaming process for data processing in real time. They will learn about the MLib library for machine learning, where they will build a model for machine learning and a process of model training will be showed to them as well. By using the GraphX library for processing graph databases through a few examples, we will show how to use it efficiently in practice.
For all possible inquiries, do not hesitate to contact us on our e-mail address learn@croz.net