The Advanced Spark training course provides a deeper dive into Spark. Information on internals as well as debugging/troubleshooting Spark applications are a central focus. Also covered is integration with other storage like Cassandra/HBase and other NoSQL implementations. The Advanced Spark course begins with a review of core Apache Spark concepts followed by lessons on understanding Spark internals for performance. Next, the course dives into the new features of Spark 2 and how to use them. The course then covers clustering, integration, and machine learning with Spark. The course concludes with lessons on advanced Spark SQL and streaming, high-performance Spark applications and best practices.
Upon successful completion of this course, the student will be able to:
- Apply the Spark fundamentals to gain a deeper understanding of Spark internals
- Identify the operational tweaks to gain the maximum performance from Spark
- Describe how to use GraphX and MLib for machine learning
Developers who have taken an Introduction to Spark course or who have equivalent experience.