[email protected] +91 9541 551 557 +91 9035 406 484
Synergific Store LMS Login Training Calendar

Apache Spark and Scala

Live Online (VILT) & Classroom Corporate Training Course

Apache Spark is a big data processing framework and its popularity lies in the fact that it is fast, easy to use and offers sophisticated solutions to data analysis. Its built-in modules for streaming, machine learning, SQL, and graph processing make it useful in diverse Industries.

Expert-Led VILT & Classroom Hands-On CloudLabs Certification Voucher Available
CloudLabs
Projects
Assessments
24/7 Support
Lifetime Access

Overview

Apache Spark and Scala course is designed to help you become proficient in Apache Spark Development. You will learn about topics such as Apache Spark Core, Motivation for Apache Spark, Spark Internals, RDD, SparkSQL, Spark Streaming, MLlib, and GraphX that form key constituents of the Apache Spark course.

Objectives

At the end of Apache Spark & Scala training course, participants will

  • Master the concepts of the Apache Spark framework
  • Understand the Spark Internals RDD and use of Spark’s API and Scala functions to create RDDs and transform RDDs
  • Master the RDD Combiners, SparkSQL, Spark Context, Spark Streaming, MLlib, and GraphX

Prerequisites

Hadoop Basics

Course Outline

  • Overview of Hadoop
  • Architecture of  HDFS  & YARN
  • Overview of Spark version 2.2.0
  • Spark Architecture
  • Spark  Components
  • Comparison of  Spark &  Hadoop
  • Installation of Spark v 2.2.0 on Linux 64 bit

  • Exploring the Spark shell
  • Creating Spark Context
  • Operations on Resilient Distributed Dataset – RDD
  • Transformations & Actions
  • Loading Data and Saving Data

  • Introduction to SQL  Operations
  • SQL Context
  • Data Frame
  • Working with Hive
  • Loading Partitioned Tables
  • Processing  CSV, Json ,Parquet files

  • Introduction to Scala
  • Feature of Scala
  • Scala vs Java Comparison
  • Data types
  • Data Structure
  • Arrays
  • Literals
  • Logical Operators
  • Mutable & Immutable variables
  • Type interface

  • Oops  vs Functions
  • Anonymous
  • Recursive
  • Call-by-name
  • Currying
  • Conditional statement

  • List
  • Map
  • Sets
  • Options
  • Tuples
  • Mutable collection
  • Immutable collection
  • Iterating
  • Filtering and counting
  • Group By
  • Flat Map
  • Word count
  • File Access

  • Classes, Objects & Properties
  • Inheritance

  • Maven  build tool implementation
  • Build Libraries
  • Create  Jar files
  • Spark-Submit

  • Overview  of Spark Streaming
  • Architecture of Spark Streaming
  • File streaming
  • Twitter Streaming

  • Overview  of Kafka Streaming
  • Architecture of Kafka Streaming
  • Kafka Installation
  • Topic
  • Producer
  • Consumer
  • File streaming
  • Twitter Streaming

  • Overview  of Machine Learning Algorithm
  • Linear Regression
  • Logistic Regression

  • GraphX overview
  • Vertices
  • Edges
  • Triplets
  • Page Rank
  • Pregel

  • On-Off-heap memory tuning
  • Kryo Serialization
  • Broadcast Variable
  • Accumulator Variable
  • DAG Scheduler
  • Data Locality
  • Check Pointing
  • Speculative Execution
  • Garbage Collection

  • Master – Driver Node capacity
  • Slave –   Worker Node capacity
  • Executor capacity
  • Executor core capacity
  • Project scenario and execution
  • Out-of-memory error handling
  • Master logs, Worker logs, Driver  logs
  • Monitoring Web UI
  • Heap memory dump

Available Training Modes

Pick the format that fits your team.

Same authorised curriculum, same trainers, same hands-on cloud labs — delivered the way that works for you.

Live Online (VILT)

Real-time instructor-led sessions over Zoom or Teams. Same classroom, different time zones.

Most popular

Classroom

Face-to-face training delivered at your office, our Bengaluru centre, or any partner venue worldwide.

Onsite

Self-Paced

Recorded sessions plus 24/7 access to cloud labs and assessments. Learn at the pace that works for each engineer.

On-demand

Blended

Live workshops with self-paced reinforcement and project-based labs. Best for hybrid teams across regions.

Hybrid teams
All modes include: hands-on cloud labs, recordings, assessments, certificate of completion. Talk to a solutions advisor →

Our Training Process

How a course becomes measurable skill.

One contract, five steps, zero handoffs. From discovery to deployment, the same Synergific team owns the outcome — not a chain of vendors.

5 Steps from your scoping call to certified, productive engineers.
01

Discover & set goals

We start with a scoping call to understand your team's current skill level, target outcomes, deadlines, and certification needs — then translate that into a measurable success plan with named owners on both sides.

02

Curate the right path

We map the optimal learning path — instructor-led, self-paced, or blended — with hands-on cloud labs, prerequisite refreshers, and certification vouchers built in. No filler modules, no padded curriculum.

03

Deliver hands-on training

Authorised trainers run live sessions backed by 24/7 cloud labs and real-world projects. Theory and practice on the same day — learners stop forgetting concepts before they get to apply them.

04

Assess & mentor

Continuous skill checks, mock exams, and 1:1 mentoring keep the program honest. If anyone falls behind, we course-correct in-flight — you'll never find out at the end that two engineers couldn't keep up.

05

Certify & apply on the job

Voucher-backed certification, post-training office hours, and 30-day reinforcement so skills land on real work — not just on the exam scorecard. Success measured after the course ends, not before.

Client Stories

What our clients say

Voices from L&D leaders, architects, and program managers who’ve trusted us with their upskilling.