---------------------------------------------------------------------------------------------------------------------------------

Mr. Kalyan, Big Data Solution Architect,
Apache Contributor, 11+ years of IT exp, 7+ years of Big Data exp,

Cloudera CCA175 Certified Consultant, IIT Kharagpur, Gold Medalist

---------------------------------------------------------------------------------------------------------------------------------

Spark and Scala Training In Hyderabad @ ORIENIT @ KALYAN

Spark and Scala Training Course Content Link

---------------------------------------------------------------------------------------------------------------------------------

Spark & Scala Course Content (Spark-1.x & Spark-2.x)

(Development and Administration)

---------------------------------------------------------------------------------------------------------------------------------

Introduction to Big Data and Hadoop

Big Data
- What is Big Data?
- Why all industries are talking about Big Data?
- What are the issues in Big Data?
  - Storage
    - What are the challenges for storing big data?
  - Processing
    - What are the challenges for processing big data?
- What are the technologies support big data?
  - Hadoop
  - Spark
  - Data Bases
    - Traditional
    - NOSQL
Hadoop
- What is Hadoop?
- Why Hadoop?
- History of Hadoop
- Hadoop Use cases
- Advantages and Disadvantages of Hadoop
Importance of Different Ecosystems of Hadoop
Importance of Integration with other Big Data solutions
Batch vs Real Time Big Data Analytics
Real Time Analytics
- Streaming Data – Storm / Kafka / Flume
- In Memory Data - Spark

Introduction to Spark

What is Spark
Why Spark
Who Uses Spark
Brief History of Spark
Storage Layers for Spark
Why Spark is 100 times faster than MapReduce
Difference between Spark-1.x and Spark-2.x
Unified Stack of Spark
- Spark Core
- Spark Sql
- Spark Streaming
- Spark MLLib
- Spark GraphX
Spark Architecture explanation
- Master Slave architecture
- Spark Driver
- Workers
- Executors
Installation of Spark in different modes
- Local mode
- Pseudo mode
- Cluster mode

Basics of Spark

Creating the Spark Context
Creating the Spark Conf
Creating the Spark Session
Configuring Spark Context with Spark Conf
Caching Overview
Distributed Persistence
Combine scala and java seamlessly
Deploying Applications with spark-submit
Verify spark jobs in Spark Web UI
SBT
- Installing sbt
- Building a Spark Project with sbt
- Running Spark Project with sbt
MAVEN
- Installing maven
- Building a Spark Project with maven
- Running Spark Project with maven

Resilient Distributed Dataset (RDD)

What is RDD
Creating RDDs
RDD Operations
- Transformations
- Actions
Passing Functions to Spark

Python, Java, Scala

Working with Key/Value Pairs

Creating Pair RDDs
Transformations on Pair RDDs
- Aggregations
- Grouping Data
- Joins
- Sorting Data
Data Partitioning
- Determining an RDD’s Partitioner
- Custom Partitioners

Loading and Saving Your Data

File Formats
- Text, json, csv, tsv, Object files
- Hadoop Input and Output Formats
Loading Data using RDD
Saving Data using RDD
MapReduce and Pair RDD Operations
Scala and Hadoop Integrations

Broadcast and Accumulators

Accumulators
- Introduction to Accumulators
- Practical Examples on Accumulators
- Creating Custom Accumulators
Broadcast variables
- Introduction to Broadcast variables
- Practical Examples on Broadcast variables
- Optimizing Broadcasts

Working with Spark in different programming languages

Python
- Installing Python
- How to use 'pyspark'
- Practical examples on spark in python
Scala
- Installing Scala
- How to use 'spark-shell'
- Practical examples on spark in Scala
Java
- Installing Java
- How to use 'Java'
- Practical examples on spark in Java
R
- Installing R
- How to use 'SparkR'
- Practical examples on spark in R

Apache Spark SQL

Spark SQL & Hive Architecture explanation
Working with Spark SQL DataSets
Working with Spark SQL DataFrames
Practice on Spark SQL Context
Practical examples on Spark SQL
Integrating Spark SQL with
- Hive
- Phoenix
- Cassandra
- RDBMS
Processing different files using Spark
- Text
- Json
- Csv
- Tsv
- Parquet
Spark SQL UDFs
Spark SQL Performance Tuning Options
JDBC/ODBC Server

Apache Spark Streaming

Spark Streaming Architecture explanation
Creating the Streaming Context
Discretized Streams (DStreams)
Transformations on Dstreams
- UpdateStateByKey Operation
- Transform Operation
- Window Operations
- Join Operations
Output Operations on DStreams
Streaming UI explanation
Spark Streaming Sources
- Basic Sources
- Advanced Sources
Integrating Spark Streaming with
- Flume
- Kafka
- Twitter
- HDFS
Performance Considerations
Practical examples on Spark Streaming

Apache Spark MLLib

Machine Learning Basics
Machine Learning Algorithms
- Classification
- Clustering
- Collaborative Filtering
Performance Considerations
Practical examples on Spark MLib

Apache Spark Graphx

Introduction to Spark Graphx
Practical Examples on Spark Graphx

Apache Mesos

Introduction to Apache Mesos
Apache Mesos Architecture explanation
Practical Examples on Apache Mesos

Apache Mahout

Introduction to Apache Mahout
Apache Mahout Architecture explanation
Practical Examples on Apache Mahout

Apache Storm

Introduction to Apache Storm
Apache Storm Architecture explanation
Practical Examples on Apache Storm

Apache Kafka

Introduction to Apache Kafka
Installing Apache Kafka
Apache Kafka Architecture explanation
Practical Examples on Apache Kafka

Apache Flume

Introduction to flume
Flume installation
Flume Architecture
- Agent
- Sources
- Channels
- Sinks
Practical Examples on Flume

Apache Phoenix

Introduction to Phoenix
Installing Phoenix
Integrating with Hbase
Practical Examples on Phoenix

Apache Cassandra

Introduction to Cassandra
Installing Cassandra
Practical Examples on Cassandra

Apache Zeppelin

Introduction to Zeppelin
Installing Zeppelin
Practical Examples on Zeppelin
Data Visualization using Zeppelin

Play Framework

Introduction to Play Framework
Installing Play Framework
Practical Examples on Play Framework
Spark Project using Play Framework

Introduction of Scala

What is Scala?
Why Scala?
Advantages of Scala?
Using the Scala REPL(Read Evaluate print loop)
What is Type Inference
Interoperability between Scala and Java

Scala using Command Line

Installing Java & Scala
Interactive Scala
Writing Scala Scripts
Compiling Scala Programs

Basics of Scala

Defining Variables
Defining Functions
String Interpolation
IDE for Scala

Scala Type Less, Do More

Semicolons
Variable Declarations
Method Declarations
Type Inference
Immutability
Reserved Words
Operators
Precedence Rules
Literals
Options
Arrays, Lists, Ranges, Tuples

Expressions and Conditionals

If expressions
If-Else expressions
Match Expressions
For Loops
While Loops
Do-While Loops
Conditional Operators
Enumerations
Pattern Matching
Using try, catch, and finally Clauses

Functional Programming in Scala

What is Functional Programming?
Functional Literals and Closures
Recursions
Currying
Functional Data Structures
Sequences,Maps,Sets
Traversing
Traversal, Mapping, Filtering, Folding and Reducing
Implicit Function Parameters
Call by Name, Call by Value

Object-Oriented Programming in Scala

Class and Object Basics
Value Classes
Parent Types
Constructors in Scala
Fields in Classes
Nested Types
Traits as Mixins
Stackable Traits
Creating Traits
Visibility Rules

Scala for Big Data

Improving MapReduce with Scala
Moving Beyond MapReduce
Categories for Mathematics
A List of Scala-Based Data Tools

Spark with Big Data Integrations:

Spark and Hive integration
Spark and Phoenix integration
Spark and Cassandra integration
Spark and Flume integration
Spark and Kafka integraion
Spark and RDBMS integration

Real Time Big Data Projects

We willl be sharing End-to-End Big Data Projects
We are providing Big Data Project Practice on Our Lab
We are providing Important Recorded Videos on Our YouTube Channel
Any information search in Google / YouTube by keyword is 'Kalyan Hadoop'

Spark Administration topics:

Hadoop Installation
Hive Installation
Hbase Installation
Zookeeper Installation
Phoenix Installation
Kafka Installation
Flume Installation
Zeppelin Installation
Play Framework Installation
MySql Installation
Java Installation
Scala Installation
Python Installation
R Installation
Eclipse Installation
Cloudera Distribution installation

Free Big Data Workshops:

Spark & Scala
NOSQL (Cassandra, MongoDB, Hbase)
Search engine & E-commerce solutions
Big Data Analytics (R, Mahout, Spark ML)

What we are offering to you:

Hands on Practice on Cloudera CCA175 Spark and Hadoop Developer Certification
Tips to Crack the CCA175 Certification
Hands on Practice on Spark & Scala Real-Time Examples
Providing 1 Major project on Spark.
Providing 2 Mini projects on Spark.
Real Time Big Data projects will be shared
Free Big Data Workshops on new & advanced technologies
Free Weekly Online Hadoop Certification
Hands on installation Spark and it's relative software's in your laptop.
Well documented Spark & Scala material with all the topics covering in the course.
Well documented Spark blog contains frequent interview questions along with the answers and latest updates on BigData technology.
Discussing about Spark & Scala interview questions daily base.
Resume preparation with POC's or Project's based on your experience.

Kalyan Hadoop and Spark Training in Hyderabad Learn Big Data From Basics... @ Kalyan @

Spark and Scala Training in Hyderabad

Spark with Big Data Integrations:

Free Big Data Workshops:

No comments :

Post a Comment