Like us!
Follow us!
Follow us!
Follow us!
Subscribe us!
Join us!

Opening Hours : Monday to Sunday - 9 Am to 7 Pm

icon

Need Help? call us free

+91 6364630111

Spark & Scala In Bangalore

Learn Spark & Scala from start for beginner

  • Teacher
    Trainer
    Spark & Scala
  • Category
    Spark & Scala
  • Review
    • (0 Reviews)
Courses
Course Summary

Spark and Scala training offers you to create Spark applications using the Scala programming. The course provides you techniques to increase application performance and enable high-speed processing using Spark RDDs. The course includes Big Data, Hadoop Core Concepts, Scala Basics, Apache Spark, Working with RDD’s, Spark SQL structure data ( Hive with spark sql) batch processing, Spark Streaming unstructured data , and real time processing.

  1. Understanding Data & Hadoop: Basic Concepts
  2. What is BigData.
  3. Characteristics of BigData.
  4. Challenges with Traditional Systems.
  5. Problems with BigData.
  6. Handling BigData.
  1. Problems with Existing Distributed Systems to deal Big Data.
  2. Why Hadoop and An Overview and History of Hadoop.
  3. Requirements of New Approach.
  4. The Hadoop Project and Hadoop Components.

  1. Scala Installation.
  2. Know the concepts of classes in scala.
  3. Object orientation in scala.
  4. Primitive Datatypes.
  5. Scala simple build tool – SBT.
  6. Functional programming in scala – Closures,Currying,Anonymous functions.
  7. Exploring mutable and immutable variables.
  8. Execution of Scala code through REPL or CLI.
  9. Working on basic programming constructs.
  10. Collections – array,set.
Lists
  1. Introduction to Apache Spark.
  2. Hadoop vs Spark.
  3. Why Spark.
  4. Spark Vs Mapreduce.
  5. Batch Vs. Real Time Big Data Analytics.
  6. Spark Installation and Configuration.
  7. Spark Execution Architecture.
  8. Components of Spark – SQL,Streaming,Storm,GraphX.
  9. Understanding Spark Context.
  10. Resilient Distributed Data (RDD) – Partitions,Features ,Parallelism.
  1. RDD operations – Transformations and Actions.
  2. RDD - DeepDive,Persistance/Caching,Lineage.
  3. Types of RDD -Pair RDD,chain RDD.
  4. Spark API programming.
  5. Executing spark program with SBT and spark-assembly.
  6. Understanding spark-submit..
  7. Running spark program in local mode and in cluster.

  1. Spark SQL overview.
  2. Understanding Dataframes,Datasets.
  3. Dataframes Vs RDD’s.
  4. Processing data using Dataframes.
  5. Hive Context.
  6. Custom case classes.
  7. Temp tables Vs Persistent tables.
  8. Inferring Schema programmatically.
  9. Querying files as tables – CSV,Text,JSON,Parquet.
  10. Standard transformations in querying.
  11. Analytics and Window functions in sql.
  12. Working of Spark SQL in Native and Hive context.

  1. Features of Spark Streaming.
  2. Understanding Dstreams.
  3. Use case 1:- Streaming data from netcat server.
  4. Use case 2:- Flume and spark streaming integration.
  5. Use case 3:- Kafka and Spark streaming integration (kafka -messaging service).
  6. Sliding window operations.
  7. Transformers and Estimators.

Instructor
Spark & Scala Trainer

A dynamic and self - motivated Trainer and System Administrator. Aspiring for a Bright and challenging career in the field of Training and Networking Technology, which could enable me to upgrade myself with emerging trends and technologies to benefits of the professional growth and accomplishment of organizational goals.

Student Reviews

Course Features

  • Duration : 50 Hours
Price : ₹ 21240

Quick Enquiry