Course details

The Basics of Scalding Programming

The Basics of Scalding Programming

Target Audience
Expected Duration
Lesson Objectives
Course Number
Expertise Level

Scalding is a Scala library that is used to abstract complex tasks such as map and reduce. In this course, you will learn to create simple Scalding programs using functions and classes.

Target Audience
This path is targeted toward programmers who wish to learn the basics of Scalding programming. A working knowledge of Cascading or Scala may be beneficial.


Expected Duration (hours)

Lesson Objectives

The Basics of Scalding Programming

  • start the course
  • identify the features and users of Scalding, and the platforms supported in Scalding
  • download and install the Simple Build Tool
  • download and install Scalding
  • describe the basics of REPL and run the Scalding REPL
  • create a Scalding program
  • start Scalding in local mode
  • use Scalding to understand the basic pipeline functionality
  • identify how to write and save data, and describe sinks
  • describe how Scalding uses snapshots to partially persist data
  • use Scalding to read text and identify text data sources
  • use Scalding functions to manipulate text
  • use Scalding group functions to aggregate data
  • describe how Scalding infers data types in saving computations
  • describe the SQL aggregation functions used in Scalding
  • use list value SQL clauses in Scalding
  • use advanced SQL aggregation techniques in Scalding
  • use map-like functions in Scalding
  • use filter and collect functions in Scalding
  • use the project function in Scalding
  • use grouping functions in Scalding
  • use join operations in Scalding
  • use record objects in Scalding
  • describe the functions Scalding uses for mappers
  • describe the functions Scalding uses for reducers
  • describe how Scalding uses Scala, Cascading, and Java in MapReduce jobs
  • create a comprehensive Scalding program
  • Course Number:

    Expertise Level