PySpark Essential Training: Introduction to Building Data Pipelines

With Sam Bail Liked by 52 users

Duration: 1h 18m Skill level: Intermediate Released: 8/7/2025

Start my 1-month free trial Buy for my team

Course details

PySpark is a powerful library that brings Apache Spark’s distributed computing capabilities to Python, making it a key tool for processing large-scale data efficiently. In this course, data engineer and analyst Sam Bail provides a structured and hands-on introduction to PySpark, starting with an overview of Apache Spark, its architecture, and its ecosystem. Learn about Spark’s core concepts, such as the DataFrame API, transformations, lazy evaluations, and actions, before setting up a lab environment and working with a real dataset. Plus, gain insights into how PySpark fits into a broader data engineering ecosystem and best practices on running PySpark in a production environment.

Skills you’ll gain

Earn a sharable certificate

Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.

LinkedIn Learning
Certificate of Completion

Showcase on your LinkedIn profile under “Licenses and Certificate” section
Download or print out as PDF to share with others
Share as image online to demonstrate your skill

Meet the instructor

Sam Bail

Founder & Tastemaker @ Third Place Bar, Data Engineer, Writer, Public Speaker

Learner reviews

4.7 out of 5

66 ratings

5 star
Current value: 53 80%
4 star
Current value: 9 14%
3 star
Current value: 3 5%
2 star
Current value: 0 0%
1 star
Current value: 1 2%

Maciej Kosowski

Maciej Kosowski

Data Analysis | Programming | Process Automation | Reporting

5/5 October 2, 2025

Thank you, very good as a refresher!

Helpful · Report
Shlomi K

Shlomi K

Data Engineer | Product Manager | ETL, Python, SQL, Pandas, PySpark, PowerBI

5/5 September 30, 2025

Fantastic course, very practical with good and clear examples.

Helpful · Report

What’s included

Learn on the go Access on tablet and phone

Similar courses

Download courses

Use your iOS or Android LinkedIn Learning app, and watch courses on your mobile device without an internet connection.

PySpark Essential Training: Introduction to Building Data Pipelines

With Sam Bail Liked by 52 users

Duration: 1h 18m Skill level: Intermediate Released: 8/7/2025

Course details

Skills you’ll gain

Earn a sharable certificate

LinkedIn Learning
Certificate of Completion

Meet the instructor

Sam Bail

Founder & Tastemaker @ Third Place Bar, Data Engineer, Writer, Public Speaker

Learner reviews

4.7 out of 5

Maciej Kosowski

Data Analysis | Programming | Process Automation | Reporting

Shlomi K

Data Engineer | Product Manager | ETL, Python, SQL, Pandas, PySpark, PowerBI

Contents

What’s included

Similar courses

High-Performance PySpark: Advanced Strategies for Optimal Data Processing

Data Engineering Foundations

Data Engineering Project: Build Streaming Ingestion Pipelines for Snowflake with AWS

Download courses

Start learning today.

Explore Business Topics

Explore Creative Topics

Explore Technology Topics

PySpark Essential Training: Introduction to Building Data Pipelines

With Sam Bail Liked by 52 users Duration: 1h 18m Skill level: Intermediate Released: 8/7/2025

Course details

Skills you’ll gain

Earn a sharable certificate

Learning LinkedIn Learning Certificate of Completion

Meet the instructor

Sam Bail

Founder & Tastemaker @ Third Place Bar, Data Engineer, Writer, Public Speaker

Learner reviews

4.7 out of 5

Maciej Kosowski

Data Analysis | Programming | Process Automation | Reporting

Shlomi K

Data Engineer | Product Manager | ETL, Python, SQL, Pandas, PySpark, PowerBI

Contents

What’s included

Similar courses

High-Performance PySpark: Advanced Strategies for Optimal Data Processing

Data Engineering Foundations

Data Engineering Project: Build Streaming Ingestion Pipelines for Snowflake with AWS

Download courses

Start learning today.

Explore Business Topics

Explore Creative Topics

Explore Technology Topics

With Sam Bail Liked by 52 users

Duration: 1h 18m Skill level: Intermediate Released: 8/7/2025

LinkedIn Learning
Certificate of Completion