Python, Spark, and Hadoop for Big Data Training Course

Python is a scalable, flexible, and widely used programming language for data science and machine learning. Spark is a data processing engine used in querying, analyzing, and transforming big data, while Hadoop is a software library framework for large-scale data storage and processing.

This instructor-led, live training (online or onsite) is aimed at developers who wish to use and integrate Spark, Hadoop, and Python to process, analyze, and transform large and complex data sets.

By the end of this training, participants will be able to:

Set up the necessary environment to start processing big data with Spark, Hadoop, and Python.
Understand the features, core components, and architecture of Spark and Hadoop.
Learn how to integrate Spark, Hadoop, and Python for big data processing.
Explore the tools in the Spark ecosystem (Spark MlLib, Spark Streaming, Kafka, Sqoop, Kafka, and Flume).
Build collaborative filtering recommendation systems similar to Netflix, YouTube, Amazon, Spotify, and Google.
Use Apache Mahout to scale machine learning algorithms.

Format of the Course

Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request a customized training for this course, please contact us to arrange.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Testimonials (3)

The fact that we were able to take with us most of the information/course/presentation/exercises done, so that we can look over them and perhaps redo what we didint understand first time or improve what we already did.

Raul Mihail Rat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

I liked that it managed to lay the foundations of the topic and go to some quite advanced exercises. Also provided easy ways to write/test the code.

Ionut Goga - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

The live examples

Ahmet Bolat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

5100 EUR (Classroom)

Python, Spark, and Hadoop for Big Data Training Course

Course Outline

Requirements

Testimonials (3)

Raul Mihail Rat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Ionut Goga - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Ahmet Bolat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Provisional Upcoming Courses (Contact Us For More Information)

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Python, Spark, and Hadoop for Big Data Training Course

Course Outline

Requirements

Testimonials (3)

Raul Mihail Rat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Ionut Goga - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Ahmet Bolat - Accenture Industrial SS

Course - Python, Spark, and Hadoop for Big Data

Provisional Upcoming Courses (Contact Us For More Information)

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Python, Spark, and Hadoop for Big Data

Related Courses

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

Big Data Analytics with Google Colab and Apache Spark

Big Data Analytics in Health

Introduction to Graph Computing

Hadoop and Spark for Administrators

Hortonworks Data Platform (HDP) for Administrators

Data Analysis with Hive/HiveQL

Impala for Business Intelligence

A Practical Introduction to Stream Processing

SMACK Stack for Data Science

Apache Spark Fundamentals

Administration of Apache Spark

Python and Spark for Big Data (PySpark)

Apache Spark MLlib

Related Categories

Hadoop

Apache Spark

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites