3
NAYA College הנדיב71 , הרצליה | טלפון: 0732865544 | פקס: 0732865417 | www.naya-college.co.il Python for Data Analysis 40 hours Course Overview: Python is a general purpose language, and is very user friendly for new users. In terms of structure and syntax, it’s a well-designed, intuitive, and exceedingly powerful general-purpose programming language. As such it has many features that definitely won’t be relevant for everyone. In last year's we have seen that Python is an increasingly popular tool for data analysis. And it is not necessary to become proficient in Python programming to be able to perform data analysis. Still, you need to invest time to learn the fundamentals of the language before you jump into applications. This course is designed to build strong basis in Python and its related packages for operating with data. The course includes the main tools for data analysts and other users with little or no serious programming experience who just want to get things done in data analysis. The course includes the basic and intermediate Python programming and the most important libraries for data analysis: Numpy is the fundamental package for scientific computing with Python. A good understanding of Numpy will help you use tools like Pandas effectively. Matplotlib is a widely-used package for scientific graphics. This part will include an introduction to the matplotlib objects, to their basic functionalities and a survey of the basic plot types. Pandas is a package for data manipulation, and gives a set of easy-to-use capabilities resembling SQL (on the data processing side) and Excel (on the visualization side). In addition, the course will introduce how to work with Python in Big Data environment, including introduction to Hadoop and Spark, and the main library for this purpose – Pyspark. Course Objectives: To get an intermediate skill level of Python programming to perform data analysis. To use the numpy library to create and manipulate arrays. To create data visualizations using matplotlib with python. To use the pandas with Python to create and structure data. To work with various data formats within python, including: JSON, HTML, and MS Excel Worksheets. Who Should Attend: This course is intended for data analysts, BI experts, developers and everyone who wants to perform a data manipulation with Python. Required Skills: Basic math skills Basic to Intermediate Python Skills are the benefit Course Contents: Basic Python Fundamentals o Intro o Python essentials o The working environment Data types o Numbers

Python for Data Analysis 40 hours - NAYA College › wp-content › uploads › ... · Python for Data Analysis 40 hours Course Overview: Python is a general purpose language, and

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Python for Data Analysis 40 hours - NAYA College › wp-content › uploads › ... · Python for Data Analysis 40 hours Course Overview: Python is a general purpose language, and

NAYA College

www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב

Python for Data Analysis

40 hours

Course Overview:

Python is a general purpose language, and is very user friendly for new users. In terms of structure and syntax, it’s a well-designed, intuitive, and exceedingly powerful general-purpose programming language. As such it has many features that definitely won’t be relevant for everyone. In last year's we have seen that Python is an increasingly popular tool for data analysis. And it is not necessary to become proficient in Python programming to be able to perform data analysis. Still, you need to invest time to learn the fundamentals of the language before you jump into applications.

This course is designed to build strong basis in Python and its related packages for operating with data. The course includes the main tools for data analysts and other users with little or no serious programming experience who just want to get things done in data analysis.

The course includes the basic and intermediate Python programming and the most important libraries for data analysis:

Numpy is the fundamental package for scientific computing with Python. A good understanding of Numpy will help you use tools like Pandas effectively.

Matplotlib is a widely-used package for scientific graphics. This part will include an introduction to the matplotlib objects, to their basic functionalities and a survey of the basic plot types.

Pandas is a package for data manipulation, and gives a set of easy-to-use capabilities resembling SQL (on the data processing side) and Excel (on the visualization side).

In addition, the course will introduce how to work with Python in Big Data environment, including introduction to Hadoop and Spark, and the main library for this purpose – Pyspark.

Course Objectives:

To get an intermediate skill level of Python programming to perform data analysis.

To use the numpy library to create and manipulate arrays.

To create data visualizations using matplotlib with python.

To use the pandas with Python to create and structure data.

To work with various data formats within python, including: JSON, HTML, and MS Excel Worksheets.

Who Should Attend:

This course is intended for data analysts, BI experts, developers and everyone who wants to perform a data manipulation with Python.

Required Skills:

Basic math skills

Basic to Intermediate Python Skills are the benefit

Course Contents:

Basic Python

Fundamentals o Intro o Python essentials o The working environment

Data types o Numbers

Page 2: Python for Data Analysis 40 hours - NAYA College › wp-content › uploads › ... · Python for Data Analysis 40 hours Course Overview: Python is a general purpose language, and

NAYA College

www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב

o Strings o Booleans o None

Collections o Lists o Tuples o Dictionaries o Sets

Control flow o if…else o for…in o list comprehension o while o continue & break

Textual interface o input o format

Intermediate Python

Functions o User-defined functions o *args and **kwargs o Built-in functions o Lambda expressions

Debugging and Error Handling

Text files

Object-Oriented Programming (OOP) Basics

The standard library o import o datetime

Python Tools for working with Data

The pandas library o The NumPy library

Array Broadcasting

o The matplotlib library matplotlib objects Plotting Seaborn

o The pandas library Series and Index DataFrame GroupBy Visualizations

o Use cases

General tools o Regular expressions (re) o JSON o API's and Connecting with Data Resources

Page 3: Python for Data Analysis 40 hours - NAYA College › wp-content › uploads › ... · Python for Data Analysis 40 hours Course Overview: Python is a general purpose language, and

NAYA College

www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב

Introduction to Python for big data – short overview

Introduction to big data o Background o Hadoop o Spark o PySpark

Working environment o databricks

PySpark and RDDs

PySpark SQL and Datasets