Data Science with Python: Data Analysis and Visualization

Data Science with Python: Data Analysis and Visualization

Data Science with Python: Data Analysis and Visualization

This class is a comprehensive introduction to data science with Python programming language. This class targets people who have some basic knowledge of programming and want to take it to the next level. It introduces how to work with different data structures in Python and covers the most popular data analytics and visualization modules, including numpy, scipy, pandas, matplotlib, and seaborn. We use Ipython notebook to demonstrate the results of codes and change codes interactively throughout the class.

Course Overview

This class is a comprehensive introduction to data science with Python programming language. This class targets people who have some basic knowledge of programming and want to take it to the next level. It introduces how to work with different data structures in Python and covers the most popular data analytics and visualization modules, including numpy, scipy, pandas, matplotlib, and seaborn. We use Ipython notebook to demonstrate the results of codes and change codes interactively throughout the class.

* Tuition paid for part-time courses can be applied to the Data Science Bootcamp if admitted within 9 months.
January Session
$1590.00
Early bird pricing
$1510.50
January Session
Jan 19 - Feb 16, 2020, 1:00-5:00pm
Want to start right away?
Check out our online option.
March Session
$1590.00
Early bird pricing
$1510.50
March Session
Mar 7 - Apr 4, 2020, 1:00-5:00pm
April Session
$1590.00
Early bird pricing
$1510.50
April Session
Apr 19 - May 17, 2020, 1:00-5:00pm

Date and Time

January Session Early-bird Pricing!

Jan 19 - Feb 16, 2020, 1:00-5:00pm
Day 1: January 19, 2020
Day 2: January 26, 2020
Day 3: February 2, 2020
Day 4: February 9, 2020
Day 5: February 16, 2020
$1590.00$1510.50
Add to Cart

March Session Early-bird Pricing!

Mar 7 - Apr 4, 2020, 1:00-5:00pm
Day 1: March 7, 2020
Day 2: March 14, 2020
Day 3: March 21, 2020
Day 4: March 28, 2020
Day 5: April 4, 2020
$1590.00$1510.50
Add to Cart

April Session Early-bird Pricing!

Apr 19 - May 17, 2020, 1:00-5:00pm
Day 1: April 19, 2020
Day 2: April 26, 2020
Day 3: May 3, 2020
Day 4: May 10, 2020
Day 5: May 17, 2020
$1590.00$1510.50
Add to Cart

June Session Early-bird Pricing!

Jun 13 - Jul 18, 2020, 1:00-5:00pm
Day 1: June 13, 2020
Day 2: June 20, 2020
Day 3: June 27, 2020
Day 4: July 11, 2020
Day 5: July 18, 2020
$1590.00$1510.50
Add to Cart

Instructors

Tony Schultz
Tony Schultz
Tony received his Ph.D. in Physics from the City University of New York and has taught at Sarah Lawrence College over the past decade. Tony specializes in developing machine learning and pattern recognition algorithms for processing motion capture data. He is passionate about teaching scientific computing and studying deep structures in human motion.
Alexander Baransky
Alexander Baransky
Alex received his degree in Environmental Biology from Columbia University. He has experience with multiple computer languages including Python, R, and SQL. As an engineer at heart and biologist through training, Alex is passionate about animal behavior and finding innovative ways to use data science in the field of biology.

Product Description


Overview

 

This class is a comprehensive introduction to Python for Data Analysis and Visualization. This class targets people who have some basic knowledge of programming and want to take it to the next level. It introduces how to work with different data structures in Python and covers the most popular Python data analysis and visualization modules, including numpy, scipy, pandas, matplotlib, and seaborn. We use Ipython notebook to demonstrate the results of codes and change codes interactively throughout the class.

Details

 


Prerequisites

 

If you have good knowledge of basic data types (e.g. string, numeric), data structures (e.g. list, tuple, dictionary) and are familiar with concepts of list comprehension and for/while loop, you are good to go with the Python for Data Analysis and Visualization course. We will cover these basic Python programming topics in the course as well, but move at a relatively fast speed.

Certificate

Certificates are awarded at the end of the program at the satisfactory completion of the course.

Students are evaluated on a pass/fail basis for their performance on the required homework and final project (where applicable). Students who complete 80% of the homework and attend a minimum of 85% of all classes are eligible for the certificate of completion.


Syllabus

Unit 1: Introduction to Python

Python is a high-level programming language. You will learn the basic syntax and data structures in Python. We demonstrate and run codes within Ipython notebook, which is a great tool providing a robust and productive environment for interactive and exploratory computing.
  • Introduction to Ipython notebook
  • Basic objects in Python
  • Variables and self-defining functions
  • Control flow
  • Data structures

Unit 2: Explore Deeper with Python

Python is an object-oriented programming (OOP) language. Having some basic knowledge of OOP will help you understand how Python codes work. More often than not, you will have to deal with data that is dirty and unstructured. You will learn many ways to clean your data such as applying regular expressions.
  • Introduction to object-oriented programming
  • How to deal with files
  • Run Python scripts
  • Handling and processing strings

Unit 3: Scientific Computation Tools

There are two modules for scientific computation that make Python powerful for data analysis: Numpy and Scipy. Numpy is the fundamental package for scientific computing in Python. SciPy is an expanding collection of packages addressing scientific computing.
  • Numpy
  • Scipy

Unit 4: Data Visualization

Python can also generate graphics easily using “Matplotlib” and “Seaborn”. Matplotlib is the most popular Python library for producing plots and other 2D data visualizations. Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing statistical graphics.
  • Seaborn
  • Matplotlib

Unit 5: Data manipulation with Pandas

Pandas provides rich data structures and functions for working with structured data. The “DataFrame” object in Pandas is just like the “data.frame” object in R. Pandas makes data manipulation (filter, select, group, aggregate, etc.) as easy as in R.
  • Pandas

Final Project

After 20 hours of structured lectures, students are encouraged to work on an exploratory data analysis project based on their own interests. A project presentation demo will be arranged afterwards.


Recommended Readings

 

  • Learn Python the Hard Way: http://learnpythonthehardway.org/
  • Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython

Preparation – How to set up Python environment

[IMPORTANT] In the class we will use Python 3. If you are following this video to set up Python environment, please make sure you download the Python 3.X version starting from 1 min 23 s in the video.

Reviews

There are no reviews yet.

Instructors

Tony Schultz
Tony Schultz
Tony received his Ph.D. in Physics from the City University of New York and has taught at Sarah Lawrence College over the past decade. Tony specializes in developing machine learning and pattern recognition algorithms for processing motion capture data. He is passionate about teaching scientific computing and studying deep structures in human motion.
Alexander Baransky
Alexander Baransky
Alex received his degree in Environmental Biology from Columbia University. He has experience with multiple computer languages including Python, R, and SQL. As an engineer at heart and biologist through training, Alex is passionate about animal behavior and finding innovative ways to use data science in the field of biology.

Product Description


Overview

 

This class is a comprehensive introduction to Python for Data Analysis and Visualization. This class targets people who have some basic knowledge of programming and want to take it to the next level. It introduces how to work with different data structures in Python and covers the most popular Python data analysis and visualization modules, including numpy, scipy, pandas, matplotlib, and seaborn. We use Ipython notebook to demonstrate the results of codes and change codes interactively throughout the class.

Details

 


Prerequisites

 

If you have good knowledge of basic data types (e.g. string, numeric), data structures (e.g. list, tuple, dictionary) and are familiar with concepts of list comprehension and for/while loop, you are good to go with the Python for Data Analysis and Visualization course. We will cover these basic Python programming topics in the course as well, but move at a relatively fast speed.

Certificate

Certificates are awarded at the end of the program at the satisfactory completion of the course.

Students are evaluated on a pass/fail basis for their performance on the required homework and final project (where applicable). Students who complete 80% of the homework and attend a minimum of 85% of all classes are eligible for the certificate of completion.


Syllabus

Unit 1: Introduction to Python

Python is a high-level programming language. You will learn the basic syntax and data structures in Python. We demonstrate and run codes within Ipython notebook, which is a great tool providing a robust and productive environment for interactive and exploratory computing.
  • Introduction to Ipython notebook
  • Basic objects in Python
  • Variables and self-defining functions
  • Control flow
  • Data structures

Unit 2: Explore Deeper with Python

Python is an object-oriented programming (OOP) language. Having some basic knowledge of OOP will help you understand how Python codes work. More often than not, you will have to deal with data that is dirty and unstructured. You will learn many ways to clean your data such as applying regular expressions.
  • Introduction to object-oriented programming
  • How to deal with files
  • Run Python scripts
  • Handling and processing strings

Unit 3: Scientific Computation Tools

There are two modules for scientific computation that make Python powerful for data analysis: Numpy and Scipy. Numpy is the fundamental package for scientific computing in Python. SciPy is an expanding collection of packages addressing scientific computing.
  • Numpy
  • Scipy

Unit 4: Data Visualization

Python can also generate graphics easily using “Matplotlib” and “Seaborn”. Matplotlib is the most popular Python library for producing plots and other 2D data visualizations. Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing statistical graphics.
  • Seaborn
  • Matplotlib

Unit 5: Data manipulation with Pandas

Pandas provides rich data structures and functions for working with structured data. The “DataFrame” object in Pandas is just like the “data.frame” object in R. Pandas makes data manipulation (filter, select, group, aggregate, etc.) as easy as in R.
  • Pandas

Final Project

After 20 hours of structured lectures, students are encouraged to work on an exploratory data analysis project based on their own interests. A project presentation demo will be arranged afterwards.


Recommended Readings

 

  • Learn Python the Hard Way: http://learnpythonthehardway.org/
  • Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython

Preparation – How to set up Python environment

[IMPORTANT] In the class we will use Python 3. If you are following this video to set up Python environment, please make sure you download the Python 3.X version starting from 1 min 23 s in the video.

Reviews

There are no reviews yet.

Testimonials View All Student Testimonials

Anthony S. Romano, Research Scientist
Anthony S. Romano, Research Scientist
at
New York City Department of Health and Mental Hygiene

"Good Starter"

In the past I've used a lot of online courses to try and learn how to use python in my day to day work but I never felt like I "got it." This in person experience was well organized, provided relevant material and challenging homework. The first day of the class really scared me and had me thinking I bit off more than I can chew, but by the end I felt like I'd learned a lot and am thankful to the instructor Tony for his in class comments and examples. I would recommend this course to anyone who is already conducting data analytics but is looking to branch out to python. The course provides a lot of food for thought and made me feel like I have a solid foundation in python now.
Bob Trieste, Equities | FX Trader
Bob Trieste, Equities | FX Trader
at
Five Tree Capital

"Cutting-edge for all of today's industries"

This course was not only extremely cutting-edge for all of today's industries and work areas but it gave me a great deal of confidence to continue my data and computer science studies at the next level. This course has actually inspired me to not only enroll in other NYC Data Science Classes but also think longer-term about continuing graduate level work in CS. This was my first class at NYCDSA and all I an say is I am extremely grateful to have had Dr. Anthony Schwartz as an instructor. Anthony keep the material vibrant and fun with colorful analogies that made the subject matter easy to digest and remember. This allowed for both an esoteric understanding of what was happening "inside the code" coupled with real-world applications that began to streamline and automate a lot of my day-to-day work right after Class 1. I cannot say enough good things about NYCDSA and Dr. Shultz !!
read more
John Chen, Software Engineer
John Chen, Software Engineer
at
American Express

It's important when learning anything to get the fundamentals right. If you build bad habits, it can become difficult to fix them later on, especially if you have also built many dependencies on those bad habits. This is why when I wanted to start learning about data science, I chose to take this course to help me make the right choices from the very beginning.

I would say that I got exactly what I came for. Tony is a very good instructor. He is able to express complicated concepts in an understandable way, and I would definitely say that now I understand enough about the Python ecosystem that I could start learning on my own if I wanted.

Aaron Ouyang
Aaron Ouyang
Analyst at
Annalect

Great class. For only a 5 week class it is very comprehensive. Covers the basics and commonly used libraries used in python for data analysis as well has how to use them. Notebooks used in the class are a great go-resource after the class ends. Also a great community of data professionals and networking if you are thinking about a new gig.

Michael Caruana
Michael Caruana
Senior Product Manager, Data Science at
Fusion

Great comprehensive course that give you a thorough overview of Python and how it can be used in the field of Data Science.

Matt Gray
Matt Gray
Analyst, Insights & Strategy at
NBCUniversal, Inc

As a novice coder, this class was a great way to learn how I can manipulate and analyze data in Python. Would recommend for anyone interested in learning how to use python and apply to daily work.

Diana Enriquez
Diana Enriquez
Content Researcher, TED Content at
TED Conferences

I enjoyed this class — I would give it a 4, only because it went a little too fast for me at some points. I am a beginner of the most clearly beginner level. I had played with some front end programming, but never attempted backend work. The 5 hour classes on Saturdays were tough because it required a lot of homework and studying during the week, but the instructor was good about answering questions and pushing us to keep working on new and interesting things. The program was extremely supportive of me while I was trying to learn new material, I have and will continue to recommend this class/NYC Data school.

Kannan Sankaran
Kannan Sankaran
Software Engineer, Business Systems at
AppNexus

I took the first offering of Data Science using Python a few weeks ago, and definitely recommend it to anyone who loves hands-on learning with some guidance. Let me explain: Last year, I took Coursera’s Machine Learning/Intro to Data Science courses and did well, but did not do a hands-on project that would enable me to retain a lot of knowledge. But this course required me to pick a detailed project and present it to a live audience, who then determined whether I did well or not. So I learned how to do web scraping, extract social media API data, write object-oriented Python, utilize a NoSQL database (MongoDB) to store results, and finally create visualizations in D3 and HighCharts. And then the pressure to present well, just to pass the class. Our instructor John was competent, knowledgeable and helpful, and covered a variety of useful tools like Pandas and Scikit Learn, including machine learning algorithms. And Vivian is always pushing us harder to do better. Sounds familiar?

Christopher Crosbie
Christopher Crosbie
Healthcare and Life Science Solution Architect at
Amazon Web Services

The instructor, John Downs, was very knowledgeable and did an excellent job of providing an overview in the key areas of Python. After the five week class I went from knowing essentially nothing about Python to using it as one of my “go to” tools in which I am able to accomplish tasks at work on a daily basis.

Sasha Bartashnik
Sasha Bartashnik
Analytics at
Zulily

I took the beginner level Python class with John Downs and really had a great experience. John is very knowledgeable about Python and programming in general, and was able to be helpful to students of all levels in the class. The exercises in class and the homework got our hands dirty with the language and the final project was a great way to create a real result by the end of the course. Overall it was challenging, but a valuable intro to a useful tool that was easier to approach with real-life sessions than self-study demos on my own. I’ll definitely take classes with NYC Data Science Academy in the future and would recommend it to my friends.

Pia Ramchandani
Pia Ramchandani
Manager at
PwC Advisory Analytics

John Down’s Python for Data Analysis class was a helpful introduction to using python toolkits such as Pandas and Scikit Learn to work with large and complex data structures. John started the class off slowly to get the group adjusted to Python syntax, but made sure to include all of the essential data management/analysis techniques to get started (e.g. dataset merging, manipulation, basic stats/regression, etc). In a short course, John did a great job of including numerous examples in ipython notebooks that he gives to the class– this approach was very helpful for exposing beginners to more complex techniques that they can go back to when they are ready. I definitely recommend this course to any beginner interested in learning how python can help make data analysis faster and easier.

Paul Schaffer
Paul Schaffer
Director at
Analytics Media Group

I strongly recommend this class to all potential students who have some programming background. The pace at the beginning is necessarily rapid to cover the basics of syntax and structure, so that more time can be devoted to numpy/scipy/pandas/etc. John was a fantastic instructor, and impressively it was his first time teaching the course! Super nice/patient/knowledgeable, and he has a real knack for explaining stuff. Taking introduction to Python for Data Analysis was a great decision for me. In a relatively short period of time, I was introduced to the top analytical code libraries in Python and gained experience using them. Well worth the time and money: I’d do it again in a heartbeat.

Date and Time

January Session Early-bird Pricing!

Jan 19 - Feb 16, 2020, 1:00-5:00pm
Day 1: January 19, 2020
Day 2: January 26, 2020
Day 3: February 2, 2020
Day 4: February 9, 2020
Day 5: February 16, 2020
$1590.00$1510.50
Register before Dec 20th to take advantage of this price!
Add to Cart

March Session Early-bird Pricing!

Mar 7 - Apr 4, 2020, 1:00-5:00pm
Day 1: March 7, 2020
Day 2: March 14, 2020
Day 3: March 21, 2020
Day 4: March 28, 2020
Day 5: April 4, 2020
$1590.00$1510.50
Register before Feb 6th to take advantage of this price!
Add to Cart

April Session Early-bird Pricing!

Apr 19 - May 17, 2020, 1:00-5:00pm
Day 1: April 19, 2020
Day 2: April 26, 2020
Day 3: May 3, 2020
Day 4: May 10, 2020
Day 5: May 17, 2020
$1590.00$1510.50
Register before Mar 20th to take advantage of this price!
Add to Cart

June Session Early-bird Pricing!

Jun 13 - Jul 18, 2020, 1:00-5:00pm
Day 1: June 13, 2020
Day 2: June 20, 2020
Day 3: June 27, 2020
Day 4: July 11, 2020
Day 5: July 18, 2020
$1590.00$1510.50
Register before May 14th to take advantage of this price!
Add to Cart

Online Session

Start Right Away!
Learn More