This course offers an accelerated intensive learning experience with Tableau – the growing standard in business intelligence for data visualization and dashboard creation. Without prior experience, students will learn to work with multiple data sources, create compelling visualizations, and roll out their data science products for continuous, scalable outputs to key stakeholders. By building insight and weaving narrative, students will be empowered to harness data in a striking way that provides value to organizations large and small.
This is a class for computer-literate people with no programming background who wish to learn basic Python programming. The course is aimed at those who want to learn “data wrangling” – manipulating downloaded files to make them amenable to analysis. We concentrate on language basics such as list and string manipulation, control structures, simple data analysis packages, and introduce modules for downloading data from the web.
This course is an introduction to ML systems in production that will demonstrate and give students exposure to how real production ML systems operate. Using Python, Docker, Kubernetes, Google Cloud and various open-source tools, students will bring the different components of an ML system to life and setup real, automated infrastructure.
This course demonstrates a practical and intuitive approach to NLP applications through variety of different use-cases. Essentials and practical fundamentals of NLP methods are presented via generic Python packages including but not limited to Regex, NLTK, SpaCy and Huggingface. The high-level foundations followed by hands-on code examples on a notebook environment will be studied touching on different aspects of NLP from conventional statistical text analytics approaches to the state-of-the-art deep/transfer learning models paired with result interpretations, industry challenges, visualizations and a prototype web application.
This is a 6-week evening program providing a hands-on introduction to the Hadoop and Spark ecosystem of Big Data technologies. The course will cover these key components of Apache Hadoop: HDFS, MapReduce with streaming, Hive, and Spark. Programming will be done in Python. The course will begin with a review of Python concepts needed for our examples. The course format is interactive. Students will need to bring laptops to class.
This course is a 35-hour program designed to provide a comprehensive introduction to R. You’ll learn how to load, save, and transform data as well as how to write functions, generate graphs, and fit basic statistical models with data. In addition to a theoretical framework in which you will learn the process of data analysis, this course focuses on the practical tools needed in data analysis and visualization. By the end of the course, you will have mastered the essential skills of processing, manipulating and analyzing data of various types, creating advanced visualizations, generating reports, and documenting your codes.
NYC Data Science Academy’s mission is to provide accelerated data science training
programs that prepare people for employment as data science professionals and to offer
continuing education courses for professional development.