< Go back to Python for Data Scientists training course
Editor (add & remove topics to suit your needs)
INSTALLATION & PACKAGING
Installation packagaing and virtualization of Python such as Conda & Anaconda. Conda is a cross-platform tool for managing packages and environments. Anaconda is a free enterprise-ready Python distribution that includes 150 installed of the most popular python packages for science, math, engineering and data analysis. Anaconda comes with conda to manage libraries and environments.
Python Developement environments including Jupyter
Become familiar with the most common development environments for Python and discover how to develop, test and debug Python programs using modern best practice. We'll go through Jupyter, iPython, PyCharm and PyDev.
Introduction to Python's data structures, basic constructs and core components of the standard library. Overview of how Python is used in the field of Data Science, its benefits as fast prototyping tool as well as its production-ready capabilities.
Designing your Python Script
Understanding of Object Orientation is key to using many of Python's excellent Data Analytics libraries and is the most popular programming paradigm in use today. Object Orientation introduces concepts such as objects, classes, inheritance and polymorphism and focuses on the manipulation of mutable data.
In addition to Python's object oriented features, Python also fully supports the Function Programming paradigm. Functional programming concentrates on transformation of immutable data using functions. Topics include comprehensions, iterators, generators, lambdas and decorators. Functional programming concepts open the way for parallelism and provide the intuitions used by many popular Big Data frameworks.
Arrays & Matrices using Numpy & SciPy
These libraries are the building blocks for the more advanced Data Analytical libraries. Numpy provides high speed manipulation of multi-dimensional data. This core Python library is written in C and hence is extremely fast and can work with very large data sets.
Data Visualization with SEABORN, BOKEH & MatplotLib
MatplotLib is an outstanding plotting library, essential for visualizing 'Big Data'. It is very mature and can perform almost any plotting you could possibly need including interactive plots and simulations.
DATAFRAMES WITH Pandas
Most real data these days is provided in the form of spreadsheets and Pandas and OpenPyXl are two libraries that make analysis of such data as simple as possible. Pandas is particularly good at handling large data sets with missing or faulty rows and columns.
MACHINE Learning & Predictive analytics with SciKitLearn
SciKitLearn provides simple and efficient tools for data classification, mining and analysis. SciKitLearn is a sophisticated data modelling package.
Write conditional constructs to tweak the execution of your scripts and get to know the Pandas DataFrame: the key data structure for Data Science in Python.
Audience & Requirements
Duration, location and dates
If you have a course length in mind, please select from the list
More than 5 days
Where would you like the course to be held?
On your site
At our London offices
Please quote for both options
Where is your site located?
If you have a date and location in mind, enter them here....
Complete the form oppositebelow to get your quote >>>
GET A CUSTOM COURSE QUOTE
Complete this form for an instant quote Or for more information call 0800 028 6400