Overview
This Data Science with Python training course teaches engineers, data scientists, statisticians, and other quantitative professionals the Python programming skills they need to analyze and chart data.
Objectives
At the end of Data Science with Python training course, participants will be able to
- Understand the difference between Python basic data types
- Know when to use different python collections
- Implement python functions
- Understand control flow constructs in Python
- Handle errors via exception handling constructs
- Be able to quantitatively define an answerable, actionable question
- Import both structured and unstructured data into Python
- Parse unstructured data into structured formats
- Understand the differences between NumPy arrays and pandas dataframes
- Understand where Python fits in the Python/Hadoop/Spark ecosystem
- Simulate data through random number generation
- Understand mechanisms for missing data and analytic implications
- Explore and Clean Data
- Create compelling graphics to reveal analytic results
- Reshape and merge data to prepare for advanced analytics
- Find test for group differences using inferential statistics
- Implement linear regression from a frequentist perspective
- Understand non-linear terms, confounding, and interaction in linear regression
- Extend to logistic regression to model binary outcomes
- Understand the difference between machine learning and frequentist approaches to statistics
- Implement classification and regression models using machine learning
- Score new datasets, evaluate model fit, and quantify variable importance
Prerequisites
All attendees should have prior programming experience and an understanding of basic statistics.
Course Outline
- History and current use
- Installing the Software
- Python Distributions
- String Literals and numeric objects
- Collections (lists, tuples, dicts)
- Datetime classes in Python
- Memory Management in Python
- Control Flow
- Functions
- Exception Handling
- Defining the quantitative construct to make inference on the question
- Identifying the data needed to support the constructs
- Identifying limitations to the data and analytic approach
- Constructing Sensitivity analyses
- Structured Data
- Structured Text Files
- Excel workbooks
- SQL databases
- Working with Unstructured Text Data
- Reading Unstructured Text
- Introduction to Natural Language Processing with Python
- Introduction to the ndarray
- NumPy operations
- Broadcasting
- Missing data in NumPy (masked array)
- NumPy Structured arrays
- Random number generation
- Filtering
- Creating and deleting variables
- Discretization of Continuous Data
- Scaling and standardizing data
- Identifying Duplicates
- Dummy Coding
- Combining Datasets
- Transposing Data
- Long to wide and back
- Univariate Statistical Summaries and Detecting Outliers
- Multivariate Statistical Summaries and Outlier Detection
- Group-wise calculations using Pandas
- Pivot Tables
- Histogram
- Box-and-whiskers plot
- Scatter plots
- Forest Plots
- Group-by plotting
Testimonials
Synergific Software Team has been very supportive, and working with them has been a best decision that we
could ever made, They are just a call away. You guys are AWESOME, Thank You, Keep up the Good Work!!!
Shamsudeen Bawa
Vice President, J.P Morgan, CIS, USA
Synergific Software has been of great help and I plan to continue to use your services in the future for
my business needs.
Farhan Hafiz
Data Architect, Fiserv
I think Synergific Software is great. I liked that it was hassle free and easy to set up. Again, it's a great feature for a fast and cheap set up, which gives me peace of mind, as I know have a terms of use agreement.
Dr. Sahdev Singh
Under Secretary, Ministry of Law & Justice, Govt. of India
I liked using Synergific Software very much. I thought the website was easy to navigate and the instructions for generating the terms was clear. I even recommended you on a Facebook Group I am a member of.
M Chikanna Swamy
Director & Learning Head, Mindtree