Python Programming and Data Analytics Project

Title: Python Programming and Data Analytics 

Author: (Your Full Name) 

Bio: dasclab.uonbi.ac.ke/analytics/members/your-account 
Date: YYYY-MM-DD 

 

*All questions all compulsory 

Instructions: Fill in your answers in the space after each section. Include pictures of the plots where applicable. 

For questions 2 and 3 use the provided workbooks linked for each question. 


General 

  1. Name 5 Python Modules in the Standard Library and describe what they are mainly used for.  

  2. Name 5 external modules of Python and describe the main use cases of each of these modules 


Data Analysis 


1. Vehicle Dataset 
 




Resources 

Student Workbook

Vehicle Dataset 
If you are having problems please refer to this document: 

Data Analysis with Python Pandas Notebook 

Instructions 
 

Import all the libraries listed in the first cell. Make sure all modules are installed. 

Use the provided data set to answer the following: 

Use pandas to come up with:
 

  1. The titles and prices of 10 Cars with highest price 
  2. The titles and prices of 5 Buses & Microbuses with highest price 
  3. The titles and prices of 5 Trucks & Trailers with highest price  


    Plotting 

Use matplotlib to come up with a plot indicating the top 10 brands that we have in the vehicle_dataset 

Key performance Metrics: 

  • Ensure all the plots have a Title 

  • Ensure all plots have x labels and y labels where applicable 

  • Your plots should be clearly visible. Change the size of your plot to a comfortable width and height. 

  • Save all your plots 


2. Time Series Data 


 




Resources 
 

  1. Student Notebook 
  2. Clean Dataset 

    If you are having problems please refer to this document: 
     
  3. Time Series Notebook 


Instructions 

Import all the libraries listed in the first cell. Make sure all modules are installed. 

Use the data set provided to answer the following: 

  1. a) What is the lowest price for Safaricom (SCOM) b) What was the date when Safaricom had the lowest price? 

  1. a) What is the highest price Safaricom stock reached in the data b) What was the date when Safaricom stock recorded the highest price? 

  1. Create a line plot for Safaricom stock and verify if the information provided above is indeed correct. 

  1. Select one of the sectors provided (agric, comm, bank, const, energy, insur, invest, manu) 

  1. a) Use pandas to create a subset containing all the rows of the dataframe and only companies in your selected sector. Rename this dataframe to the sector_name_df 

  1. b) Using the subset for the sector, use matplotlib subplot to create subplots to fit all the sector stocks in one plot. One row can have a maximum of 3 charts. 

  • c) Using your sector DataFrame use the corr() DataFrame method to come up with a correlogram. Create a DataFrame for these correlations 
  • d) Use Seaborn to plot the correlation plot for your sector stocks.
     


Key performance Metrics: 
 

  • Go an extra step to produce charts that are visually appealing 
  • Ensure all the plots have a Title 
  • Ensure all plots have x labels and y labels where applicable 
  • Your plots should be clearly visible. Change the size of your plot to a comfortable width and height. 
  • Save all your plots 


 


Submission Portal

Last updated 2022-03-18 16:10:39 by Kennedy Waweru