Python Programming and Data Analytics Project
Title: Python Programming and Data Analytics
Author: (Your Full Name)
Bio: dasclab.uonbi.ac.ke/analytics/members/your-account
Date: YYYY-MM-DD
*All questions all compulsory
Instructions: Fill in your answers in the space after each section. Include pictures of the plots where applicable.
For questions 2 and 3 use the provided workbooks linked for each question.
General
-
Name 5 Python Modules in the Standard Library and describe what they are mainly used for.
-
Name 5 external modules of Python and describe the main use cases of each of these modules
Data Analysis
1. Vehicle Dataset
Resources
Student Workbook
Vehicle Dataset
If you are having problems please refer to this document:
Data Analysis with Python Pandas Notebook
Instructions
Import all the libraries listed in the first cell. Make sure all modules are installed.
Use the provided data set to answer the following:
Use pandas to come up with:
- The titles and prices of 10 Cars with highest price
- The titles and prices of 5 Buses & Microbuses with highest price
- The titles and prices of 5 Trucks & Trailers with highest price
Plotting
Use matplotlib to come up with a plot indicating the top 10 brands that we have in the vehicle_dataset
Key performance Metrics:
-
Ensure all the plots have a Title
-
Ensure all plots have x labels and y labels where applicable
-
Your plots should be clearly visible. Change the size of your plot to a comfortable width and height.
-
Save all your plots
2. Time Series Data
Resources¶
- Student Notebook
- Clean Dataset
If you are having problems please refer to this document:
Instructions
Import all the libraries listed in the first cell. Make sure all modules are installed.
Use the data set provided to answer the following:
-
a) What is the lowest price for Safaricom (SCOM) b) What was the date when Safaricom had the lowest price?
-
a) What is the highest price Safaricom stock reached in the data b) What was the date when Safaricom stock recorded the highest price?
-
Create a line plot for Safaricom stock and verify if the information provided above is indeed correct.
-
Select one of the sectors provided (agric, comm, bank, const, energy, insur, invest, manu)
-
a) Use pandas to create a subset containing all the rows of the dataframe and only companies in your selected sector. Rename this dataframe to the sector_name_df
-
b) Using the subset for the sector, use matplotlib subplot to create subplots to fit all the sector stocks in one plot. One row can have a maximum of 3 charts.
- c) Using your sector DataFrame use the corr() DataFrame method to come up with a correlogram. Create a DataFrame for these correlations
- d) Use Seaborn to plot the correlation plot for your sector stocks.
Key performance Metrics:
- Go an extra step to produce charts that are visually appealing
- Ensure all the plots have a Title
- Ensure all plots have x labels and y labels where applicable
- Your plots should be clearly visible. Change the size of your plot to a comfortable width and height.
- Save all your plots
Last updated 2022-03-18 16:10:39 by Kennedy Waweru