Data Pipeline for analysis on simulated online store for computer components.

Description

Data Pipeline for analysis on simulated online store for computer components. This service will upload information from multiple sources, store data on a Cloud Data Base, clean and enrich it in order to generate granular indicators and insights on Sales results that will be placed on 3 different platforms: Visualization tool, second Cloud database (different provider) and Rest API to serve external apps.

Child issues

Issue Type Icon XDP-2 Create Workflow Diagram Priority: Medium Assignee:
Done
Issue Type Icon XDP-4 Python script to scrape Item descriptions and prices from NewEgg portal and then injecting info on Cloud Data Base Priority: Medium Assignee:
Done
Issue Type Icon XDP-5 Sign up to Lucid Chart Priority: Medium Assignee:
Done
Issue Type Icon XDP-6 Create the workflow with all the logos and Icons Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-7 Code the script Priority: Medium Assignee:
Done
Issue Type Icon XDP-8 Test the result by sending resultant data frame to Google Sheets Priority: Medium Assignee:
Done
Issue Type Icon XDP-9 Create instance on GCP - Big Query Priority: Medium Assignee:
Done
Issue Type Icon XDP-10 Send all data Frames to cloud DB instance (Big Query) Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-11 Create Rest API to enable external users to consume information from our Data Base Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-12 Outline the Stament of Work Document Priority: Medium Assignee:
Done
Issue Type Icon XDP-13 Code Python Script to consume information from Census.gov API Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-14 Code Python Script to scrape sales data from multiple PDF files, clean info and consolidate results on one single CSV file that will be sent to Google Drive Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-15 Clean tables on Google Sheets and upload data to Cloud DB as tables by using Python Script Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-16 Configure Big Query Instance, set up data base Model and perform SQL queries to capture insights Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-17 Set up Power Bi Instance and create visual Dashboard Priority: Medium Assignee:
To Do
Issue Type Icon XDP-18 Configure Backup SQL data base on Azure Priority: Medium Assignee:
To Do
Issue Type Icon XDP-19 Update Portfolio Website Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-20 Code The Script Priority: Medium Assignee:
Done
Issue Type Icon XDP-21 Enhance the Notebook by adding text to describe steps Priority: Medium Assignee:
Done
Issue Type Icon XDP-22 Test the script Priority: Medium Assignee:
Done
Issue Type Icon XDP-23 Enable connection to Big Query Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-24 Code the script back bone Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-25 Expand functionality Priority: Medium Assignee:
To Do
Issue Type Icon XDP-26 Test API with another script Priority: Medium Assignee:
To Do
Issue Type Icon XDP-27 Collect all Related Data Frame into G-sheets within Google Drive Priority: Medium Assignee:
To Do
Issue Type Icon XDP-28 Wrangle and perform action to Clean Tables Priority: Medium Assignee:
To Do
Issue Type Icon XDP-29 Create Python Script to grab Gsheets from Drive Folder, convert them into pandas Data Frames Priority: Medium Assignee:
Done
Issue Type Icon XDP-30 Make the script to Establish connection with Big Query and send the data frames as Tables Priority: Medium Assignee:
Done
Issue Type Icon XDP-31 Create instance on Big Query Priority: Medium Assignee:
Done
Issue Type Icon XDP-32 Test instance with scripts that are already operational Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-33 Import data from all established sources Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-34 Configure Joins and Keys according to ERD, Priority: Medium Assignee:
To Do
Issue Type Icon XDP-35 Perform SQL statements to find and capture insights to feed all outlets for end system Priority: Medium Assignee:
To Do
Issue Type Icon XDP-36 Create instance on Azure SQL Priority: Medium Assignee:
Done
Issue Type Icon XDP-37 Test data base and perform basic SQL operations Priority: Medium Assignee:
To Do
Issue Type Icon XDP-38 Code Script to extract data from Big Query and load data to Azure Backup DB Priority: Medium Assignee:
To Do
Issue Type Icon XDP-39 Define set of graphics and visual aids that we'll use Priority: Medium Assignee:
To Do
Issue Type Icon XDP-40 Invoke Big Query connector and configure pipeline Priority: Medium Assignee:
Done
Issue Type Icon XDP-41 Test visuals Priority: Medium Assignee:
To Do
Issue Type Icon XDP-42 Enable Power Bi public portal Priority: Medium Assignee:
Done
Issue Type Icon XDP-43 Update Website with each partial progress Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-44 Enhance Descriptions Priority: Medium Assignee:
To Do
Issue Type Icon XDP-45 Make sure items for portfolio project are displayed on the right order Priority: Medium Assignee:
To Do
Issue Type Icon XDP-46 Sign up for Confluence Priority: Medium Assignee:
Done
Issue Type Icon XDP-47 Create Template and link it to Jira Project Priority: Medium Assignee:
Done
Issue Type Icon XDP-48 Update content and wording Priority: Medium Assignee:
In Progress
Issue Type Icon XDP-49 Attach Jira's Road Map Priority: Medium Assignee:
Done
Issue Type Icon XDP-50 Publish document on Website Priority: Medium Assignee:
Done
Issue Type Icon XDP-51 Code the Script Priority: Medium Assignee:
Done
Issue Type Icon XDP-52 Enhance Notebook comments Priority: Medium Assignee:
Done

Activity