Data Pipeline for analysis on simulated online store for computer components.

Description

Data Pipeline for analysis on simulated online store for computer components. This service will upload information from multiple sources, store data on a Cloud Data Base, clean and enrich it in order to generate granular indicators and insights on Sales results that will be placed on 3 different platforms: Visualization tool, second Cloud database (different provider) and Rest API to serve external apps.

Child issues

Issue Type Icon XDP-2 Create Workflow Diagram Priority: Medium
Done
Issue Type Icon XDP-4 Python script to scrape Item descriptions and prices from NewEgg portal and then injecting info on Cloud Data Base Priority: Medium
Done
Issue Type Icon XDP-5 Sign up to Lucid Chart Priority: Medium
Done
Issue Type Icon XDP-6 Create the workflow with all the logos and Icons Priority: Medium
In Progress
Issue Type Icon XDP-7 Code the script Priority: Medium
Done
Issue Type Icon XDP-8 Test the result by sending resultant data frame to Google Sheets Priority: Medium
Done
Issue Type Icon XDP-9 Create instance on GCP - Big Query Priority: Medium
Done
Issue Type Icon XDP-10 Send all data Frames to cloud DB instance (Big Query) Priority: Medium
In Progress
Issue Type Icon XDP-11 Create Rest API to enable external users to consume information from our Data Base Priority: Medium
In Progress
Issue Type Icon XDP-12 Outline the Stament of Work Document Priority: Medium
Done
Issue Type Icon XDP-13 Code Python Script to consume information from Census.gov API Priority: Medium
In Progress
Issue Type Icon XDP-14 Code Python Script to scrape sales data from multiple PDF files, clean info and consolidate results on one single CSV file that will be sent to Google Drive Priority: Medium
In Progress
Issue Type Icon XDP-15 Clean tables on Google Sheets and upload data to Cloud DB as tables by using Python Script Priority: Medium
In Progress
Issue Type Icon XDP-16 Configure Big Query Instance, set up data base Model and perform SQL queries to capture insights Priority: Medium
In Progress
Issue Type Icon XDP-17 Set up Power Bi Instance and create visual Dashboard Priority: Medium
To Do
Issue Type Icon XDP-18 Configure Backup SQL data base on Azure Priority: Medium
To Do
Issue Type Icon XDP-19 Update Portfolio Website Priority: Medium
In Progress
Issue Type Icon XDP-20 Code The Script Priority: Medium
Done
Issue Type Icon XDP-21 Enhance the Notebook by adding text to describe steps Priority: Medium
Done
Issue Type Icon XDP-22 Test the script Priority: Medium
Done
Issue Type Icon XDP-23 Enable connection to Big Query Priority: Medium
In Progress
Issue Type Icon XDP-24 Code the script back bone Priority: Medium
In Progress
Issue Type Icon XDP-25 Expand functionality Priority: Medium
To Do
Issue Type Icon XDP-26 Test API with another script Priority: Medium
To Do
Issue Type Icon XDP-27 Collect all Related Data Frame into G-sheets within Google Drive Priority: Medium
To Do
Issue Type Icon XDP-28 Wrangle and perform action to Clean Tables Priority: Medium
To Do
Issue Type Icon XDP-29 Create Python Script to grab Gsheets from Drive Folder, convert them into pandas Data Frames Priority: Medium
Done
Issue Type Icon XDP-30 Make the script to Establish connection with Big Query and send the data frames as Tables Priority: Medium
Done
Issue Type Icon XDP-31 Create instance on Big Query Priority: Medium
Done
Issue Type Icon XDP-32 Test instance with scripts that are already operational Priority: Medium
In Progress
Issue Type Icon XDP-33 Import data from all established sources Priority: Medium
In Progress
Issue Type Icon XDP-34 Configure Joins and Keys according to ERD, Priority: Medium
To Do
Issue Type Icon XDP-35 Perform SQL statements to find and capture insights to feed all outlets for end system Priority: Medium
To Do
Issue Type Icon XDP-36 Create instance on Azure SQL Priority: Medium
Done
Issue Type Icon XDP-37 Test data base and perform basic SQL operations Priority: Medium
To Do
Issue Type Icon XDP-38 Code Script to extract data from Big Query and load data to Azure Backup DB Priority: Medium
To Do
Issue Type Icon XDP-39 Define set of graphics and visual aids that we'll use Priority: Medium
To Do
Issue Type Icon XDP-40 Invoke Big Query connector and configure pipeline Priority: Medium
Done
Issue Type Icon XDP-41 Test visuals Priority: Medium
To Do
Issue Type Icon XDP-42 Enable Power Bi public portal Priority: Medium
Done
Issue Type Icon XDP-43 Update Website with each partial progress Priority: Medium
In Progress
Issue Type Icon XDP-44 Enhance Descriptions Priority: Medium
To Do
Issue Type Icon XDP-45 Make sure items for portfolio project are displayed on the right order Priority: Medium
To Do
Issue Type Icon XDP-46 Sign up for Confluence Priority: Medium
Done
Issue Type Icon XDP-47 Create Template and link it to Jira Project Priority: Medium
Done
Issue Type Icon XDP-48 Update content and wording Priority: Medium
In Progress
Issue Type Icon XDP-49 Attach Jira's Road Map Priority: Medium
Done
Issue Type Icon XDP-50 Publish document on Website Priority: Medium
Done
Issue Type Icon XDP-51 Code the Script Priority: Medium
Done
Issue Type Icon XDP-52 Enhance Notebook comments Priority: Medium
Done

Activity