- Toronto, On, Canada
- https://www.linkedin.com/in/andyphamto/
- @QuangAnhPham6
-
-
Top-Rentals-Cineplex Public
Applying data engineering techniques to create data pipeline with Azure Cloud Computing
-
-
-
spark-optimization Public
Hands-on experience optimizing PySpark code.
Jupyter Notebook UpdatedFeb 23, 2022 -
spark-mini-project Public
Using Spark transformations to solve traditional MapReduce data problems.
-
hadoop-mini-project Public
Using hadoop to utilize data from an automobile tracking platform that tracks the history of important incidents after the initial sale of a new vehicle.
-
-
kafka-mini-project-2 Public
create a streaming application as a simple real-time fraud detection system backed by Apache Kafka using a Python client.
-
-
airflow-mini-project1 Public
Utilize Docker and Apache Airflow to orchestrate the pipeline, exercise the DAG creation, uses of various operators (BashOperator, PythonOperator, etc), setting up order of operation of each task.
-
kubernetes-the-hard-way Public
Forked from kelseyhightower/kubernetes-the-hard-wayBootstrap Kubernetes the hard way on Google Cloud Platform. No scripts.
Apache License 2.0 UpdatedNov 26, 2021 -
-
-
EURO_CUP_2016_PostgreSQL Public
EURO CUP 2016 mini-project PostgreSQL schema and tables setup with solutions
UpdatedNov 8, 2021 -
banking-system Public
Python OOP - Banking System with MySQL database case study.
-
Python-Challenge-Questions Public
This repository consists of the collection of Python Challenges' solutions and some Python tips and fundamentals.
Jupyter Notebook UpdatedSep 24, 2021 -
Creating an interactive KPI Dashboard and other visualizations with Tableau π .
-
Statistical-Analysis Public
This repository focused on statistical analysis and exploration used on various data sets for personal and professional projects. π
-
SQL-Challenge-Questions Public
This repository consists of all the SQL solutions with intuitive explanations that I have done. (including supplemental readings relating to the tools used)
1 UpdatedJul 1, 2021 -
GoodVitamins Public
Applying NLP and unsupervised machine learning technique to quickly showing the top representative user reviews of vitamin products from iHerb π .
Jupyter Notebook UpdatedJun 24, 2021 -
A quick webscraping tutorial with BeautifulSoup and Pandas π·οΈ πΌ
-
Web-Scraping-with-Selenium Public
An intuitive tutorial of web scraping with Selenium. π·οΈ
Jupyter Notebook UpdatedJun 14, 2021 -
A quick and intuitive tutorial for a simple WebApp deployment with Streamlit and Heroku π» .
HTML UpdatedJun 10, 2021 -
-
Lessons-Learned-Data-Science-Interviews Public
Forked from gkamradt/Lessons-Learned-Data-Science-InterviewsLessons learned the hard way through over 30+ data science interviews
UpdatedOct 2, 2020 -
WebScraping Public
Forked from MariyaSha/WebScrapingCreate a database from scratch by extracting html elements from a webpage
Jupyter Notebook UpdatedJun 3, 2020 -
python_data_pipeline Public
Forked from nickmancol/python_data_pipelineA Simple Pure Python Data Pipeline to process a Data Stream
Python MIT License UpdatedFeb 25, 2020 -
textpack Public
Forked from lukewhyte/textpackGroup thousands of similar spreadsheet or database text entries in seconds
Python MIT License UpdatedFeb 12, 2020 -
Data-Science--Cheat-Sheet Public
Forked from kadnan/Data-Science--Cheat-SheetCheat Sheets
TeX UpdatedNov 22, 2019