Home
samueltyh
Cancel

Implementing Fivetran Data Source Connector with AWS Lambda

Basic background of AWS Lambda Official developer guide from AWS AWS Lambda is a serverless, event-driven compute service that lets you run code for virtually any type of application or backend s...

Useful gadget sharing - cron-job.org

Useful gadget sharing - cron-job.org Due to initiating to maintain my side projects which have done before, I started investigating any pain points that needed to be improved or could be divided i...

Run your own Apache Spark jobs in AWS EMR and S3

Run your own Apache Spark jobs in AWS EMR and S3 Recently, I participated in Udacity’s Nanodegree Program for Data Engineers. It’s kinda like to review what I did past and refresh some tech stacks...

How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model

How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model There are a bunch of online resources to teach you how to train your own NER model by spaCy, so I w...

Connect PostgreSQL in docker container with Azure Data Studio

Connect PostgreSQL in docker container with Azure Data Studio Azure Data Studio is a cool product that can easily connect MySQL (if you already installed in your system) and show what in your data...

Setting both Celery and Flask inside the docker-compose

Setting both Celery and Flask inside the docker-compose Due to the issue I need to resolve is that put the heavy task to background, then run it periodically or asynchronously. And the most import...