Basic background of AWS Lambda Official developer guide from AWS AWS Lambda is a serverless, event-driven compute service that lets you run code for virtually any type of application or backend s...
Useful gadget sharing - cron-job.org
Useful gadget sharing - cron-job.org Due to initiating to maintain my side projects which have done before, I started investigating any pain points that needed to be improved or could be divided i...
Run your own Apache Spark jobs in AWS EMR and S3
Run your own Apache Spark jobs in AWS EMR and S3 Recently, I participated in Udacity’s Nanodegree Program for Data Engineers. It’s kinda like to review what I did past and refresh some tech stacks...
How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model
How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model There are a bunch of online resources to teach you how to train your own NER model by spaCy, so I w...
Connect PostgreSQL in docker container with Azure Data Studio
Connect PostgreSQL in docker container with Azure Data Studio Azure Data Studio is a cool product that can easily connect MySQL (if you already installed in your system) and show what in your data...
Setting both Celery and Flask inside the docker-compose
Setting both Celery and Flask inside the docker-compose Due to the issue I need to resolve is that put the heavy task to background, then run it periodically or asynchronously. And the most import...
- Terraform for Data Engineers: Automating Your Data Infrastructure
- How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model
- Run your own Apache Spark jobs in AWS EMR and S3
- Implementing Fivetran Data Source Connector with AWS Lambda
- Learning and Takeaways from Kubesimplify Workshop