Run your own Apache Spark jobs in AWS EMR and S3
Run your own Apache Spark jobs in AWS EMR and S3 Recently, I participated in Udacity’s Nanodegree Program for Data Engineers. It’s kinda like to review what I did past and refresh some tech stacks...
Run your own Apache Spark jobs in AWS EMR and S3 Recently, I participated in Udacity’s Nanodegree Program for Data Engineers. It’s kinda like to review what I did past and refresh some tech stacks...
How to train a customized Name Entities Recognition (NER) model based on spaCy pre-trained model There are a bunch of online resources to teach you how to train your own NER model by spaCy, so I w...
Connect PostgreSQL in docker container with Azure Data Studio Azure Data Studio is a cool product that can easily connect MySQL (if you already installed in your system) and show what in your data...
Setting both Celery and Flask inside the docker-compose Due to the issue I need to resolve is that put the heavy task to background, then run it periodically or asynchronously. And the most import...