Extract, Transform, Load (ETL). All the scripts (glue and sagemaker) and custom libs.
git clone https://<your-git-user-name>@bitbucket.org/ashishgenerico/etl.git
./etl/
directorycd etl
zeno_secrets.py
file from your fellow team members and paste that file inside secret
directorypython3.6
virtual env and activatepython3 -m venv etl_env
source etl_env/bin/activate
requirements.txt
inside the virtual envpip install -r requirements.txt
glue-jobs/src/scripts/<your_script_name>/
folderpython3.7
(or greater) virtual env and activatepython3 -m venv etl_env
source etl_env/bin/activate
requirements-ml.txt
inside the virtual envpip install -r requirements-ml.txt
sagemaker-jobs/src/scripts/<your_script_name>/
foldersagemaker-jobs/src/scripts/experiments/
.\templates\templetes.json
filezeno_etl_libs
custom library then we need to publish it on PyPIsetup.py
present in etl
folderpython setup.py sdist bdist_wheel
twine upload dist/* --verbose -u kuldeepsingh -p bEmham-6sonke-forcex
!pip install zeno_etl_libs==new.version.number
./zeno_etl_libs
) to the python search path before import zeno_etl_libs
it for local developmentimport sys
sys.path.append('../../../..')
export LC_ALL=en_US.utf-8 && export LANG=en_US.utf-8 && export PYTHONIOENCODING=utf8
aws codebuild delete-project --name create-sagemaker-container-env_name-notebook-runner
./extra_dependency/Dockerfile
to virtual environment's site package at ./site-packages/sagemaker_run_notebook/container/Dockerfile
, basically add these extra command in the file.RUN apt-get update
RUN pip3 install --upgrade pip
RUN apt-get install -y gcc g++ build-essential python3-dev
run-notebook create-container env_name-notebook-runner --requirements requirements-ml.txt