Data Validation with Pandera
docker-compose.yml filedocker compose updocker compose rmπ§ Local Development
conda create & conda installenvironment.ymlconda-lock fileDockerfiledocker-compose.ymlβοΈ CI/CD & Deployment
docker pull new imagedocker rungit pull locally to extract updated docker-compose.yml filedocker compose up locally to start the containerdocker compose rm locally to stop the container&& \ in Dockerfile?# build on top of template of minimal notebook
FROM quay.io/jupyter/minimal-notebook:afe30f0c9ad8
# copy all conda environment dependencies
COPY conda-linux-64.lock /tmp/conda-linux-64.lock
# copy my local python package files to pip install in docker
COPY pyproject.toml /tmp/pyproject.toml
COPY src /tmp/src
COPY README.md /tmp/README.md
# conda install all the other packages
RUN mamba update --quiet --file /tmp/conda-linux-64.lock \
&& mamba clean --all -y -f \
&& fix-permissions "${CONDA_DIR}" \
&& fix-permissions "/home/${NB_USER}"
# install openai using pip because the openai package insatlled from conda has bug
# also install my local AI_representation_bias_in_farming as a python package
# 2025-06-22: added gpt-image-1 to the list of models
RUN pip install openai==1.57.0 \
&& python -m pip install -e /tmp Image generated by OpenAI GPT-5


Check out this data cleaning script I created for the Moo4Feed R package
Image generated by OpenAI GPT-5
