You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

90 lines
2.4 KiB
Markdown

3 years ago
# Actes princiers -- refactoring datascience
3 years ago
3 years ago
## Project Name
human readable name : `Actes Princiers`
The project name 'Actes Princiers' has been applied to:
- The project title in `datascience/actes-princiers/README.md`
- The folder created for your project in `datascience/actes-princiers`
- The project's python package in `datascience/actes-princiers/src/actes_princiers`
A best-practice setup includes initialising git and creating a virtual environment before running 'pip install -r src/requirements.txt'
3 years ago
## Getting started
3 years ago
3 years ago
- Install the virtual environment : `python3 -m venv .venv`
- Enable the virtual environment : `source .venv/bin/activate`
3 years ago
- install kedro `pip install kedro`
3 years ago
- Install the packages and libraries `pip install -r src/requirements.txt`
3 years ago
3 years ago
**go to `actes-princiers`'s folder**
3 years ago
Then open a terminal in the `actes-princiers`'s folder
3 years ago
and launch jupyter : `kedro jupyter notebook`
or start the ipython prompt : `kedro ipython`
3 years ago
3 years ago
## Launching the pipelines
3 years ago
3 years ago
Open a terminal in the `actes-princiers`'s folder and launch kedro
3 years ago
`kedro run`
3 years ago
or launch a specific node in the pipeline with:
`kedro run --nodes=preprocess_html`
or a search by tags with:
`kedro run --tags=xsl`
3 years ago
## Visualizing the pipelines
`kedro viz`
3 years ago
## Building the docs
`./build-docs.sh docs`
the html built doc is `here <docs/build/html/>`_
3 years ago
## Developper's rules and guidelines
3 years ago
3 years ago
Declare any dependencies in `src/requirements.txt` for `pip` installation.
3 years ago
3 years ago
To install them, run: `pip install -r src/requirements.txt`
3 years ago
3 years ago
## tips
3 years ago
3 years ago
You need to reload Kedro variables by calling `%reload_kedro` in your notebook and re-run the code snippet
3 years ago
3 years ago
Par rapport aux bonnes pratiques kedro
------------------------------------------
Dans `actes-princiers/.gitignore`,
- les datas sont mises dans le dépôt git
- le datacatalog en local est placé dans le dépôt git
::
# ignore all local configuration
# conf/local/**
# ignore everything in the following folders
# data/**
3 years ago
## make a package for deployment
[package based deployment](https://docs.kedro.org/en/stable/deployment/single_machine.html#package-based)
If you prefer not to use containerisation, you can instead package your Kedro project using kedro package.
Run the following in your projects root directory:
kedro package
Kedro builds the package into the dist/ folder of your project, and creates a .whl file, which is a Python packaging format for binary distribution.