datascience
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
gwen 5002efb631 add new parameters syntax (kedro 0.18) 3 years ago
actes-princiers add new parameters syntax (kedro 0.18) 3 years ago
.gitignore install kedro 3 years ago
README.md add xml pipeline 3 years ago
config.yml install kedro 3 years ago

README.md

Actes princiers -- refactoring datascience

Project Name

human readable name : Actes Princiers

The project name 'Actes Princiers' has been applied to:

  • The project title in datascience/actes-princiers/README.md
  • The folder created for your project in datascience/actes-princiers
  • The project's python package in datascience/actes-princiers/src/actes_princiers

A best-practice setup includes initialising git and creating a virtual environment before running 'pip install -r src/requirements.txt'

Getting started

  • Install the virtual environment : python3 -m venv .venv
  • Enable the virtual environment : source .venv/bin/activate
  • install kedro pip install kedro
  • Install the packages and libraries pip install -r src/requirements.txt

Then open a terminal in the actes-princiers's folder and launch jupyter : kedro jupyter notebook or start the ipython prompt : kedro ipython

Launching the pipeline

Open a terminal in the actes-princiers's folder and launch kedro

kedro run

or launch a specific node in the pipeline with:

kedro run --nodes=preprocess_html

or a search by tags with:

kedro run --tags=xsl

Visualizing the pipelines

kedro viz

Developper's rules and guidelines

Declare any dependencies in src/requirements.txt for pip installation.

To install them, run: pip install -r src/requirements.txt

tips

You need to reload Kedro variables by calling %reload_kedro in your notebook and re-run the code snippet

Par rapport aux bonnes pratiques kedro

Dans actes-princiers/.gitignore,

  • les datas sont mises dans le dépôt git
  • le datacatalog en local est placé dans le dépôt git

::

# ignore all local configuration
# conf/local/**
# ignore everything in the following folders
# data/**

## make a package for deployment

package based deployment

If you prefer not to use containerisation, you can instead package your Kedro project using kedro package.

Run the following in your projects root directory:

kedro package

Kedro builds the package into the dist/ folder of your project, and creates a .whl file, which is a Python packaging format for binary distribution.