A simple Web application to explore Google Cloud Storage files with Google Cloud Dataprep by Trifacta.
This application creates all the necessary Dataprep objects (Dataset, Flow and Recipe) and generate URLs to the Dataprep interfaces.
This Web application use Python Flask Web framework and Dataprep REST API (https://api.trifacta.com/).
Python source code using Dataprep API can be found in create_dataprep_objects.py file.
-
Clone this repo
git clone https://github.com/victorcouste/dataprep-explorer.git
-
Install the requirements (in a virtual environment)
pip install -r requirements.txt
If need more details on Flask framework installation and configuration, see here
1/ Update 2 parameters in create_dataprep_objects.py Python file:
- DATAPREP_AUTH_TOKEN : The token to use API and to authenticate to Dataprep, it can be generated from Dataprep UI with a project's owner user
- DATAPREP_FOLDERID : The Dataprep folder flow ID where you want to generate flows
2/ Start the Flask Web app
In the Dataprep Explorer directory run:
export FLASK_APP=datataprep-explorer.py
flask run --port 5000
Now you must be able to go to http://127.0.0.1:5000/ and play with the application: