Final Project for the "Data Science" course ("Computational Management of Data"), Silvio Peroni @ University of Bologna, MA "DHDK".
The goal of the project is to develop a software that enables one to process data stored in different formats and to upload them into two distinct databases to query these databases simultaneously according to predefined operations. The software must be accompanied by a document (i.e., a Jupyter notebook) describing the data to process (their main characteristics and possible issues) and how the software has been organised (name of the files, where have been defined the various Python classes, etc.).
Please use the impl.py
file to use the software in the best possible manner. It includes all the classes and functions required for the optimal functioning of the software.
The main
branch of this repository contains the new code for the project, while the old-stuff
branch contains the previous version (stemming from the first version). The current code has been rebuilt from scratch almost completely by the two contributors.
- Manu Srivastava -
graphData_Manager.py
,objcreator.py
,auxiliary.py
,GenericQueryP.py
, testing and debugging - Orsola Maria Borrini -
ModelClasses.py
,relationalData_Manager.py
,auxiliary.py
,GenericQueryP.py
, testing and debugging