Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/support cosmos #145

Draft
wants to merge 59 commits into
base: main
Choose a base branch
from
Draft

Feature/support cosmos #145

wants to merge 59 commits into from

Conversation

hmoazam
Copy link
Contributor

@hmoazam hmoazam commented Jan 6, 2023

Updating implementation to support cosmos (data source v2) events.

wjohnson and others added 5 commits December 12, 2022 10:08
* Adding tests/environment folder to store datasets and bicep templates for test sources
* Added scripts to create databricks jobs and a notebook to mount storage on Databricks
* Making test environments more consistent across notebooks (secret scope, environment variables)
* Handle of tests were modified to correct mistakes not caught in source controlled versions
* Added documentation for testing environment including what secrets are used and what they look like
* Adding requirements.txt file for environment deployment
* Hive tests should run without additional intervention (i.e. use CREATE IF NOT EXISTS)
* Removing production env deployment
* Remove the wasbs with parameters test
* After updating all jobdefs to be ready for upload, the run-tests script needed to look at .name instead of .settings.name
  * Unfortunately, when calling the jobs API, it returns a .settings.name which must be used
…s deploy is going

and event hub authorization rules deploying at the same time.

Co-authored-by: hmoazam <hannamoazam@microsoft.com>
@hmoazam hmoazam marked this pull request as draft January 6, 2023 21:15
@hmoazam hmoazam force-pushed the feature/supportCosmos branch from cd3bcdf to 6cf20d5 Compare January 12, 2023 21:20
@hmoazam hmoazam temporarily deployed to Integration January 18, 2023 17:44 — with GitHub Actions Inactive
@hmoazam hmoazam mentioned this pull request Jan 28, 2023
Bumps [cryptography](https://github.com/pyca/cryptography) from 38.0.4 to 39.0.1.
- [Release notes](https://github.com/pyca/cryptography/releases)
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst)
- [Commits](pyca/cryptography@38.0.4...39.0.1)

---
updated-dependencies:
- dependency-name: cryptography
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
@hmoazam hmoazam force-pushed the feature/supportCosmos branch from 894914d to 636b13e Compare February 17, 2023 16:07
hmoazam and others added 16 commits April 28, 2023 16:40
* Adding bicep file

* Added unit and integration tests for Postgres. Updated LIMITATIONS.md to describe support and limitations
* Implemented support for kusto by updating mappings.

* Implemented Kusto support and added unit and integration tests
* checking for resource set types as its own method
* checking for blob or data lake file path or filesystem types as its own method
* refactored checking for adf relationships for blob/data lake file path types to its own method
* Behavior Change for blob/data lake file path types that match: follow the insert into first position of valid entities rather than return a list with only the entity
* Renamed ResourceSet_QualifiedNames_Match to QualifiedNames_Match_After_Normalizing and added the trim of the trailing slash for comaprisons
    * Using QualifiedNames_Match_After_Normalizing when comparing as the dfs vs blob comparison was preventing folder paths from matching if they're using wasbs but scanned for azure data lake gen2 (abfss)
…h matching fully qualified names

By pulling out and putting first the QualifiedNames_Match_After_Normalizing, it's clear that all search result entities must have a matching FQN.

This puts a stronger emphasis on the Build_Searchable_QualifiedName and its ability to standardize the name.
…r the Log method in favor of using the ILogger directly
…e from a simple entity to purviewEntityToBeUpdated to make it more clear what's going to happen to this entity and not confuse it with other references to the word 'entity'
…hat may be deleted and adding comments to walk through relationship attribute extraction and replacement
…simpleEntity, AddToTable, FindQualifiedNameInPurview, CleanUnusedCustomEntities, and Name_To_Search
wjohnson and others added 29 commits April 28, 2023 16:40
Python script to remove spaces and added as an artifact to Build and Release (Closes #183)
Python script to update the arm template with the OlToPurviewMapping in a stringified json format (Closes #184)

Updated the arm template with the standardized response from the new python script to avoid conflicts later on
Mount points that have a source with a subdirectory were failing to include the subdirectories

Introduced a new namespaceBodyPartsJoinedBySlashFrom mappings variable to capture any parts from
a certain position and concatenate them with a forward slash.
Rebased to include updates to newdeploymenttemp.json
…ate LIMITATIONS, add integration test notebooks, and add integration test for additional inputs + cosmos
…d result to True, as now considering COMPLETE events with only outputs
@hmoazam hmoazam force-pushed the feature/supportCosmos branch from 636b13e to edd5657 Compare April 28, 2023 13:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants