-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat/drop unstructured constraint #303
Conversation
|
||
@pytest.mark.parametrize("chunker_file", chunker_files, ids=[path.name for path in chunker_files]) | ||
@pytest.mark.parametrize("strategy", ["basic", "by_title"]) | ||
def test_chunker_basic(chunker_file: Path, strategy: str): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
any way this can be recreated in the original Unstructured repo or somewhere, if it's being pesky here due to nltk? It's always hard to give up tests
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good.
Description
Recently the unstructured dependency changed how NLTK data was being downloaded. This dependency is not a hard requirement for this repo and so the constaint on that dependency was lifted to be able to pull in the latest, but the use of it was isolated to only the unit test CI job to prevent collisions with underlying dependencies that are needed elsewhere, rather than have it as a shared dependency with everything else using pytest.