Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stanford harvest fails #266

Closed
aaron-collier opened this issue Sep 27, 2022 · 0 comments
Closed

Stanford harvest fails #266

aaron-collier opened this issue Sep 27, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@aaron-collier
Copy link
Contributor

aaron-collier commented Sep 27, 2022

The stanford harvest is failing with the following error:

Traceback (most recent call last):
  File "/home/airflow/.local/lib/python3.9/site-packages/airflow/operators/python.py", line 171, in execute
    return_value = self.execute_callable()
  File "/home/airflow/.local/lib/python3.9/site-packages/airflow/operators/python.py", line 189, in execute_callable
    return self.python_callable(*self.op_args, **self.op_kwargs)
  File "/home/airflow/.local/lib/python3.9/site-packages/dlme_airflow/harvester/source_harvester.py", line 17, in data_source_harvester
    dataframe_to_file(collection)
  File "/home/airflow/.local/lib/python3.9/site-packages/dlme_airflow/utils/dataframe.py", line 36, in dataframe_to_file
    source_df = collection.catalog.read().drop_duplicates(
  File "/home/airflow/.local/lib/python3.9/site-packages/dlme_airflow/drivers/iiif_json.py", line 120, in read
    self._load_metadata()
  File "/home/airflow/.local/lib/python3.9/site-packages/intake/source/base.py", line 285, in _load_metadata
    self._schema = self._get_schema()
  File "/home/airflow/.local/lib/python3.9/site-packages/dlme_airflow/drivers/iiif_json.py", line 110, in _get_schema
    self._open_collection()
  File "/home/airflow/.local/lib/python3.9/site-packages/dlme_airflow/drivers/iiif_json.py", line 25, in _open_collection
    collection_result = requests.get(self.collection_url).json()
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/api.py", line 73, in get
    return request("get", url, params=params, **kwargs)
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/api.py", line 59, in request
    return session.request(method=method, url=url, **kwargs)
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/sessions.py", line 573, in request
    prep = self.prepare_request(req)
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/sessions.py", line 484, in prepare_request
    p.prepare(
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/models.py", line 368, in prepare
    self.prepare_url(url, params)
  File "/home/airflow/.local/lib/python3.9/site-packages/requests/models.py", line 439, in prepare_url
    raise MissingSchema(
requests.exceptions.MissingSchema: Invalid URL 'None': No scheme supplied. Perhaps you meant http://None?

This may be a configuration issue?

@aaron-collier aaron-collier added the bug Something isn't working label Sep 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants